R is a well-known and increasingly popular tool in the Data Science field. It is a programming language and a software environment primarily designed for statistical computing, so its interface and structure are very well suited for the scientific tasks. Moreover, R has one of the most developed libraries systems that counts thousands of packages to solve a wide variety of problems.
Although there are many general-purpose…Continue
Created an R package for exploratory data analysis. Package name is SmartEDA now available on CRAN. This package includes multiple custom functions to perform initial exploratory analysis on any input data describing the structure and the relationships present in the data. The generated output can be obtained in both summary and graphical form. The graphical form or charts…Continue
Added by Dayanand on May 22, 2018 at 2:00am — No Comments
Summary: Researchers in Synthetic Neuro Biology are proposing to solve the AGI problem by building a brain in the laboratory. This is not science fiction. They are virtually at the door of this capability. Increasingly these researchers are presenting at major AGI conferences. Their argument is compelling.
Added by William Vorhies on May 21, 2018 at 3:00pm — No Comments
It is hard to imagine that some data element could contain less information than a bit (a digit equal to either 0 or 1.) Yet examples are abundant. Indeed, I am wondering if we should create a unit of information called microbit, or nanobit.
The first examples that come to my mind are some irrational numbers such as Pi: it's digits are widely believed to be indistinguishable from pure noise, thus carrying essentially no information. While there is not enough data storage in the…Continue
Added by Vincent Granville on May 21, 2018 at 8:00am — No Comments
In this column, we would like to elaborate on the concept of data security.
Although security is often related to privacy, they are not synonyms. Data security can be defined as the set of policies and techniques to ensure the confidentiality, availability and integrity of data at all times. On the other hand, data privacy refers to the fact that the parties accessing and using the data do so only in ways that comply with the agreed upon purposes of data…Continue
Added by Bart Baesens on May 21, 2018 at 2:30am — No Comments
To SQL or not To SQL: that’s the question!
Lemahieu W., vanden Broucke S., Baesens B.
This article is based upon our upcoming book Principles of Database Management: The Practical Guide to Storing, Managing and Analyzing Big and Small Data, www.pdbmbook.com See also our corresponding YouTube channel with free video lectures :…Continue
Added by Bart Baesens on May 20, 2018 at 9:00pm — No Comments
I recently read a very popular article entitled 5 Reasons “Logistic Regression” should be the first thing you learn when becoming a Data Scientist. Here I provide my opinion on why this should no be the case.
It is nice to have logistic regression on your resume, as many jobs request it, especially in some fields such as biostatistics. And if you learned the details during your college classes, good for you. However, for a beginner, this is not the first thing you should…Continue
These pictures were posted on Quora by Oleg Sergeykin, former Structural Analysis Engineer at Boeing. His philosophy is that Data science is actually an iterative processes. It is never possible to complete a DS project in a single pass. A data scientist constantly tries new ideas and changes steps of his pipeline.…Continue
Added by Capri Granville on May 20, 2018 at 1:00pm — No Comments
Many are free. They are available online. They are offered by Princeton, Georgia Tech, Harvard, Columbia, Stanford, and Penn State.
Machine Learning is hottest subject of today’s time, DataScientist is the sexiest job of today but implementing these buzz words in real life business is most important need.
Machine Learning is the hottest subject of today’s time, DataScientist is the sexiest job of today but implementing these buzz words in real life business is most important need. The real need for today’s time and business is to clarify,…Continue
This is the new book by Andrew Ng, still in progress. Andrew Yan-Tak Ng is a computer scientist and entrepreneur. He is one of the most influential minds in Artificial Intelligence and Deep Learning. Ng founded and led Google Brain and was a former VP & Chief Scientist at Baidu, building the company's Artificial Intelligence Group into several thousand people. He is an adjunct professor (formerly associate professor and Director of the AI Lab) at Stanford University. Ng is also an early…Continue
Added by Capri Granville on May 20, 2018 at 9:00am — No Comments
Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week.
Added by Vincent Granville on May 19, 2018 at 12:00pm — No Comments
The Explainable Machine Learning Challenge is a collaboration between Google, FICO and academics at Berkeley, Oxford, Imperial, UC Irvine and MIT, to generate new research in the area of algorithmic explainability. Teams will be challenged to create machine learning models with both high accuracy and explainability; they will use a real-world financial dataset provided by FICO. Designers and end users of machine learning algorithms will both benefit from more interpretable and…Continue
Added by Capri Granville on May 19, 2018 at 11:00am — No Comments
The Practical Guide to Storing, Managing and Analyzing Big and Small Data -- Cambridge University Press.
This comprehensive textbook teaches the fundamentals of database design, modeling, systems, data storage, and the evolving world of data warehousing, governance and more. Written by experienced educators and experts in big data, analytics, data quality, and data integration, it provides an up-to-date approach to database management. This full-color, illustrated text has a…Continue
Technology is known to shift landscapes, even change the game. We saw that when the internet exploded in scale and popularity, as computers became smarter, and the world goes through the digital transformation. An easy example is in traditional marketing, which now borders on the irrelevant, unable to hold a candle to its more modern counterparts.
Added by Jay Nair on May 18, 2018 at 4:30pm — No Comments
A single query optimization tip can boost your database performance by 100x. At one point, we advised one of our customers that had a 10TB database to use a date-based multi-column index. As a result, their date range query sped up…Continue
Added by Luba Belokon on May 17, 2018 at 2:30am — No Comments
After all, the term Machine Learning was coined based on the way the human (or animal) brain learns, meaning that somehow, machines could also benefit from a similar kind of learning.
But human beings, successful ones for sure, know how to un-learn. In my case, while I was always fascinated by mathematics since my very early years, the school system's training (as in training an algorithm in ML) failed on me. It failed not because I did not succeed at school (I…Continue
Added by Vincent Granville on May 16, 2018 at 4:06pm — No Comments
The word Data and Data Science have taken the business world by storm. Nowadays, improving business productivity and performance greatly depends on collection and analyzing data. Businesses have been processing data for ages but the introduction of Internet of Things (IoT) has been a game changer. Data collected through IoT is analyzed using different techniques as compared to that collected traditionally. Furthermore, Data Scientists require more sophisticated skills for analyzing IoT…Continue
Added by VAMSI NELLUTLA on May 16, 2018 at 1:30pm — No Comments
I've got a big digital mouth. Last time, I wrote on frequencies using R, noting cavalierly that I'd done similar development in Python/Pandas. I wasn't lying, but the pertinent work I dug up from…Continue
Added by steve miller on May 16, 2018 at 12:30pm — No Comments
Demystifying key buzzwords like Artificial intelligence, machine learning, artificial neural networks and deep learning is simple and complex task at the same time.
Let us attempt to melt down the thick confusion of how all-encompassing terms like artificial intelligence, machine learning, and deep learning speaks to each other. Machine learning, Blockchain and…Continue
Added by Vinod Sharma on May 16, 2018 at 12:00am — No Comments