Interesting survey produced by 365datascience.com, summarized in a few infographics, and based on LinkedIn profiles.. Summary: R (53%) and Python (53%) are the programmin...
Published in 2013, but still very interesting, and different from most data science books. Authors: Ian Langmore and Daniel Krasner.. This book focuses more on the stat...
Sad news about the Theano framework. Here is the announcement that some developers found in their mailbox two days ago. Dear users and developers, After almost ten yea...
You will find here nine interesting topics that you won’t learn in college classes. Most have interesting applications in business and elsewhere. They are not espec...
This article was written by Saurav Kaushik. Saurav is a Data Science enthusiast, currently in the final year of his graduation at MAIT, New Delhi. He loves to use machi...
In this article, an R-hadoop (with rmr2) implementation of Distributed KMeans Clustering will be described with a sample 2-d dataset. First the dataset shown below i...
And 92 percent of all (positive) integers have a factor under 1,000. And how many have a factor under 6? Can you guess the answer? Read more to find out. Clearly, the vas...
By Reiichiro Nakano. There are a number of visualizations that frequently pop up in machine learning. Scikit-plot is a humble attempt to provide aesthetically-challenged ...
Time series forecasting is different from other machine learning problems. The key difference is the fixed sequence of observations and the constraints and additional str...
Most of the articles on extreme events are focusing on the extreme values. Very little has been written about the arrival times of these events. This article fills the ga...