This article was contributed by Statistical Future.
Whether you want it to or not, credit and its availability plays a major role in everyone’s life, whether or not you directly experience it. For the average person, credit scores are mainly going to be used for three things: buying a house, buying a car, and using credit cards.
In the world of business, things get exponentially more complicated and it also ended up leading to a horrible housing crash and recession…Continue
This article was written by Tristan Handy. Tristan is the founder and president of Fishtown Analytics: helping startups implement advanced analytics.
I’m very confident of that, because today, everyone needs analytics. Not just product, not just marketing, not just finance… sales, fulfillment, everyone at a startup needs analytics today.…Continue
Added by Emmanuelle Rieuf on April 10, 2017 at 11:00am — No Comments
In this article was written by Michael Grogan. Michael is a data scientist and statistician, with a profound passion for statistics and programming.
In a previous tutorial, I elaborated on how an ARIMA model can be implemented using R. The model was fitted on a stock price dataset, with a (0,1,0) configuration being used for ARIMA.
Here, I detail how to implement an ARIMA model in Python using the…Continue
Added by Emmanuelle Rieuf on April 9, 2017 at 11:00am — No Comments
This overview is intended for beginners in the fields of data science and machine learning.…Continue
Added by Emmanuelle Rieuf on April 6, 2017 at 12:30pm — No Comments
This article was posted by S. Richter-Walsh.
A Brief Introduction:
Linear regression is a classic supervised statistical technique for predictive modelling which is based on the linear hypothesis:
where y is the response or outcome variable, m is the gradient of the linear…Continue
Added by Emmanuelle Rieuf on April 4, 2017 at 6:00pm — No Comments
This video was posted on Youtube by Sirajology. He explains the basics of recurrent neural networks. Then you code your own RNN in 80 lines of python (plus white-space) that predicts the sum of two binary numbers after training.
Code for this video:…Continue
Added by Emmanuelle Rieuf on April 4, 2017 at 12:00pm — No Comments
Added by Emmanuelle Rieuf on April 2, 2017 at 6:31pm — No Comments
An Essential Reference for Intermediate and Advanced R Programmers
Advanced R presents useful tools and techniques for attacking many types of R programming problems, helping you avoid mistakes and dead ends. With more than ten years of…Continue
Added by Emmanuelle Rieuf on March 30, 2017 at 3:30am — No Comments
About the Textbook:
Providing a broad but in-depth introduction to neural network and machine learning in a statistical framework, this book provides a single, comprehensive resource for study and further research. All the major…Continue
Added by Emmanuelle Rieuf on March 29, 2017 at 5:00pm — No Comments
This article was posted on Intellipaat.
Hadoop is an open-source framework developed in Java, dedicated to store and analyze the large sets of unstructured data. It is a highly scalable platform which allows multiple concurrent tasks to run from single to thousands of servers without any delay.
It consists of a distributed file system that allows transferring data and files in split seconds between different nodes. Its ability to process efficiently even if a node…Continue
This article was posted on Data Flair. Below is a quick overview of the original article.
This tutorial provides introduction to Apache Spark, what are its ecosystem components, Spark abstraction – RDD, transformation and action. The…Continue
Added by Emmanuelle Rieuf on March 27, 2017 at 4:00pm — No Comments
This article was written by Matthew Rubashkin. With a background in optical physics and biomedical research, Matthew has a broad range of experiences in software development, database engineering, and data analytics.
At SVDS, our R&D team has been investigating different deep learning technologies, from recognizing images of trains to speech recognition. We needed to build a pipeline for ingesting…Continue
Added by Emmanuelle Rieuf on March 24, 2017 at 12:30pm — No Comments
Machine learning is a very hot topic for many key reasons, and because it provides the ability to automatically obtain deep insights, recognize unknown patterns, and create high performing predictive models from data, all without requiring explicit programming…Continue
Added by Emmanuelle Rieuf on March 22, 2017 at 3:00pm — No Comments
Here are three eBooks available for free.
Edited by Abdelhamid Mellouk and Abdennacer Chebira
Machine Learning can be defined in various ways related to a scientific domain concerned with the design and development of theoretical and implementation tools that allow building systems with some Human Like intelligent behaviour.
Machine Learning addresses more specifically the ability to…
This is a nice collection of free eBooks to learn the ropes on topics covering Hadoop, machine learning, Spark, analytics, and more.
The Little Bee series of books provides an overview of the hot topics in data and analytics, giving you a snapshot of each technology and its potential benefit to your organisation. These books will not make you an expert, but they will improve your understanding and open the door to new ideas.
The subject of data and…Continue
Added by Emmanuelle Rieuf on March 20, 2017 at 4:00pm — No Comments
This article was written by ML bot2 on Machine Learning in Action.
Text mining (deriving information from text) is a wide field which has gained popularity with the huge text data being generated. Automation of a number of applications like sentiment analysis, document classification, topic classification, text summarization, machine translation, etc has been done using machine learning…Continue
Added by Emmanuelle Rieuf on March 17, 2017 at 9:15am — No Comments
This article was posted by Vik Paruchuri.
Working with large JSON datasets can be a pain, particularly when they are too large to fit into memory. In cases like this, a combination of command line tools and Python can make for an efficient way to explore and analyze the data. In this post, we’ll look at how to leverage tools like Pandas to explore and map out police activity in Montgomery County, Maryland. We’ll start with a look at the…Continue
Added by Emmanuelle Rieuf on March 17, 2017 at 9:00am — No Comments
This article was written by Roopam Upadhyay. Roopam is a seasoned professional of advanced analytics with more than a decade of experience in statistical modeling, data science, predictive analytics, optimization, & business consulting.
How do machines learn? They learn…Continue
Added by Emmanuelle Rieuf on March 15, 2017 at 12:00pm — No Comments
From NodeXLExcelAutomator, recently updated..
The graph represents a network of 3,210 Twitter users whose tweets in the requested range contained "data science" or #datascience", or who were replied to or mentioned in those tweets.
According to this source, the top influencers…Continue
Added by Emmanuelle Rieuf on March 15, 2017 at 12:00pm — No Comments
This infographic was posted by Robert Kelley on Dataiku.
Here at Dataiku, we frequently stress the importance of collaboration in building a successful data team. In short, successful data science and analytics are just as much about creativity as they are about crunching numbers, and creativity flourishes in a collaborative environment. One key to a collaborative environment is having a shared set of terms and…Continue
Added by Emmanuelle Rieuf on March 15, 2017 at 11:30am — No Comments