# Emmanuelle Rieuf's Blog (178)

### Ranking All 50 States by Average Credit Score of its Citizens

Whether you want it to or not, credit and its availability plays a major role in everyone’s life, whether or not you directly experience it. For the average person, credit scores are mainly going to be used for three things: buying a house, buying a car, and using credit cards.

In the world of business, things get exponentially more complicated and it also ended up leading to a horrible housing crash and recession…

Continue

Added by Emmanuelle Rieuf on April 15, 2017 at 9:30am — 1 Comment

### The Startup Founder’s Guide to Analytics

This article was written by Tristan Handy. Tristan is the founder and president of Fishtown Analytics: helping startups implement advanced analytics.

I’m very confident of that, because today, everyone needs analytics. Not just product, not just marketing, not just finance… sales, fulfillment, everyone at a startup needs analytics today.…

Continue

Added by Emmanuelle Rieuf on April 10, 2017 at 11:00am — No Comments

### Implement an ARIMA model using statsmodels (Python)

In this article was written by Michael Grogan. Michael is a data scientist and statistician, with a profound passion for statistics and programming.

In a previous tutorial, I elaborated on how an ARIMA model can be implemented using R. The model was fitted on a stock price dataset, with a (0,1,0) configuration being used for ARIMA.

Here, I detail how to implement an ARIMA model in Python using the…

Continue

Added by Emmanuelle Rieuf on April 9, 2017 at 11:00am — No Comments

### Introduction to Anomaly Detection

In this article, Data Scientist Pramit Choudhary provides an introduction to both statistical and machine learning-based approaches to anomaly detection in Python. Introduction: Anomaly Detection

This overview is intended for beginners in the fields of data science and machine learning.…

Continue

Added by Emmanuelle Rieuf on April 6, 2017 at 12:30pm — No Comments

### Implementing the Gradient Descent Algorithm in R

A Brief Introduction:

Linear regression is a classic supervised statistical technique for predictive modelling which is based on the linear hypothesis:

## y = mx + c

where is the response or outcome variable, m is the gradient of the linear…

Continue

Added by Emmanuelle Rieuf on April 4, 2017 at 6:00pm — No Comments

### Build a Recurrent Neural Net in 5 min

This video was posted on Youtube by Sirajology. He explains the basics of recurrent neural networks. Then you code your own RNN in 80 lines of python (plus white-space) that predicts the sum of two binary numbers after training.

Code for this video:…

Continue

Added by Emmanuelle Rieuf on April 4, 2017 at 12:00pm — No Comments

## Book Description

AI and Deep Learning are transforming the way we understand software, making computers more intelligent than we could even imagine just a decade ago. Deep Learning algorithms are being used across a broad range of industries – as the…

Continue

Added by Emmanuelle Rieuf on April 2, 2017 at 6:31pm — No Comments

### Book: Advanced R (Chapman & Hall/CRC The R Series)

An Essential Reference for Intermediate and Advanced R Programmers

Advanced R presents useful tools and techniques for attacking many types of R programming problems, helping you avoid mistakes and dead ends. With more than ten years of…

Continue

Added by Emmanuelle Rieuf on March 30, 2017 at 3:30am — No Comments

### Book: Neural Networks and Statistical Learning

Providing a broad but in-depth introduction to neural network and machine learning in a statistical framework, this book provides a single, comprehensive resource for study and further research. All the major…

Continue

Added by Emmanuelle Rieuf on March 29, 2017 at 5:00pm — No Comments

Hadoop is an open-source framework developed in Java, dedicated to store and analyze the large sets of unstructured data. It is a highly scalable platform which allows multiple concurrent tasks to run from single to thousands of servers without any delay.

It consists of a distributed file system that allows transferring data and files in split seconds between different nodes. Its ability to process efficiently even if a node…

Continue

Added by Emmanuelle Rieuf on March 28, 2017 at 7:30am — 1 Comment

### Apache Spark Introduction – A Comprehensive Guide for beginners

This article was posted on Data Flair. Below is a quick overview of the original article.

1.Objective

This tutorial provides introduction to Apache Spark, what are its ecosystem components, Spark abstraction – RDD, transformation and action. The…

Continue

Added by Emmanuelle Rieuf on March 27, 2017 at 4:00pm — No Comments

### Getting Started with Deep Learning

This article was written by Matthew Rubashkin. With a background in optical physics and biomedical research, Matthew has a broad range of experiences in software development, database engineering, and data analytics.

At SVDS, our R&D team has been investigating different deep learning technologies, from recognizing images of trains to speech recognition. We needed to build a pipeline for ingesting…

Continue

Added by Emmanuelle Rieuf on March 24, 2017 at 12:30pm — No Comments

## Introduction:

Machine learning is a very hot topic for many key reasons, and because it provides the ability to automatically obtain deep insights, recognize unknown patterns, and create high performing predictive models from data, all without requiring explicit programming…

Continue

Added by Emmanuelle Rieuf on March 22, 2017 at 3:00pm — No Comments

### Free Machine Learning eBooks - March 2017

MACHINE LEARNING

Edited by Abdelhamid Mellouk and Abdennacer Chebira

Machine Learning can be defined in various ways related to a scientific domain concerned with the design and development of theoretical and implementation tools that allow building systems with some Human Like intelligent behaviour.

Machine Learning addresses more specifically the ability to…

Continue

Added by Emmanuelle Rieuf on March 20, 2017 at 4:00pm — 5 Comments

### Little Bee books: Tough topics simply explained

This is a nice collection of free eBooks to learn the ropes on topics covering Hadoop, machine learning, Spark, analytics, and more.

The Little Bee series of books provides an overview of the hot topics in data and analytics, giving you a snapshot of each technology and its potential benefit to your organisation. These books will not make you an expert, but they will improve your understanding and open the door to new ideas.

The subject of data and…

Continue

Added by Emmanuelle Rieuf on March 20, 2017 at 4:00pm — No Comments

### Email Spam Filtering : A python implementation with scikit-learn

This article was written by ML bot2 on Machine Learning in Action.

Text mining (deriving information from text) is a wide field which has gained popularity with the huge text data being generated. Automation of a number of applications like sentiment analysis, document classification, topic classification, text summarization, machine translation, etc has been done using machine learning…

Continue

Added by Emmanuelle Rieuf on March 17, 2017 at 9:15am — No Comments

### Python & JSON: Working with large datasets using Pandas

Introduction

Working with large JSON datasets can be a pain, particularly when they are too large to fit into memory. In cases like this, a combination of command line tools and Python can make for an efficient way to explore and analyze the data. In this post, we’ll look at how to leverage tools like Pandas to explore and map out police activity in Montgomery County, Maryland. We’ll start with a look at the…

Continue

Added by Emmanuelle Rieuf on March 17, 2017 at 9:00am — No Comments

### Intuitive Machine Learning : Gradient Descent Simplified

How do machines learn? They learn…

Continue

Added by Emmanuelle Rieuf on March 15, 2017 at 12:00pm — No Comments

### Top Data Science Tweets and Influencers on Twitter

From NodeXLExcelAutomator, recently updated..

The graph represents a network of 3,210 Twitter users whose tweets in the requested range contained "data science" or #datascience", or who were replied to or mentioned in those tweets.

According to this source, the top influencers…

Continue

Added by Emmanuelle Rieuf on March 15, 2017 at 12:00pm — No Comments

### An Introduction to Key Data Science Concepts- Infographic

This infographic was posted by Robert Kelley on Dataiku.

Here at Dataiku, we frequently stress the importance of collaboration in building a successful data team. In short, successful data science and analytics are just as much about creativity as they are about crunching numbers, and creativity flourishes in a collaborative environment. One key to a collaborative environment is having a shared set of terms and…

Continue

Added by Emmanuelle Rieuf on March 15, 2017 at 11:30am — No Comments

2017

2016

1

2

3

4

5

6