Subscribe to DSC Newsletter

Emmanuelle Rieuf's Blog (152)

Machine Learning Skills Among Data Scientists

This article was posted by Bob E. Hayes on Customer think. Bob, PhD is Chief Research Officer at Appuri. He a scientist, blogger and author on CEM and data science.

Data scientists have a variety of different skills that they bring to bear on Big Data projects. These skills cut across Subject Matter Expertise, Technology, Programming, Math & Modeling and Statistics. One valuable…

Continue

Added by Emmanuelle Rieuf on April 18, 2017 at 9:00am — No Comments

The Startup Founder’s Guide to Analytics

This article was written by Tristan Handy. Tristan is the founder and president of Fishtown Analytics: helping startups implement advanced analytics.

I’m very confident of that, because today, everyone needs analytics. Not just product, not just marketing, not just finance… sales, fulfillment, everyone at a startup needs analytics today.…

Continue

Added by Emmanuelle Rieuf on April 10, 2017 at 11:00am — No Comments

Implement an ARIMA model using statsmodels (Python)

In this article was written by Michael Grogan. Michael is a data scientist and statistician, with a profound passion for statistics and programming.

In a previous tutorial, I elaborated on how an ARIMA model can be implemented using R. The model was fitted on a stock price dataset, with a (0,1,0) configuration being used for ARIMA.

Here, I detail how to implement an ARIMA model in Python using the…

Continue

Added by Emmanuelle Rieuf on April 9, 2017 at 11:00am — No Comments

Ideas on interpreting machine learning

This article was written by Patrick Hall,Wen Phan and SriSatish Ambati

Introduction …

Continue

Added by Emmanuelle Rieuf on April 6, 2017 at 9:00am — No Comments

Build a Recurrent Neural Net in 5 min

This video was posted on Youtube by Sirajology. He explains the basics of recurrent neural networks. Then you code your own RNN in 80 lines of python (plus white-space) that predicts the sum of two binary numbers after training.

Code for this video:…

Continue

Added by Emmanuelle Rieuf on April 4, 2017 at 12:00pm — No Comments

Book: Advanced R (Chapman & Hall/CRC The R Series)

An Essential Reference for Intermediate and Advanced R Programmers

Advanced R presents useful tools and techniques for attacking many types of R programming problems, helping you avoid mistakes and dead ends. With more than ten years of…

Continue

Added by Emmanuelle Rieuf on March 30, 2017 at 3:30am — No Comments

Book: Neural Networks and Statistical Learning

About the Textbook:

Providing a broad but in-depth introduction to neural network and machine learning in a statistical framework, this book provides a single, comprehensive resource for study and further research. All the major…

Continue

Added by Emmanuelle Rieuf on March 29, 2017 at 5:00pm — No Comments

What is Hadoop?

This article was posted on Intellipaat. 

Hadoop is an open-source framework developed in Java, dedicated to store and analyze the large sets of unstructured data. It is a highly scalable platform which allows multiple concurrent tasks to run from single to thousands of servers without any delay.

It consists of a distributed file system that allows transferring data and files in split seconds between different nodes. Its ability to process efficiently even if a node…

Continue

Added by Emmanuelle Rieuf on March 28, 2017 at 7:30am — 1 Comment

Apache Spark Introduction – A Comprehensive Guide for beginners

This article was posted on Data Flair. Below is a quick overview of the original article.

1.Objective

This tutorial provides introduction to Apache Spark, what are its ecosystem components, Spark abstraction – RDD, transformation and action. The…

Continue

Added by Emmanuelle Rieuf on March 27, 2017 at 4:00pm — No Comments

Getting Started with Deep Learning

This article was written by Matthew Rubashkin. With a background in optical physics and biomedical research, Matthew has a broad range of experiences in software development, database engineering, and data analytics.

At SVDS, our R&D team has been investigating different deep learning technologies, from recognizing images of trains to speech recognition. We needed to build a pipeline for ingesting…

Continue

Added by Emmanuelle Rieuf on March 24, 2017 at 12:30pm — No Comments

Machine Learning: An In-Depth Guide - Overview, Goals, Learning Types, and Algorithms

This article was written by Alex Castrounis. Alex is the founder of InnoArchiTech

Introduction:

Machine learning is a very hot topic for many key reasons, and because it provides the ability to automatically obtain deep insights, recognize unknown patterns, and create high performing predictive models from data, all without requiring explicit programming…

Continue

Added by Emmanuelle Rieuf on March 22, 2017 at 3:00pm — No Comments

Free Machine Learning eBooks - March 2017

Here are three eBooks available for free.

MACHINE LEARNING

Edited by Abdelhamid Mellouk and Abdennacer Chebira

Machine Learning can be defined in various ways related to a scientific domain concerned with the design and development of theoretical and implementation tools that allow building systems with some Human Like intelligent behaviour.

Machine Learning addresses more specifically the ability to…

Continue

Added by Emmanuelle Rieuf on March 20, 2017 at 4:00pm — 5 Comments

Little Bee books: Tough topics simply explained

This is a nice collection of free eBooks to learn the ropes on topics covering Hadoop, machine learning, Spark, analytics, and more.

The Little Bee series of books provides an overview of the hot topics in data and analytics, giving you a snapshot of each technology and its potential benefit to your organisation. These books will not make you an expert, but they will improve your understanding and open the door to new ideas.

The subject of data and…

Continue

Added by Emmanuelle Rieuf on March 20, 2017 at 4:00pm — No Comments

Python & JSON: Working with large datasets using Pandas

This article was posted by Vik Paruchuri. 

Introduction

Working with large JSON datasets can be a pain, particularly when they are too large to fit into memory. In cases like this, a combination of command line tools and Python can make for an efficient way to explore and analyze the data. In this post, we’ll look at how to leverage tools like Pandas to explore and map out police activity in Montgomery County, Maryland. We’ll start with a look at the…

Continue

Added by Emmanuelle Rieuf on March 17, 2017 at 9:00am — No Comments

Intuitive Machine Learning : Gradient Descent Simplified

This article was written by Roopam Upadhyay. Roopam is a seasoned professional of advanced analytics with more than a decade of experience in statistical modeling, data science, predictive analytics, optimization, & business consulting.

How do machines learn? They learn…

Continue

Added by Emmanuelle Rieuf on March 15, 2017 at 12:00pm — No Comments

Top Data Science Tweets and Influencers on Twitter

From NodeXLExcelAutomator, recently updated.. 

The graph represents a network of 3,210 Twitter users whose tweets in the requested range contained "data science" or #datascience", or who were replied to or mentioned in those tweets.

According to this source, the top influencers…

Continue

Added by Emmanuelle Rieuf on March 15, 2017 at 12:00pm — No Comments

An Introduction to Key Data Science Concepts- Infographic

This infographic was posted by Robert Kelley on Dataiku. 

Here at Dataiku, we frequently stress the importance of collaboration in building a successful data team. In short, successful data science and analytics are just as much about creativity as they are about crunching numbers, and creativity flourishes in a collaborative environment. One key to a collaborative environment is having a shared set of terms and…

Continue

Added by Emmanuelle Rieuf on March 15, 2017 at 11:30am — No Comments

Book: Evaluating Machine Learning Models

Data science today is a lot like the Wild West: there’s endless opportunity and excitement, but also a lot of chaos and confusion. If you’re new to data science and applied machine learning, evaluating a machine-learning model can seem pretty overwhelming. Now you have help. With this O’Reilly report,…

Continue

Added by Emmanuelle Rieuf on March 10, 2017 at 8:00am — No Comments

The ROI of Machine Learning in Business - Infographics

This infographic comes from TechEmergence. TechEmergence is a market research firm specializing in the applications and implications of artificial intelligence/machine learning. TechEmergence's team has recently polled a total of 30 artificial intelligence researchers and executives on the criterion needed for a company to derive maximum value from machine learning to solve business problems:

 …

Continue

Added by Emmanuelle Rieuf on March 6, 2017 at 10:30am — No Comments

Book: Data for Business Performance

Long title: The Goal-Question-Metric (GQM) Model to Transform Business Data into an Enterprise Asset.

Today, digitization is dramatically changing the business landscape, and many progressive organizations have started to treat data as a valuable business…

Continue

Added by Emmanuelle Rieuf on February 22, 2017 at 12:30pm — No Comments

Follow Us

Videos

  • Add Videos
  • View All

Resources

© 2017   Data Science Central   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service