This article was posted by Bob E. Hayes on Customer think. Bob, PhD is Chief Research Officer at Appuri. He a scientist, blogger and author on CEM and data science.
Data scientists have a variety of different skills that they bring to bear on Big Data projects. These skills cut across Subject Matter Expertise, Technology, Programming, Math & Modeling and Statistics. One valuable…Continue
Added by Emmanuelle Rieuf on April 18, 2017 at 9:00am — No Comments
This article was written by Tristan Handy. Tristan is the founder and president of Fishtown Analytics: helping startups implement advanced analytics.
I’m very confident of that, because today, everyone needs analytics. Not just product, not just marketing, not just finance… sales, fulfillment, everyone at a startup needs analytics today.…Continue
Added by Emmanuelle Rieuf on April 10, 2017 at 11:00am — No Comments
In this article was written by Michael Grogan. Michael is a data scientist and statistician, with a profound passion for statistics and programming.
In a previous tutorial, I elaborated on how an ARIMA model can be implemented using R. The model was fitted on a stock price dataset, with a (0,1,0) configuration being used for ARIMA.
Here, I detail how to implement an ARIMA model in Python using the…Continue
Added by Emmanuelle Rieuf on April 9, 2017 at 11:00am — No Comments
This article was written by
Added by Emmanuelle Rieuf on April 6, 2017 at 9:00am — No Comments
This video was posted on Youtube by Sirajology. He explains the basics of recurrent neural networks. Then you code your own RNN in 80 lines of python (plus white-space) that predicts the sum of two binary numbers after training.
Code for this video:…Continue
Added by Emmanuelle Rieuf on April 4, 2017 at 12:00pm — No Comments
An Essential Reference for Intermediate and Advanced R Programmers
Advanced R presents useful tools and techniques for attacking many types of R programming problems, helping you avoid mistakes and dead ends. With more than ten years of…Continue
Added by Emmanuelle Rieuf on March 30, 2017 at 3:30am — No Comments
About the Textbook:
Providing a broad but in-depth introduction to neural network and machine learning in a statistical framework, this book provides a single, comprehensive resource for study and further research. All the major…Continue
Added by Emmanuelle Rieuf on March 29, 2017 at 5:00pm — No Comments
This article was posted on Intellipaat.
Hadoop is an open-source framework developed in Java, dedicated to store and analyze the large sets of unstructured data. It is a highly scalable platform which allows multiple concurrent tasks to run from single to thousands of servers without any delay.
It consists of a distributed file system that allows transferring data and files in split seconds between different nodes. Its ability to process efficiently even if a node…Continue
This article was posted on Data Flair. Below is a quick overview of the original article.
This tutorial provides introduction to Apache Spark, what are its ecosystem components, Spark abstraction – RDD, transformation and action. The…Continue
Added by Emmanuelle Rieuf on March 27, 2017 at 4:00pm — No Comments
This article was written by Matthew Rubashkin. With a background in optical physics and biomedical research, Matthew has a broad range of experiences in software development, database engineering, and data analytics.
At SVDS, our R&D team has been investigating different deep learning technologies, from recognizing images of trains to speech recognition. We needed to build a pipeline for ingesting…Continue
Added by Emmanuelle Rieuf on March 24, 2017 at 12:30pm — No Comments
Machine learning is a very hot topic for many key reasons, and because it provides the ability to automatically obtain deep insights, recognize unknown patterns, and create high performing predictive models from data, all without requiring explicit programming…Continue
Added by Emmanuelle Rieuf on March 22, 2017 at 3:00pm — No Comments
Here are three eBooks available for free.
Edited by Abdelhamid Mellouk and Abdennacer Chebira
Machine Learning can be defined in various ways related to a scientific domain concerned with the design and development of theoretical and implementation tools that allow building systems with some Human Like intelligent behaviour.
Machine Learning addresses more specifically the ability to…
This is a nice collection of free eBooks to learn the ropes on topics covering Hadoop, machine learning, Spark, analytics, and more.
The Little Bee series of books provides an overview of the hot topics in data and analytics, giving you a snapshot of each technology and its potential benefit to your organisation. These books will not make you an expert, but they will improve your understanding and open the door to new ideas.
The subject of data and…Continue
Added by Emmanuelle Rieuf on March 20, 2017 at 4:00pm — No Comments
This article was posted by Vik Paruchuri.
Working with large JSON datasets can be a pain, particularly when they are too large to fit into memory. In cases like this, a combination of command line tools and Python can make for an efficient way to explore and analyze the data. In this post, we’ll look at how to leverage tools like Pandas to explore and map out police activity in Montgomery County, Maryland. We’ll start with a look at the…Continue
Added by Emmanuelle Rieuf on March 17, 2017 at 9:00am — No Comments
This article was written by Roopam Upadhyay. Roopam is a seasoned professional of advanced analytics with more than a decade of experience in statistical modeling, data science, predictive analytics, optimization, & business consulting.
How do machines learn? They learn…Continue
Added by Emmanuelle Rieuf on March 15, 2017 at 12:00pm — No Comments
From NodeXLExcelAutomator, recently updated..
The graph represents a network of 3,210 Twitter users whose tweets in the requested range contained "data science" or #datascience", or who were replied to or mentioned in those tweets.
According to this source, the top influencers…Continue
Added by Emmanuelle Rieuf on March 15, 2017 at 12:00pm — No Comments
This infographic was posted by Robert Kelley on Dataiku.
Here at Dataiku, we frequently stress the importance of collaboration in building a successful data team. In short, successful data science and analytics are just as much about creativity as they are about crunching numbers, and creativity flourishes in a collaborative environment. One key to a collaborative environment is having a shared set of terms and…Continue
Added by Emmanuelle Rieuf on March 15, 2017 at 11:30am — No Comments
Data science today is a lot like the Wild West: there’s endless opportunity and excitement, but also a lot of chaos and confusion. If you’re new to data science and applied machine learning, evaluating a machine-learning model can seem pretty overwhelming. Now you have help. With this O’Reilly report,…Continue
Added by Emmanuelle Rieuf on March 10, 2017 at 8:00am — No Comments
Added by Emmanuelle Rieuf on March 6, 2017 at 10:30am — No Comments
Long title: The Goal-Question-Metric (GQM) Model to Transform Business Data into an Enterprise Asset.
Today, digitization is dramatically changing the business landscape, and many progressive organizations have started to treat data as a valuable business…Continue
Added by Emmanuelle Rieuf on February 22, 2017 at 12:30pm — No Comments