Subscribe to DSC Newsletter

All Blog Posts (3,236)

47 New External Data Science / Machine Learning Resources and Articles

Starred articles are candidates for the picture of the week. A comprehensive list of all past resources is found here. We are in the process of automatically categorizing them using indexation and automated tagging…


Added by Vincent Granville on October 26, 2016 at 7:30am — No Comments

Accelerated Computing and Deep Learning

Guest blog post by Jen-Hsun Hunag, Founder, President and CEO at NVIDIA, Originally entitled "The Intelligent Industrial Revolution".

A New Era

A New Era of Computing

Intelligent machines powered by AI computers that can learn, reason and interact with people are no…


Added by Vincent Granville on October 25, 2016 at 5:00pm — No Comments

Catching up on Big Data & Healthcare

The health area is characterized by the management of huge data volumes. What if those data are processed and provided to the health professionals and their patients or, even to the health system at large? Not only one, two, three…


Added by Ernesto Mislej on October 25, 2016 at 7:37am — No Comments

How to Intelligently Apply Data Integration and Visual Analytics Tools

Data integration requires merging date from different sources, stored using technologies. Companies build a “data warehouse where aggregated data can be stored and retrieved. This is particularly useful for researchers looking to big data to aid in their investigation and corporations usually during…


Added by Dante Munnis on October 25, 2016 at 7:00am — No Comments

22 Great Blogs Posted in the last 12 Months

This is part of a new series of articles: once or twice a month, we post previous articles that were very popular when first published. These articles are at least 6 month old but no more than 12 month old. The previous digest in this series was posted here a while back. Below is our fourth edition.…


Added by Vincent Granville on October 24, 2016 at 5:00pm — No Comments

Recurrent Neural Nets – The Third and Least Appreciated Leg of the AI Stool

Summary:  Convolutional Neural Nets are getting all the press but it’s Recurrent Neural Nets that are the real workhorse of this generation of AI.

 We’ve paid a lot of attention lately to…


Added by William Vorhies on October 24, 2016 at 3:53pm — No Comments

Migrating an Excel Spreadsheet Directly to HDFS and Spark 2.0.1 (Part 2)

Recently, in a previous post, we reviewed a path to leverage legacy Excel data and import CSV files thru MySQL into Spark 2.0.1. This may apply frequently in…


Added by Marc Borowczak on October 23, 2016 at 5:57am — No Comments

Classifications in R: Response Modeling/Credit Scoring/Credit Rating using Machine Learning Techniques

This article was written by Ariful Mondal. Artful is a senior manager, data science and big data analytics consultant at Tata Consultancy Services. 

1. Introduction

This is an attempt to showcase some worked out examples of Machine Learning (ML) use German Credit Data. Though we have selected credit…


Added by Emmanuelle Rieuf on October 22, 2016 at 9:30am — No Comments

Home Internet Data Usage - FAQS

Data transfer - what it is?

Whilst you are online, everything is about transfer of data – thus, emails and web pages are basically a file that when you read or log onto, you are in essence downloading the file or transferring it to your screen so you can view it. If you watch a film or play a game online, these activities send data backward and forward in…


Added by Glen Johnson on October 22, 2016 at 9:30am — No Comments

Structural Accommodation

A theme in my blogs is how the "structure" of data - rather than just the "content" - affects what that data can say and is capable of doing. In particular, I suggest that certain structures tend to reinforce certain contents; this means that a structural imposition can have an effect similar to a contextual…


Added by Don Philip Faithful on October 22, 2016 at 5:30am — No Comments

Data Integration Tools – Market Study

This post is a brief review of leading Data Integration tools in the market. Heavily referencing from the Gartner 2016 report and peer reviews from my circle.


The Market

The data integration tool market was worth approximately $2.8 billion at the end of 2015, an increase of 10.5% from the end of 2014 [2016 Gartner Report – Data Integration Tools].

Key data integration responsibilities-

  1. Data acquisition for…

Added by Kashif Saiyed on October 21, 2016 at 7:30pm — No Comments

Weekly Digest, October 24

Monday newsletter published by Data Science Central. Previous editions can be found here.  The contribution flagged with a + is our selection for the picture of the week.


  • Use data to drive decisions—and your career. Advance your knowledge through…

Added by Vincent Granville on October 21, 2016 at 12:00pm — No Comments

Cultural Institutions of New York City: Data, Analysis, R Code, Visu

Contributed by Rob Castellano. He  is currently in the NYC Data Science Academy 12 week full time Data Science…


Added by SupStat on October 21, 2016 at 9:30am — No Comments

Data Science for Internet of Things methodology - Evolving CRISP-DM - Part Two

This set of blog posts is part of the book/course on Data Science for the Internet of Things. We welcome your comments atjjb at cantab dot net.  Jean-Jacques Bernard  has been a founding member of the Data Science for Internet of Things Course.

Please email at ajit.jaokar at if you are interested in joining the course.

You can find the first…


Added by Jean-Jacques Bernard on October 19, 2016 at 11:30pm — No Comments

DSC Top Resources

Here is our updated list of top Data Science Central (DSC) resources, including reference articles and tutorials, top categories, tools and techniques, as well as several useful links (jobs, events, training, webinars, books and so on) and information about our popular newsletter. You will also find information about blogging with us, or where to find us on Facebook, LinkedIn or Twitter.

  • Article: …

Added by Vincent Granville on October 19, 2016 at 8:00pm — No Comments

What is BuDAI?

What is BuDAI?

Data science[1] (covering data mining and related practices) is a multidisciplinary field that requires knowledge of a number of different skills, practices, and technologies, including but not limited to machine learning, pattern recognition, mathematics, programming, algorithms, statistics, and databases. In the…

Added by Khosrow Hassibi on October 19, 2016 at 10:00am — No Comments

Top 9 Big Data Security Issues You Should Watch For

Understanding that big data comes from such a huge pool of devices, constantly collecting new data, gives you an idea where security threats may come from. Each device that is online, from phones to tablets to computers to smart appliances, has the potential of…


Added by Shezagary on October 18, 2016 at 9:30pm — No Comments

11 Great Hadoop, Spark and Map-Reduce Articles

This reference is a part of a new series of DSC articles, offering selected tutorials, references/resources, and interesting articles on subjects such as deep learning, machine learning, data science, deep data science, artificial intelligence, Internet of Things, algorithms, and related topics. It is designed for the busy reader who does not have a lot of time digging into long lists of advanced publications.…


Added by Vincent Granville on October 18, 2016 at 7:30pm — No Comments

How To Implement Machine Learning Algorithm Performance Metrics From Scratch With Python

This article was written by Jason Brownlee. Jason is the editor-in-chief at has a Masters and PhD in Artificial Intelligence, has published books on Machine Learning and has written operational code that is running in production.

After you make predictions, you need to know if they are any good.

There are standard measures that we can use to summarize how good a set of predictions actually are.

Knowing how good a set of predictions is,…


Added by Emmanuelle Rieuf on October 18, 2016 at 10:30am — No Comments

What is Data Science? 24 Fundamental Articles Answering This Question

Many people new to data science might believe that this field is just about R, Python, Hadoop, SQL, and traditional machine learning techniques or statistical modeling. Below you will find fundamental articles that show how modern, broad and deep the field is. Some data scientists are actually doing none of the above. In my case, I don't even code, but instead, I make various applications talk to each other, in a machine-to-machine communication framework. It is true though that most data…


Added by Vincent Granville on October 18, 2016 at 9:00am — No Comments

Monthly Archives








Follow Us


  • Add Videos
  • View All


© 2016   Data Science Central   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service