Subscribe to DSC Newsletter

Featured Blog Posts – January 2018 Archive (84)

Creating Your First Machine Learning Classifier with Sklearn

This article was written by Kasper Fredenslund.

At the end of the post you will know how to:

  • Import and transform data from a .csv file to use with sklearn
  • Inspect the dataset and select relevant features
  • Train different classifiers on the data using sklearn
  • Analyse the results with the intention of improving your model

Importing data



Added by Vincent Granville on January 16, 2018 at 9:30am — No Comments

Data Science is Changing and Data Scientists will Need to Change Too – Here’s Why and How

Summary:  Deep changes are underway in how data science is practiced and successfully deployed to solve business problems and create strategic advantage.  These same changes point to major changes in how data scientists will do their work.  Here’s why and how.


There’s a sea change underway in data science.  It’s changing how…


Added by William Vorhies on January 16, 2018 at 8:14am — 10 Comments

Moving Data from the Basement to the Boardroom

According to IDG, 35% of companies with effective data grow faster year-on-year, however, many still take little notice of data quality or ongoing data management, pushing it into deepest corners to eventually be abandoned and forgotten.

If this…


Added by Martin Doyle on January 16, 2018 at 4:00am — No Comments

Machine Learning Explained: Understanding Supervised, Unsupervised, and Reinforcement Learning

Once we start delving into the concepts behind Artificial Intelligence (AI) and Machine Learning  (ML), we come across copious amounts of jargon related to this field of study. Understanding this jargon and how it can have an impact on the study related to ML goes a long way in comprehending the study that has been conducted by researchers and data scientists to get AI to the state it now is. 

In this article, I will be providing you with a comprehensive definition of supervised,…


Added by Ronald van Loon on January 16, 2018 at 1:30am — 2 Comments

Selected Recent Articles from Top DSC Contributors - Part 3

This is a new series, featuring great content from our top contributors. Some of these articles are rather technical in nature, but many are business-oriented and written in simple English. The entire series consists of about 120 articles. We intend to publish a new set every two weeks or so. Click here to check out the…


Added by Vincent Granville on January 15, 2018 at 6:00pm — No Comments

Who will get hired at Google?

In this script, we will explore the open roles at Google, and try to see what common attributes Google is looking for, in future employees. 

This dataset contains text information about job location, title, department, minimum and preferred qualifications and the responsibilities of the position. Using this dataset we will try to answer the following questions:

  1. Where are the open roles?
  2. What are the minimum and preferred educational qualifications needed…

Added by Ann Rajaram on January 15, 2018 at 5:30pm — 2 Comments

Relational Algebra Is the Root of SQL Problems

There is no doubt that SQL is the most widely-used working language for processing structured data. Not only is the language adopted by all relational database products, but its implementation is the goal of many newly-invented big data platforms. But in many aspects SQL isn’t so convenient to use in handling various computational and query demands. The procedurality issue stated in the last article is just a superficial one. SQL’s problems are rooted in its theory foundation, the relational…


Added by JIANG Buxing on January 14, 2018 at 10:30pm — 5 Comments

One important overlooked point here is that significant value

In the article, Data Science Should Monitor Big Brother by Arjan Haring, one important…


Added by Peter Bruce on January 14, 2018 at 11:00am — No Comments

Weekly Digest, January 15

Monday newsletter published by Data Science Central. Previous editions can be found here.  The contribution flagged with a + is our selection for the picture of the week.

Featured Resources and Technical Contributions


Added by Vincent Granville on January 13, 2018 at 6:00pm — 1 Comment

7 Books About Machine Learning, Statistics, and Python

This is part of a book series by Pakt Publishing. The first book is entitled Statistics for Machine Learning and described below.

Book Description

Complex statistics in Machine Learning worry a lot of developers. Knowing statistics helps you build strong…


Added by Capri Granville on January 13, 2018 at 9:30am — No Comments

7 Visualizations You Should Learn in R

This article was posted by Dikesh Jariwala on R Bloggers.

With ever increasing volume of data, it is impossible to tell stories without visualizations. Data visualization is an art of how to turn numbers into useful knowledge.

R Programming lets you learn this art by offering a set of inbuilt functions and libraries to build visualizations and present data. Before the…


Added by Emmanuelle Rieuf on January 13, 2018 at 9:00am — 5 Comments

Avoiding the IOT ‘Twister’ Business Strategy

Most organizations’ IOT Strategy look like a game of ‘Twister’ with progress across important IOT capabilities such as architecture, technology, data, analytics and governance; variables comprising a series of random investments and decisions.…


Added by Bill Schmarzo on January 13, 2018 at 5:00am — No Comments

The DeepMind Strategy - How AI is Revolutionizing Business Models

This article was written by Francesco Corea.

Image Credit: …


Added by Amelia Matteson on January 11, 2018 at 2:30pm — No Comments

Machine Learning Guide and Tutorial for Software Engineers

This article was written by Nam Vu on GitHub. 

What is it? 

This is my multi-month study plan for going from mobile developer (self-taught, no CS degree) to machine learning engineer.My main goal was to find an approach to studying Machine Learning that is mainly hands-on and abstracts most of the Math for the beginner. This approach is unconventional because it’s the top-down and results-first approach designed for…


Added by Emmanuelle Rieuf on January 11, 2018 at 1:00pm — No Comments

Market Mix Modeling – 101: Part 2

In my last article, I had introduced you to the concept of Market Mix Modeling. If you want a refresher, please click on   the this link. This article is in continuation of my first article – Market Mix Modeling – 101 In this article, I would like…


Added by Ridhima Kumar on January 11, 2018 at 2:30am — No Comments

Want to become a Data Scientist? Try Feynman Technique

Many a blogs and articles are written on how to become a Data Scientist. The list normally goes like this

  • Study descriptive statistics, hypothesis testing, probability
  • Learn types of Machine learning algorithms – Supervised, Unsupervised
  • Learn Python, R, SAS, SQL
  • Apply machine learning techniques using Python, R, SAS
  • Learn Data Visualization

While there is nothing wrong in the path illustrated above, it is not the…


Added by Venkat Raman on January 11, 2018 at 12:30am — 9 Comments

Automated Driving: How will it affect me?

Contributed by David Richard Steinmetz. He enrolled in the NYC Data Science Academy 12-week full time Data Science Bootcamp program taking place between July 5th to September 23rd, 2016. This post is based on their second project - R Shiny, due on 4th week of the program. The original article can be found here.

Automated driving is coming, and it will affect you…


Added by NYC Data Science Academy on January 10, 2018 at 1:30pm — 1 Comment

Number Theory: Nice Generalization of the Waring Conjecture

The Waring conjecture - actually a problem associated with a number of conjectures, many now being solved - is one of the most fascinating mathematical problems. This article covers new aspects of this problem, with a generalization and new conjectures, some with a tentative solution, and a new framework to tackle the problem. Yet it is written in simple English and accessible to the layman.

I also review a number of famous related mathematical conjectures, including one with a $1…


Added by Vincent Granville on January 10, 2018 at 1:00pm — 2 Comments

Design Patterns for Deep Learning Architectures - with Free eBook

This article comes from DeepLearningPatterns.. 

Deep Learning can be described as a new machine learning toolkit that has a high likelihood to lead to more advanced forms of artificial intelligence. The evidence for this is in the sheer number of breakthroughs that had occurred since the beginning of this decade. There is a new found optimism in the air and we are now again in a new AI spring.

Unfortunately, the current state of deep learning appears to many ways to be…


Added by Emmanuelle Rieuf on January 10, 2018 at 11:30am — No Comments

Best DSC Forum Questions - Part 6

This is a new series, featuring forum questions (new and old) that are still popular today. These questions were selected manually based on popularity, removing outdated material. The entire series consists of about 160 questions -- most with answers, sometimes several answers. We intend to publish a new set every two weeks or so. The previous edition was posted …


Added by Vincent Granville on January 10, 2018 at 11:00am — No Comments

Featured Monthly Archives












  • Add Videos
  • View All

© 2020   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service