Subscribe to DSC Newsletter

August 2017 Blog Posts (121)

Understanding Neural Network: A beginner’s guide

Neural network or artificial neural network is one of the frequently used buzzwords in analytics these days. Neural network is a machine learning technique which enables a computer to learn from the observational data. Neural network in computing is inspired by the way biological nervous system process information.

Biological neural networks consist of…


Added by Ashish Sukhadeve on August 6, 2017 at 7:00am — 45 Comments

Data Science Simplified Part 5: Multivariate Regression Models

In the last article of this series, we discussed the story of Fernando. A data scientist who wants to buy a car. He uses Simple Linear Regression model to estimate the price of the car.…


Added by Pradeep Menon on August 6, 2017 at 5:30am — 1 Comment

Data Science Simplified Part 4: Simple Linear Regression Models

In the previous posts of this series, we discussed the concepts of statistical learning and hypothesis testing. In this article, we dive into linear regression models.

Before we dive in, let us recall some important aspects of statistical learning.

Independent and Dependent…


Added by Pradeep Menon on August 6, 2017 at 5:30am — 3 Comments

Data Science Simplified Part 2: Key Concepts of Statistical Learning

In the first article of this series, I had touched upon key concepts and processes of Data Science. In this article, I will dive in a bit deeper. First, I will define what is Statistical learning. Then, we will dive into key concepts in Statistical learning. Believe me; it is simple.

As per …


Added by Pradeep Menon on August 6, 2017 at 5:30am — 2 Comments

Operational Data and Social Justice

I spotted an interesting book in my local library recently:  The Final Report of the Truth and Reconciliation Commission of Canada [1].  I thought to myself, our government spent considerable resources on this commission.  I should at least browse through the final report.  I flipped through the first few pages.  I found a note saying that the contents are public domain.  In this blog, I reproduce some of the contents of the report to create a setting for my discussion on operational data. …


Added by Don Philip Faithful on August 6, 2017 at 5:00am — No Comments

Do you want to hire a Data Scientist?

As mentioned by Tom Davenport few years back,Data Scientist is still a hottest job of century. Data scientists are those elite people who solve business problems by analyzing tons of data and communicate the results in a very compelling way to senior leadership and persuade them to take action.

They have the critical responsibility to understand the data and help business get more knowledgeable about their customers.

The importance of Data Scientists has rose to top due to two…


Added by Sandeep Raut on August 6, 2017 at 5:00am — 1 Comment

Weekly Digest, August 7

Monday newsletter published by Data Science Central. Previous editions can be found here.  The contribution flagged with a + is our selection for the picture of the week.


Added by Vincent Granville on August 5, 2017 at 7:30pm — No Comments

When Not to Use Deep Learning

Guest blog by Pablo Cordero. Pablo is currently a postdoc at UCSC’s systems biology group, doing applied machine learning research in the context of cell biology and regenerative medicine, particularly looking at single-cell measurements. He is also consulting from time to time. He can be contacted here

I know…


Added by Amelia Matteson on August 5, 2017 at 11:30am — No Comments

Data Science Simplified Part 1: Principles and Process

In 2006, Clive Humbly, UK Mathematician, and architect of Tesco’s Clubcard coined the phrase “Data is the new oil. He said the following:



Added by Pradeep Menon on August 5, 2017 at 2:10am — 5 Comments

Apache Spark Streaming

Introduction to Apache Spark Streaming

A data stream is an unbounded sequence of data arriving continuously. Streaming divides continuously flowing input data into discrete units for further processing. Stream processing is low latency processing and analyzing of streaming data.

Spark Streaming was added to Apache spark in 2013, an extension of the core Spark API that provides scalable, high-throughput and fault-tolerant stream…


Added by Shreya Gupta on August 5, 2017 at 2:00am — No Comments

Data Science Simplified Part 3: Hypothesis Testing

Edward Teller, the famous Hungarian-American physicist, once quoted:

“A fact is a simple statement that everyone believes. It is innocent, unless found guilty. A hypothesis is a novel suggestion that no one wants to believe. It is guilty, until…

Added by Pradeep Menon on August 5, 2017 at 2:00am — 2 Comments

R code and reproducible model development with DVC

DVC is an open source tool that could help with achieving code simplicity, readability and faster model development.The idea is to track files/data dependencies during model development in order to facilitate reproducibility and track data files versioning.  However, DVC is a language agnostic tool and can be used with any programming language. Here we will describe how we can…


Added by Marija Zoldin on August 4, 2017 at 12:30am — No Comments

Deep Learning with TensorFlow in Python: Convolution Neural Nets

The following problems appeared in the assignments in the Udacity course Deep Learning (by Google). The descriptions of the problems are taken from the assignments (continued from the last post).

Classifying the letters with notMNIST dataset with Deep Network

Here is how some sample images from the dataset look like:


Let’s try to get the best performance using a multi-layer model!…


Added by Sandipan Dey on August 3, 2017 at 10:30pm — No Comments

Thursday News: Machine Learning, NLP, Data Lakes, R, Python, Hadoop...

Here is our list of featured articles and resources posted since Monday:


Added by Vincent Granville on August 3, 2017 at 8:00am — No Comments

Dummy errors when using neuralnet package in R

Ok, so you read a bunch of stuff on how to do Neural Networks and how many layers or nodes you should add, and etc... But when you start to implement the actual Neural Network you face a ton of dummy errors that stop your beautiful inspirational programming.

This post talks about some errors you might face when using the neuralnet package in R.…


Added by Renata Ghisloti Duarte Souza Gra on August 2, 2017 at 10:30am — No Comments

3 Ways To Drive Effective HR Insights Using Data Analytics

Image Source:

Data Analytics has become the buzzword in the world of business today. An effective Human Resource or HR management is one of the most important corporate responsibilities to ensure smooth business operations. Therefore, the growing role of data analytics in HR can not be ignored at all. Broadly…


Added by Shantanu Chaturvedi on August 2, 2017 at 1:30am — No Comments

Architectural Primer: Conversation as a Platform (CAAP) on Azure

According to Mckinsey report, Artificial Intelligence (AI) is poised to unleash digital disruption and companies needs to start preparing…


Added by Pradeep Menon on August 1, 2017 at 5:00am — No Comments

Machine Learning Translation and the Google Translate Algorithm

Years ago, it was very time consuming to translate the text from an unknown language. Using simple vocabularies with word-for-word translation was hard for two reasons: 1) the reader had to know the grammar rules and 2) needed to keep in mind all language versions while translating the whole sentence.

Now, we…


Added by Luba Belokon on August 1, 2017 at 5:00am — No Comments

Demystifying Data Lake Architecture

According to Gartner, 80% of successful CDOs will have value creation or revenue generation as their Number 1 priority through 2021.

To create the maximum value out the organization’s data landscape, traditional decision support system architecture are no longer adequate. New architectural patterns need to be developed to harness the…


Added by Pradeep Menon on August 1, 2017 at 5:00am — 3 Comments

Modeling the growth of a sunflower with golden angle and Fibonacci numbers in R

  • In this article, a mathematical model for the growth of a sunflower (shown below) will be described (reference: the video lectures of Prof. Jeffrey R Chesnov from Coursera Course on Fibonacci numbers).


  • New florets are created close to center.
  • Florets move radially out with constant speed as the sunflower grows.
  • Each new…

Added by Sandipan Dey on August 1, 2017 at 1:30am — No Comments

Blog Topics by Tags

Monthly Archives













  • Add Videos
  • View All

© 2020   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service