Subscribe to DSC Newsletter

April 2017 Blog Posts (92)

Open Source Deep Learning Frameworks and Visual Analytics

Deep Learning gets more and more traction. It basically focuses on one section of Machine Learning: Artificial Neural Networks. This article explains why Deep Learning is a game changer in analytics, when to use it, and how Visual Analytics allows business analysts to leverage the analytic models built by a (citizen) data scientist.

What is Deep Learning and Artificial Neural Networks?

Deep Learning is the modern buzzword for artificial neural networks, one of many concepts…


Added by Kai Waehner on April 23, 2017 at 9:00am — 1 Comment

Weekly Digest, April 24

Monday newsletter published by Data Science Central. Previous editions can be found here.  The contribution flagged with a + is our selection for the picture of the week.

Featured Resources and Technical Contributions


Added by Vincent Granville on April 22, 2017 at 6:00am — No Comments

Beyond SMAC – Digital twister of disruption!!

Have your seen the 1996 movie Twister, based on tornadoes disrupting the neighborhoods? A group of people were shown trying to perfect the devices called Dorothy which has hundreds of sensors to be released in the center of twister so proper data can be collected to create a more advanced warning system and save people.

Today if we apply the…


Added by Sandeep Raut on April 22, 2017 at 4:30am — No Comments

Variable Reduction: An art as well as Science

Variable reduction is a crucial step for accelerating model building without losing the potential predictive power of the data. With the advent of Big Data and sophisticated data mining techniques, the number of variables encountered is often tremendous making variable selection or dimension reduction techniques imperative to produce models with acceptable accuracy and generalization. The temptation to build an ecological model using all available information (i.e., all variables) is hard to…


Added by Valiance Solutions on April 21, 2017 at 9:20pm — No Comments

Deep Learning Meets Recommendation Systems

Contributed by Wann-Jiun Ma.  


Almost everyone loves to spend their leisure time to watch movies with their family and friends. We all have the same experience when we sit on our couch to choose a movie that we are going to watch and spend the next two hours but can't even find one after 20 minutes. It is so disappointing. We definitely need a computer agent to provide movie recommendation to us when we need to…


Added by NYC Data Science Academy on April 21, 2017 at 11:00am — No Comments

Mapping NYC Common Core Scores

Contributed by David Letzler. 

Introduction: A Brief History and Description of the Common Core

In 2009 the National Governors’ Association and the Council of Chief State School Officers resolved to develop a set of national education standards known as the Common Core. These were intended to unify what had been to that point a set of highly localized state education standards.  The hope of the Common Core was that a…


Added by NYC Data Science Academy on April 21, 2017 at 10:30am — No Comments

Avoiding Look Ahead Bias in Time Series Modelling

Any time series classification or regression forecasting involves the Y prediction at 't+n' given the X and Y information available till time T. Obviously no data scientist or statistician can deploy the system without back testing and validating the performance of model in history. Using the future actual information in training data which could be termed as "Look Ahead Bias" is probably the gravest mistake a data scientist can make. Even the sentence “we cannot make use future…


Added by Rohit Walimbe on April 21, 2017 at 6:00am — No Comments

How to Sabotage a Successful Data Solution - Data Hoarding, A Ticking Time Bomb

Poor data governance often leads to a failed data state. Too many times companies, and even silos within a company, ignore the importance of data governance. Such neglect leads to increased cost, decreased compliance, and even a complete failure of a data solution.

A key data governance principal to adhere to is data usage. This is a simple one. If…


Added by Eric Mayberry on April 20, 2017 at 9:00am — 2 Comments

Thursday News: ML, AI, Python, Dataviz, Hadoop and more

Here is our new selection of featured articles and resources posted since Monday.


Added by Vincent Granville on April 20, 2017 at 8:54am — No Comments

50 Important Things You Need to Know About Data Science

According to IBM, the world generates 2.5 quintillion bytes of data every day. A decent chunk of those quintillion bytes is made up of people asking the experts how to break into and excel in the dynamic, lucrative field of data science. An even larger chunk of those bytes consists of convoluted, contradicting answers to that question.

This is, on one hand, a great thing. Multiple prominent data science innovators are out there giving you free advice on your most pressing questions,…


Added by Lauren Delapenha on April 20, 2017 at 6:30am — 1 Comment

50 Important Things You Need to Know About Data Science

This is a guest post by Lauren Delapenha. She is an editor at, a one-stop resource for learning about the rapidly-evolving field of data science through comprehensive education and career guides.

According to IBM, the world generates 2.5 quintillion bytes of data every day. A decent chunk of those quintillion bytes is made up of people asking the experts how to break into and excel in the dynamic, lucrative field of data science. An even…


Added by Shay Pal on April 19, 2017 at 4:30pm — No Comments

How analytics is helping CPG companies drive growth

Consumer Packaged companies (CPG) are grappling with a lot of challenges owing to economic uncertainty, price consciousness, changing demographics, rise of discounters coupled with fast-changing retail needs. The growth of leading CPG players is stagnant and there is a fierce competition as a large number of small firms are venturing into the CPG and retail…

Added by Ashish Sukhadeve on April 19, 2017 at 9:30am — No Comments

Fact or Fiction? Smart Data Visualization Tells the Tale

If you are considering a Business Intelligence solution, you ought to give some consideration to the concept of Smart Data Visualization  and review your prospective solution to determine its capabilities in that regard. Smart Data Visualization provides many benefits to the organization and to the business users, who will leverage the selected BI tools to gather, analyze,…


Added by Arbuda Dave on April 19, 2017 at 2:30am — No Comments

Artificial Intelligence: Look Ma, No Hands!


Added by Rekha Joshi on April 18, 2017 at 12:30pm — No Comments

Detecting Fake News, Fake Reviews, Fake Accounts, Fake Pictures

A while back, I was reading an article posted on Facebook, about Clovis people found alive and well living in Florida, with a picture featuring tribesmen (see below.) The quality of the picture was poor, and the URL was very suspicious:, as to make it appear that it was from It turned out that the picture (and thus the whole story) was fake: these people are real…


Added by Vincent Granville on April 18, 2017 at 12:00pm — 2 Comments

Machine Learning Skills Among Data Scientists

This article was posted by Bob E. Hayes on Customer think. Bob, PhD is Chief Research Officer at Appuri. He a scientist, blogger and author on CEM and data science.

Data scientists have a variety of different skills that they bring to bear on Big Data projects. These skills cut across Subject Matter Expertise, Technology, Programming, Math & Modeling and Statistics. One valuable…


Added by Emmanuelle Rieuf on April 18, 2017 at 9:00am — No Comments

There’s Nothing Like a Huge Public Failure to Boost Interest in AI

Summary:  We are swept up by the rapid advances in AI and deep learning, and tend to laugh off AI’s failures as good fodder for YouTube videos.  But those failures are starting to add up.  It’s time to take a hard look at the weaknesses in AI and where that’s leading us.



Added by William Vorhies on April 18, 2017 at 8:04am — No Comments

How to Lie with Data

We expect that data scientists and analysts should be objective and base their conclusions on data. Now while the name of the job implies that “data” is the fundamental material that is used to do their jobs, it is not impossible to lie with it. Quite the opposite – the data scientist is affected by unconscious biases, peer pressure, urgency, and if that’s not enough – there are inherent risks in the process of data analysis and interpretation that lead to lying. It happens all the time…


Added by Karolis Urbonas on April 17, 2017 at 10:30am — 3 Comments

Learn under the hood of Gradient Descent algorithm using excel

When I first started out learning about machine learning algorithms, it turned out to be quite a task to gain an intuition of what the algorithms are doing. Not just because it was difficult to understand all the mathematical theory and notations, but it was also plain boring. When I turned to online tutorials for answers, I could again only see equations or high level explanations without going through the detail in a majority of the cases.…


Added by Jahnavi Mahanta on April 17, 2017 at 9:30am — 6 Comments

Streaming Event Modeling

Data modeling has been a fixture of enterprise architecture since the 1970’s with ANSI defined conceptual, logical, and physical data schema.    As data models developed, so did the availability of templates for business use.   Retail banks use similar data models, as do other industries.  A shared approach to data modeling advanced the discussion and planning of solutions. 

Growth in unstructured data has led to development of tools to search data to identify context or bring…


Added by Paul Stanton on April 16, 2017 at 2:00pm — No Comments

Blog Topics by Tags

Monthly Archives













  • Add Videos
  • View All

© 2020   TechTarget ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service