Featured Blog Posts – April 2017 Archive (86)

Weekly Digest, April 24

Monday newsletter published by Data Science Central. Previous editions can be found here.  The contribution flagged with a + is our selection for the picture of the week.

Featured Resources and Technical Contributions


Added by Vincent Granville on April 22, 2017 at 6:00am — No Comments

Beyond SMAC – Digital twister of disruption!!

Have your seen the 1996 movie Twister, based on tornadoes disrupting the neighborhoods? A group of people were shown trying to perfect the devices called Dorothy which has hundreds of sensors to be released in the center of twister so proper data can be collected to create a more advanced warning system and save people.

Today if we apply the…


Added by Sandeep Raut on April 22, 2017 at 4:30am — No Comments

Variable Reduction: An art as well as Science

Variable reduction is a crucial step for accelerating model building without losing the potential predictive power of the data. With the advent of Big Data and sophisticated data mining techniques, the number of variables encountered is often tremendous making variable selection or dimension reduction techniques imperative to produce models with acceptable accuracy and generalization. The temptation to build an ecological model using all available information (i.e., all variables) is hard to…


Added by Valiance Solutions on April 21, 2017 at 9:20pm — No Comments

Deep Learning Meets Recommendation Systems

Contributed by Wann-Jiun Ma.  


Almost everyone loves to spend their leisure time to watch movies with their family and friends. We all have the same experience when we sit on our couch to choose a movie that we are going to watch and spend the next two hours but can't even find one after 20 minutes. It is so disappointing. We definitely need a computer agent to provide movie recommendation to us when we need to…


Added by NYC Data Science Academy on April 21, 2017 at 11:00am — No Comments

Mapping NYC Common Core Scores

Contributed by David Letzler. 

Introduction: A Brief History and Description of the Common Core

In 2009 the National Governors’ Association and the Council of Chief State School Officers resolved to develop a set of national education standards known as the Common Core. These were intended to unify what had been to that point a set of highly localized state education standards.  The hope of the Common Core was that a…


Added by NYC Data Science Academy on April 21, 2017 at 10:30am — No Comments

Avoiding Look Ahead Bias in Time Series Modelling

Any time series classification or regression forecasting involves the Y prediction at 't+n' given the X and Y information available till time T. Obviously no data scientist or statistician can deploy the system without back testing and validating the performance of model in history. Using the future actual information in training data which could be termed as "Look Ahead Bias" is probably the gravest mistake a data scientist can make. Even the sentence “we cannot make use future…


Added by Rohit Walimbe on April 21, 2017 at 6:00am — No Comments

How to Sabotage a Successful Data Solution - Data Hoarding, A Ticking Time Bomb

Poor data governance often leads to a failed data state. Too many times companies, and even silos within a company, ignore the importance of data governance. Such neglect leads to increased cost, decreased compliance, and even a complete failure of a data solution.

A key data governance principal to adhere to is data usage. This is a simple one. If…


Added by Eric Mayberry on April 20, 2017 at 9:00am — 2 Comments

50 Important Things You Need to Know About Data Science

According to IBM, the world generates 2.5 quintillion bytes of data every day. A decent chunk of those quintillion bytes is made up of people asking the experts how to break into and excel in the dynamic, lucrative field of data science. An even larger chunk of those bytes consists of convoluted, contradicting answers to that question.

This is, on one hand, a great thing. Multiple prominent data science innovators are out there giving you free advice on your most pressing questions,…


Added by Lauren Delapenha on April 20, 2017 at 6:30am — 1 Comment

How analytics is helping CPG companies drive growth

Consumer Packaged companies (CPG) are grappling with a lot of challenges owing to economic uncertainty, price consciousness, changing demographics, rise of discounters coupled with fast-changing retail needs. The growth of leading CPG players is stagnant and there is a fierce competition as a large number of small firms are venturing into the CPG and retail…

Added by Ashish Sukhadeve on April 19, 2017 at 9:30am — No Comments

Fact or Fiction? Smart Data Visualization Tells the Tale

If you are considering a Business Intelligence solution, you ought to give some consideration to the concept of Smart Data Visualization  and review your prospective solution to determine its capabilities in that regard. Smart Data Visualization provides many benefits to the organization and to the business users, who will leverage the selected BI tools to gather, analyze,…


Added by Arbuda Dave on April 19, 2017 at 2:30am — No Comments

Artificial Intelligence: Look Ma, No Hands!


Added by Rekha Joshi on April 18, 2017 at 12:30pm — No Comments

Detecting Fake News, Fake Reviews, Fake Accounts, Fake Pictures

A while back, I was reading an article posted on Facebook, about Clovis people found alive and well living in Florida, with a picture featuring tribesmen (see below.) The quality of the picture was poor, and the URL was very suspicious: baynews9.com.ddwg.clonezone.link, as to make it appear that it was from Baynews9.com. It turned out that the picture (and thus the whole story) was fake: these people are real…


Added by Vincent Granville on April 18, 2017 at 12:00pm — 2 Comments

Machine Learning Skills Among Data Scientists

This article was posted by Bob E. Hayes on Customer think. Bob, PhD is Chief Research Officer at Appuri. He a scientist, blogger and author on CEM and data science.

Data scientists have a variety of different skills that they bring to bear on Big Data projects. These skills cut across Subject Matter Expertise, Technology, Programming, Math & Modeling and Statistics. One valuable…


Added by Emmanuelle Rieuf on April 18, 2017 at 9:00am — No Comments

There’s Nothing Like a Huge Public Failure to Boost Interest in AI

Summary:  We are swept up by the rapid advances in AI and deep learning, and tend to laugh off AI’s failures as good fodder for YouTube videos.  But those failures are starting to add up.  It’s time to take a hard look at the weaknesses in AI and where that’s leading us.



Added by William Vorhies on April 18, 2017 at 8:04am — No Comments

How to Lie with Data

We expect that data scientists and analysts should be objective and base their conclusions on data. Now while the name of the job implies that “data” is the fundamental material that is used to do their jobs, it is not impossible to lie with it. Quite the opposite – the data scientist is affected by unconscious biases, peer pressure, urgency, and if that’s not enough – there are inherent risks in the process of data analysis and interpretation that lead to lying. It happens all the time…


Added by Karolis Urbonas on April 17, 2017 at 10:30am — 3 Comments

Learn under the hood of Gradient Descent algorithm using excel

When I first started out learning about machine learning algorithms, it turned out to be quite a task to gain an intuition of what the algorithms are doing. Not just because it was difficult to understand all the mathematical theory and notations, but it was also plain boring. When I turned to online tutorials for answers, I could again only see equations or high level explanations without going through the detail in a majority of the cases.…


Added by Jahnavi Mahanta on April 17, 2017 at 9:30am — 6 Comments

Streaming Event Modeling

Data modeling has been a fixture of enterprise architecture since the 1970’s with ANSI defined conceptual, logical, and physical data schema.    As data models developed, so did the availability of templates for business use.   Retail banks use similar data models, as do other industries.  A shared approach to data modeling advanced the discussion and planning of solutions. 

Growth in unstructured data has led to development of tools to search data to identify context or bring…


Added by Paul Stanton on April 16, 2017 at 2:00pm — No Comments

The applications of Artificial Intelligence (AI) in the Telecoms industry


Last week, I spoke at the Swiss Mobile Association. The event was held at one of the oldest  cross-functional research institutes Gottlieb Duttweiler Institute just outside Zurich. Prior to being involved in IoT and AI, I worked for many…


Added by ajit jaokar on April 16, 2017 at 12:00pm — No Comments

Seasons in Binary Star Planetary Systems

Here are a few challenges for the mathematically inclined - most data scientists are. This is just fun problems if you have some time to kill. The first problem is about seasons in binary star planetary systems: it has implications on whether such planets are inhabitable. It is also related to time series with double periodicity.  The next problems are related to infinite products, with an emphasis on building a prime-generating or at least prime-detection function. Large prime numbers are…


Added by Vincent Granville on April 16, 2017 at 11:00am — No Comments

A to Z of Analytics

Analytics has taken world by storm & It it the powerhouse for all the digital transformation happening in every industry.

Today everybody is generating tons of data – we as consumers leaving digital footprints on social media,IoT generating millions of records from sensors, Mobile phones are used from morning till we sleep. All these variety…


Added by Sandeep Raut on April 15, 2017 at 6:30pm — 5 Comments

Featured Monthly Archives












© 2021   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service