Subscribe to DSC Newsletter

Michael Walker's Blog (96)

Caveat Data Scientist: Public Trust Low for Science

A new paper entitled "Gaining Trust as well as Respect in Communicating Science Topicspublished in the Proceedings of the National…

Continue

Added by Michael Walker on September 24, 2014 at 5:07pm — No Comments

Big Data is Stupid Data

The "Big Data" marketing hype obscures the fact that more actionable, valuable insights are likely to be found in the right smaller "Smart Data" sets in contrast to large data sets.



While the term "Big Data" is properly defined as data sets whose size is beyond the ability of commonly used software tools to capture, manage, and process the data within a tolerable elapsed time - the…

Continue

Added by Michael Walker on September 19, 2014 at 11:10am — 2 Comments

Big Data Technology Vendor Consolidation

The "big data" technology vendor market is ripe for consolidation. The myriad of vendors and technologies is causing market confusion. Matt Turck created a nice visualization entitled "…

Continue

Added by Michael Walker on September 12, 2014 at 9:09am — 2 Comments

Data Scientists by Education 2014

According to recent survey by Burtch Works.

Added by Michael Walker on August 25, 2014 at 12:10pm — No Comments

Advanced and Predictive Analytics Vendor Ratings 2014

How will Advanced & Predictive Analytics (APA) affect you? Our inaugural Advanced and Predictive Analytics report can help answer that question! APA is growing and developing a plan for your organization now is critical!

The 2014 Wisdom of Crowds® Advanced and Predictive Analytics market study contains everything you need to assess this dynamic market…

Continue

Added by Michael Walker on July 23, 2014 at 2:30pm — No Comments

Law and Ethics of Online Human Subject Experiments

Facebook data scientists recently conducted an online experiment on 689,003 unknowing Facebook users - likely including children under the age of 18 - to see if it could manipulate and change user emotions. One group had positive words like “love” and “nice” filtered out of their News Feeds. Another group had negative words like “hurt” and “nasty” filtered. The result - published in a paper entitled…

Continue

Added by Michael Walker on July 3, 2014 at 11:56am — No Comments

Data Science Summer Reading List 2014

Continue

Added by Michael Walker on June 12, 2014 at 7:00pm — No Comments

NoSQL & NewSQL Database Adoption 2014

While MongoDB has been the most popular NoSQL database over the past few years, it appears Cassandra is most popular over the past six months. Many assert that Cassandra has superior scalability, better data management features, is faster and MongoDB has more moving parts and complexity to cause…

Continue

Added by Michael Walker on June 4, 2014 at 4:00pm — 1 Comment

Data science profession: accredidation, code of conduct

Data scientists are the lion kings of data pros while salaries for business intelligence and data warehousing pros are stagnating. 

Actual data scientist salaries are much higher considering many garden…

Continue

Added by Michael Walker on May 27, 2014 at 8:30pm — 1 Comment

Bad Data Science and Woody Allen

"Life imitates art far more than art imitates life." - Oscar Wilde



In Woody Allen's 1973 iconoclastic movie "Sleeper" a man (health food store owner) wakes up two hundred years in the future. For breakfast…

Continue

Added by Michael Walker on May 12, 2014 at 3:00pm — 1 Comment

The Deadly Data Science Sin of Confirmation Bias

Confirmation bias occurs when people actively search for and favor information or evidence that confirms their preconceptions or hypotheses while ignoring or slighting adverse or mitigating evidence. It is a type of cognitive bias (pattern of deviation in judgment that occurs in…

Continue

Added by Michael Walker on April 24, 2014 at 7:30pm — 5 Comments

The Haboob Clouds Hadoops Future

Hadoop is an open source framework for storing massive amounts of data on clusters of commodity hardware.



Haboob is a dense dust storm that moves fast…

Continue

Added by Michael Walker on March 23, 2014 at 9:03am — 3 Comments

The Texas Sharpshooter Deception

I received a call from an old client who stated his analytics team had a recent string of failures alarming the firm and costing money. He asked me to review and audit the teams work and analytical processes in attempt to understand and remedy the failures. The data crunching technology was…

Continue

Added by Michael Walker on March 12, 2014 at 9:00pm — No Comments

Forecasting with the Baum-Welch Algorithm and Hidden Markov Models

Leonard Baum and Lloyd Welch designed a probabilistic modelling algorithm to detect patterns in Hidden Markov Processes. They built upon the theory of probabilistic functions of a …

Continue

Added by Michael Walker on February 24, 2014 at 10:02pm — 1 Comment

Data Silos Obstruct Quest for Competitive Advantage

Data and information silos are a significant problem for organizations getting full value from data. Data silos are separate databases or data files that are not part of an organization's enterprise-wide…

Continue

Added by Michael Walker on February 11, 2014 at 8:00pm — 2 Comments

Predicting the Super Bowl

NFL 2013 Team Expected Points Added (EPA) per game - Defense by Offense

The atavistic love of sport, strategy,…

Continue

Added by Michael Walker on January 27, 2014 at 7:00pm — No Comments

Markov Logic Networks for Better Decisions

One important goal of data science is to help decision makers make better decisions. Markov…

Continue

Added by Michael Walker on January 15, 2014 at 12:24pm — 1 Comment

Boosting Algorithms for Better Predictions

Boosting is a supervised learning algorithm based on …

Continue

Added by Michael Walker on January 1, 2014 at 11:30am — 1 Comment

The Data Bug

Regarding defects in human-built systems, the term "bug" appears…
Continue

Added by Michael Walker on December 16, 2013 at 9:30am — No Comments

Lambda Architecture for Big Data Systems

Big data analytical ecosystem architecture is in early stages of development. Unlike traditional data warehouse / business intelligence (DW/BI) architecture which is designed for structured,…

Continue

Added by Michael Walker on December 4, 2013 at 7:57am — 3 Comments

Blog Topics by Tags

Monthly Archives

2016

2015

2014

2013

2012

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service