Subscribe to DSC Newsletter

Michael Walker's Blog Posts Tagged 'Data' (74)

Data Science Summer Reading List 2014

Continue

Added by Michael Walker on June 12, 2014 at 7:00pm — No Comments

Data science profession: accredidation, code of conduct

Data scientists are the lion kings of data pros while salaries for business intelligence and data warehousing pros are stagnating. 

Actual data scientist salaries are much higher considering many garden…

Continue

Added by Michael Walker on May 27, 2014 at 8:30pm — 1 Comment

Bad Data Science and Woody Allen

"Life imitates art far more than art imitates life." - Oscar Wilde



In Woody Allen's 1973 iconoclastic movie "Sleeper" a man (health food store owner) wakes up two hundred years in the future. For breakfast…

Continue

Added by Michael Walker on May 12, 2014 at 3:00pm — 1 Comment

The Deadly Data Science Sin of Confirmation Bias

Confirmation bias occurs when people actively search for and favor information or evidence that confirms their preconceptions or hypotheses while ignoring or slighting adverse or mitigating evidence. It is a type of cognitive bias (pattern of deviation in judgment that occurs in…

Continue

Added by Michael Walker on April 24, 2014 at 7:30pm — 5 Comments

The Haboob Clouds Hadoops Future

Hadoop is an open source framework for storing massive amounts of data on clusters of commodity hardware.



Haboob is a dense dust storm that moves fast…

Continue

Added by Michael Walker on March 23, 2014 at 9:03am — 3 Comments

The Texas Sharpshooter Deception

I received a call from an old client who stated his analytics team had a recent string of failures alarming the firm and costing money. He asked me to review and audit the teams work and analytical processes in attempt to understand and remedy the failures. The data crunching technology was…

Continue

Added by Michael Walker on March 12, 2014 at 9:00pm — No Comments

Data Silos Obstruct Quest for Competitive Advantage

Data and information silos are a significant problem for organizations getting full value from data. Data silos are separate databases or data files that are not part of an organization's enterprise-wide…

Continue

Added by Michael Walker on February 11, 2014 at 8:00pm — 2 Comments

The Data Bug

Regarding defects in human-built systems, the term "bug" appears…
Continue

Added by Michael Walker on December 16, 2013 at 9:30am — No Comments

Lambda Architecture for Big Data Systems

Big data analytical ecosystem architecture is in early stages of development. Unlike traditional data warehouse / business intelligence (DW/BI) architecture which is designed for structured,…

Continue

Added by Michael Walker on December 4, 2013 at 7:57am — 3 Comments

Models vs. Experiments

At the Rose Professional Data Science Practice, we see many organizations spending a majority of…
Continue

Added by Michael Walker on November 17, 2013 at 8:30am — No Comments

Sensors Here, There and Everywhere

Smart organizations are using the power of data science and data produced by embedded sensors and machine devices to better measure performance, discover patterns, prevent problems, and improve…

Continue

Added by Michael Walker on October 29, 2013 at 11:30am — 3 Comments

The Professionalization of Data Science

There has been much discussion and debate about the definition of data science and the new rare breed of sexy bird called the data scientist. The …

Continue

Added by Michael Walker on October 22, 2013 at 10:13pm — 2 Comments

Fooled by Twitter Data

Data scientists must always remember that data sets are not objective -  they are selected, collected, filtered, structured and analyzed by human design. Naked and hidden biases in selecting,…

Continue

Added by Michael Walker on October 7, 2013 at 9:14pm — No Comments

High Performance Computing + Data Science = Competitive Advantage

High Performance Computing (HPC) plus data science allows public and private organizations get…

Continue

Added by Michael Walker on September 17, 2013 at 12:28pm — 1 Comment

Prescriptive Analytics

Business analytics comes in three (3) general flavors: descriptive, predictive and prescriptive. See: …

Continue

Added by Michael Walker on August 27, 2013 at 2:00pm — No Comments

Batch vs. Real Time Data Processing

Batch data processing is an efficient way of processing high volumes of data is where a group of transactions is collected over a period of time. Data is collected, entered, processed and then the batch results are…

Continue

Added by Michael Walker on August 13, 2013 at 2:30pm — 2 Comments

Hadoop Falcon and Data Lifecycle Management

Data management in the Hadoop ecosystem is still in the early stages of development. The goal…

Continue

Added by Michael Walker on July 30, 2013 at 12:13pm — No Comments

Demand for Data Scientists and the Datification of Business

Source: EMC2 Survey.

You cannot improve and manage what you cannot…
Continue

Added by Michael Walker on July 16, 2013 at 1:00pm — 2 Comments

Data Science Summer Reading List 2013

Machine Learning: A Probabilistic Perspective, by Kevin Murphy.



Boosting: Foundations and Algorithms, by Robert E. Schapire.



Models Behaving Badly: Why Confusing Illusion with Reality Can Lead to Disaster, by Emanuel Derman.



Doing Data Science, by Cathy O'Neil and Rachel…

Continue

Added by Michael Walker on July 9, 2013 at 3:00pm — 1 Comment

Data Scientists vs. Data Engineers

More and more frequently we see organizations make the mistake of mixing and confusing team roles on a data science or "big data" project - resulting in over-allocation of responsibilities assigned to …
Continue

Added by Michael Walker on July 2, 2013 at 12:01pm — 9 Comments

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service