Apache Hadoop Admin Tips and Tricks
In this post I will share some tips I learned after using the Apache Hadoop environment for some years, and doing many many workshops and… Read More »Apache Hadoop Admin Tips and Tricks
In this post I will share some tips I learned after using the Apache Hadoop environment for some years, and doing many many workshops and… Read More »Apache Hadoop Admin Tips and Tricks
This big picture view lays the foundation of our book Data Science for the Internet of Things. (Co-authored by Ajit Jaokar, Jean Jacques Bernard and… Read More »Data Science for Internet of Things – The Big Picture
The list below is a (non-comprehensive) selection of what I believe should be taught first, in data science classes, based on 30 years of business… Read More »The First Things you Should Learn as a Data Scientist – Not what you Think
AI and machine learning are everywhere. Most decisions affecting every aspect of our lives are being made based on anomalies, classifications, and predictions. Even governmental… Read More »Are YOU the Outlier?
It is crucial to ask the right questions and/or understand the problem, prior to beginning data analysis. Below is a list of 20 questions you… Read More »20 Questions to Ask Prior to Starting Data Analysis
My daughter just started a business analytics Master’s program. For the probability sequence of the core statistics course, one of her assignments is to calculate… Read More »Poker, Probability, Monte Carlo, and R
Black hat data science consists of techniques designed to fool existing algorithms (Google search, Amazon rankings, and so on), compromising or tampering with the metrics… Read More »Black Hat Data Science
You know who you are. A high-calibre machine learning magician, a well-versed wrangler of data… but you want a bit more from your role. That… Read More »Are You Ready To Become A Chief Data Scientist?
Predictive analytics uses current and historical data in order to determine the probability of a particular outcome. This is a particularly powerful approach when it… Read More »The Role of Predictive Analytics in Medical Diagnosis
R is a well-known and increasingly popular tool in the Data Science field. It is a programming language and a software environment primarily designed for… Read More »Top 20 R Libraries for Data Science in 2018 [Infographic]