Machine learning has the ability to automate a lot of jobs in the future. It is very easy to talk about this automation when it isn’t your job that will be automate...
Although I deal with many different types of metrics, I believe they can be generally classified as follows: 1) time use; 2) alignment; 3) production; 4) performance; 5) ...
Mathematical Olympiads are popular among high school students. However, there is nothing similar for college students, except maybe IMC. Even IMC is not popular. It focu...
In this presentation we will explore how innovative companies are modernizing their tech stack and workflows to allow data scientists to spend more of their time doing a...
The list below is a (non-comprehensive) selection of what I believe should be taught first, in data science classes, based on 30 years of business experience. This is a f...
In this post I will share some tips I learned after using the Apache Hadoop environment for some years, and doing many many workshops and courses. The information here ...
This big picture view lays the foundation of our book Data Science for the Internet of Things. (Co-authored by Ajit Jaokar, Jean Jacques Bernard and Sukanya Mandal) We ad...
The list below is a (non-comprehensive) selection of what I believe should be taught first, in data science classes, based on 30 years of business experience. This is a f...
Apache Spark™ has become the de-facto data processing and AI engine in enterprises today due to its speed, ease of use, and sophisticated analytics. As the first Unifie...
AI and machine learning are everywhere. Most decisions affecting every aspect of our lives are being made based on anomalies, classifications, and predictions. Even gover...