In this article, Data Scientist Pramit Choudhary provides an introduction to both statistical and machine learning-based approaches to anomaly detection in Python. Introd...
This Python cheat sheet was produced by DataCamp. Click on the image to zoom in. The original, published here, is available as a PDF document. The translation from PDF to...
This infographic comes from TechEmergence. TechEmergence is a market research firm specializing in the applications and implications of artificial intelligence/machine ...
Multicollinearity (Collinearity) is not a new term especially when dealing with multiple regression models. This phenomenon of relationship in between one response variab...
This is a curated list of the most cited deep learning papers (since 2012) posted by Terry Taewoong Um. Source for picture: What is deep learning and how does it work? ...
I recently read an MIT Sloan Management Review article by Clayton Christensen’s recent book titled “The Hard Truth About Business Model Innovations.” While the art...
It’s a complete tutorial on data wrangling or manipulation with R. This tutorial covers one of the most powerful R package for data wrangling i.e. dplyr. This packa...
As we all know CRISP DM stands for Cross Industry Standard Process for Data Mining is a process model that outlines the most common approach to tackle data driven problem...
In 2017, it’s clear that the hospitality industry needs to prepare for even more disruptions that will unfold over the coming year. Airbnb, OTAs, start-ups, big data co...
Random Forests algorithm has always fascinated me. I like how this algorithm can be easily explained to anyone without much hassle. One quick example, I use very frequent...