Ensemble methods take several machine learning techniques and combine them into one predictive model. It is a two step process: Generate the Base Learners: Choose any c...
Introduction Interactive notebooks are experiencing a rise in popularity. How do we know? They’re replacing PowerPoint in presentations, shared around organizations, an...
Summary: Whether you’re a data scientist building an implementation case to present to executives or a non-data scientist leader trying to figure this out there’s a...
Machine Learning (ML) has been among the top strategies for almost every organization – whoever adopts the new methodology early and quickly establishes the corpora...
Logistic regression is regressing data to a line (i.e. finding an average of sorts) so you can fit data to a particular equation and make predictions for your data. This ...
This article, written by the Facebook research team, was written by Ben Letham, Brian Karrer, Guilherme Ottoni and Eytan Bakshy. It features some of the techniques used b...
This is a simple overview of the k-NN process. Perhaps the most challenging step is finding a k that’s “just right”. The square root of n can put you i...
Summary: Based on a McKinsey study we reported that 47% of companies had at least one AI/ML implementation in place. Looking back at the data and the dominance of RPA...
Determining sample sizes is a challenging undertaking. For simplicity, I’ve limited this picture to the one of the most common testing situation: testing for differ...
This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, decision trees, ensembles, correlati...