We propose simple solutions to important problems that all data scientists face almost every day. In short, a toolbox for the handyman, useful to busy professionals in an...
This is part 3 of a 3 part series: “How to make your mark on the world as a talented, socially conscious data scientist.” In the first post in this series, we explor...
This is the fourth article in a DeepTech Series by Margaretta Colangelo and Dmitry Kaminskiy. Dmitry Kaminskiy, General Partner at Deep Knowledge Ventures, is based i...
This article was written by Michael Grogan. It is often the case that a dataset contains significant outliers – or observations that are significantly out of range from...
Companies are ramping up their adopting of cloud-based Software-as-a-Service applications. Public cloud spending is expected to reach over $206 billion in 2019. Large ent...
Last time, I posted Part 2 of a blog trilogy on data programming with Python. That article revolved on showcasing NumPy, a comprehensive library created in 2005 that ...
PREFACE This article has 4 sections: Introduction: Introduction to the gambler’s ruin problem Methodology: How the simulation will be carried out Pseudocode: Summar...
Majority of modern companies deal with processes which they want to be automated. This need can be caused by various reasons, in particular, due to the routine, repetitiv...
Introduction to topic model: In machine learning and natural language processing, a topic model is a type of statistical model for discovering the a...
Interaction plots are used to understand the behavior of one variable depends on the value of another variable. Interaction effects are analyzed in regression analysis, D...