Three fallacies about Covid-19
It is surprising to see the level of innumeracy in the population, even in college-educated professionals. People still have blind faith in so-called experts and… Read More »Three fallacies about Covid-19
This rubric covers the use of statistical tools working on large datasets to create models and derive inferences, as well as coverage of the field in its entirety. This differs from machine learning primarily in that the latter focuses on functional gradient analysis or neural networks (kernels) to derive models.
It is surprising to see the level of innumeracy in the population, even in college-educated professionals. People still have blind faith in so-called experts and… Read More »Three fallacies about Covid-19
How oversampling yielded great results for classifying cases of Sexual Harassment. The problem: Overcoming an imbalanced data set When it comes to data science, sexual… Read More »Overcoming an Imbalanced Dataset using Oversampling.
In my previous article, we analyzed the COVID-19 data of Turkey and selected the cubic model for predicting the spread of disease. In this article,… Read More »Model Selection: Adjusted Coefficient of Determination-Variance Tradeoff
(This article is now a chapter of my github proto-book Bayesuvius) Simpson’s paradox is a recurring nightmare for all statisticians overseeing a clinical trial for… Read More »Simpson’s Paradox, the Bane of Clinical Trials
Correlation is a measure of linear association between two variables X and Y, while linear regression is a technique to make predictions, using the following model: Y = a0 + a1 X1 +… Read More »Difference Between Correlation and Regression in Statistics
The following graphic is based on Sam Priddy’s excellent DSC/Tableau Webinar How to Accelerate and Scale Your Data Science Workflows. Sam covered many interesting points for… Read More »How to Communicate Data
I just uploaded a new chapter to my github proto-book “Bayesuvius”. This chapter deals with Reinforcement Learning (RL) done right, i.e., with Bayesian Networks 🙂… Read More »Added Chapter on Reinforcement Learning to my book “Bayesuvius” on Bayesian Networks
Data scientist ranks third on the list of LinkedIn emerging jobs of 2020. Similarly, it ranks first on Glassdoor’s hottest jobs of 2020. The data… Read More »Why Data Science is a hot Career in 2020
What is a commodity? Commodity are basic raw materials with certain standards that are used with other goods, commodity are often the basis of the… Read More »Data As Commodity: For Data Science Professional
Recently Netflix launched Rs 199 ($2.8) mobile-only monthly plan in India. CyberMedia Research (CMR) reported that after the release of the new mobile plan, Netflix… Read More »The Rise and Rise of Price Analytics!