Problem statement Price and quantity sold are the two determinants of business revenue/profit. At higher price the revenue is expected to be high. But this is not the cas...
It has been popularly noted that artificial intelligence would be like the ultimate version of Google. With recent advancements in research and technology, Artificial Int...
Introduction Automated machine learning is a fundamental shift to machine learning and data science. Data science as it stands today, is resource-intensive, expensive ...
Naive Bayes is a deceptively simple way to find answers to probability questions that involve many inputs. For example, if you’re a website owner, you might be inte...
Lecture notes for the Statistical Machine Learning course taught at the Department of Information Technology, University of Uppsala (Sweden.) Updated in March 2019. Aut...
A little less than a year ago, I posted a blog on generating multivariate frequencies with the Python Pandas data management library, at the same time showcasing Pyt...
Hadoop and Spark are software frameworks from Apache Software Foundation that are used to manage ‘Big Data’. There is no particular threshold size which classi...
Summary: Finally there are tools that let us transcend ‘correlation is not causation’ and identify true causal factors and their relative strengths in our models....
So many fascinating and deep results have been written about the number (1 + SQRT(5)) / 2 and its related sequence – the Fibonacci numbers – that it would tak...