Scale-Invariant Clustering and Regression
The impact of a change of scale, for instance using years instead of days as the unit of measurement for one variable in a clustering… Read More »Scale-Invariant Clustering and Regression
Author and Publisher at MLtechniques.com. Machine learning scientist, mathematician, book author (Wiley), patent owner, former post-doc at Cambridge University, former VC-funded executive, with 20+ years of corporate experience including CNET, NBC, Visa, Wells Fargo, Microsoft, eBay. Vincent also founded and co-founded a few start-ups, including one with a successful exit (Data Science Central acquired by Tech Target).
The impact of a change of scale, for instance using years instead of days as the unit of measurement for one variable in a clustering… Read More »Scale-Invariant Clustering and Regression
In this article, we show that the issue with polynomial regression is not over-fitting, but numerical precision. Even if done right, numerical precision still remains… Read More »Deep Dive into Polynomial Regression and Overfitting
In this article, we show that the issue with polynomial regression is not over-fitting, but numerical precision. Even if done right, numerical precision still remains… Read More »Deep Dive into Polynomial Regression and Overfitting
In Part 1 of this article (see here) we featured the two results below, as well as a simple way to prove these formulas. Here,… Read More »Two Beautiful Mathematical Results – Part 2
Here are some recommended books for data science practitioners, covering machine learning, deep learning, business intelligence, forecasting, text analytics and much more. Book Repository for… Read More »Reading List for Data Scientists
I wrote about this long ago (see here in 2014), and so did many other practitioners. This new post shows more maturity I think, a more… Read More »Four Types of Data Scientists
This question was recently posted on Quora, and generated a lot of answers. Here is mine: What differentiates a real doctor from a fake doctor?… Read More »How do you identify an actual data scientist?
For a person being from a non-statistical background the most confusing aspect of statistics, are always the fundamental statistical tests, and when to use which.… Read More »Statistical Tests – When to use Which ?
This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, Hadoop, decision trees, ensembles, correlation, outliers,… Read More »Programming Languages for Data Science and ML – With Source Code Illustrations
Summary: There are some interesting use cases where combining CNNs and RNN/LSTMs seems to make sense and a number of researchers pursuing this. However, the latest… Read More »Combining CNNs and RNNs – Crazy or Genius?