Subscribe to DSC Newsletter
.

Vincent Granville's Blog (1,559)

A Plethora of Machine Learning Articles: Part 2

Source:…

Continue

Added by Vincent Granville on March 3, 2021 at 4:44pm — No Comments

A Plethora of Machine Learning Articles: Part 1

Source: see here

In Part 1 of this short series, I have included the most interesting articles that I wrote in the last few years. This part focuses on the business analytics / BI /…

Continue

Added by Vincent Granville on February 21, 2021 at 2:30pm — No Comments

Maximum runs in Bernoulli trials: simulations and results

Bernoulli trials are random experiments with two possible outcomes: "yes" and "no" (in the case of polls),  "success" and "failure" (in the case of gambling or clinical trials). The trials are independent from each other: for instance tossing a coin multiple times, or testing the success of a new drug against a specific…

Continue

Added by Vincent Granville on February 15, 2021 at 11:00pm — No Comments

More Surprising Math Images

To zoom in on any picture, click on the image to get a higher resolution.

This a follow up to my previous article here, where you can find additional, very different images, the theory behind it, and relevance to machine learning techniques. What is surprising is that all these images were produced with a formula with a single parameter λ, and…

Continue

Added by Vincent Granville on February 7, 2021 at 7:30pm — No Comments

Beautiful Mathematical Images

To zoom in on any picture, click on the image to get a higher resolution.

Figure 1: The pillow basins (see section 3)

The topic discussed here is closely related to optimization techniques in machine…

Continue

Added by Vincent Granville on February 2, 2021 at 10:30am — 1 Comment

Can a Diploma from a Lower Ranking University Hurt your Data Science Career Prospects?

Here I specifically discuss the case of a PhD degree from a third-tier university, though to some extent, it also applies to master degrees. Many professionals joining companies such as Facebook, Microsoft, or Google in a role other than a programmer, typically have a PhD degree, although there are many exceptions. It is still possible to learn data science on the job, especially if you have a quantitative background (say in physics or engineering) and have experience working with serious…

Continue

Added by Vincent Granville on January 28, 2021 at 7:16pm — No Comments

Moving Averages: Natural Weights, Iterated Convolutions, and Central Limit Theorem

Convolution is a concept well known to machine learning and signal processing professionals. In this article, we explain in simple English how a moving average is actually a discrete convolution, and we use this fact to build weighted moving averages with natural weights that at the limit, have a Gaussian behavior guaranteed by the Central Limit Theorem. Moving averages are nothing more than blurring filters for signal processing experts, with a Gaussian-like kernel in the case discussed…

Continue

Added by Vincent Granville on January 25, 2021 at 5:00pm — No Comments

Machine Learning / Stats / BI: Mini Translation Dictionary

Here I provide translations for various important terms, to help professionals from related backgrounds better understand each other. In particular, machine learning professionals versus statisticians.

Source for picture:…

Continue

Added by Vincent Granville on January 18, 2021 at 9:12pm — No Comments

Deep visualizations to Help Solve Riemann's Conjecture

This is the second part of my article Spectacular Visualization: The Eye of the Riemann Zeta Function, focusing on the most infamous unsolved mathematical conjecture, one that has a $1 million dollar price attached to it. I used the word deep not in the sense of deep neural networks, but because the implications of these…

Continue

Added by Vincent Granville on January 5, 2021 at 9:00pm — No Comments

Spectacular Visualization: The Eye of the Riemann Zeta Function

We discuss here one of the most famous unsolved mathematical conjectures of all times, one among seven that has a $1 million award attached to it, see here. It is known as the Riemann Hypothesis and abbreviated as RH. Of course I did not solve it (yet), but the material presented here offers a new…

Continue

Added by Vincent Granville on January 2, 2021 at 11:30am — No Comments

Opening a New Restaurant in Covid Times

I am a data scientist, and decided to open a restaurant last November, 10 days before the governor in my state banned dining-in (who knows for how long) and customers were already rare. Some data scientists in managerial positions dream about exiting the corporate world and envied me, at least before the Covid, when I told them my plan.

Here I explore the options and opportunities available, and this article reflects my optimism. I will also discuss analytics in some detail. The…

Continue

Added by Vincent Granville on December 22, 2020 at 9:44pm — No Comments

Amazing Things You Did Not Know You Could Do in Excel

I have included a lot of Excel spreadsheets in the numerous articles and books that I have written in the last 10 years, based either on real life problems or simulations to test algorithms, and featuring various machine learning techniques. It is time to create a new blog series focusing on these useful techniques that can easily be handled with Excel. Data scientists typically use programming languages and other visual tools for these techniques, mostly because they are unaware that it can…

Continue

Added by Vincent Granville on December 16, 2020 at 8:00pm — No Comments

All-purpose, Robust, Fast, Simple Non-linear Regression

Announcements

  • Watch APEXX W3: The Data Science Workstation, and learn how an NVIDIA-certified BOXX workstation can accelerate your workflow. Access video here
  • Use real-time anomaly detection reference patterns to combat fraud | Google. Read full article.
  • Merrimack College offers three online…
Continue

Added by Vincent Granville on December 16, 2020 at 9:22am — No Comments

New Tests of Randomness and Independence for Sequences of Observations

There is no statistical test that assesses whether a sequence of observations, time series, or residuals in a regression model, exhibits independence or not. Typically, what data scientists do is to look at auto-correlations and see whether they are close enough to zero. If the data follows a Gaussian distribution, then absence of auto-correlations implies independence. Here however, we are dealing with non-Gaussian observations. The setting is similar to testing whether a pseudo-random…

Continue

Added by Vincent Granville on December 2, 2020 at 4:30pm — No Comments

Covid-19: My Predictions for 2021

Here I share my predictions as well as personal opinion about the pandemic. My thoughts are not derived from running sophisticated models on vast amounts of data. Much of the data available has major issues anyway, something I am also about to discuss. There are some bad news and some good news. This article discusses what I believe are the good news and bad news, as well a some attempt at explaining people behavior and reactions, and resulting consequences. My opinion is very different from…

Continue

Added by Vincent Granville on November 29, 2020 at 10:30pm — No Comments

Introducing an All-purpose, Robust, Fast, Simple Non-linear Regression

The model-free, data-driven technique discussed here is so basic that it can easily be implemented in Excel, and we actually provide an Excel implementation. It is surprising that this technique does not pre-date standard linear regression, and is rarely if ever used by statisticians and data scientists. It is related to kriging and nearest neighbor interpolation, and apparently first mentioned in 1965 by Harvard scientists working on GIS (geographic information systems). It was referred…

Continue

Added by Vincent Granville on November 23, 2020 at 6:00pm — No Comments

Interesting Application of the Poisson-Binomial Distribution

While the Bernoulli and binomial distributions are among the first ones taught in any elementary statistical course, the Poisson-Binomial is rarely mentioned. It is however one of the simplest discrete distributions, with applications in survey analysis, see here. In this article, we are dealing with…

Continue

Added by Vincent Granville on November 10, 2020 at 6:30pm — 2 Comments

Thursday News, October 29

Here is our selection of featured articles and technical resources posted since Monday:

Announcements

DSC Articles

Continue

Added by Vincent Granville on October 29, 2020 at 9:00am — No Comments

Weekly Digest, October 26

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.  

Featured Resources and Technical…

Continue

Added by Vincent Granville on October 25, 2020 at 12:00pm — No Comments

Thursday News, October 22

Here is our selection of featured articles and technical resources posted since Monday:

Announcement

Technical Resources

Continue

Added by Vincent Granville on October 22, 2020 at 8:30am — No Comments

Monthly Archives

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

© 2021   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service