.

This is not a discussion as to whether the data is flawed or not, or whether we are comparing apples to oranges or not (the way statistics are gathered in different countries). These are of course fundamental questions, but here I will only use data (provided by Google) that everyone seem to more or less agree with, and I am not questioning it…

ContinueAdded by Vincent Granville on July 18, 2021 at 7:30pm — 1 Comment

The original version of the central limit theorem (CLT) assumes *n* independently and identically distributed (i.i.d.) random variables *X*1, ..., *Xn*, with finite variance. Let *Sn* = *X…*

Added by Vincent Granville on July 15, 2021 at 7:30pm — No Comments

*Source: here*

If you are wondering how much a data scientist earns, whether you are a hiring manager or looking for a job, there are plenty of websites…

ContinueAdded by Vincent Granville on July 7, 2021 at 7:49pm — No Comments

This is part of a series featuring the following aspects of machine learning:

- Mathematics, simulations, benchmarking algorithms based on synthetic data (in short,…

Added by Vincent Granville on June 19, 2021 at 7:26am — No Comments

In this article, I describe the various steps involved in managing a machine learning process from beginning to end. Depending on which company you work for, you may or may not be involved in all the steps. In larger companies, you typically focus on one or two specialized aspects of a project. In small companies, you may be involved in all…

ContinueAdded by Vincent Granville on June 12, 2021 at 7:00pm — No Comments

As a machine learning professional, I have worked for several startups ranging from zero to 600 employees, as well as companies such as eBay, Wells Fargo, Visa and Microsoft. Here I share my experience. A brief summary can be found in my conclusions, at the bottom of this article.

It is not easy to define what a startup is. The first…

ContinueAdded by Vincent Granville on June 3, 2021 at 6:30pm — 1 Comment

In this article, I illustrate the concept of asymmetric key with a simple example. Rather than discussing algorithms such as RSA, (still widely used, for instance to set up a secure website) I focus on a system easier to understand, based on random permutations. I discuss how to generate these random permutations and compound them, and how to…

ContinueAdded by Vincent Granville on June 1, 2021 at 7:30pm — No Comments

Many of us have solid foundations in math or have an interest in learning more, and are passionate about solving difficult problems during our free time. Of course, most of us are not professional mathematicians, but we may bring some value to help solve some of the most challenging mathematical conjectures, especially the ones that can be…

ContinueAdded by Vincent Granville on May 25, 2021 at 7:30pm — No Comments

This is part of a series featuring the following aspects of machine learning:

- Mathematics, simulations, benchmarking algorithms based on synthetic data (in short, experimental data science)
- Opinions, for instance about the value of a PhD in our field, or the use of some…

Added by Vincent Granville on May 19, 2021 at 6:26pm — No Comments

*Orbit of the Riemann zeta function in the complex plane (see also here)*

Despite my long statistical and machine learning career both in academia and in…

ContinueAdded by Vincent Granville on May 9, 2021 at 8:00am — 1 Comment

*Source: here*

Here is what I learned from practicing machine learning in business settings for over two decades, and prior to that in the academia. Back in the nineties, it was known as computational…

ContinueAdded by Vincent Granville on May 3, 2021 at 9:00pm — No Comments

Added by Vincent Granville on April 29, 2021 at 6:57pm — No Comments

Here some off-the-beaten-path options to consider, when looking for a first job, a new job or extra income by leveraging your machine learning experience. Many were offers that came to my mailbox at some point in the last 10 years, mostly from people looking at my LinkedIn profile. Thus the importance of growing your network and visibility, write…

ContinueAdded by Vincent Granville on April 19, 2021 at 4:30pm — No Comments

We describe here a methodology that applies to any statistical test, and illustrated in the context of assessing independence between successive observations in a data set. After reviewing a few standard approaches, we discuss our methodology, its benefits, and drawbacks. The data used here for illustration purposes, has known theoretical…

ContinueAdded by Vincent Granville on April 7, 2021 at 9:00pm — No Comments

*Source: See article #5, in section 1*

Part 2 of this short series focused on fundamental techniques, see here. In this Part 3, you will find several…

ContinueAdded by Vincent Granville on April 5, 2021 at 6:59pm — No Comments

There are many ways chaos is defined, each scientific field and each expert having its own definitions. We share here a few of the most common metrics used to quantify the level of chaos in univariate time series or data sets. We also introduce a new, simple definition based on metrics that are familiar to everyone. Generally speaking, chaos…

ContinueAdded by Vincent Granville on March 28, 2021 at 3:00pm — No Comments

Added by Vincent Granville on March 21, 2021 at 8:30pm — No Comments

*Source: here*

There are numerous examples in machine learning, statistics, mathematics and deep learning, requiring an algorithm to solve some complicated equations: for instance, maximum likelihood…

ContinueAdded by Vincent Granville on March 7, 2021 at 6:30pm — No Comments

Added by Vincent Granville on March 3, 2021 at 4:44pm — No Comments

*Source: see here*

In Part 1 of this short series, I have included the most interesting articles that I wrote in the last few years. This part focuses on the business analytics / BI /…

ContinueAdded by Vincent Granville on February 21, 2021 at 2:30pm — No Comments

- Covid-19: Fundamental Statistics that are Ignored
- Central Limit Theorem for Non-Independent Random Variables
- Salary Trends for Data Scientists and Machine Learning Professionals
- More Fun Math Problems for Machine Learning Practitioners
- The Machine Learning Process in 7 Steps
- The Pros and Cons of Working for a Startup
- Simple Introduction to Public-Key Cryptography and Cryptanalysis: Illustration with Random Permutations

- 88 percent of all integers have a factor under 100
- 29 Statistical Concepts Explained in Simple English - Part 1
- Machine Learning Summarized in One Picture
- Data Science Summarized in One Picture
- Deep Learning versus Machine Learning in One Picture
- The Mathematics of Machine Learning
- Big Data and Data Science. Some reflections on compensation levels

- data (3)
- science (3)
- learning (2)
- machine (2)
- , Deep, (1)
- , Learning, Learning (1)
- , Siby, (1)
- , Siby, artificial, (1)
- , intelligence, intelligence (1)
- Azure (1)
- Databases (1)
- Learning" (1)
- Machine (1)
- Processing (1)
- SQL (1)
- Visualization (1)
- artificial (1)
- intelligence" (1)
- series (1)
- time (1)

**2021**

**2020**

- December (4)
- November (4)
- October (1)
- August (1)
- July (1)
- June (1)
- May (1)
- March (1)
- February (1)
- January (1)

**2019**

**2018**

- December (1)
- November (1)
- October (2)
- September (1)
- August (1)
- July (7)
- June (2)
- May (1)
- April (10)
- March (12)

**2017**

**2016**

**2015**

- December (1)
- November (1)
- October (4)
- September (2)
- August (1)
- July (2)
- June (2)
- May (1)
- April (3)
- March (1)
- February (2)
- January (1)

**2014**

- December (3)
- November (2)
- October (4)
- August (1)
- July (1)
- June (1)
- May (2)
- April (15)
- March (14)
- February (15)
- January (15)

**2013**

- December (16)
- November (12)
- October (11)
- September (15)
- August (12)
- July (11)
- June (10)
- May (8)
- April (9)
- March (14)
- February (8)
- January (13)

**2012**

- December (4)
- November (12)
- October (17)
- September (6)
- August (6)
- July (8)
- June (5)
- May (9)
- April (9)
- March (9)
- February (9)
- January (4)

**2011**

- December (17)

**1999**

- November (1)

- The value of PDF data extraction: Sifting for hidden data
- Natural Language Generation (NLG)
- How to build a successful cloud data architecture
- IoT security: Development and defense
- 15 advantages and disadvantages of remote work
- Microsoft widens Teams' reach with Dynamics 365 integration
- Kebab vs. camel case: How these naming conventions differ
- The 5 essential HTTP methods in RESTful API development

Posted 20 July 2021

© 2021 TechTarget, Inc. Powered by

Badges | Report an Issue | Privacy Policy | Terms of Service

**Most Popular Content on DSC**

To not miss this type of content in the future, subscribe to our newsletter.

- Book: Applied Stochastic Processes
- Long-range Correlations in Time Series: Modeling, Testing, Case Study
- How to Automatically Determine the Number of Clusters in your Data
- New Machine Learning Cheat Sheet | Old one
- Confidence Intervals Without Pain - With Resampling
- Advanced Machine Learning with Basic Excel
- New Perspectives on Statistical Distributions and Deep Learning
- Fascinating New Results in the Theory of Randomness
- Fast Combinatorial Feature Selection

**Other popular resources**

- Comprehensive Repository of Data Science and ML Resources
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- 100 Data Science Interview Questions and Answers
- Cheat Sheets | Curated Articles | Search | Jobs | Courses
- Post a Blog | Forum Questions | Books | Salaries | News

**Archives:** 2008-2014 |
2015-2016 |
2017-2019 |
Book 1 |
Book 2 |
More

**Most popular articles**

- Free Book and Resources for DSC Members
- New Perspectives on Statistical Distributions and Deep Learning
- Time series, Growth Modeling and Data Science Wizardy
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- Comprehensive Repository of Data Science and ML Resources
- Advanced Machine Learning with Basic Excel
- Difference between ML, Data Science, AI, Deep Learning, and Statistics
- Selected Business Analytics, Data Science and ML articles
- How to Automatically Determine the Number of Clusters in your Data
- Fascinating New Results in the Theory of Randomness
- Hire a Data Scientist | Search DSC | Find a Job
- Post a Blog | Forum Questions