I am investigating if some numbers like Pi or SQRT(2) are normal in base 2 or 10, that is, whether any sequence of digits appear with the expected frequency in their decimal representation. Actually, I am even more interested in the nested square root representation (see here) where "digits" are either 0 (with probability 0.43), 1 (with probability 0.30) or 2 (with probability 0.23.) Pretty much all numbers have that particular distribution for the nested square root representations.

A famous result claims that pretty much all numbers are normal, in the sense that the set of non-normal numbers, though infinite, has a Lebesgue measure equal to 0. Few non-normal numbers are known, and these cases are artificially manufactured, such as the number 0.10110111011110... It is easy to prove, for instance, that numbers not containing the digit 5 in their base-10 representation (a subset of non-normal numbers) are very rare - so rare that the probability for any real number to have this property, is actually 0: Among the numbers with *n* digits, a proportion *p* = (9/10)^*n* do not contain the digit 5; and as *n* tends to infinity, *p* tends to zero. But this raises interesting questions (**paradoxes**):

- Since a uniform distribution for the digits, is just one example of distribution among many possible distributions, one would think that normal numbers are incredibly rare, as they would be far outnumbered by numbers having any kind of digit distribution, other than uniform. However the converse is true.
- Consider the set S' of numbers in [0, 1] with all digits being duplicated, such as 0.22774499990033..., and let S be the interval [0,1]. There is a one-to-one mapping (bijection) between S and S'. Yet the Lebesgue measure of S is equal to 1. How come can the Lebesgue measure of S' be zero? (S' is a subset of non-normal numbers, for instance no number in S' contain 123 anywhere in its decimal representation.)
- Thus it seems that pretty much all numbers are "random" in some sense, which is counter-intuitive.

How do you explain these paradoxes? Is it possible to have two sets S and S' with a bijection between them, have one of them with a non-zero Lebesgue measure, and the other either with a zero measure or un-measurable? Is it possible that the probability for any real number to belong to S' is zero, yet S' is allowed to have a strictly positive measure?

These are questions for people interested in measure theory and random numbers. Read more about this subject, here.

**DSC Resources**

- Services: Hire a Data Scientist | Search DSC | Classifieds | Find a Job
- Contributors: Post a Blog | Ask a Question
- Follow us: @DataScienceCtrl | @AnalyticBridge

Popular Articles

- Difference between Machine Learning, Data Science, AI, Deep Learnin...
- What is Data Science? 24 Fundamental Articles Answering This Question
- Hitchhiker's Guide to Data Science, Machine Learning, R, Python
- Advanced Machine Learning with Basic Excel

a proportionIf a number between 0 and 1, with n digits, this feature, and its the decimal representation is (a1)...(an) for the first n digits, then (a1)

© 2020 Data Science Central ® Powered by

Badges | Report an Issue | Privacy Policy | Terms of Service

**Upcoming DSC Webinar**

- Natural Language Trends in Visual Analysis - Aug 6

In this latest Data Science Central webinar, Vidya will discuss how natural language can be leveraged in various aspects of the analytical workflow ranging from smarter data transformations, visual encodings, autocompletion to supporting analytical intent. More recently, chatbot systems have garnered interest as conversational interfaces for a variety of tasks. Machine learning approaches have proven to be promising for approximating the heuristics and conversational cues for continuous learning in a chatbot interface. Register today.

**Most Popular Content on DSC**

To not miss this type of content in the future, subscribe to our newsletter.

- Book: Statistics -- New Foundations, Toolbox, and Machine Learning Recipes
- Book: Classification and Regression In a Weekend - With Python
- Book: Applied Stochastic Processes
- Long-range Correlations in Time Series: Modeling, Testing, Case Study
- How to Automatically Determine the Number of Clusters in your Data
- New Machine Learning Cheat Sheet | Old one
- Confidence Intervals Without Pain - With Resampling
- Advanced Machine Learning with Basic Excel
- New Perspectives on Statistical Distributions and Deep Learning
- Fascinating New Results in the Theory of Randomness
- Fast Combinatorial Feature Selection

**Other popular resources**

- Comprehensive Repository of Data Science and ML Resources
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- 100 Data Science Interview Questions and Answers
- Cheat Sheets | Curated Articles | Search | Jobs | Courses
- Post a Blog | Forum Questions | Books | Salaries | News

**Archives:** 2008-2014 |
2015-2016 |
2017-2019 |
Book 1 |
Book 2 |
More

**Upcoming DSC Webinar**

- Natural Language Trends in Visual Analysis - Aug 6

In this latest Data Science Central webinar, Vidya will discuss how natural language can be leveraged in various aspects of the analytical workflow ranging from smarter data transformations, visual encodings, autocompletion to supporting analytical intent. More recently, chatbot systems have garnered interest as conversational interfaces for a variety of tasks. Machine learning approaches have proven to be promising for approximating the heuristics and conversational cues for continuous learning in a chatbot interface. Register today.

**Most popular articles**

- Free Book and Resources for DSC Members
- New Perspectives on Statistical Distributions and Deep Learning
- Time series, Growth Modeling and Data Science Wizardy
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- Comprehensive Repository of Data Science and ML Resources
- Advanced Machine Learning with Basic Excel
- Difference between ML, Data Science, AI, Deep Learning, and Statistics
- Selected Business Analytics, Data Science and ML articles
- How to Automatically Determine the Number of Clusters in your Data
- Fascinating New Results in the Theory of Randomness
- Hire a Data Scientist | Search DSC | Find a Job
- Post a Blog | Forum Questions

## You need to be a member of Data Science Central to add comments!

Join Data Science Central