.

I am investigating if some numbers like Pi or SQRT(2) are normal in base 2 or 10, that is, whether any sequence of digits appear with the expected frequency in their decimal representation. Actually, I am even more interested in the nested square root representation (see here) where "digits" are either 0 (with probability 0.43), 1 (with probability 0.30) or 2 (with probability 0.23.) Pretty much all numbers have that particular distribution for the nested square root representations.

A famous result claims that pretty much all numbers are normal, in the sense that the set of non-normal numbers, though infinite, has a Lebesgue measure equal to 0. Few non-normal numbers are known, and these cases are artificially manufactured, such as the number 0.10110111011110... It is easy to prove, for instance, that numbers not containing the digit 5 in their base-10 representation (a subset of non-normal numbers) are very rare - so rare that the probability for any real number to have this property, is actually 0: Among the numbers with *n* digits, a proportion *p* = (9/10)^*n* do not contain the digit 5; and as *n* tends to infinity, *p* tends to zero. But this raises interesting questions (**paradoxes**):

- Since a uniform distribution for the digits, is just one example of distribution among many possible distributions, one would think that normal numbers are incredibly rare, as they would be far outnumbered by numbers having any kind of digit distribution, other than uniform. However the converse is true.
- Consider the set S' of numbers in [0, 1] with all digits being duplicated, such as 0.22774499990033..., and let S be the interval [0,1]. There is a one-to-one mapping (bijection) between S and S'. Yet the Lebesgue measure of S is equal to 1. How come can the Lebesgue measure of S' be zero? (S' is a subset of non-normal numbers, for instance no number in S' contain 123 anywhere in its decimal representation.)
- Thus it seems that pretty much all numbers are "random" in some sense, which is counter-intuitive.

How do you explain these paradoxes? Is it possible to have two sets S and S' with a bijection between them, have one of them with a non-zero Lebesgue measure, and the other either with a zero measure or un-measurable? Is it possible that the probability for any real number to belong to S' is zero, yet S' is allowed to have a strictly positive measure?

These are questions for people interested in measure theory and random numbers. Read more about this subject, here.

**DSC Resources**

- Services: Hire a Data Scientist | Search DSC | Classifieds | Find a Job
- Contributors: Post a Blog | Ask a Question
- Follow us: @DataScienceCtrl | @AnalyticBridge

Popular Articles

- Difference between Machine Learning, Data Science, AI, Deep Learnin...
- What is Data Science? 24 Fundamental Articles Answering This Question
- Hitchhiker's Guide to Data Science, Machine Learning, R, Python
- Advanced Machine Learning with Basic Excel

a proportionIf a number between 0 and 1, with n digits, this feature, and its the decimal representation is (a1)...(an) for the first n digits, then (a1)

- What is cloud management? Everything you need to know
- Best practices for defining a cloud monitoring strategy
- cloud infrastructure
- Cloud automation
- Enabling collaboration a rising analytics trend
- Can tech self-correct the skills gap by upskilling employees?
- recurrent neural networks
- 5 benefits and challenges of IT/OT convergence
- Flexify.IO CTO says multi-cloud migration within easy reach
- Infor spins off EAM business to Hexagon but retains a stake

Posted 13 July 2021

© 2021 TechTarget, Inc. Powered by

Badges | Report an Issue | Privacy Policy | Terms of Service

**Most Popular Content on DSC**

To not miss this type of content in the future, subscribe to our newsletter.

- Book: Applied Stochastic Processes
- Long-range Correlations in Time Series: Modeling, Testing, Case Study
- How to Automatically Determine the Number of Clusters in your Data
- New Machine Learning Cheat Sheet | Old one
- Confidence Intervals Without Pain - With Resampling
- Advanced Machine Learning with Basic Excel
- New Perspectives on Statistical Distributions and Deep Learning
- Fascinating New Results in the Theory of Randomness
- Fast Combinatorial Feature Selection

**Other popular resources**

- Comprehensive Repository of Data Science and ML Resources
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- 100 Data Science Interview Questions and Answers
- Cheat Sheets | Curated Articles | Search | Jobs | Courses
- Post a Blog | Forum Questions | Books | Salaries | News

**Archives:** 2008-2014 |
2015-2016 |
2017-2019 |
Book 1 |
Book 2 |
More

**Most popular articles**

- Free Book and Resources for DSC Members
- New Perspectives on Statistical Distributions and Deep Learning
- Time series, Growth Modeling and Data Science Wizardy
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- Comprehensive Repository of Data Science and ML Resources
- Advanced Machine Learning with Basic Excel
- Difference between ML, Data Science, AI, Deep Learning, and Statistics
- Selected Business Analytics, Data Science and ML articles
- How to Automatically Determine the Number of Clusters in your Data
- Fascinating New Results in the Theory of Randomness
- Hire a Data Scientist | Search DSC | Find a Job
- Post a Blog | Forum Questions

## You need to be a member of Data Science Central to add comments!

Join Data Science Central