(I will give you a hint. It’s in the name.)

**Scientists!**

This post is intended as a response to an interesting discussion on LinkedIn started by analytics manager at Ford, Michael Cavaretta. https://www.linkedin.com/posts/michael-cavaretta-ph-d-795a965_indus.... He asked what hires make the best data scientists.

In my real-world experience in the data science field, I find that physicists make the best data scientists. Why? Because physicists work at the intersection of math and reality. They have been trained to hypothesize ideas about how things work, and then to design experiments to test these hypothesizes, gather the data, interpret it, and communicate the results. This is mainly what we are asking data scientists to do. (I do not believe model building is the sole role of data scientists, but that’s for another post.) Physicists can also work independently and can learn new techniques on their own.

While I specifically call out physicists from my experience, all physical scientists, chemists, astrophysicists, and even (sometimes) biologists should have the same skill sets and expertise. Physical scientists perform the best when you have a lot of data and you do not know what to do with it. They are good with uncertainty and can propose hypotheses to move analyses forward when working in a vacuum.

This doesn’t mean you cannot hire individuals with different educational backgrounds. Here are some cases when it would be a good idea to hire:

- Social Scientists – When investigating human behavior. Social scientists have experience with different tool sets than physical scientists, but the process is the same and statistics is still statistics. I find individuals with these backgrounds excel at survey analysis, behavioral analysis, and marketing applications.
- Engineers – While engineers study a lot of physics and math and are quite competent in both fields, engineers are trained to apply discoveries in pure science disciplines in the real world. They work with certainty and well-defined systems, or very bad things could happen (Think of engineering a bridge… how do you want that engineer to think about it? Would you want them to try some things and see what works? No.). For this reason, engineers tend to struggle with the “unknown” involved in a lot of data science work. However, engineers working as data scientists can still excel, especially in industrial applications and operations research work, where conditions are controlled and systems well defined.
- Computer Scientists – Computer scientists work at the interface of the computer & the real world and are great for developing applications & data structures that surround data scientists. The also tend to be very good at optimizing data science algorithms and deploying them optimally. However, they tend to struggle with what is actually inside the data and what that data means. They make good data engineers, architects and developers of ML consuming applications, and ML model deployers and optimizers, but not necessarily good data analysts.
- Anyone with a questioning attitude and critical thinking skills – I am not trying to be a naysayer and tell people they cannot be a data scientist. I believe almost anyone can become a great data scientist. The fundamental characteristics are a questioning attitude, critical thinking skills, and a constant curiosity to learn.

What background do you think makes for the best data scientist? Let me know in the comments below.

© 2020 Data Science Central ® Powered by

Badges | Report an Issue | Privacy Policy | Terms of Service

**Most Popular Content on DSC**

To not miss this type of content in the future, subscribe to our newsletter.

- Book: Statistics -- New Foundations, Toolbox, and Machine Learning Recipes
- Book: Classification and Regression In a Weekend - With Python
- Book: Applied Stochastic Processes
- Long-range Correlations in Time Series: Modeling, Testing, Case Study
- How to Automatically Determine the Number of Clusters in your Data
- New Machine Learning Cheat Sheet | Old one
- Confidence Intervals Without Pain - With Resampling
- Advanced Machine Learning with Basic Excel
- New Perspectives on Statistical Distributions and Deep Learning
- Fascinating New Results in the Theory of Randomness
- Fast Combinatorial Feature Selection

**Other popular resources**

- Comprehensive Repository of Data Science and ML Resources
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- 100 Data Science Interview Questions and Answers
- Cheat Sheets | Curated Articles | Search | Jobs | Courses
- Post a Blog | Forum Questions | Books | Salaries | News

**Archives:** 2008-2014 |
2015-2016 |
2017-2019 |
Book 1 |
Book 2 |
More

**Most popular articles**

- Free Book and Resources for DSC Members
- New Perspectives on Statistical Distributions and Deep Learning
- Time series, Growth Modeling and Data Science Wizardy
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- Comprehensive Repository of Data Science and ML Resources
- Advanced Machine Learning with Basic Excel
- Difference between ML, Data Science, AI, Deep Learning, and Statistics
- Selected Business Analytics, Data Science and ML articles
- How to Automatically Determine the Number of Clusters in your Data
- Fascinating New Results in the Theory of Randomness
- Hire a Data Scientist | Search DSC | Find a Job
- Post a Blog | Forum Questions

## You need to be a member of Data Science Central to add comments!

Join Data Science Central