Subscribe to DSC Newsletter

Data scientist Demographics: 2015 versus 2013 - How Things Changed (or Not)

Here we compare statistics about two well known top data science websites, 2015 vs. 2013. The 2013 data can be found here. Below are the same stats for these two web properties, as of today. From a methodology point of view, comparing two (or more) websites on two different time periods is much better than comparing just one website on two different time periods, as it allows you to detect, assess, and eliminate data noise and variance in your analysis.

Very little statistically significant changes are observed between 2013 and 2015:

  • One property (SDC) is losing US traffic (down to 39%, from 47%), while the other one is growing in US.
  • One property is getting a bigger share from graduate people in 2015, the other one (SDC) stays flat
  • One property sees a significant growth from Asian people (in US), the other one (SDC) stays flat

SDC stands for Smart Data Collective, while the other property is a DSC (Data Science Central) channel. SDC is the second chart in both articles.

The data comes from Quantcast.com and is subject to errors. It compares two of the very few large data science properties that have been on Quantcast for a long time. In my opinion, the most significant insight is that gender imbalance is still at an all time high in US today in 2015, for data scientists. The same thing can be said about minorities. How can we change this? Even more worrisome is the fact that the few female data scientists, even today, have on average a less managerial job title.

Finally, you can check the same stats on Comscore.com, Compete.com, or Alexa.com. Surprisingly, the stats from Alexa.com are completely wrong about non-US traffic: Quantcast.com and GoogleAnalytics numbers (both are based on using tracking code) are totally different from Alexa.com (not based on tracking code). So Amazon, please - since you own Alexa.com, and have tons of great data scientists - please fix this problem (hint: detect and eliminate fake traffic from your statistics - especially fake traffic from Indian IP addresses, it tends to have an abnormally high number of pages per session). According to Quantcast, SDC has 11% of its traffic from India (see table below); according to Alexa, this proportion is 26% as of today. The latter number is wrong.

DSC Resources

Additional Reading

Follow us on Twitter: @DataScienceCtrl | @AnalyticBridge

Views: 2989

Comment

You need to be a member of Data Science Central to add comments!

Join Data Science Central

Follow Us

Videos

  • Add Videos
  • View All

Resources

© 2017   Data Science Central   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service