People often ask “What technology & tool skills do I need to develop to be a data scientist?”. We decided to go straight to the source of job descriptions & check what requirements are being asked for while people hire data scientists. We analyzed about 1660 job postings which had “Data Scientists” in the job title and decided to further search for specific technology & tools skills that are required in those job descriptions.
Our first analysis involved understanding what programming and scripting languages were required in these Data Scientist job postings. Unsurprisingly, the number one listed programming language was Python with 1014 job postings because of its inherent friendliness to data analysis & supporting libraries such as NumPy, SciPy & Pandas. The second most popular language was Java, followed by C++, Perl, Ruby & C#.
We then performed a similar analysis on statistical tools required for these jobs. The tool most required in these job postings was R (1077 Job postings), followed by SAS, Matlab, SPSS, Stata & Minitab.
Here are some job Description for Data Scientists to give you a flavor of the skills required: