The first one is about the difference between Data Science, Data Analysis, Big Data, Data Analytics, and Data Mining:
The source for this one is, according to a tweet, onthe.io. I could not find the article in question, though this website is very interesting, but anyway, I love the above picture,…Continue
Of course each data scientist is different, so please take this criticism with a grain of salt. By a long stretch, they don't apply to all data scientists.
Interesting article by Regina Nuzzo, posted in Nature.com. Indeed, it's not just p-values that are being questioned, but even the Fisher-Neyman-Pearson (FNP) paradigm and the concept of maximum likelihood estimates (MLE).
Here's an extract published on the American Statistical…Continue
Added by L.V. on September 24, 2015 at 12:00pm — No Comments
Added by L.V. on September 20, 2015 at 1:30pm — No Comments
Very interesting compilation published here, with a strong machine learning flavor (maybe machine learning book authors - usually academics - are more prone to making their books available for free). Many are O'Reilly books freely available. Here we display those most relevant to data science. I haven't checked all the sources, but they seem legit. If you find some issue, let us know in the…Continue
Many products or published articles based on data science are heavily regulated, and illegal to perform or publish or sell without a special license, especially in US. You may be doing research and development on a topic considered as classified by the US government. Here a few examples:
Here's one of the main differences between data engineering and data science: ETL (Extract / Load / Transform) is for data engineers, or sometimes data architects or DBA's.
DAD (Discover / Access / Distill) is for data scientists. Sometimes data engineers do DAD, sometimes data scientists do ETL, but it's rather rare, and when they do it, it's purely internal…Continue
Added by L.V. on September 6, 2015 at 3:30pm — No Comments
Here we compare statistics about two well known top data science websites, 2015 vs. 2013. The 2013 data can be found here. Below are the same stats for these two web properties, as of today. From a methodology point of view, comparing two (or more) websites on two different time periods is much better than comparing just one website on…Continue
Added by L.V. on September 5, 2015 at 3:30pm — No Comments
Here's a selection from Udacity's website. Initially, I intended to post questions from Google or Microsoft hiring managers and recruiters, but you can find these questions by doing a Google search, or…Continue
Added by L.V. on September 5, 2015 at 12:00pm — No Comments
And for software engineers or data analysts as well, in random order: