Originally published by in 2013, it still is a goldmine for all machine learning professionals. The algorithms are broken down in several categories. Here we provide a high-level summary, a much longer and detailed version…Continue
These are quotes by a well known data scientist, posted on his Facebook account and elsewhere, over the last three years. It includes both quotes related to data science as well as on how to become successful and happy.
Better: write "No SQL…Continue
Added by L.V. on October 19, 2015 at 4:30pm — No Comments
Very interesting data compiled and analyzed by O'Reilly, using statistical models such as Lasso regression to predict salary based on different factors. It reminds me our own analysis based on simulated (but realistic) data, to assess whether having Python or R (or both) commands a bigger salary, and what is the extra boost provided by these skills,…Continue
Added by L.V. on October 12, 2015 at 12:00pm — No Comments
Very interesting collection of resources compiled by DistrictDataLabs, featuring books, online courses, articles across multiple categories: data science, probability and statistics, machine learning, R, Python, big data, DataViz, and NLP.
Here's my favorite section: books, courses and articles for ML (Machine Learning)…Continue
Added by L.V. on October 11, 2015 at 2:00pm — No Comments
Your best references to do your job or get started in data science.Continue
Originally posted on KDNuggets. We've added some blogs that were missing in the original list, and eliminated some that aren't worth mentioning, hoping to make this list less biased. For a much bigger list, click here.…Continue
Added by L.V. on October 10, 2015 at 3:30pm — No Comments
Click on each link below to read all articles posted on the DSC network by these great bloggers.These authors are listed in random order, with the top contributors (in terms of volume and quality) featured at the top. You are welcome to connect with them.
Please email us at [email protected] if we missed you, or feel free to add a comment in the comments section below. This is just the beginning of a man-made 'data science' list that we want to grow over time. All…Continue
We have published many articles on this subject, for instance:Continue
The first one is about the difference between Data Science, Data Analysis, Big Data, Data Analytics, and Data Mining:
The source for this one is, according to a tweet, onthe.io. I could not find the article in question, though this website is very interesting, but anyway, I love the above picture,…Continue
Of course each data scientist is different, so please take this criticism with a grain of salt. By a long stretch, they don't apply to all data scientists.
Interesting article by Regina Nuzzo, posted in Nature.com. Indeed, it's not just p-values that are being questioned, but even the Fisher-Neyman-Pearson (FNP) paradigm and the concept of maximum likelihood estimates (MLE).
Here's an extract published on the American Statistical…Continue
Added by L.V. on September 24, 2015 at 12:00pm — No Comments
Added by L.V. on September 20, 2015 at 1:30pm — No Comments
Very interesting compilation published here, with a strong machine learning flavor (maybe machine learning book authors - usually academics - are more prone to making their books available for free). Many are O'Reilly books freely available. Here we display those most relevant to data science. I haven't checked all the sources, but they seem legit. If you find some issue, let us know in the…Continue
Many products or published articles based on data science are heavily regulated, and illegal to perform or publish or sell without a special license, especially in US. You may be doing research and development on a topic considered as classified by the US government. Here a few examples:
Here's one of the main differences between data engineering and data science: ETL (Extract / Load / Transform) is for data engineers, or sometimes data architects or DBA's.
DAD (Discover / Access / Distill) is for data scientists. Sometimes data engineers do DAD, sometimes data scientists do ETL, but it's rather rare, and when they do it, it's purely internal…Continue
Added by L.V. on September 6, 2015 at 3:30pm — No Comments
Here we compare statistics about two well known top data science websites, 2015 vs. 2013. The 2013 data can be found here. Below are the same stats for these two web properties, as of today. From a methodology point of view, comparing two (or more) websites on two different time periods is much better than comparing just one website on…Continue
Added by L.V. on September 5, 2015 at 3:30pm — No Comments
Here's a selection from Udacity's website. Initially, I intended to post questions from Google or Microsoft hiring managers and recruiters, but you can find these questions by doing a Google search, or…Continue
Added by L.V. on September 5, 2015 at 12:00pm — No Comments
And for software engineers or data analysts as well, in random order:
Originally posted by Avinash Kaushik - the Google data science evangelist. The list below shows 8 of them.
#12: Almost all reporting is off custom reports.
#11: Close to zero aggregated analysis exists, everything's…
Added by L.V. on August 27, 2015 at 8:30pm — No Comments
These authors are leading bloggers on various niche data science and big data publisher websites. Very few of these bloggers are posting on more than one website.
The authors below are displayed in a totally random order.