Learn how to utilize multi-core, high-memory machines to dramatically accelerate your computations in R and Python, without any complex or time-consuming setup.
Added by Anna Anisin on January 28, 2015 at 12:30pm — No Comments
We’ve created a Domino project with starter code in R and Python for participating in the Data Science Bowl.
Get a jump start in the competition with our starter project by training your models on massive hardware and running multiple experiments in parallel while keeping track of…Continue
Added by Anna Anisin on January 13, 2015 at 3:00pm — No Comments
We all know that calculating error bounds on metrics derived from very large data sets has been problematic for a number of reasons. In more traditional statistics one can put a confidence interval or error bound on most metrics (e.g., mean), parameters (e.g., slope in a regression), or classifications (e.g., confusion matrix and the Kappa statistic).
For many machine learning applications, an error bound could be very important.…Continue
Added by Anna Anisin on December 14, 2014 at 3:33pm — No Comments
It seems like more and more companies are very interested in either, improving or setting up their analytical capabilities. All these companies are quite attracted to Hadoop, Spark or other similar solutions, not necessarily because they solve real problems they’re facing, but because they are shiny, trendy pieces of technology.
Hadoop, Spark and others are…Continue
Added by Anna Anisin on October 28, 2014 at 2:30pm — No Comments