Alan Turing was the first one to present the idea of simulating the machine thinking. Its been more than 60 years since the ground breaking paper of Alan Turing came out, The Imitation Game. The world has changed rapidly since then.
The machines of today have become so powerful. They can actually think, which endorses the idea of Alan Turing presented in 50s. However, the machine thinking may be different. Alan Turing argued, just because the thinking can be…Continue
We’ve created a Domino project with starter code in R and Python for participating in the Data Science Bowl.
Get a jump start in the competition with our starter project by training your models on massive hardware and running multiple experiments in parallel while keeping track…Continue
Added by Anna Anisin on January 13, 2015 at 3:00pm — No Comments
We all know that calculating error bounds on metrics derived from very large data sets has been problematic for a number of reasons. In more traditional statistics one can put a confidence interval or error bound on most metrics (e.g., mean), parameters (e.g., slope in a regression), or classifications (e.g., confusion matrix and the Kappa statistic).
For many machine learning applications, an error bound could be very important.…Continue
Added by Anna Anisin on December 14, 2014 at 3:33pm — No Comments
When you use Twitter, how do you know when you are being presented with something credible instead of something totally bogus? The answer is, unless you spend a lot of time researching each tweet, you probably don’t. However, one thing is for certain, we rely on what we read on Twitter to be true.
Twitter is one of the fastest and most effective ways we disseminate news across our world. If this…Continue
Added by Renette Youssef on December 8, 2014 at 4:00pm — No Comments
This blog is extrapolated from DataScience Hacks by the author himself.
Apache Spark, another apache licensed top-level project that could perform large scale data processing way faster than Hadoop (I am referring to MR1.0 here). It is possible due to Resilient Distributed Datasets concept that is behind this fast data processing. RDD is basically a collection of objects,…Continue
If I want to build a house, wouldn't it be wise to learn carpentry? Does the analogy hold for data-analytic multivariate models? Or is it simply enough to let a machine do it, with no knowledge by the machine operator of how to interpret the results from those modeling efforts? Or is it true, as one person has recently asserted, that he could replicate ALL statistical procedures and techniques using MapReduce, without knowing anything about statistics and probability, or the vast collection…Continue
Data science might be one of the hottest buzzwords in 2013. But is it only a marketing gimmick? I don’t think so. In my opinion, data science can be the best protocol that reveals what’s happening every day in the real world.
The data science incorporates mathematics, statistics, computer science and…Continue
Added by Yuanjen Chen on February 6, 2014 at 6:30pm — No Comments
Smart organizations are using the power of data science and data produced by embedded sensors and machine devices to better measure performance, discover patterns, prevent problems, and improve…Continue
High Performance Computing (HPC) plus data science allows public and private organizations get…Continue
Data Science - The Process of Capturing, Analyzing and Presenting Business Intelligence with Skill - DataReality
Business analytics comes in three (3) general flavors: descriptive, predictive and prescriptive. See: …Continue
Added by Michael Walker on August 27, 2013 at 2:00pm — No Comments
There are various offerings out there if you want to use machine learning in your analysis nowadays. Nick WIlson spent his internship at BigML comparing three SaaS Machine Learning Services (BigML, Prior Knowledge and Google Prediction API), with WEKA as a benchmark. He wrote a series of blog posts about his findings. In his final post he gives a summary of his work, with links to the different blog posts for details. He let me re-blog his summary here.
Added by Jos Verwoerd on September 13, 2012 at 3:37am — No Comments
Interested in using your skills for a good cause, a great challenge and a $25,000 prize? How about predicting the future of ALS patients?
Added by Neta Zach on August 13, 2012 at 2:52pm — No Comments
Like many industries the Infrastructure/Security/Compliance function within large telecom companies is becoming more data driven. Here are 3 powerful use cases which vividly bring out new possibilities in Telecom big data
Telecom use case-1 : Contact centre text mining and Telecom Bandwidth throttling…Continue
Added by derick.jose on July 17, 2012 at 2:11am — No Comments