A traditional business problem customized here to data science.
1. Identify the problem
Added by Vincent Granville on March 10, 2014 at 9:00pm — No Comments
In this article we are going to discuss Bayesian inference and statistics. In order to gain understanding of Bayesian inference we will use the experiment of coin tossing. Hence it is only appropriate to have a quick discussion about the power of…
To begin with, let me pay my tribute to the master of randomness and the most flippant of them all – the act of tossing a coin.…Continue
Added by Roopam Upadhyay on March 10, 2014 at 7:30am — No Comments
The Big Data Yawn
Over the past couple of months we have met with a number of oil and gas executives to demonstrate our Oil and Gas Solution built on Data-Tactics’ Big Data Engine (BDE). During these conversations it has become obvious that the very mention of "Big Data" produces an involuntary physiological response among business leaders - eye rolls and yawns. It appears that big data has reached the Gartner "trough of disillusionment". These executives have heard from a bewildering…Continue
Added by Sullexis LLC on March 10, 2014 at 6:00am — No Comments
In-memory database technology is fashionable in recent years as the price of RAM drops substantially and gigabyte chips become affordable. By taking advantage of the cost-performance value of RAM, leading edge database developers are boosting the performance of next-generation databases with in-memory technology. However, many developers who intend to adopt in-memory technology only think of speed in terms of RAM, and do not exploit the true power of in-memory technology.
The success rate of a start depends on the innovation and presentation of its products. Data analytics significantly improves these odds when used with proper business understanding. We have tried stimulating the same in this business case. This case walks through different level of analytics a business adopts to differentiate its product in market. Because of the complexity of the case we have broken it down to 3 part. Click on the link to view each part. We strongly recommend to view the…Continue
Added by Tavish Srivastava on March 9, 2014 at 8:39pm — No Comments
This is an area of data science that the public is less familiar with. This example involves small data, simulations, and 18 years old crowdsourcing.
It's an attempt to explain the cause of the TWA Flight 800 that exploded near New York on July 17, 1996. I raised the possibility that a potential cause for the Malaysia Airlines flight that went missing last week, was being hit by a missile (accidental or not). Likewise, many people still believe that TWA 800 was destroyed by a…Continue
Added by Vincent Granville on March 9, 2014 at 10:30am — No Comments
I have always had a great interest in how businesses organize in order to get things done. Here I raise some discussion points intended to stimulate debate.
Principle of Systemic Domains
Not that long ago, I was completing a graduate degree in “critical” disability studies. The critical part deserves to be in quotations since it is probably subject to interpretation and all sorts of misinterpretation. I am going to suggest that in critical…Continue
Added by Don Philip Faithful on March 8, 2014 at 8:15am — No Comments
Apache Sqoop is a connectivity tool to perform data transfer between Hadoop and traditional databases (RDBMS) which contains structured data. Using sqoop, one can import data to Hadoop Distributed File System from RDBMS likeOracle, Teradata, MySQL, etc… and also export the data from Hadoop to any RDBMS in form of CSV file or direct export to databases.
There is a possibility of writing mapreduce programs that would use jdbc connectors…Continue
Added by Pavan Kumar N on March 8, 2014 at 1:12am — No Comments
Here we discuss the secret metric mix used by air controllers to decide which flights get cancelled, as the administration has accepted to reveal how the "cancellator" algorithm works.
It is interesting that they call it "predictive modeling" (a branch of statistics) when actually it is "mathematical optimization" (a branch of operations research). They use Monte Carlo simulations to identify the best scenarios. Note that both predictive modeling and mathematical optimization (applied…Continue
Operations research (including Monte Carlo…Continue
Added by Mirko Krivanek on March 7, 2014 at 10:00am — No Comments
How do you know if an outlier is the result of a data glitch, or a real data point -- indeed maybe not an outlier. Difficult question to answer, but the chart below shows that in some cases, the outlier is not an error.
In this example, you could argue that we are not…Continue
Wils Corrigan, Ph.D., director of data science at Anametrix, recently attended the Predictive Analytics Innovation Summit in San Diego.…
Added by Ryan Montano on March 6, 2014 at 11:00am — No Comments
As we move to a more…Continue
Added by Kyle Albert on March 5, 2014 at 10:06pm — No Comments
Added by Vincent Granville on March 5, 2014 at 3:30pm — No Comments
I just came across this blog and thought it was an interesting point. i disagree. but its worth a discussion :)
While companies complain about lack of analytic talent, professionals complain about lack of jobs. Everyone wants to work for Facebook, LinkedIn, Google, Intel, Apple, Twitter or some hot start-up. It creates fierce competition getting a job interview, let alone a job. But companies that do not belong to this circle see very few candidates applying for their data scientist open positions; in addition, they are only hiring what I call technical developers (defined by a narrow set of technical…Continue
Case studies are the best way to learn analytics. Here is a case study which lays out how analytics plays a role in getting an edge to a business. This case study has been broken down into 3 parts which cover independent aspects. The first part ( …Continue
Added by Tavish Srivastava on March 2, 2014 at 10:05pm — No Comments
Recommender systems are among the most fun and profitable applications of data science in the big data world. Training data (corresponding to the historical search, browse, purchase, and customer feedback patterns of your customers) can be converted into golden opportunities for ROI (i.e., Return…Continue
Added by Kirk Borne on February 28, 2014 at 2:30pm — No Comments
Data Alchemy is delighted to bring you this guest post from Jim Sterne, an international consultant who focuses on measuring the value of the Web as a medium for creating and strengthening customer relationships. He has written eight books on using the Internet for marketing, is the founding president and current chairman of the Digital Analytics Association, produces the eMetrics Summit and sits on Anametrix’s Board of…Continue
Added by Ryan Montano on February 28, 2014 at 8:30am — No Comments