The emerging "Data Stack" or "Data Layer" is in full transition and can be viewed and defined many different ways. The ability to capture, analyze and learn from data generated at unprecedented scale, combined with…Continue
Added by Michael Walker on October 10, 2012 at 10:52am — No Comments
Added by Vincent Granville on October 9, 2012 at 12:30pm — No Comments
Why do people selling supposedly great ROI on the stock market MUST disclaim that "Past performance IS NOT indicative of future results", or else face litigation?
If I had to tell that statement to my clients, nobody would buy. Indeed, I tell the opposite (as in the subject line), because I'm confident about my predictions…Continue
Added by Vincent Granville on October 4, 2012 at 8:30am — No Comments
Analyzes of texts put lights in two main types of information “facts and opinions”. Most current treatment methods of textual information aim to extract and use factual information, this is the case for example of research we do on the web. Analysis of opinions is concerned about feelings and emotions expressed in the texts, it has grown much today because of the space taken from the web in our society, and the very large volume of daily comments expressed by consumers with the advent of the…Continue
Added by Vincent Granville on October 2, 2012 at 7:56am — No Comments
Analytics, big data, data science... All of them were posted in major news oultets (WSJ, MIT, etc.) in the last 10 days. Enjoy the reading!Continue
Added by Vincent Granville on October 1, 2012 at 1:30pm — No Comments
Text analytics, sometimes alternately referred to as text data mining or text mining, refers to the process of deriving high-quality information from text. High-quality information is typically derived through the devising of patterns and trends through means such as statistical pattern learning. Text mining usually involves the process of structuring the input…Continue
Added by Venky Rao on September 29, 2012 at 2:28pm — No Comments
Weekly digest from Data Science Central and Analytic Bridge:Continue
Added by Vincent Granville on September 26, 2012 at 2:15pm — No Comments
Added by Michael Walker on September 26, 2012 at 10:00am — No Comments
Working and valorising Big Data is business-as-usual for companies that have built their business model on it. For companies that don’t compete on analytics, that is for whom analytics is not a core element of their strategy, it’s a huge challenge.
But Big Data is the talk of town nowadays. I think that a part of the growing management interest is due to two factors:
Added by Patrick Glenisson on September 23, 2012 at 11:26am — No Comments
With the launch last week of Datascape I thought it would be worth putting an MD’s perspective on the product – how we got here, what the philosophy is that lies behind it, and where we hope to go with it. For a more formal view of the academic and commercial background see our Immersive Data Visualisation white…Continue
Added by David Burden on September 24, 2012 at 5:58am — No Comments
Hey good lookin’. Yep, I’m talking to you, or at least the data scientists reading this. (The rest of you are incredibly good looking, intelligent, and clearly have good taste, as well.)
TheHarvard Business Review has…Continue
The goal of Data Analytics (big and small) is to get actionable insights resulting in smarter decisions and better business outcomes. How you architect business technologies and design data analytics processes to get valuable, actionable insights varies.
It is critical to design and build a data warehouse / business intelligence…
Added by Michael Walker on September 19, 2012 at 11:57am — No Comments
In today's post, we dive into understanding Association Rules for Market Basket Analysis and discuss three numeric measures that should be considered before deciding to act on / make a business decision based on associations that have been observed in the data: (1) Support (2) Confidence and (3) Lift.
Association rules are typically written in the format:
Left hand side Implies Right hand…
There are various offerings out there if you want to use machine learning in your analysis nowadays. Nick WIlson spent his internship at BigML comparing three SaaS Machine Learning Services (BigML, Prior Knowledge and Google Prediction API), with WEKA as a benchmark. He wrote a series of blog posts about his findings. In his final post he gives a summary of his work, with links to the different blog posts for details. He let me re-blog his summary here.
Added by Jos Verwoerd on September 13, 2012 at 3:37am — No Comments
The goal is to design and build a data warehouse / business intelligence (BI) architecture that provides a flexible, multi-faceted analytical ecosystem for each unique organization.
A traditional BI architecture has analytical processing first pass through a data warehouse.
In the new, modern BI architecture, data reaches users…Continue
Added by Michael Walker on September 12, 2012 at 11:53am — No Comments
Lift and Gain Charts are a useful way of visualizing how good a predictive model is. In SPSS, a typical gain chart appears as follows:
In today's post, we will attempt to understand the logic behind generating a gain chart and then discuss how gain and lift charts are interpreted.
To do this,…