Text analytics, sometimes alternately referred to as text data mining or text mining, refers to the process of deriving high-quality information from text. High-quality information is typically derived through the devising of patterns and trends through means such as statistical pattern learning. Text mining usually involves the process of structuring the input…Continue
Added by Venky Rao on September 29, 2012 at 2:28pm — No Comments
AnalyticTalent, the official Job Board forDataScienceCentral.com and the Data Science community, reached a milestone last month with AnalyticTalent receiving more than 70,000 job searches!
This number is a great indicator of the reach and opportunity that AnalyticTalent represents. We…
Added by Vincent Granville on September 29, 2012 at 8:30am — No Comments
Weekly digest from Data Science Central and Analytic Bridge:Continue
Added by Vincent Granville on September 26, 2012 at 2:15pm — No Comments
Added by Michael Walker on September 26, 2012 at 10:00am — No Comments
With the launch last week of Datascape I thought it would be worth putting an MD’s perspective on the product – how we got here, what the philosophy is that lies behind it, and where we hope to go with it. For a more formal view of the academic and commercial background see our Immersive Data Visualisation white…Continue
Added by David Burden on September 24, 2012 at 5:58am — No Comments
Working and valorising Big Data is business-as-usual for companies that have built their business model on it. For companies that don’t compete on analytics, that is for whom analytics is not a core element of their strategy, it’s a huge challenge.
But Big Data is the talk of town nowadays. I think that a part of the growing management interest is due to two factors:
Added by Patrick Glenisson on September 23, 2012 at 11:26am — No Comments
Hey good lookin’. Yep, I’m talking to you, or at least the data scientists reading this. (The rest of you are incredibly good looking, intelligent, and clearly have good taste, as well.)
TheHarvard Business Review has…Continue
The goal of Data Analytics (big and small) is to get actionable insights resulting in smarter decisions and better business outcomes. How you architect business technologies and design data analytics processes to get valuable, actionable insights varies.
It is critical to…
Added by Michael Walker on September 19, 2012 at 11:57am — No Comments
In today's post, we dive into understanding Association Rules for Market Basket Analysis and discuss three numeric measures that should be considered before deciding to act on / make a business decision based on associations that have been observed in the data: (1) Support (2) Confidence and (3) Lift.
Association rules are typically written in the format:
Left hand side Implies Right hand…
There are various offerings out there if you want to use machine learning in your analysis nowadays. Nick WIlson spent his internship at BigML comparing three SaaS Machine Learning Services (BigML, Prior Knowledge and Google Prediction API), with WEKA as a benchmark. He wrote a series of blog posts about his findings. In his final post he gives a summary of his work, with links to the different blog posts for details. He let me re-blog his summary here.
Added by Jos Verwoerd on September 13, 2012 at 3:37am — No Comments
The goal is to design and build a data warehouse / business intelligence (BI) architecture that provides a flexible, multi-faceted analytical ecosystem for each unique organization.
A traditional BI architecture has analytical processing first pass through…Continue
Added by Michael Walker on September 12, 2012 at 11:53am — No Comments
Lift and Gain Charts are a useful way of visualizing how good a predictive model is. In SPSS, a typical gain chart appears as follows:
In today's post, we will attempt to understand the logic behind generating a gain chart and then discuss how gain and lift charts are interpreted.
To do this,…
Data scientists are the new astronauts. Everyone wants to become one. And it is not difficult to understand the reason for this.
In this age of “Big data”, more and more businesses are relying on people who can make sense of the vast amounts of information generated around us – people who can use sophisticated tools and complex-sounding statistical techniques to derive insights from larger and larger mounds of data.
Businesses have started to understand the power of data. They…Continue
Added by Gaurav Vohra on September 10, 2012 at 11:29pm — No Comments
This is about how to boost your analytic career and/or revenue by leveraging our professional network to the fullest extent.
We invite you to post blogs, or participate in forums (including answering questions asked by peers) on DataScienceCentral and…Continue
This Saturday, I've noticed that Facebook now displays a few new boxes on everyone's profile page (not just me). The box that worries me most is the one that shows all the places where you've traveled and where you've lived, including your current location.
To compound the problem, the box in question clearly…Continue
These are the articles that I enjoyed reading this week:
Added by Vincent Granville on September 8, 2012 at 8:30pm — No Comments
Copyright © SAS Institute Inc., Cary, NC, USA. All Rights Reserved. Used with permission.
Companies are looking increasingly to take advantage of Big Data, especially textual information, those generated via user tools by web or desktop applications. The analysts specialized in this subject believe that 70% of information of interest to business are nestled in word documents, excel, email, etc. These data are not predefined in a model and cannot be perfectly stored in relational tables. They occur most often in the very free form, but contain dates, numbers, key words,…Continue
Added by Michel Bruley on September 2, 2012 at 10:57pm — No Comments
Do you agree with this? I don't, I think this Forbes article is using a provocative title to get you to read it. While assembler programmers in the seventies were eventually replaced by compilers and programming language interpreters, I believe that real statisticians and data scientists can't fully be replaced by machines or software. When they are,…Continue