Vincent Granville's Blog (1,564)

Interview with Drew Rockwell, CEO of Lavastorm

1. Short Bio

I started my career in the communications industry, where I spent 20 years with a Tier 1 carrier in probably 15 different jobs across the entire organization: Marketing, Advertising, Product Management, Operations, Sales, General Management, Strategy and Business Development. I basically experienced a multi-billion business from many different functional areas, at increasingly responsible…


Added by Vincent Granville on February 9, 2012 at 2:32pm — No Comments

[Job] Data Scientist, Analytics - Twitter (San Francisco, CA)

About this Job

We're looking for highly motivated individuals to help us extract meaning from Twitter's massive dataset. As a Data Scientist for the Analytics team, you'll use statistical analysis and data mining techniques to help us better understand how users engage with Twitter, determine whether new and experimental features should be launched, and measure Twitter's success across the entire organization. You should be passionate about finding insights in data and using…


Added by Vincent Granville on February 9, 2012 at 8:20am — No Comments

DataScienceCentral.com Announces Content Partnership with Visual.ly to Expand Big Data Ecosystem

The online hub for big data practitioners also adds industry thought leader, Richard Snee, as advisor and guest editor

LOS ANGELES, Feb. 1, 2012 /PRNewswire via…


Added by Vincent Granville on February 1, 2012 at 8:50am — No Comments

Another idea to help Amazon increase ROI

When you do a book search on Amazon using keywords, in the search result page you see 16 results. If you've bought one (or more) of these books, it still shows up on the search result page. It would be easy to eliminate that book that you already purchased (e.g. by showing it at the bottom of the page), and replacing it by another book that you haven't purchased yet. Sounds to me like an obvious, simple optimization trick. Maybe Amazon's statisticians use very complex models that change…


Added by Vincent Granville on January 25, 2012 at 2:50pm — No Comments

News from the Wharton Customer Analytics Initiative

Wharton Customer Analytics Initiative

Added by Vincent Granville on January 20, 2012 at 7:32am — No Comments

Missing Values Imputation with Bayesian Networks

A New Approach to Missing Values Processing with Bayesian Networks

Download the white paper here (39.5 MB).

With the abundance of “big data” in the field of analytics, and all the challenges today’s immense data volume is causing, it may not be particularly fashionable or pressing to discuss missing values. After all, who cares…


Added by Vincent Granville on January 12, 2012 at 1:46pm — No Comments

Sales forecasts: how to improve accuracy while simplifying models?

The solution is simple: leverage external data, and simplify your predictive model.

Back in 2000 I was working with GE's analytic team to improve sales forecasts for NBC Internet, a web portal owned by NBC. The sales / finance people were using a very basic formula to predict next month sales, based mostly on sales from previous month. With GE, we started to develop more…


Added by Vincent Granville on January 3, 2012 at 10:54am — No Comments

Social Networks: How to Measure User Engagement?

Here are a few interesting KPI's:

  • Average session duration
  • Proportion of active versus passive behavior (active = posting a blog, inviting a thread; passive = browsing, reading articles)
  • New members per week, with breakdown US vs. non US
  • Average number of groups per member
  • Average number of friends per member
  • Number of page views per user (per member) per month
  • Time to last visit (average; per user; per…

Added by Vincent Granville on December 20, 2011 at 8:17am — No Comments

Only One-Third of Companies Making Effective Use of Data

HOPKINTON, Mass., Dec. 5, 2011 -- EMC Corporation today unveiled the findings of the largest-ever global survey of the data science community. Spanning the United States, the United Kingdom, France, Germany, India and China, the EMC Data Science Study reveals and quantifies a rampant scarcity across the globe for the prerequisite skills necessary for a company to capitalize on the opportunities found at the intersection of Big Data and data analytics. Only one-third of companies are…


Added by Vincent Granville on December 12, 2011 at 8:33am — No Comments

Healthcare fraud detection still uses cave-man data mining techniques

The Washington Education Association (WEA, in Washington State) is partnering with Aon Hewitts (Illinois), a verification company, to eliminate a specific type of health insurance fraud: teachers reporting non-qualifying people as dependents, such as an unemployed friend with no health insurance. The fraud is used by "nice" people (teachers) to provide health insurance to people who would otherwise have none, by reporting them as spouse or kids.

Interestingly, I saw the letter sent to…


Added by Vincent Granville on December 7, 2011 at 5:16pm — No Comments

Interview with Kaggle CEO Anthony GoldBloom

For those that haven't heard of Kaggle before, Kaggle is a team of people that provide the functionality and support to host Data Mining contests. Here is how it works : Suppose that you are working for a Telco and wish to implement a new Churn prediction model. Rather than running this project in-house, you submit your data to Kaggle. What happens next is that -hopefully- many…


Added by Vincent Granville on December 5, 2011 at 10:41am — No Comments

Online advertising: a solution to optimize ad relevancy

When you see google ads on Google search result pages or elsewhere, the ads that are displayed in front of you eyes (should) have been highly selected in order to maximize the chance that you convert and generate ad revenue for Google. Same on Facebook, Yahoo, Bing, LinkedIn and on all ad networks.

If you think that you see irrelevant ads, either they are priced very cheaply, or Google's ad relevancy algorithm is not working well.

Ad scoring algorithms used to be very simple,…


Added by Vincent Granville on December 3, 2011 at 8:43am — No Comments

Why and how you should build a data dictionary for big data sets

One of the most valuable tools that I've used, when performing exploratory analysis, is building a data dictionary. It offers the following advantages:

  • Identify areas of sparsity and areas of concentration in high-dimensional data sets
  • Identify outliers and data glitches
  • Get a good sense of what the data contains, and where to spend time (or not) in further data mining

What is a data dictionary

A data dictionary is a table…


Added by Vincent Granville on December 2, 2011 at 8:41am — No Comments

EMC Tries To Unify Big Data Analytics | InformationWeek

EMC Greenplum Modular Data Computing Appliance puts SQL and Hadoop in the same box, but is it a truly cohesive platform?


Added by Vincent Granville on December 2, 2011 at 8:40am — No Comments

Connecting with the Social Analytics Experts

Social Media Tips for Analytics Professionals 

From Text and Data Mining to Market Research and Social Media Consulting, few are more influential than today’s guests. In advance of the West Coast Text Analytics Summit (Nov. 10-11, San Jose), Text Analytics News caught up with four analytics leaders who are helping…


Added by Vincent Granville on December 2, 2011 at 8:39am — No Comments

How to detect a pattern? Problem and solution.

Check the three charts below: only one shows no pattern and is truly random. Which one?

Chart #1


Chart #2



Added by Vincent Granville on December 2, 2011 at 8:38am — No Comments

Statisticians Have Large Role to Play in Web Analytics | American Statistical Association

Read my full interview for AMSTATat http://magazine.amstat.org/blog/2011/09/01/webanalytics/. You will also find my list of recommended books. Here is a copy of the interview, in case the original article (posted on AMSTAT News) disappear.

(Dr. Granville's Interview for AMSTAT)

Vincent Granville is chief scientist at a publicly traded company and the founder of AnalyticBridge. He has consulted on…


Added by Vincent Granville on December 2, 2011 at 8:37am — No Comments

Classification accuracy using different bag of words

In this post I show how the accuracy of the classifier is influenced by the bag of words. 

The test has been done on a naive classifier but it returns good information about the data set.


The above image is an example of the bag of words used…


Added by Vincent Granville on December 2, 2011 at 8:36am — No Comments

Lifetime value of an e-mail blast: much longer than you think

See below an example of an Analyticbridge email campaign that was monitored over a period of about 600 days. It clearly shows that 20% of all clicks originate after day #5. Yet most advertisers and publishers ignore clicks occurring after day #3. Not only 20% of all clicks occurred after day #3, but the best clicks (in terms of conversions) occurred several weeks after the email blast. Also, note an organic spike occurring on day #23 in the chart below -…


Added by Vincent Granville on December 2, 2011 at 8:35am — No Comments

IBM's vice president of Big Data Products explains the role of data scientists | ComputerWorld

What is a data scientist? A data scientist is someone who analyzes an organization's big data to discover actionable trends that lead to business results. Data scientists look at what questions business people need to ask to remain competitive. They work directly with C-level executives, advising them on how to drive maximum value from big data and integrate new information. In many ways, a …


Added by Vincent Granville on December 2, 2011 at 8:33am — No Comments

Monthly Archives













© 2021   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service