Subscribe to DSC Newsletter
Vincent Granville
  • Male
  • Issaquah, WA
  • United States
Share on Facebook
Share

Vincent Granville's Friends

  • Olga Collins
  • Rob Conn
  • Bhaskara Reddy B
  • Kimberly Stewart
  • Ramesh Kumar Singh
  • Mahsa Hassankashi
  • Travis Meeks
  • José Bautista
  • Ken Craig, CIR
  • matt corkum
  • Keith Maclaren
  • George F. Hart
  • Ken Mooney
  • Charles Fox
  • Antoine Bruyns

Vincent Granville's Discussions

Will GDPR kill business in Ireland?

Started 21 hours ago 0 Replies

Companies such as Facebook and Apple use Ireland as a tax shelter, with great benefits for them. As billion of dollars in lawsuits could result from GDPR, will Ireland still be able to attract big…Continue

Can AI Robots Beat Professional Stock Traders?

Started on Monday 0 Replies

I am not talking here about high frequency trading, which has been done by automated algorithms long ago, and that humans can not beat (if done well!) But more like, if you allow a robot or a human…Continue

Which Data Science Techniques Should I Introduce to my Team Members?

Started this discussion. Last reply by Yann MAINVIS May 17. 1 Reply

This is an interesting question asked by a director managing a team of scientists, who wants to keep his team updated on data science techniques. Below is the email exchange in question:Hey Vincent.…Continue

GDPR and Internet Security: are they Incompatible?

Started this discussion. Last reply by Charlie Bickerton on Tuesday. 1 Reply

In order to detect intrusion, bots, spammers, scammers, and fight Internet crime in general, you need to store and monitor a number of metrics for a certain amount of time, at the individual level…Continue

 

Vincent Granville's Page

Profile Information

Short Bio
Data science pioneer, founder, author, CEO, investor, with broad spectrum of domain expertise, technical knowledge, and proven success in bringing measurable added value to companies ranging from startups to fortune 100, across multiple industries (finance, Internet, media, IT, security), domains (data science, operations research, machine learning, computer science, business intelligence, statistics, applied mathematics, growth hacking, IoT) and roles (data scientist, founder, CFO, CEO, HR, product development, marketing, media buyer, operations, management consulting).

Vincent developed and deployed new techniques such as hidden decision trees (for scoring and fraud detection), automated tagging, indexing and clustering of large document repositories, black-box, scalable, simple, noise-resistant regression known as the Jackknife Regression (fit for black-box, real-time or automated data processing), model-free confidence intervals, bucketisation, combinatorial feature selection algorithms, detecting causation not correlations, automated exploratory data analysis with data dictionaries, data videos as a visualization tool, automated data science, and generally speaking, the invention of a set of consistent robust statistical / machine learning techniques that can be understood, implemented, interpreted, leveraged and fine-tuned by the non-expert. Vincent also invented many synthetic metrics (for instance, predictive power and L1 goodness-of-fit) that work better than old-fashioned stats, especially on badly-behaved sparse big data. Some of these techniques have been implemented in a Map-Reduce Hadoop-like environment. Some are concerned with identifying true signal in an ocean of noisy data.

Vincent is a former post-doctorate of Cambridge University and the National Institute of Statistical Sciences. He was among the finalists at the Wharton School Business Plan Competition and at the Belgian Mathematical Olympiads. Vincent has published 40 papers in statistical journals (including Journal of Number Theory, IEEE Pattern analysis and Machine Intelligence, Journal of the Royal Statistical Society, Series B), a Wiley book on data science, and is an invited speaker at international conferences. He also holds a few patents on scoring technology, and raised $6 MM in VC funding for his first startup. Vincent also created the first IoT platform to automate growth and content generation for digital publishers, using a system of API's for machine-to-machine communications, involving Hootsuite, Twitter, and Google Analytics.

Vincent's profile is accessible here and includes top publications, presentations, and work experience with Visa, Microsoft, eBay, NBC, Wells Fargo, and other organisations.

Follow me on Twitter at @AnalyticBridge.
My Web Site Or LinkedIn Profile
http://www.linkedin.com/in/vincentg
Professional Status
C-Level
Years of Experience:
15
Your Company:
Data Science Central, AnalyticBridge
Industry:
Internet
Your Job Title:
Executive Data Scientist, Co-Founder
How did you find out about DataScienceCentral?
Tim Matteson
Interests:
Networking, New venture, Recruiting, Other
What is your Favorite Data Mining or Analytical Website?
http://www.datasciencecentral.com
What Other Analytical Website do you Recommend?
http://www.analyticbridge.com

Bio

Data science pioneer, founder, author, CEO, investor, with broad spectrum of domain expertise, technical knowledge, and proven success in bringing measurable added value to companies ranging from startups to fortune 100, across multiple industries (finance, Internet, media, IT, security) and domains (data science, operations research, machine learning, computer science, business intelligence, statistics, applied mathematics, growth hacking, IoT).

Vincent developed and deployed new techniques such as hidden decision trees (for scoring and fraud detection), automated tagging, indexing and clustering of large document repositories, black-box, scalable, simple, noise-resistant regression known as the Jackknife Regression (fit for black-box, real-time or automated data processing), model-free confidence intervals, bucketisation, combinatorial feature selection algorithms, detecting causation not correlations, and generally speaking, the invention of a set of consistent robust statistical / machine learning techniques that can be understood, implemented, interpreted, leveraged and fine-tuned by the non-expert. Vincent also invented many synthetic metrics (for instance, predictive power and L1 goodness-of-fit) that work better than old-fashioned stats, especially on badly-behaved sparse big data. Some of these techniques have been implemented in a Map-Reduce Hadoop-like environment. Some are concerned with identifying true signal in an ocean of noisy data.

Vincent is a former post-doctorate of Cambridge University and the National Institute of Statistical Sciences. He was among the finalists at the Wharton School Business Plan Competition and at the Belgian Mathematical Olympiads. Vincent has published 40 papers in statistical journals and is an invited speaker at international conferences. Vincent also created the first IoT platform to automate growth and content generation for digital publishers, using a system of API's for machine-to-machine communications, involving Hootsuite, Twitter, and Google Analytics.

Vincent's profile is accessible at http://bit.ly/1jWEfMP and includes top publications, presentations, and work experience with Visa, Microsoft, eBay, NBC, Wells Fargo, and other organisations.

Latest Activity

George Joseph liked Vincent Granville's blog post Why Logistic Regression should be the last thing you learn when becoming a Data Scientist
30 minutes ago
Vincent Granville replied to Fabrice JOURDAN's discussion How to check/optimize cross validation with randomforest on imbalanced classes ?
"Your data set is a bit small. The classic solution is to over-sample under-represented classes. I've been doing it routinely but on data sets with 50+ million observations, where the class "fraud" (versus "non fraud")…"
2 hours ago
Vincent Granville commented on ajit jaokar's blog post Data Science for Internet of Things - The Big Picture
"To get a higher resolution of the picture in Jaap's comment, click on it."
4 hours ago
Vincent Granville posted a discussion

Will GDPR kill business in Ireland

Companies such as Facebook and Apple use Ireland as a tax shelter, with great benefits for them. As billion of dollars in lawsuits could result from GDPR, will Ireland still be able to attract big companies, or keep the ones that have elected to relocate there? What will be the impact on jobs? Was GDPR primarily designed to penalize Ireland?Also, we have seen recently large publishers (for instance, the Los Angeles Time) decide to block access to all European visitors. Will this get worse, or…See More
21 hours ago
ANISH XAVIER liked Vincent Granville's blog post The First Things you Should Learn as a Data Scientist - Not what you Think
yesterday
Hassine Saidane replied to Vincent Granville's discussion Update about our data Science Certification in the group Data Science Certification
"Hello Professor Granville, How can I get a copy of the certification document? Thanks and best regards."
yesterday
Vincent Granville posted a blog post

Weekly Digest, May 28

Monday newsletter published by Data Science Central. Previous editions can be found here.  The contribution flagged with a + is our selection for the picture of the week.Featured Resources and Technical ContributionsApache Hadoop Admin Tips and Tricks …See More
yesterday
Tim Matteson liked Vincent Granville's discussion Which Data Science Techniques Should I Introduce to my Team Members?
yesterday
Narayan Prasai commented on Vincent Granville's group Data Science Certification
"Thanks for accepting me into this active group of talented people. I am looking forward to interacting, learning & contributing to the group. Best, Rayan"
yesterday
Mohamed Mokhtar liked Vincent Granville's blog post The First Things you Should Learn as a Data Scientist - Not what you Think
yesterday
alon begin liked Vincent Granville's blog post 27 Great Resources About Logistic Regression
Friday
Gustavo Mirapalheta liked Vincent Granville's blog post The First Things you Should Learn as a Data Scientist - Not what you Think
Friday
Vincent Granville posted blog posts
Thursday
Vincent Granville commented on Vincent Granville's blog post Why Logistic Regression should be the last thing you learn when becoming a Data Scientist
"Dear R Bohn, We do not delete comments and we like to have contrasted opinions. My main argument is that logistic regression should not be taught first. If you are working with a team of statisticians, and the analysis performed correctly, and the…"
Thursday
R Bohn commented on Vincent Granville's blog post Why Logistic Regression should be the last thing you learn when becoming a Data Scientist
"I am sorry to report that this article is nonsense.  It's not the conclusion - use it or don't use it, there are now many alternatives to logistic regression. (Which in the machine learning world is a "linear classifier."…"
Thursday
Emma Muhleman, CFA, CPA liked Vincent Granville's blog post My Data Science Book - Table of Contents
Thursday

Comment Wall (14 comments)

You need to be a member of Data Science Central to add comments!

Join Data Science Central

At 6:24am on October 01, 2017, Nitesh Choudhary gave Vincent Granville a gift
Gift
Your posts are very informative and I have learned a lot from them. Thanks for sharing!
At 1:37pm on June 23, 2016, Bill Bahl said…

Dr. Granville,

I enjoyed your white paper on Building Dashboards that Flow and could not agree more with minimalism. One thing that seems to be missing from the dashboard packages I've seen is control charts.  At least for the process owner, my personal opinion is a control chart should be the first chart.  If the process is not stable and predicable, statistical analysis seems futile.  Before I retired (two months ago) we started including these in the process owners' LEAN PIT boards.  We generated them in Minitab.  It only takes a few clicks once the data is paste into Minitab.  Bill Bahl

At 12:05pm on February 11, 2016, Dean Pangelinan said…

Dr. Granville,

Regarding the passerelle options for the Data Science certification program, does the notation of "IEEE Computer Science Society - Member" refer to Associate Membership in the IEEE Computer Society, or to full IEEE Membership with additional membership in the IEEE Computer Science Society?

Please advise, at your earliest convenience.

--  Dean Pangelinan

At 5:05pm on June 21, 2015, Sankara Kumaravel gave Vincent Granville a gift
Gift
Dear Dr.Vincent, Thanks for preserving such a nice professional web page for Data Analytics, this is really help for the novice like me.
At 5:28am on June 15, 2015, Lissy Able said…

Hi Vincent,

Can you suggest some points or links about serious data quality issue with the information pulled.

Thanks

Lissy

At 3:38pm on March 11, 2015, Donald Tynes said…

Vincent,

I recently was hired as a data scientist. As a new hire, leading the department of Business Intelligence, I am faced with self-posed questions such as, "What do I need to accomplish in the first 5 days?" And, "What should I accomplish in the first month?" And, of course, "How do I develop a long-term plan for transforming the business into a data-driven organization?" To make the problem of determining how I should focus my attention even more complicated, I have a single employee whom I want to groom to understand the algorithms that I am implementing. Also, I have a CEO who only agreed to hire for this position because the CIO, CFO, and COO encouraged him to do so, but he is highly skeptical of what data science can do for the organization; this complicates matters too because it puts on me a pressure to be dazzling right out-of-the-box. 

I have given these questions considerable thought. I am on day 3 of my new job. I have decided to orient myself on the business' data, query tools, and self-service tools, such as QlikView. I have so many ideas, I have difficulty in choosing a single direction in which I should run. I must note that I want to be significantly impactful while minimizing disruptions in the business' daily functions. To that end, I keep thinking, "run a clustering analysis! Discover the patterns and trends in the company's data to begin the model-building process."

What advice would you give a young data scientist on his 4th day on the job (as it is for me, tomorrow)? 

At 5:22am on December 3, 2014, Harvey Summers said…

I thought you might like this site: http://rpsychologist.com/d3/CI/ 

Interpreting Confidence Intervals

an interactive visualization

At 11:29pm on October 31, 2014, Philippe Van Impe said…

Being from Belgium, you are welcome to join our meetup group about data sciences http://www.meetup.com/Brussels-Data-Science-Community-Meetup/

At 12:21pm on September 25, 2014, Christian Block said…

Hello Vincent, 

I just found DataScienceCentral and wanted to say thank you for putting it together! I'm looking forward to reading through more of the content and checking out your book (which I have ordered).  

Best Regards,

Christian Block

At 6:19am on August 5, 2014, Nasir M. Uddin, Ph.D. said…

Hi Vincent:

This is a great platform to be informed with any latest updates in the field of data science - thank you so much for such a platform.

Regards, Nasir 

Vincent Granville's Videos

  • Add Videos
  • View All

Vincent Granville's Blog

Weekly Digest, May 28

Posted on May 26, 2018 at 8:00am 0 Comments

Monday newsletter published by Data Science Central. Previous editions can be found here.  The contribution flagged with a + is our selection for the picture of the week.

Featured Resources and Technical Contributions

Continue

The First Things you Should Learn as a Data Scientist - Not what you Think

Posted on May 24, 2018 at 1:00pm 0 Comments

The list below is a (non-comprehensive) selection of what I believe should be taught first, in data science classes, based on 30 years of business experience. This is a follow up to my article Why logistic regression should be taught last.

I am not sure whether these topics below are even discussed in data camps or college…

Continue

Thursday News: Logistic Regression, AI, R, NLP, ML, Courses, Books

Posted on May 24, 2018 at 8:00am 0 Comments

Here is our selection of featured articles and resources posted since Monday:

Featured Resources

Continue

From Petabytes to Nanobits, with Application to Blockchain

Posted on May 21, 2018 at 8:00am 0 Comments

It is hard to imagine that some data element could contain less information than a bit (a digit equal to either 0 or 1.) Yet examples are abundant. Indeed, I am wondering if we should create a unit of information called microbit, or nanobit.

The first examples that come to my mind are some irrational numbers such as Pi: it's digits are widely believed to be indistinguishable from pure noise, thus carrying essentially no information. While there is not enough data storage in the…

Continue
 
 
 

Follow Us

Videos

  • Add Videos
  • View All

Resources

© 2018   Data Science Central™   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service