Subscribe to DSC Newsletter

Vincent Granville's Blog – August 2014 Archive (15)

Frozen versus liquid analytics - or the competitive advantage of data science

In this article, I further emphasize the difference between data scientists and other analytic practitioners. I wrote last week that statisticians are to data scientists what astronomers are to physicists: there's some overlap, but less than most people think. Here, I elaborate on this theme.

Other disciplines such as data mining and machine learning…

Continue

Added by Vincent Granville on August 29, 2014 at 8:30am — No Comments

33 unusual problems that can be solved with data science

Here is a non-exhausting list of curious problems that could greatly benefit from data analysis. If you think you can't get a job as a data scientist (because you only apply to jobs at Facebook, LinkedIn, Twitter or Apple), here's a way to find or create new jobs, broaden your horizons, and make Earth a better world not just for human beings, but for all living creatures. Even beyond Earth indeed. Help us grow this list of 33 problems, to 100+.

The actual number is higher than 33, as…

Continue

Added by Vincent Granville on August 28, 2014 at 9:00pm — 7 Comments

Weekly Digest - September 1

The full version is always published Monday. Starred articles or sections are new additions or updated content, posted between Thursday and Sunday. 

Sponsored Announcements…

Continue

Added by Vincent Granville on August 27, 2014 at 2:30pm — No Comments

Synthetic criterion to choose the right variables for your predictive algorithm

The success of any big data or data science initiative is determined by the kind of data that you collect, and how you analyze it. In this article, we describe a simple criterion to select great metrics out of dozens, hundreds or even millions of potential predictors - sometimes called features or rules by machine learning professionals, or independent variables, by statisticians.…

Continue

Added by Vincent Granville on August 26, 2014 at 3:30pm — No Comments

Should we all have two ID's (one for backup purposes)?

With high levels of ID theft occurring each day (if you believe the media, you'd think 50 million people have experienced critical ID theft in US alone over the last two years) and how hard it is to recover from this crime, one wonders: should we all have multiple ID's?

If your answer is yes, then the next big question is: how do you implement this? You'd think that if your primary ID has been compromised, chances are that your secondary ID has been compromised as well, unless these…

Continue

Added by Vincent Granville on August 25, 2014 at 4:48pm — No Comments

The most expensive data science textbook

This rudimentary statistics textbook, entitled Statistics: The Art and Science of Learning from Data (3rd Edition), sells on Amazon for $157.79. Not sure if everyone sees the same price as me (maybe prices are user-customized), if price changes over time, but it seems stable. Below is a screenshot.

Surprisingly, this book is meant for first-year…

Continue

Added by Vincent Granville on August 21, 2014 at 7:30pm — 7 Comments

Why Zipf's law explains so many big data and physics phenomenons

The Zipf's law states that in many settings (that we are going to explore), the volume or size of entities is inversely proportional to a power s (s > 0) of their ranking. This has important implications in predictive modeling, discussed below. The processes that create this type of dynamic are not well understood. It is the purpose of this article to explain the underlying mechanics. The traditional example for the Zipf distribution is the distribution of Internet…

Continue

Added by Vincent Granville on August 21, 2014 at 3:00pm — 12 Comments

High Value Talent Analytics Projects: “Fishing” or Solving Business Problems?

Guest blog post by Pasha Roberts, Chief Scientist, Talent Analytics, Corp. | @pasharoberts

Over the years, our…

Continue

Added by Vincent Granville on August 20, 2014 at 5:30pm — No Comments

Machine Learning: Interview with Spencer Greenberg, CEO of Rebellion Research

Spencer Greenberg holds a B.S. Magna Cum Laude in Applied Mathematics & Computer Science, from Columbia University, and a Ph D. in Machine Learning, from NYU. Prior to Rebellion Research, he was Software Developer, Neuberger Berman, LLC and Engineer in The Investigative Project for Terrorism. Spencer has been interviewed  on CNBC, Bloomberg News, Canada’s BNN, and in the Wall Street Journal. He has also lectured at Columbia…

Continue

Added by Vincent Granville on August 20, 2014 at 5:00pm — No Comments

Weekly Digest - August 25

The full version is always published Monday. Starred articles or sections are new additions or updated content, posted between Thursday and Sunday. 

Sponsored Announcements

  • Predictive Analytics World for Government shares insights, case studies and advice on how predictive analytics can help government agencies increase transparency, measure…
Continue

Added by Vincent Granville on August 20, 2014 at 9:30am — No Comments

Weekly Digest - August 18

The full version is always published Monday. Starred articles or sections are new additions or updated content, posted between Thursday and Sunday. 

Sponsored Announcement

AnyLogic Modeling and Simulation software is utilized by globally recognized organizations to ensure better, more profitable decision making. Annually, these organizations gather to…

Continue

Added by Vincent Granville on August 12, 2014 at 7:00pm — No Comments

Black-box Confidence Intervals: Excel and Perl Implementation

Confidence interval is abbreviated as CI. In this new article (part of our series on robust techniques for automated data science) we describe an implementation both in Excel and Perl, and discuss our popular model-free confidence interval technique introduced in our original Analyticbridge article, as part of our (open source) intellectual property…

Continue

Added by Vincent Granville on August 8, 2014 at 12:00pm — 22 Comments

Weekly Digest - August 11

The full version is always published Monday. Starred articles or sections are new additions or updated content, posted between Thursday and Sunday. 

Sponsored Announcement

Predictive…

Continue

Added by Vincent Granville on August 6, 2014 at 10:30am — No Comments

Announcement for Data Science Central Subscribers

Moving forward, we plan to offer additional features that will only be accessible from our newsletter:

This include, but is not limited, to the following:

  • Password-protected Excel spreadsheets and other protected documents. The password will be provided in our weekly digest. 
  • Discounts offered exclusively to our subscribers (they typically expire after some time), on conferences and training
  • Brand new articles and resources published for the first time…
Continue

Added by Vincent Granville on August 2, 2014 at 10:00am — No Comments

Machine Learning and Data Science Cheat Sheet

You can download the new machine learning cheat sheet here (PDF format, 14 pages.) 

Originally published in 2014 and viewed more than 200,000 times, this is the oldest data science cheat sheet - the mother of all the numerous cheat sheets that are so popular nowadays. I decided to update it in June 2019. While the first half, dealing with installing components…

Continue

Added by Vincent Granville on August 1, 2014 at 12:30pm — 35 Comments

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service