Subscribe to Dr. Granville's Weekly Digest

All Blog Posts (1,223)

Predictions - Effect of unique number of target classes on accuracy

When we perform machine learning of type classification, the target variable is a categorical (nominal) variable that has a set of unique values or classes . It could be a simple two class target variable like "approve application? " with classes (values)  of "yes" or "no". Sometimes they might indicate ranges like "Excellent", "Good" etc. for a target variable like satisfaction score. We might also convert continuous variables like test scores (1 - 100)  into classes like grades (A, B, C…


Added by Kumaran Ponnambalam on October 30, 2014 at 7:00am — 1 Comment

Weekly Digest - November 3

The full version is always published Monday. Starred articles or sections are new additions or updated content, posted between Thursday and Sunday. 



Added by Vincent Granville on October 29, 2014 at 4:00pm — No Comments

Announcement to Data Science Central subscribers

If you haven't checked out our newsletter recently, I invite you to do so. The next weekly digest will announce our upcoming Data Science 2.0. book, and a complimentary copy (eBook) will be offered to our subscribers later on.

To make sure that you benefit from these exclusive advantages, check out if you receive our messages:

The sender (the name we use in the "From" field) is usually Data Science Central, and all messages have our physical address…


Added by Vincent Granville on October 28, 2014 at 11:30am — No Comments

Linked Data meets Data Science

As a long-term member of the Linked Data community, which has evolved from W3C's Semantic Web, the latest developments around Data Science have become more and more attractive to me due to its complementary perspectives on similar challenges. Both disciplines work on questions like these:

  • How to extract meaningful information from large amounts of data?
  • How to connect pieces of information to other pieces in…

Added by Andreas Blumauer on October 28, 2014 at 12:27am — No Comments

Data Science 2.0.

This is an announcement regarding my upcoming book: Data Science 2.0. The subtitle is Automation, survival kit, career resources.

Just like our first book, it will first be available as a…


Added by Vincent Granville on October 27, 2014 at 1:30pm — 2 Comments

Data science versus statistics, to solve problems: case study

In this article, I compare two approaches (with their advantages and drawbacks) to compute a simple metric: the number of unique visitors ("uniques") per year for a website. I use the word user or visitor interchangeably.

Source for picture: …


Added by Vincent Granville on October 27, 2014 at 9:30am — 6 Comments

The Richness and Reality of World Data

I’ve been thinking a lot about data, where it comes from, and what it looks like.  I can’t help it.  I’ve been a data geek for almost 15 years.  And I find data beautiful.  Not necessarily in its raw form, mind you. Then it’s just messy and more often than not a pain to deal with, especially when it gets really, really big.  But when smart, creative people start to clean it up and use it in different ways to find the hidden stories that make sense, it can help us learn things in ways that we…


Added by Anne Russell on October 27, 2014 at 6:30am — No Comments

My Data Science Apprenticeship Project


Any author would like to know if his/her article will be successful or not. Here is an attempt to deal with this task.

Data and tools

  1. We obtained 5000 most significant articles (Analytic Bridge and Data Science Central) from here (…

Added by Nikitinsky Nikita on October 26, 2014 at 10:30am — No Comments

Qualitative Engine for Organizational Simulations

Given the nature of the community, presumably many visitors already have a strong understanding of the nature of quantitative data. Perhaps more mysterious is the idea of qualitative data especially since it can sometimes be expressed in quantitative terms. For instance, "stress" as an internal response to an externality differs from person to person; yet it would be possible to canvas a large number of people and express stress levels as an aggregate based on a perceptual gradient: minimal,…


Added by Don Philip Faithful on October 25, 2014 at 6:37am — No Comments banned on Google

This happened tonight, shortly after Facebook took the same decision. Even itself is banned, see picture below. This happens only with Chrome, but not with other browsers such as IE or Firefox. The ban will probably be lifted in several hours.

This brings interesting…


Added by Vincent Granville on October 24, 2014 at 11:00pm — No Comments

How Zipfian Academy Graduate Alex Mentch became a Data Scientist at Facebook

Zipfian Academy has graduated more than 50 alumni, placing graduates into data science roles at Facebook, Twitter, Airbnb, Tesla, Uber, Square,…


Added by Molly Larkin on October 24, 2014 at 10:22am — 3 Comments

New batch of 38 selected resources and articles - October 24

Starred articles were potential candidates for our picture of the week published in our weekly digest. Enjoy our new selection of articles and resources (R, data science, Python, machine learning etc.) Comments are from Vincent Granville. For a full…


Added by Amy on October 24, 2014 at 9:30am — No Comments

Prescriptive versus Predictive Analytics - A Distinction without a Difference?

Summary:  Is the addition of “Prescriptive” analytics to our nomenclature really worthwhile or are we just confusing our customers?

I admit to being annoyed when this or that industry wag tries to coin a new term to describe some portion of the discipline we are already practicing.  Some of these folks I think are…


Added by William Vorhies on October 23, 2014 at 10:00am — 6 Comments

SMBs and the Cloud: Things to Consider Before Adopting Cloud Strategies

For a very long time, businesses had their documents filed in folders and stored in huge metal cabinets. But thanks to advances in technology, they were eventually coded and stored digitally. As we advance through the Age of Information, the traditional digital storage devices like the floppy, compact, and flash…


Added by Kyle Albert on October 23, 2014 at 5:51am — No Comments

How should a financial report look like?

It should be detailed, featuring appropriate and properly scaled business charts, organized in a meaningful way and commented:…


Added by Andrej Lapajne on October 23, 2014 at 4:00am — 1 Comment

Fake traffic un-detected by Google Analytics

Or to put it differently, when your metrics lie to you: how to find out, and what should you do? 

The purpose of this article is to let Google aware of the problem, and fix their Google Analytics reports (filtering out the fake traffic). This scheme also impacts many companies computing website rankings. Tons of websites now have their traffic…


Added by Vincent Granville on October 22, 2014 at 6:30pm — 3 Comments

Weekly Digest - October 27

The full version is always published Monday. Starred articles or sections are new additions or updated content, posted between Thursday and Sunday. 



Added by Vincent Granville on October 22, 2014 at 3:00pm — No Comments

50 Face Recognition APIs

Interesting listing published on Mashape. Only the top 12 are listed below. It would be nice to have a separate blog for voice recognition APIs. I've been thinking at using voice rather than passport or driving license, as a more secure ID. The voice has a texture unique to each individual.…


Added by Mirko Krivanek on October 21, 2014 at 1:30pm — No Comments

Meta-list of data set repositories for cool data science projects

Listings featuring free, great, real-life, big or small data sets are published regularly. Our Data Science Apprenticeship webpage lists various data sets that anyone can use, not just our candidates: some are real, some are simulated data sets to study the behavior of algorithms such as Jackknife regression. More are listed…


Added by Amy on October 21, 2014 at 11:00am — 1 Comment

Harvesting wild data: A pickers guide.

The old world original genus names for fruit trees were all named after women. Representing an implicit importance of relation a fruit bearing tree and its fruit to the human condition. A simple and elegant description between human perception and the reality of living.  Now we take liberty with fruit as it is available from all over the world any time of year. To look at a lovely ripe peach on a wonderful late summer day and to taste its wonderful essence over shadows the long rich story of…


Added by Sigmond Axel on October 21, 2014 at 10:05am — No Comments

Blog Topics by Tags

Monthly Archives






Follow Us


  • Add Videos
  • View All

© 2014   Data Science Central

Badges  |  Report an Issue  |  Terms of Service