Subscribe to DSC Newsletter

Vincent Granville's Blog – October 2014 Archive (17)

Do we really have a data obesity problem?

The constant search for something bigger might be part of the American culture. However, big data is often critical: without real time credit card fraud detection - a big data application - no store would accept credit cards.

There has been a few people questioning the value of big data recently, and predicting that big data is going to get smaller in the future. While most of these would-be oracles are traditional statisticians working on small data and worried about their…

Continue

Added by Vincent Granville on October 31, 2014 at 9:00pm — No Comments

Data Science Apprenticeship: Announcing our First Graduate

Nikitinsky Nikita is the first to complete our DSA, using NLP, web crawling, statistical techniques and Python to cluster our content in top categories: click here to check his project.

To be fair, our intern …

Continue

Added by Vincent Granville on October 31, 2014 at 10:30am — No Comments

Weekly Digest - November 3

The full version is always published Monday. Starred articles or sections are new additions or updated content, posted between Thursday and Sunday. 

Featured

Continue

Added by Vincent Granville on October 29, 2014 at 4:00pm — No Comments

Announcement to Data Science Central subscribers

If you haven't checked out our newsletter recently, I invite you to do so. The next weekly digest will announce our upcoming Data Science 2.0. book, and a complimentary copy (eBook) will be offered to our subscribers later on.

To make sure that you benefit from these exclusive advantages, check out if you receive our messages:

The sender (the name we use in the "From" field) is usually Data Science Central, and all messages have our physical address…

Continue

Added by Vincent Granville on October 28, 2014 at 11:30am — No Comments

Data Science 2.0.

This is an announcement regarding my upcoming book: Data Science 2.0. The subtitle is Automation, survival kit, career resources.

Just like our first book, it will first be available as a free PDF document to members of our community. It will…

Continue

Added by Vincent Granville on October 27, 2014 at 1:30pm — 16 Comments

Data science versus statistics, to solve problems: case study

In this article, I compare two approaches (with their advantages and drawbacks) to compute a simple metric: the number of unique visitors ("uniques") per year for a website. I use the word user or visitor interchangeably.

Source for picture: …

Continue

Added by Vincent Granville on October 27, 2014 at 9:30am — 7 Comments

Bit.ly banned on Google

This happened tonight, shortly after Facebook took the same decision. Even Bit.ly itself is banned, see picture below. This happens only with Chrome, but not with other browsers such as IE or Firefox. The ban will probably be lifted in several hours.

This brings interesting questions:

  • Bit.ly is a widely used URL shorterner and…
Continue

Added by Vincent Granville on October 24, 2014 at 11:00pm — No Comments

Fake traffic un-detected by Google Analytics

Or to put it differently, when your metrics lie to you: how to find out, and what should you do? 

The purpose of this article is to let Google aware of the problem, and fix their Google Analytics reports (filtering out the fake traffic). This scheme also impacts many companies computing website rankings. Tons of websites now have their traffic…

Continue

Added by Vincent Granville on October 22, 2014 at 6:30pm — 4 Comments

Weekly Digest - October 27

The full version is always published Monday. Starred articles or sections are new additions or updated content, posted between Thursday and Sunday. 

Featured

Continue

Added by Vincent Granville on October 22, 2014 at 3:00pm — No Comments

Weekly Digest - October 20

The full version is always published Monday. Starred articles or sections are new additions or updated content, posted between Thursday and Sunday. 

Featured

Continue

Added by Vincent Granville on October 15, 2014 at 4:30pm — No Comments

Is data science a new paradigm, or recycled material?

Data science is the result of a new paradigm taking place in IT. The question was raised recently, and here I explain how and why data science is part of this new paradigm, and not recycled material.

New arsenal of techniques and metrics

Many data science techniques are very different, if not the opposite of old techniques that were…

Continue

Added by Vincent Granville on October 14, 2014 at 8:30pm — 4 Comments

2-D random walks: simulation, video with R source code, curious facts

We have produced a 90-second video (click on this link to view the video) showing a 'random walk' (a particular case of a Markov process) evolving over 400,000 steps. Figure 1 below shows the last frame (out of 2,000 frames, each one with 200 new steps).…

Continue

Added by Vincent Granville on October 12, 2014 at 10:00am — 2 Comments

Popular predictive apps and APIs

We want to create a repository of apps / APIs that provide forecasts and predictions for a broad class of questions that concern all  human beings. We would also like to create our own predictive API - so if you are interested in this project, contact us at [email protected], we might have a gig for you! Here's an example of what we did in the past. We also designed apps for stock market predictions (see…

Continue

Added by Vincent Granville on October 10, 2014 at 12:30pm — No Comments

Weekly Digest - October 13

The full version is always published Monday. Starred articles or sections are new additions or updated content, posted between Thursday and Sunday. 

Featured

Continue

Added by Vincent Granville on October 8, 2014 at 3:00pm — No Comments

200 Top Bloggers on Data Science Central

Top data science bloggers, authors, websites, or Twitter profiles worth following is now a popular topic, sure to attract lots of attention. We've published our share, including 

Continue

Added by Vincent Granville on October 7, 2014 at 5:00pm — 2 Comments

Top 30 DSC blogs, based on new scoring technology

Most of you will read this article to discover the most popular blogs, but the real purpose here is to show what goes wrong with many data science projects as simple as this one, and how it can easily be fixed. In the process, we created a new popularity score, much more robust than any ranking used in similar articles (top bloggers, popular books, best websites etc.) This scoring, based on a decay function, could be incorporated in recommendation engines.…

Continue

Added by Vincent Granville on October 4, 2014 at 9:00am — No Comments

Weekly Digest, October 6

The full version is always published Monday. Starred articles or sections are new additions or updated content, posted between Thursday and Sunday. 

Featured

Continue

Added by Vincent Granville on October 1, 2014 at 3:00pm — No Comments

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service