Here's the content. Click here to check the full table of content. The final chapter was produced on time, and the expected publication date is still by March 2014. Publisher is Wiley.…
Added by Vincent Granville on December 30, 2013 at 4:00pm — No Comments
A few data sets are accessible from our data science apprenticeship web page.
This is another provocative KDNuggets blog post: the data scientist is reduced to three circles, missing the biggest, most important one that encompasses all three of them..
My answer: This Venn diagram…Continue
Search engines (Google, Microsoft), social networks (Twitter, Facebook, LinkedIn), financial institutions, Amazon, Apple, eBay, the health care industry, engineering companies (Boeing, Intel, Oil industry), retail analytics, mobile analytics, marketing agencies, data science vendors (for instance, Pivotal, Teradata, Tableau, SAS, Alpine Labs), environment, utilities government and defense routinely hire…Continue
Added by Vincent Granville on December 29, 2013 at 11:00am — No Comments
Facebook uses an algorithm to estimate the proportion of text in an image embedded into a Facebook ad.
The algorithm works well and it is well designed (it's not an issue of bad data science per se), but it results in 50% of my ads being rejected, see sample ad and rejection notice below. This is an example where better…Continue
Added by Vincent Granville on December 25, 2013 at 5:30pm — No Comments
At 12 years old, Capri might be the youngest data scientist hired by a tech company. So what is she working on?
First, she helps promote data science and computer science among teens. She is the future generation of tech-savvy…Continue
Added by Vincent Granville on December 20, 2013 at 3:30pm — No Comments
I need to finish the last chapter of my Data Science book by December 30. It will include
Added by Vincent Granville on December 18, 2013 at 4:00pm — No Comments
It is interesting to see what Harvard considers to be data science. They use Python in all projects / training (there's nothing wrong with that, though exposure to other languages - R, Stata, SQL - would be great, in top of Python). It is too traditional, and too heavy in statistics in particular. I did not see anything about machine-to-machine communications (e.g. keyword bidding), processing real time data, the…Continue
Visualization continues to mature and focus more on the data first than on novel designs and size. People improved on existing forms and got better at analysis. Readerships seemed to be more ready and eager to explore more data at a time. Fewer spam graphics landed in my inbox.
So all in all, 2013 was a pretty good year for…Continue
Added by Vincent Granville on December 18, 2013 at 10:30am — No Comments
It all started as a little data science project, possibly a job interview question for applicants: How would you compute the number of entries on Wikipedia.
The idea was to use large…Continue
Added by Vincent Granville on December 11, 2013 at 8:00pm — No Comments
Chapter 7: Launching Your New Data Science Career
This discussion was posted on our large LinkedIn group (100,000+ members) by our friend Gregory, pictured below. It has generated a tremendous volume of great comments by a number top leaders. Below is some of my comments. You can read and participate in the discussion by clicking here.…Continue
Added by Vincent Granville on December 7, 2013 at 10:00am — No Comments
What kind of data is salable? How can data scientists independently make money by selling data that is automatically generated: raw data, research data (presented as customized reports), or predictions. In short, using an automated data generation / gathering or prediction system, working from home with no boss and no employee, and possibly no direct interactions with clients. An alternate career path that many of…Continue