Proposal for an Apprenticeship in Data Science

For motivated students who can learn on their own, here's an option that I would like to offer: the possibility to become an expert data scientist in less than six months, for a cost well below $10,000, and with guaranteed job opportunities.

The program would be open to everyone without screening, but the degree and the guaranteed jobs would be offered only to students with a successful completion of selected projects. If you don't succeed, you don't pay.

The program would contain three parts:

Part I: Online training:

20-pages booklet containing all the info you need to jump-start your data science career, written in simple English:

  • how to download Python, Perl, Java, R, get sample programs, get started with writing an efficient web crawler, get started with Linux, Cygwin, Excel (including logistic regression)
  • Hadoop, MapReduce, NoSQL their limitations and more modern technologies
  • how to find data sets or download very lage data sets for free on the web
  • how to analyze data: from understanding business requirements to maintaining an automated (machine talking to machine) web / database application in production mode  - a 12 steps process
  • how to develop your first "Analytics as a Service" application and scale it
  • big data algorithms, and how to make them more efficient and more robust (application in computational optimization: how to efficiently test trillions of trillions of multivariate vectors to design good scores)
  • basics about statistics, monte-carlo, cross-validation, robustness, sampling, design of experiments
  • tons of startup ideas for analytic people
  • reference data science book available for free (click here to see 2nd Edition)
  • basics of Perl, Python, real time analytics, distributed architecture and general programming practices
  • data visualization, dashboards and how to communicate like a management consultant
  • tips for future consultants
  • tips for future entrepreneurs
  • rules of thumb, best practices, craftsmanship secrets, and why is data science an art?
  • additional online resources
  • lift and other metrics to measure success, metrics selection, use of external data, make data silos communicate via fuzzy merging and statistical techniques

Part II: Potential projects to be completed:

  • hacking and reverse-engineering projects, for instance a captcha attack
  • web crawling projects: how many Facebook accounts are duplicate or dead? Or categorize Tweets 
  • taxonomy creation or improving an existing taxonomy
  • optimal pricing for bid keywords on Google
  • create a web app that provide (in real time) better-than-average trading signals
  • find low-frequency and Botnet fraud cases in a sea of data
  • internship in computational marketing with a data science start-up
  • automated plagiarism detection
  • estimating the number of entries (articles) on Wikipedia
  • use web crawlers, assess whether Google Search favors (1) its products over competitors [is this an unfair business practice?], (2) local over non-local results and (3) returns different results to web robots and humans. Identify other bias and patterns in Google search results.  
  • creation of RSS feed exchange

Part III: Students successfully completing two projects

  • would be featured in the largest data science community 
  • would receive help finding a job or advice about jump-starting their own company
  • would get endorsement from a leading data scientist
  • may be hired by sponsor companies funding this project

How to enroll?

If interested, join our Data Science Apprenticeship group to receive updates about our program and schedule, and to receive an invitation to participate as well as free training material, when the program is open.

Related articles:

Views: 37848

Tags: predictive modeling


You need to be a member of Data Science Central to add comments!

Join Data Science Central

Comment by Cicero Neves on January 18, 2013 at 7:05pm

I would be very interested too.

Comment by Chandrasekhara S. ("C.S.") Ganti on December 19, 2012 at 7:35pm

Parvan Kumar, the link for ebook works fine and I have downloaded previously and just now as well. I  have also shared with the book on your wall on Data Science Central .. Please take look at it.

Comment by Tuhin Chattopadhyay on December 19, 2012 at 7:26pm

Dear Vincent,

I am interested about joining this programme. Kindly provide me the detailed information.



Comment by Pavan Kumar on December 19, 2012 at 6:29pm

I am unable to download the updated data science pdf book from this link. http://www.datasciencecentral.com/page/data-science-book

please can anyone help?

Comment by RAJEEV RANJAN on December 19, 2012 at 5:41pm

It would be great if there will be some information about which type of educational and work experience back ground you would like to have.

Comment by WK on December 19, 2012 at 1:37pm

This would be a great. I'm in!

Comment by phenomena on December 19, 2012 at 1:11pm

I am very interested in learning more about this formal apprenticeship program!

Comment by Vickie Comrie on December 19, 2012 at 1:06pm

I sincerely hope this apprenticeship program takes off!  I would be thrilled to be a part of it!



Comment by Umair Siddiq on November 13, 2012 at 12:32pm

What would be the pre-requisites to get into this program. Would it be good for somebody with little or no database or coding skills?

Comment by Pavan Kumar on November 2, 2012 at 9:59pm

Hi... Is this program still on? Anyone already started working on this apprenticeship?

© 2021   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service