Proposal for an Apprenticeship in Data Science

For motivated students who can learn on their own, here's an option that I would like to offer: the possibility to become an expert data scientist in less than six months, for a cost well below $10,000, and with guaranteed job opportunities.

The program would be open to everyone without screening, but the degree and the guaranteed jobs would be offered only to students with a successful completion of selected projects. If you don't succeed, you don't pay.

The program would contain three parts:

Part I: Online training:

20-pages booklet containing all the info you need to jump-start your data science career, written in simple English:

  • how to download Python, Perl, Java, R, get sample programs, get started with writing an efficient web crawler, get started with Linux, Cygwin, Excel (including logistic regression)
  • Hadoop, MapReduce, NoSQL their limitations and more modern technologies
  • how to find data sets or download very lage data sets for free on the web
  • how to analyze data: from understanding business requirements to maintaining an automated (machine talking to machine) web / database application in production mode  - a 12 steps process
  • how to develop your first "Analytics as a Service" application and scale it
  • big data algorithms, and how to make them more efficient and more robust (application in computational optimization: how to efficiently test trillions of trillions of multivariate vectors to design good scores)
  • basics about statistics, monte-carlo, cross-validation, robustness, sampling, design of experiments
  • tons of startup ideas for analytic people
  • reference data science book available for free (click here to see 2nd Edition)
  • basics of Perl, Python, real time analytics, distributed architecture and general programming practices
  • data visualization, dashboards and how to communicate like a management consultant
  • tips for future consultants
  • tips for future entrepreneurs
  • rules of thumb, best practices, craftsmanship secrets, and why is data science an art?
  • additional online resources
  • lift and other metrics to measure success, metrics selection, use of external data, make data silos communicate via fuzzy merging and statistical techniques

Part II: Potential projects to be completed:

  • hacking and reverse-engineering projects, for instance a captcha attack
  • web crawling projects: how many Facebook accounts are duplicate or dead? Or categorize Tweets 
  • taxonomy creation or improving an existing taxonomy
  • optimal pricing for bid keywords on Google
  • create a web app that provide (in real time) better-than-average trading signals
  • find low-frequency and Botnet fraud cases in a sea of data
  • internship in computational marketing with a data science start-up
  • automated plagiarism detection
  • estimating the number of entries (articles) on Wikipedia
  • use web crawlers, assess whether Google Search favors (1) its products over competitors [is this an unfair business practice?], (2) local over non-local results and (3) returns different results to web robots and humans. Identify other bias and patterns in Google search results.  
  • creation of RSS feed exchange

Part III: Students successfully completing two projects

  • would be featured in the largest data science community 
  • would receive help finding a job or advice about jump-starting their own company
  • would get endorsement from a leading data scientist
  • may be hired by sponsor companies funding this project

How to enroll?

If interested, join our Data Science Apprenticeship group to receive updates about our program and schedule, and to receive an invitation to participate as well as free training material, when the program is open.

Related articles:

Views: 37848

Tags: predictive modeling


You need to be a member of Data Science Central to add comments!

Join Data Science Central

Comment by Conrad Montlouis on March 7, 2013 at 8:57pm

I would be very interested!  How to?

Comment by Steven Paul Sanderson II on March 4, 2013 at 4:29pm

Very Interested in this Dr. G

Comment by alex negash on February 20, 2013 at 5:30pm
I'm very exited about the apprenticeship program what a great way to learn new skills!
Comment by Terrance Campbell on February 19, 2013 at 12:08pm

Vincent, I am interested in the data science apprenticeship.  Please let me know when the program starts.

Comment by Justin Mao-Jones on February 18, 2013 at 8:52am

I'm in.  What's next?

Comment by Elise Ralph on February 16, 2013 at 3:36pm

This is terrific. I feel like it's just what I need to step into a new community and grow.I'm really looking forward to participating in this.

Comment by Reginald Ezeh on February 14, 2013 at 10:20am

Same here. I cant wait to start this wonderful program. Its a novel approach to learning.

Please let me know when this program starts

Comment by Lito P. Cruz on February 13, 2013 at 1:36pm

Hi Vincent,

I am ready to start the program. Are the materials ready yet.

I am good to go so please kick start me.


Comment by L. Shane Hall on February 4, 2013 at 10:44am
I am definitely interested in this program. Please provide details to [email protected]

Comment by Michael Griffiths on February 3, 2013 at 10:38am
Hi Vincent,
Is this course due to start soon? If so, could you send me the details to [email protected]

© 2021   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service