Note: for the most recent updates, click here.

At the request of many prospective participants, here's an update about our DSA (Data Science Apprenticeship). Also, I have added a few large data sets, new projects and more material. Click here for details.

If you have already earned a data science certificate or diploma, but was not requested to develop and use your own API in batch mode, and harvest/work on a data set with at least 50 million observations in a distributed environment, then it's time to learn the real stuff that will land you a real job!

A surface with only one side

Klein Bottle

Click here for a general overview of our apprenticeship. We have published the data and source code for our big data keyword correlation API. Read the material and download the three files (and post your comments if you have questions, I'll reply ASAP): it will teach you how API's work, and how to write your first API from scratch!

Our next API example will come with the source code of a web crawler, and will illustrate how to detect copyright infringement or how to detect the original, first version of an article published in multiple news outlets (doing a better job than Google).

All the training material will be offered for free to everyone. We have not yet put everything into a nice booklet, but some of the content is already available:

A few data sets available for download, from the following articles:

The following articles will be included in our curriculum, so you can start reading them now

List of potential projects for students:

Starred items (*) are recent additions.

We are still in the process of writing our small booklet to teach you all the fundamentals (computer science, statistics, business analytics, Python, Map Reduce, big data etc.) in 20 pages. Also, feel free to check our Data Science eBook -  2nd Edition. A much more comprehensive, curated and easy-to-read version is published by Wiley (April 2014) and costs less than $30.

Views: 98795

Replies to This Discussion

I would be up for that- I'm in the health sector.

I'm ready!  Let's get started!  How do we get going?  I'm in the financial sector.

Great! Thank you so much for all your hard work.

I am really looking forward to this apprenticeship.  But the question is when?

I'd like to help put the 2nd edition together into a nice PDF format. Let me know if I can help!

Any idea when this eBook is expected to be finalized? thanks

I'd like to contribute too.

Philip Best said:

I'd like to help put the 2nd edition together into a nice PDF format. Let me know if I can help!

I am patiently waiting for its commencement...Awesome!

how do I join?

I am new here. I am very interested in more details about this program. 


I would like to join this program - pl let me know how to



is that 20 page booklet available for download? the one about crawler, AaaS, etc...


© 2021   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service