Note: for the most recent updates, click here.
At the request of many prospective participants, here's an update about our DSA (Data Science Apprenticeship). Also, I have added a few large data sets, new projects and more material. Click here for details.
If you have already earned a data science certificate or diploma, but was not requested to develop and use your own API in batch mode, and harvest/work on a data set with at least 50 million observations in a distributed environment, then it's time to learn the real stuff that will land you a real job!
Click here for a general overview of our apprenticeship. We have published the data and source code for our big data keyword correlation API. Read the material and download the three files (and post your comments if you have questions, I'll reply ASAP): it will teach you how API's work, and how to write your first API from scratch!
Our next API example will come with the source code of a web crawler, and will illustrate how to detect copyright infringement or how to detect the original, first version of an article published in multiple news outlets (doing a better job than Google).
All the training material will be offered for free to everyone. We have not yet put everything into a nice booklet, but some of the content is already available:
A few data sets available for download, from the following articles:
The following articles will be included in our curriculum, so you can start reading them now
List of potential projects for students:
Starred items (*) are recent additions.
We are still in the process of writing our small booklet to teach you all the fundamentals (computer science, statistics, business analytics, Python, Map Reduce, big data etc.) in 20 pages. Also, feel free to check our Data Science eBook - 2nd Edition. A much more comprehensive, curated and easy-to-read version is published by Wiley (April 2014) and costs less than $30.
Hello. I would like to join this Appreticeship. How do I get started?
Hi everyone, I just found out about this and it sounds like a world of fun. I will definitely be participating. Is this performed in a group or is everyone just self learning at their own pace?
All geared up...Dates please !!
Great ready to start.
The same for me. How can I join?
Xia Lu said:
Thank you for this effort. Can you please post the link to the booklet for DIY? The "get started here" link just redirects to this page.
This is an excellent program offered by DSC. Superior to most every USA University program offered in the US. Suggest obtaining the book and download everything that is available now.
Dr. F. N. Kautzmann III, Ph.D., Litt.D. Data Science Certification.