You can download the new machine learning cheat sheet here (PDF format, 14 pages.)
Originally published in 2014 and viewed more than 200,000 times, this is the oldest data science cheat sheet - the mother of all the numerous cheat sheets that are so popular nowadays. I decided to update it in June 2019. While the first half, dealing with installing components on your laptop and learning UNIX, regular expressions, and file management hasn't changed much, the second half, dealing with machine learning, was rewritten entirely from scratch. It is amazing how things have changed in just five years!
Source for picture: see here (original) or here (PDF)
Written for people who have never seen a computer in their life, it starts with the very beginning: buying a laptop! You can skip the first half and jump to sections 5 and 6 if you are already familiar with UNIX. This new cheat sheet will be included in my upcoming book Machine Learning: Foundations, Toolbox, and Recipes to be published in September 2019, and available (for free) to Data Science Central members exclusively. This cheat sheet is 14 pages long.
Content
1. Hardware
2. Linux environment on Windows laptop
3. Basic UNIX commands
4. Scripting languages
5. Python, R, Hadoop, SQL, DataViz
6. Machine Learning
To not miss this type of content in the future, subscribe to our newsletter. For related articles from the same author, click here or visit www.VincentGranville.com. Follow me on on LinkedIn, or visit my old web page here.
Comment
Excellent !!
Covers almost all the MUST have things for Data Science.
Thanks for sharing.
Many thanks.
It seems like there is a little error: Basic web crawler * could not be accessed, which gives the message that "Our apologies – this page was not found". Would you please help check it? Thanks.
Excellent collation!!! As beginner to the site and DS journey this is well organized stuff. Looking forward use this as I march ahead in learning more about data science.
'This Article' is now "https://github.com/gumption/Python_for_Data_Science"
Very nice summary, and some very useful links. I am in the process of putting together a dashboard and found your 10 Features all Dashboards Should Have post a helpful checklist.
Do you find github/bitbucket/etc. a useful data science tool at this point?
Wonderful tutorial!
Things that not learn in the classroom...
Congratulations Vincent!!!
AMAZING! Thank you a lot!
Vincent, thanks a lot!
Excellent compilation - just one niggle though - avoid using Cygwin on Windows or Homebrew on MacOSX - it may mess up some big data frameworks. It's probably best to use Linux with a Virtualbox - it also allows you to use nice IDEs like Dataiku.
Great work though - thanks Vincent!
@Vincent, what do you think about use only opensource softaware office (Writer, Calc) instead Excel, Word? some negative experience with these?
© 2021 TechTarget, Inc.
Powered by
Badges | Report an Issue | Privacy Policy | Terms of Service
Most Popular Content on DSC
To not miss this type of content in the future, subscribe to our newsletter.
Other popular resources
Archives: 2008-2014 | 2015-2016 | 2017-2019 | Book 1 | Book 2 | More
Most popular articles
You need to be a member of Data Science Central to add comments!
Join Data Science Central