Subscribe to DSC Newsletter

Machine Learning and Data Science Cheat Sheet

You can download the new machine learning cheat sheet here (PDF format, 14 pages.) 

Originally published in 2014 and viewed more than 200,000 times, this is the oldest data science cheat sheet - the mother of all the numerous cheat sheets that are so popular nowadays. I decided to update it in June 2019. While the first half, dealing with installing components on your laptop and learning UNIX, regular expressions, and file management hasn't changed much, the second half, dealing with machine learning, was rewritten entirely from scratch. It is amazing how things have changed in just five years!

Source for picture: see here (original) or here (PDF)

Written for people who have never seen a computer in their life, it starts with the very beginning: buying a laptop! You can skip the first half and jump to sections 5 and 6 if you are already familiar with UNIX. This new cheat sheet will be included in my upcoming book Machine Learning: Foundations, Toolbox, and Recipes to be published in September 2019, and available (for free) to Data Science Central members exclusively. This cheat sheet is 14 pages long.

Content

1. Hardware

2. Linux environment on Windows laptop

3. Basic UNIX commands

4. Scripting languages

5. Python, R, Hadoop, SQL, DataViz

6. Machine Learning

  • Algorithms
  • Getting started
  • Applications
  • Data sets and sample projects

To not miss this type of content in the future, subscribe to our newsletter. For related articles from the same author, click here or visit www.VincentGranville.com. Follow me on on LinkedIn, or visit my old web page here.

Views: 241191

Comment

You need to be a member of Data Science Central to add comments!

Join Data Science Central

Comment by Janet Dobbins on October 6, 2014 at 9:53am

Thanks for this list, Vincent, I have listed your blog as a resource on our site too.  http://www.statistics.com/data-science/

Comment by Kalyanaraman K on August 7, 2014 at 3:34pm
One of the important and useful post. Thanks.
Comment by Vincent Granville on August 7, 2014 at 10:11am

Jeffrey: Is using a Windows machine critical?

Maybe not. I recommend Windows because I'm used to it and I use Excel quite a lot, as well as Word, and it has many apps such as Perl / Cygwin, easy to work with. Files are compatible with what I get from clients, though file formats is easy to address.

If you work with Excel spreadsheets on your Mac, no reasons to buy a Windows machine. Ideally, a Unix laptop with Excel is good. Personally, I don't like Microsoft products, don't use IE, don't use Outlook, and don't use most Microsoft features on my Laptop.

Comment by Jeffrey Keeton on August 7, 2014 at 9:07am

Planning to purchase a new laptop...what are everybody's thoughts on using a macbook (pro or air)? Is using a Windows machine critical?

Comment by Nasir M. Uddin, Ph.D. on August 5, 2014 at 6:21am

Very useful information - thank you very much. Regards, Nasir 

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service