Which machine learning algorithm should I use?

By Hui Li, Principal Staff Scientist, Data Science, at SAS.

A typical question asked by a beginner, when facing a wide variety of machine learning algorithms, is “which algorithm should I use?” The answer to the question varies depending on many factors, including:

The size, quality, and nature of data.
The available computational time.
The urgency of the task.
What you want to do with the data.

Even an experienced data scientist cannot tell which algorithm will perform the best before trying different algorithms. We are not advocating a one and done approach, but we do hope to provide some guidance on which algorithms to try first depending on some clear factors.

The machine learning algorithm cheat sheet

Click on the picture below to zoom in.

To read more, click here.

The article describes when using one of the following algorithms:

Linear regression and Logistic regression
Linear SVM and kernel SVM
Trees and ensemble trees
Neural networks and deep learning
k-means/k-modes, GMM (Gaussian mixture model) clustering
DBSCAN
Hierarchical clustering
PCA, SVD and LDA

DSC Resources

Services: Hire a Data Scientist | Search DSC | Classifieds | Find a Job
Contributors: Post a Blog | Ask a Question
Follow us: @DataScienceCtrl | @AnalyticBridge

Which machine learning algorithm should I use?

Leave a Reply Cancel reply