Abstract: In this paper, we propose hybrid principal component analysis (HPCA) to extract appearance feature of a face and inter-age group variation-based classifier (IAG...
Why Python? Python is a multipurpose programming language and widely used for Data Science, which is termed as the sexiest job of this century. Data Scientist mine thru t...
This is a tutorial to show how to implement dashboards in R, using the new “flexdashboard” library package. this new library leverages these libraries and all...
I love it when I get feedback from a blog that I’ve written. I appreciate the different perspectives and insights that others bring to a topic of interest. The sectio...
This article is from Win-Vector LLC In this article we will discuss the machine learning method called “decision trees”, moving quickly over the usual “how decision...
Summary: Dealing with imbalanced datasets is an everyday problem. SMOTE, Synthetic Minority Oversampling TEchnique and its variants are techniques for solving this pr...
Guest blog by Pablo Cordero. Pablo is currently a postdoc at UCSC’s systems biology group, doing applied machine learning research in the context of cell biology and r...
In this two-part series, we will explore text clustering and how to get insights from unstructured data. It will be quite powerful and industrial strength. The first part...
So here are my three principle experiences you won’t effectively discover in books. 1. Evaluation Is Key The main goal in data analysis/machine learning/data scienc...
The following problems appeared in the exercises in the Coursera course Image Processing (by Northwestern University). The following descriptions of the problems are ta...