t-Distributed Stochastic Neighbor Embedding (t-SNE) is a (prize-winning) technique for dimensionality reduction that is particularly well suited for the visualization of high-dimensional datasets. The technique can be implemented via Barnes-Hut approximations, allowing it to be applied on large real-world datasets. Click here for details.
According to Wikipedia, t-SNE is a nonlinear dimensionality reduction technique developed by Geoffrey Hinton and Laurens van der Maaten. It is particularly well-suited for embedding high-dimensional data into a space of two or three dimensions, which can then be visualized in a scatter plot. Specifically, it models each high-dimensional object by a two- or three-dimensional point in such a way that similar objects are modeled by nearby points and dissimilar objects are modeled by distant points.
The information below was tweeted by Rubens Zimbres:
t-SNE algo in R and Python, made with same dataset (digits from Python). It seems the discriminant power is the same, have to check. BUT time spent in computation is more than double for R.
Top DSC Resources
Follow us on Twitter: @DataScienceCtrl | @AnalyticBridge
Comment
Great Article on t-SNE, I saw another one explaning how t-SNE works https://www.displayr.com/using-t-sne-to-visualize-data-before-predi...
© 2019 Data Science Central ®
Powered by
Badges | Report an Issue | Privacy Policy | Terms of Service
Most Popular Content on DSC
To not miss this type of content in the future, subscribe to our newsletter.
Other popular resources
Archives: 2008-2014 | 2015-2016 | 2017-2019 | Book 1 | Book 2 | More
Most popular articles
You need to be a member of Data Science Central to add comments!
Join Data Science Central