Subscribe to DSC Newsletter

Vincent Granville's Blog (876)

Weekly Digest, May 1

Monday newsletter published by Data Science Central. Previous editions can be found here.  The contribution flagged with a + is our selection for the picture of the week.

Featured Resources and Technical Contributions

Continue

Added by Vincent Granville on April 29, 2017 at 8:54am — No Comments

18 Great Blogs Posted in the last 12 Months

This is part of a new series of articles: once or twice a month, we post previous articles that were very popular when first published. These articles are at least 6 month old but no more than 12 month old. The previous digest in this series was posted here a while back. 

18 Great Blogs Posted in the last 12 Months…

Continue

Added by Vincent Granville on April 27, 2017 at 8:07am — No Comments

Thursday News: AI, Python, Hadoop, ML, Data Science, Automation

Here is our selection of featured articles and resources posted since Monday:

Continue

Added by Vincent Granville on April 27, 2017 at 5:36am — No Comments

Which machine learning algorithm should I use?

By Hui Li, Principal Staff Scientist, Data Science, at SAS.

A typical question asked by a beginner, when facing a wide variety of machine learning algorithms, is “which algorithm should I use?” The answer to the question varies depending on many factors, including:

  • The size, quality, and nature of data.
  • The available computational time.
  • The urgency of the task.
  • What you want to do with the data.

Even an experienced data…

Continue

Added by Vincent Granville on April 26, 2017 at 6:30pm — 1 Comment

Book: Data Science for the Layman: No Math Added

Want to get started on data science? Our promise: no math added.

This book has been written in layman’s terms as a gentle introduction to data science and its algorithms. Each algorithm has its own dedicated chapter that explains how it works, and shows an example of a real-world application. To help you grasp key concepts, we stick to intuitive explanations and visuals.…

Continue

Added by Vincent Granville on April 26, 2017 at 3:00pm — 1 Comment

Key Machine Learning PreReq: Viewing Linear Algebra through the right lenses

Guest blog by Ashwin Rao. Ashwin is Vice President, Data Science & Optimization at Target.

The tech industry has gone berserk - everyone wants to develop “skills in Machine Learning and AI” but few are willing to put in…

Continue

Added by Vincent Granville on April 26, 2017 at 2:58pm — No Comments

Introduction to Principal Component Analysis

This formula-free summary provides a short overview about how PCA (principal component analysis) works for dimension reduction, that is, to select k features (also called variables) among a larger set of n features, with k much smaller than n. This smaller set of k features built with PCA is the best subset of k features, in the sense that it minimizes the variance of the residual noise when fitting data to a…

Continue

Added by Vincent Granville on April 26, 2017 at 8:30am — 2 Comments

Weekly Digest, April 24

Monday newsletter published by Data Science Central. Previous editions can be found here.  The contribution flagged with a + is our selection for the picture of the week.

Featured Resources and Technical Contributions

Continue

Added by Vincent Granville on April 22, 2017 at 6:00am — No Comments

Thursday News: ML, AI, Python, Dataviz, Hadoop and more

Here is our new selection of featured articles and resources posted since Monday.

Continue

Added by Vincent Granville on April 20, 2017 at 8:54am — No Comments

Detecting Fake News, Fake Reviews, Fake Accounts, Fake Pictures

A while back, I was reading an article posted on Facebook, about Clovis people found alive and well living in Florida, with a picture featuring tribesmen (see below.) The quality of the picture was poor, and the URL was very suspicious: baynews9.com.ddwg.clonezone.link, as to make it appear that it was from Baynews9.com. It turned out that the picture (and thus the whole story) was fake: these people are real…

Continue

Added by Vincent Granville on April 18, 2017 at 12:00pm — 3 Comments

Seasons in Binary Star Planetary Systems

Here are a few challenges for the mathematically inclined - most data scientists are. This is just fun problems if you have some time to kill. The first problem is about seasons in binary star planetary systems: it has implications on whether such planets are inhabitable. It is also related to time series with double periodicity.  The next problems are related to infinite products, with an emphasis on building a prime-generating or at least prime-detection function. Large prime numbers are…

Continue

Added by Vincent Granville on April 16, 2017 at 11:00am — No Comments

Weekly Digest, April 17

Monday newsletter published by Data Science Central. Previous editions can be found here.  The contribution flagged with a + is our selection for the picture of the week.

Announcement

Continue

Added by Vincent Granville on April 15, 2017 at 7:30am — No Comments

19 Great Blogs Posted in the last 12 Months

This is part of a new series of articles: once or twice a month, we post previous articles that were very popular when first published. These articles are at least 6 month old but no more than 12 month old. The previous digest in this series was posted here a while back. 

19 Great Blogs Posted in the last 12 Months…

Continue

Added by Vincent Granville on April 14, 2017 at 8:00am — No Comments

Thursday News: IoT, AI, Spark, Cheat Sheets, Machine Learning, Stats

Here is our selection of featured articles and resources posted since Monday:

Continue

Added by Vincent Granville on April 13, 2017 at 8:56am — No Comments

20 Cheat Sheets: Python, ML, Data Science, R, and More

This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, Hadoop, decision trees, ensembles, correlation, outliers, regression Python, R, Tensorflow, SVM, data reduction, feature selection, experimental design, time series, cross-validation, model fitting, dataviz, and many more. To keep receiving these articles, …

Continue

Added by Vincent Granville on April 11, 2017 at 9:00am — No Comments

PySpark Cheat Sheet: Spark in Python

Apache Spark is generally known as a fast, general and open-source engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing. It allows you to speed analytic applications up to 100 times faster compared to technologies on the market today. You can interface Spark with Python through "PySpark". This is the Spark Python API exposes the Spark programming model to Python. 

The cheat sheet below was produced by…

Continue

Added by Vincent Granville on April 10, 2017 at 9:00am — No Comments

The Future of Data Science in One Picture

We have published various "one picture" articles about data science topics. Our readers find them very useful, as they convey the same amount of information in one visual, as a long article. 

The picture below is from the Data Science Field Guide published by Booz Allen Hamilton. You can download the guide…

Continue

Added by Vincent Granville on April 9, 2017 at 10:00am — No Comments

Weekly Digest, April 10

Monday newsletter published by Data Science Central. Previous editions can be found here.  The contribution flagged with a + is our selection for the picture of the week.

Featured Resources and Technical Contributions

Continue

Added by Vincent Granville on April 8, 2017 at 7:30am — No Comments

18 Great Blogs Posted in the last 12 Months

This is part of a new series of articles: once or twice a month, we post previous articles that were very popular when first published. These articles are at least 6 month old but no more than 12 month old. The previous digest in this series was posted here a while back. 

18 Great Blogs Posted in the last 12…

Continue

Added by Vincent Granville on April 7, 2017 at 7:30am — No Comments

Factoring Massive Numbers with Machine Learning Techniques

We are interested here in factoring numbers that are a product of two very large primes. Such numbers are used by encryption algorithms such as RSA, and the prime factors represent the keys (public and private) of the encryption code. Here you will also learn how data science techniques are applied to big data, including visualization, to derive insights. This article is good reading for the data scientist in training, who might not necessarily have easy access to interesting data: here the…

Continue

Added by Vincent Granville on April 6, 2017 at 8:00am — 3 Comments

Follow Us

Videos

  • Add Videos
  • View All

Resources

© 2017   Data Science Central   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service