Subscribe to DSC Newsletter

Vincent Granville's Blog – September 2019 Archive (17)

Weekly Digest, September 30

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.  

Announcements

  • Earn a Data…
Continue

Added by Vincent Granville on September 29, 2019 at 7:00am — No Comments

Thursday News, September 26

Here is our selection of featured articles and technical resources posted since Monday:

Resources

Continue

Added by Vincent Granville on September 26, 2019 at 11:00am — No Comments

Weekly Digest, September 23

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.  

Announcements…

Continue

Added by Vincent Granville on September 22, 2019 at 11:30am — No Comments

Thursday News, September 19

Here is our selection of featured articles and technical contributions posted since Monday:

Announcements

Continue

Added by Vincent Granville on September 19, 2019 at 9:30am — No Comments

Applications of Data Analytics

I am presenting at the upcoming NISS (National Institute of Statistical Sciences) webinar on September 27. This was my first employer in US, back in 1996. I was then completing a post-doc.

My presentation focuses on new algorithms, original applications, theoretical data science (including a new conjecture about data sets) and implications to business analytics, as well as new foundations of statistics, based on general resampling and model free, data-driven techniques. It will also…

Continue

Added by Vincent Granville on September 19, 2019 at 9:30am — No Comments

Introduction to Authorship Analysis as a Text Classification/Clustering Problem

Guest blog post by Nabanita Roy.

Introduction:

The art and science of discriminating between writing styles of authors by identifying the characteristics of the persona of the authors and examining articles authored by them is called Authorship Analysis. It aims to determine characteristics of an individual like age, gender, native language and personality traits…

Continue

Added by Vincent Granville on September 18, 2019 at 3:02pm — No Comments

Introduction to Authorship Analysis as a Text Classification/Clustering Problem

Guest blog post by Nabanita Roy.

Introduction:

The art and science of discriminating between writing styles of authors by identifying the characteristics of the persona of the authors and examining articles authored by them is called Authorship Analysis. It aims to determine characteristics of an individual like age, gender, native language and personality traits…

Continue

Added by Vincent Granville on September 18, 2019 at 3:02pm — No Comments

Weekly Digest, September 16

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.  

Announcement

  • Building…
Continue

Added by Vincent Granville on September 14, 2019 at 8:30am — No Comments

Thursday News, September 12

This is our selection of featured articles and resources posted since Monday:

Announcement

Technical Contributions

Continue

Added by Vincent Granville on September 12, 2019 at 10:30am — No Comments

Six Degrees of Separation Between Any Two Data Sets

This is an interesting data science conjecture, inspired by the well known six degrees of separation problem, stating that there is a link involving no more than 6 connections between any two people on Earth, say between you and anyone living (say) in North Korea.   

Here the link is between any two univariate data sets of the same…

Continue

Added by Vincent Granville on September 8, 2019 at 4:00pm — 6 Comments

Weekly Digest, September 9

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.

Announcement
  • Find your…
Continue

Added by Vincent Granville on September 8, 2019 at 2:00pm — No Comments

Two New Deep Conjectures in Probabilistic Number Theory

The material discussed here is also of interest to machine learning, AI, big data, and data science practitioners, as much of the work is based on heavy data processing, algorithms, efficient coding, testing, and experimentation. Also, it's not just two new conjectures, but paths and suggestions to solve these problems. The last section contains a few new, original exercises, some with solutions, and may be useful to students, researchers, and instructors offering math and statistics classes…

Continue

Added by Vincent Granville on September 7, 2019 at 9:30pm — 1 Comment

Thursday News, September 5

Here is our selection of featured articles and technical resources posted since Monday:

Technical Resources

Continue

Added by Vincent Granville on September 5, 2019 at 7:30am — No Comments

Common Errors in Machine Learning due to Poor Statistics Knowledge

Probably the worst error is thinking there is a correlation when that correlation is purely artificial. Take a data set with 100,000 variables, say with 10 observations. Compute all the (99,999 * 100,000) / 2 cross-correlations. You are almost guaranteed to find one above 0.999. This is best illustrated in may article How to Lie with P-values (also discussing how to handle…

Continue

Added by Vincent Granville on September 2, 2019 at 3:00pm — 2 Comments

15 Articles and Tutorials about Outliers

This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, Hadoop, decision trees, ensembles, correlation, ouliers, regression Python, R, Tensorflow, SVM, data reduction, feature selection, experimental design, time series, cross-validation, model fitting, and many more. To keep receiving these articles, …

Continue

Added by Vincent Granville on September 1, 2019 at 10:00am — No Comments

Misuses of Statistics: Examples and Solutions

This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, Hadoop, decision trees, ensembles, correlation, outliers, regression Python, R, Tensorflow, SVM, data reduction, feature selection, experimental design, time series, cross-validation, model fitting, dataviz, and many more. To keep receiving these articles, …

Continue

Added by Vincent Granville on September 1, 2019 at 9:30am — No Comments

Weekly Digest, September 2

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.  

Featured Resources and Technical…

Continue

Added by Vincent Granville on September 1, 2019 at 6:30am — No Comments

Monthly Archives

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2020   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service