Subscribe to DSC Newsletter

Featured Blog Posts – September 2019 Archive (61)

What are the Typical Data Scientist Profiles on LinkedIn? Survey Results

The charts below represents the main findings of some recent analysis of 1,000 data scientist LinkedIn profiles, using a web scraper. It was limited to Singapore, and for people having "data scientist" on their profile. Of course, many have a different job title especially in fields such as Fintech (quant engineer) or Healthcare (biostatistician) but the findings are interesting nevertherless and seem to apply to other locales as well.

The first chart features the educational…

Continue

Added by Capri Granville on September 12, 2019 at 10:00am — 1 Comment

“Will china dominate AI?” – is the wrong question

Introduction

 

Will china dominate AI?

When I spoke at the UK China business forum last month, I discussed this topic in response to an audience question.

In the current climate of nationalistic fervour, I see the same question asked in many guises.

For…

Continue

Added by ajit jaokar on September 9, 2019 at 11:32am — No Comments

Turn your ERP failures into optimization success

ERP. No other three letters can strike such fear into the heart of a CFO. But why is this the case? And what can leaders do to solve this problem?…

Continue

Added by Matthew Gierc on September 9, 2019 at 9:00am — No Comments

Introduction to Various Reinforcement Learning Algorithms

This article was written by Steeve Huang.

 

Reinforcement Learning (RL) refers to a kind of Machine Learning method in which the agent receives a delayed reward in the next time step to evaluate its previous action. It was mostly used in games (e.g. Atari, Mario), with performance on par with or even exceeding humans. Recently, as the algorithm evolves with the combination of Neural Networks, it is capable…

Continue

Added by Andrea Manero-Bastin on September 9, 2019 at 12:30am — No Comments

Six Degrees of Separation Between Any Two Data Sets

This is an interesting data science conjecture, inspired by the well known six degrees of separation problem, stating that there is a link involving no more than 6 connections between any two people on Earth, say between you and anyone living (say) in North Korea.   

Here the link is between any two univariate data sets of the same…

Continue

Added by Vincent Granville on September 8, 2019 at 4:00pm — 6 Comments

Weekly Digest, September 9

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.

Announcement
  • Find your…
Continue

Added by Vincent Granville on September 8, 2019 at 2:00pm — No Comments

How GANs and Adaptive Content Will Change Learning, Entertainment and More

This is the next blog in my random series on better understanding some of these advanced Artificial Intelligence and Deep Learning algorithms. This “episode” takes on Generative Adversarial Networks (GANs). Hope you enjoy my “Deep Learning” learning journey.

I originally wrote in “Transforming from Autonomous to Smart: Reinforcement Learning Basics” how…

Continue

Added by Bill Schmarzo on September 8, 2019 at 12:12pm — No Comments

Tai Chi of Civilization

It is not unusual for a person to encounter hurdles or barriers in social processes - for example, to obtain financial support to do a post-secondary program.  I drink decaffeinated coffee. Sometimes when I order coffee, perhaps due to lack of product demand I will receive a cup of coffee that is barely above room temperature.  This situation likewise extends from some kind of social process - e.g. to get the order out although it is not quite what I ordered.…

Continue

Added by Don Philip Faithful on September 8, 2019 at 6:38am — No Comments

Bayesian Machine Learning Part 5

Bayesian Machine Learning (part -5)

Introduction: Expectation-Maximization

In this blog we are going to see how Expectation-maximization algorithm works very closely. This blog is in strict continuation of the previous blog. Previously we saw how…

Continue

Added by Ashutosh vyas on September 7, 2019 at 11:42pm — No Comments

Two New Deep Conjectures in Probabilistic Number Theory

The material discussed here is also of interest to machine learning, AI, big data, and data science practitioners, as much of the work is based on heavy data processing, algorithms, efficient coding, testing, and experimentation. Also, it's not just two new conjectures, but paths and suggestions to solve these problems. The last section contains a few new, original exercises, some with solutions, and may be useful to students, researchers, and instructors offering math and statistics classes…

Continue

Added by Vincent Granville on September 7, 2019 at 9:30pm — 1 Comment

Machine Learning vs Statistics vs Statistical Learning in One Picture

Statistics, Statistical Learning, and Machine Learning are three different areas with a large amount of overlap. Despite that overlap, they are distinct fields in their own right. The following picture illustrates the difference between the three fields.

References…

Continue

Added by Stephanie Glen on September 6, 2019 at 8:47am — 1 Comment

R Users’ Salaries from the 2019 Stackoverflow Survey

Interesting analysis done in R, about salaries of R developers broken down by country, featuring salary range and median salary. 

The dataset consists of survey answers from nearly 90,000 respondents. About 5,000 of them reported using R for “extensive development work over the past year”. The first filter used reduces the dataset from 88,883 respondents to 5,048. The second filter…

Continue

Added by Capri Granville on September 5, 2019 at 7:00am — No Comments

Jobs, Unemployment and 45's Performance.

Despite the consuming controversy surrounding his presidency, POTUS 45 has been able to secure solid ratings on the performance of the economy over his so-far 30-month administration. And he certainly isn't bashful about taking credit for the successes, opining loudly and often that his tax cuts and de-regulation initiatives…

Continue

Added by steve miller on September 4, 2019 at 8:39am — No Comments

NLP Picks Bestsellers – A Lesson in Using NLP for Hidden Feature Extraction

Summary:  99% of our application of NLP has to do with chatbots or translation.  This is a very interesting story about expanding the bounds of NLP and feature creation to predict bestselling novels.  The authors created over 20,000 NLP features, about 2,700 of which proved to be predictive with a 90% accuracy rate in predicting NYT bestsellers.

 …

Continue

Added by William Vorhies on September 3, 2019 at 7:35am — 1 Comment

Common Errors in Machine Learning due to Poor Statistics Knowledge

Probably the worst error is thinking there is a correlation when that correlation is purely artificial. Take a data set with 100,000 variables, say with 10 observations. Compute all the (99,999 * 100,000) / 2 cross-correlations. You are almost guaranteed to find one above 0.999. This is best illustrated in may article How to Lie with P-values (also discussing how to handle…

Continue

Added by Vincent Granville on September 2, 2019 at 3:00pm — No Comments

Wrestling with Data Science “Second Surgeries”? Try MVE!

Please join me in Las Vegas at Hitachi Vantara’s NEXT 2019 in October where I’ll be…

Continue

Added by Bill Schmarzo on September 2, 2019 at 11:03am — No Comments

15 Articles and Tutorials about Outliers

This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, Hadoop, decision trees, ensembles, correlation, ouliers, regression Python, R, Tensorflow, SVM, data reduction, feature selection, experimental design, time series, cross-validation, model fitting, and many more. To keep receiving these articles, …

Continue

Added by Vincent Granville on September 1, 2019 at 10:00am — No Comments

Misuses of Statistics: Examples and Solutions

This resource is part of a series on specific topics related to data science: regression, clustering, neural networks, deep learning, Hadoop, decision trees, ensembles, correlation, outliers, regression Python, R, Tensorflow, SVM, data reduction, feature selection, experimental design, time series, cross-validation, model fitting, dataviz, and many more. To keep receiving these articles, …

Continue

Added by Vincent Granville on September 1, 2019 at 9:30am — No Comments

Weekly Digest, September 2

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.  

Featured Resources and Technical…

Continue

Added by Vincent Granville on September 1, 2019 at 6:30am — No Comments

Loop-Runtime Comparison R, RCPP, Python

The positive reactions on my last post: “Different kinds of loops in R” lead me to compare some different versions of loops in R, RCPP (C++ integration of R). To see a bigger picture, I apply the Python for-loop additionally. The comparison focuses on the runtime for non-costly tasks with different numbers of iterations. For comparison purpose I create vectors in the form of (R syntax):

Vector <- 1:k

k = (1.000, 100.000, 1.000.000)

 

The task is to…

Continue

Added by Frank Raulf on September 1, 2019 at 4:30am — 1 Comment

Featured Monthly Archives

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

Videos

  • Add Videos
  • View All

© 2020   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service