Subscribe to DSC Newsletter

All Blog Posts (7,233)

No ML algorithms cheat sheet, please

What is a Cheat Sheet ?

Continue

Added by Venkat Raman on June 8, 2020 at 11:30pm — 1 Comment

Managing Data in Massive-Scale Vector Search Engine



How data management is done in Milvus…

Continue

Added by Kate Shao on June 8, 2020 at 11:27pm — No Comments

The Most Important Aspects of a Data Science

Often touted as one of the top jobs of the 21st century, a data scientist is an increasingly common requirement across firms of different sizes and from a variety of verticals. This comes from the fact that: 

  • Massive amounts of data are flowing into most companies
  • This data is looked at as potentially a great source of valuable strategic insights

In the course of a data science career, a data scientist handles a number of…

Continue

Added by Yoey Thamas on June 8, 2020 at 9:00pm — No Comments

Targeting Hate Speech

Summary:  Objectively identifying hateful or abusive speech on social media platforms would allow those platforms to better control it.  However to be objective and without bias that identification would have to be independent of the author especially where elected officials are involved.

 

What could be more…

Continue

Added by William Vorhies on June 8, 2020 at 2:25pm — No Comments

How COVID-19 is Changing our Relationship with Data

How COVID-19 is Changing our Relationship with Data

An increasing proportion of businesses use scientific methods to analyze data. Yet, because key decision-makers do not believe in a data-driven method, a sizable number do not turn their data into actionable information. Instead, these individuals rely on their instincts to drive decisions. If used in lieu of data,such hunch-based mindsets often lead to inefficiencies and wasted…

Continue

Added by Taymour Matin on June 8, 2020 at 10:00am — 2 Comments

Digital Strategy Series Part 2: Creating an Agile and Adaptive Business Strategy Courtesy of AI

While Strategy may not be dead, the importance of the traditional strategy function will be greatly reduced by the emergence of AI. In my previous blog “Strategy Series Part 1: "Creating a Data Strategy that Delivers Value,"  I asked:  How does one develop data and AI strategies in a world of continuous change and transformation?  Tesla provides an interesting poster child for that question: What is Tesla’s business strategy?  Is Tesla an automotive…

Continue

Added by Bill Schmarzo on June 8, 2020 at 5:30am — No Comments

Weekly Digest, June 8

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this…

Continue

Added by Vincent Granville on June 7, 2020 at 12:00pm — No Comments

Mortality and co-morbidity Investigation in Italy under Covid-19

Mortality and co-morbidity Investigation in Italy under Covid-19

Dispensing connection between characteristic, co-morbidities, and mortality outcomes among 1890 patients with Covid-19 and three cause of death: infectious, respiratory and under investigation diseases which affected mortality rates in Italy between 2010 and 2017.

by Federica…

Continue

Added by Federica Gazzelloni on June 6, 2020 at 1:12am — No Comments

Bernoulli Lattice Models - Connection to Poisson Processes

Bernouilli lattice processes may be one of the simplest examples of point processes, and can be used as an introduction to learn about more complex spatial processes that rely on advanced measure theory for their definition. In this article, we show the differences and analogies between Bernouilli lattice processes on the standard rectangular or hexagonal grid, and the Poisson process, including convergence of discrete lattice processes to continuous Poisson process, mainly in two…

Continue

Added by Vincent Granville on June 5, 2020 at 7:00am — 2 Comments

Why Address Standardization and Validation Matters & What You Can Do About It

Poor address data is a complex data quality challenge that affects customers, businesses, and mailing service. Each year, millions of dollars get wasted in resolving the consequences of poor address data. Mailers spend over $20 billion on UAA mail, while direct costs to the USPS is over $1.5 billion/year. All this unnecessary cost is the result of poor, mismanaged, invalidated address data. 

Over the years, working…

Continue

Added by Farah Kim on June 5, 2020 at 1:54am — No Comments

Reinforcement Learning (Q-learning) - Implementation using R (Part 2)

This the second part of Reinforcement Learning (Q-learning). If you would like to understand the RL, Q-learning, and key terms please read Part 1.

In this part, we will implement a simple example of Q learning using the R programming language from scratch. It is expected from you to understand the basics of R programming and complete the reading of Part 1 of this article.

Import libraries

We are coding the algorithms using the R base package…

Continue

Added by Nitin Agarwal on June 4, 2020 at 8:23pm — No Comments

Reinforcement Learning (Q-learning) - An Introduction (Part 1)

Have you heard about AI learning to play computer games on their own and giving tough competitions to expert Human gamers?

A very popular example being Deepmind whose AlphaGo program defeated the South Korean Go world champion in 2016. Other than this there are other AI agents developed with the intent of playing Atari games like…

Continue

Added by Nitin Agarwal on June 4, 2020 at 7:51pm — No Comments

Information Theory (Turbo Codes) & Bayesian Networks

An error correcting code (ECC) is a way of controlling errors in data that is being transmitted over an unreliable or noisy communication channel. In an ECC, the sender encodes the message with redundant information. The receiver is able to detect a limited number of errors, and to correct these errors without retransmission.

Turbo codes are a class of ECC that approach very closely the theoretical…

Continue

Added by Robert R. Tucci on June 4, 2020 at 4:04pm — No Comments

Thursday News, June 4

Here is our selection of featured articles and technical contributions posted since Monday:

Announcements

Technical…

Continue

Added by Vincent Granville on June 4, 2020 at 8:00am — No Comments

Deploy Your First Serverless AWS ML Solution Fast

Introduction to SageMaker

I’ve been working with AWS SageMaker for a while now and have enjoyed great success. Creating and tuning models, architecting pipelines to support both model development and real-time inference, and data lake formation have all been made easier in my opinion.  AWS has proven to be an all encompassing solution for machine learning use cases, both batch and real-time, helping me decrease time to…

Continue

Added by Jeff Diaz on June 2, 2020 at 12:00pm — No Comments

Explaining Data Science to a Non-Data Scientist

Summary:  Explaining data science to a non-data scientist isn’t as easy as it sounds.  You may know a lot about math, tools, techniques, data, and computer architecture but the question is how do you explain this briefly without getting buried in the detail.  You might try this approach.

 

We’ve all been…

Continue

Added by William Vorhies on June 2, 2020 at 9:40am — 3 Comments

Johns Hopkins Covid-19 Data and R, Part II, data.table functions and graphics, plus R-Naught.

Summary: This blog is part II of a series showcasing management and analytics of the daily U.S. Covid-19 case/death data published by the Center for Systems Science and Engineering at Johns…

Continue

Added by steve miller on June 2, 2020 at 8:33am — No Comments

Visualizing the Performance of COVID Models

by Chris McLean & Peter Bruce

Never before have statistical models received the attention they are getting now in the midst of the Coronavirus pandemic.  It is hard to read a news feed today without encountering either:

  • New predictions…
Continue

Added by Paul Derstine on June 2, 2020 at 7:06am — No Comments

Lockdowns Knock Down the Spread of COVID-19, but Only to a Point (and only early on)

by Mike Thurber and John Elder, Ph.D.

By tracking anonymized mobile phone location data and COVID-19 case reports for many countries with different policies, we studied the effect of restricting mobility on the spread of COVID-19.  We found that lockdown policies did rapidly reduce the…

Continue

Added by Paul Derstine on June 2, 2020 at 6:44am — No Comments

5 Most Popular Measures For Capturing Forecasting Error For Time Series

This article is chunk from one of my blog posts on Arima time series forecasting with Python It is a pretty extensive tutorial and until and unless you are not really interested in learning in and outs of about ARIMA time series forecasting don't bother to click. 

But I do wanted to share this list of 5 very useful metrics for a…

Continue

Added by Mohit Sharma on June 1, 2020 at 8:30pm — No Comments

Blog Topics by Tags

Monthly Archives

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2020   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service