Subscribe to DSC Newsletter

Featured Blog Posts – May 2019 Archive (100)

R-Squared in One Picture

R-squared measures how well your data fits a regression line. More specifically, it's how much variation in the…

Continue

Added by Stephanie Glen on May 31, 2019 at 8:00am — No Comments

Apache Spark Streaming Tutorial for Beginners

Introduction

In a world where we generate data at an extremely fast rate, the correct analysis of the data and providing useful and meaningful results at the right time can provide helpful solutions for many domains dealing with data products. We can apply this in Health Care and Finance to Media, Retail, Travel Services and etc. some solid examples include Netflix providing personalized recommendations at real-time, Amazon tracking your interaction with different products on its…

Continue

Added by Divya Singh on May 30, 2019 at 8:00pm — No Comments

Simulated Significance

I pulled out a dusty copy of Thinking Stats by Allen Downey the other day. I highly recommend this terrific little read that teaches statistics with easily understood examples using Python. When I purchased the book eight years ago, the Python code proved invaluable as…

Continue

Added by steve miller on May 30, 2019 at 7:56am — No Comments

Harnessing Potential of Artificial Intelligence In Energy and Oil & Gas

The energy industry is undergoing a rapid transformation in recent past owing to the enhanced role of renewables and enhanced data-driven models making the value chain smarter. In the context of the primary constituents of this sector comprising of coal, power, renewables, solar energy, oil, and gas, there is a huge role AI can play.

We illustrate some key use cases below:

1. Smart Grid

The biggest disruption in power in recent times is in the smart grid…

Continue

Added by Mahesh Kumar CV on May 30, 2019 at 5:02am — No Comments

Basic Statistics Concepts Every Data Scientist Should know

Introduction

Data science is a multidisciplinary blend of data inference, algorithm development, and technology in order to solve analytically complex problems. At the core is data. Troves of raw information, streaming in and stored in enterprise data warehouses. Much to learn by mining it. Advanced capabilities we can build with it. Data science is ultimately about using this data in creative ways to generate business value

The broader fields of understanding what data…

Continue

Added by Divya Singh on May 29, 2019 at 8:00pm — No Comments

Build Your Intelligent Enterprise through a Data Fabric

The future offers interesting and exciting times ahead for most businesses. With data being a big influencer in the enterprise of the future, it is a matter of time before we jump into the era of intelligent enterprises.

Intelligent enterprises are going to be…

Continue

Added by Ronald van Loon on May 29, 2019 at 7:26pm — No Comments

6 Important Steps to Building a Successful Factory of the Future

What is the factory of the future? Is it a synonym to Industry 4.0, or is it a different concept in its own right? Industry 4.0 and the factory of the future might sound similar, but they are different in some ways. To begin with, the factory of the future is an elusive concept that isn’t as common as Industry 4.0.

The factory of the future is…

Continue

Added by Ronald van Loon on May 29, 2019 at 6:30pm — No Comments

Data science Coding in a weekend series of books …

After testing this idea for the last few months, we have formally launched this concept

 

The idea of ‘Data Science Coding in a weekend’ originated from meetups we conducted in London

 

The idea is simple but effective

 

We choose a complex section of code and try to learn it in detail over…

Continue

Added by ajit jaokar on May 29, 2019 at 7:52am — No Comments

10 Areas of Expertise in Data Science

The analytics market is booming, and so is the use of the keyword – Data Science. Professionals from different disciplines are using data in their day to day activities, and feel the need to master the start-of-the-art technology in order to get maximum insights from the data, and subsequently help the business to grow.

Moreover, there are professionals who want to keep them updated with this latest skills such as Machine Learning, Deep Learning, Data Science, and so either to elevate…

Continue

Added by Divya Singh on May 28, 2019 at 10:19pm — No Comments

Simple Trick to Remove Serial Correlation in Regression Models

Here is a simple trick that can solve a lot of problems.

You can not trust a linear or logistic regression performed on data if the error term (residuals) are auto-correlated. There are different approaches to de-correlate the observations, but they usually involve introducing a new matrix to take care of the resulting bias. See for instance here.  …

Continue

Added by Vincent Granville on May 28, 2019 at 9:30am — No Comments

Top AI algorithms for Healthcare



The …

Continue

Added by Max Ved on May 27, 2019 at 11:02pm — No Comments

Gentle Approach to Linear Algebra, with Machine Learning Applications

This simple introduction to matrix theory offers a refreshing perspective on the subject. Using a basic concept that leads to a simple formula for the power of a matrix, we see how it can solve time series, Markov chains, linear regression, data reduction, principal components analysis (PCA) and other machine learning problems. These problems are usually solved with more advanced matrix calculus, including eigenvalues, diagonalization, generalized inverse matrices, and other types of matrix…

Continue

Added by Vincent Granville on May 27, 2019 at 2:00pm — 1 Comment

Real Time Computer Vision is Likely to be the Next Killer App but We’re Going to Need New Chips

Summary:  Real Time Computer Vision (RTCV) that requires processing video DNNs at the edge is likely to be the next killer app that powers a renewed love affair with our mobile devices.  The problem is that current GPUs won’t cut it and we have to wait once again for the hardware to catch up.

 

 The entire…

Continue

Added by William Vorhies on May 27, 2019 at 8:47am — 1 Comment

Questions to ask while implementing Anomaly detection system

A.I. based automated Anomaly detection system is gaining popularity nowadays due to the increase in data generated from various devices and the increase in ever evolving sophisticated threats from hackers etc. Anomaly detection systems can be applied across various business scenarios like monitoring financial transactions of a fintech company, highlighting fraudulent activities in a network, e-commerce price glitches among millions of products, and so on. Anomaly detection system can work…

Continue

Added by Avinash Udaykumar on May 27, 2019 at 2:30am — No Comments

Web Scraping

All the data we need today is already available on the internet, which is great news for data scientists. The only barrier to using this data is the ability to access it. There are some platforms that even include APIs (such as Twitter) that support data collection from web pages, but it is not possible to crawl most web pages using this advantage..

This…

Continue

Added by Packt Publishing on May 26, 2019 at 7:57pm — No Comments

Azure Machine Learning concepts - an Introduction

Azure Machine Learning concepts - an Introduction

Introduction

Last week, we launched a free book called Classification and Regression in a weekend. The idea of the ‘in a weekend’ series of books is to study one complex section of code in a weekend to master the concept. This week. we plan to launch a book called “An…

Continue

Added by ajit jaokar on May 26, 2019 at 10:00am — No Comments

Data Science Central Monday Digest, May 27

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this…

Continue

Added by Vincent Granville on May 26, 2019 at 8:30am — No Comments

Create Transformed, N-Dimensional Polygons with Covariance Matrix

The covariance matrix has many interesting properties, and it can be found in mixture models, component analysis, Kalman filters, and more. Developing an intuition for how the covariance matrix operates is useful in understanding its practical implications. This article will focus on a few important properties, associated proofs, and then some interesting practical applications, i.e., extracting transformed polygons from a Gaussian mixture's covariance matrix.

I have often found that…

Continue

Added by Rohan Kotwani on May 26, 2019 at 7:30am — No Comments

Data Science Jobs - eBook Summary

As a senior datascience professional and analytics manager, I get countless requests for job search advice, resume feedback and heart-breaking stories from brilliant students who are unable to snag a job in this exciting field. There are tons of books on how to learn the skills to become a data scientist/ data analyst, but none to prepare folks for the frustrating job search.

I've repeated this advice to dozens of people, most of whom found their dream datascience job with…

Continue

Added by Ann Rajaram on May 25, 2019 at 12:00pm — No Comments

Cross Validation in One Picture

Cross Validation explained in one simple picture. The method shown here is k-fold cross validation, where data is split into k folds (in this example, 5 folds). Blue balls represent training data; 1/k (i.e. 1/5) balls are held back for model testing.

Monte Carlo cross validation works the same way, except that the balls would be chosen with replacement. In other words, it would be possible for a ball to appear in more than one sample.…

Continue

Added by Stephanie Glen on May 25, 2019 at 8:30am — No Comments

Featured Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service