Subscribe to DSC Newsletter

Featured Blog Posts – June 2020 Archive (52)

23 sources of data bias for #machinelearning and #deeplearning

In the paper A survey on bias and fairness in machine learning.- the authors outline 23 types of bias in data for machinelearning. The source is good – so below is an actual representation because I found it useful as it is

full paper link below

 

1) Historical Bias. Historical bias is the already…

Continue

Added by ajit jaokar on June 30, 2020 at 12:02pm — 1 Comment

Racial Bias in Modeling – Not as Simple as You Might Think

Summary:  Bias in modeling has long been a public concern that is now amplified and focused on the disparate treatment models may cause for African Americans.  Defining and correcting the bias presents difficult issues for data scientists that need to be carefully thought through before reaching conclusions.

 …

Continue

Added by William Vorhies on June 29, 2020 at 11:31am — No Comments

The other key slide of your quarterly analysis

Profit analysis is an integral part of analysis drills, which are conducted at regular frequency. Both finance and marketing departments are equally engaged in this activity — though both have a different approach to it. Variance…

Continue

Added by saurabh ajmera on June 29, 2020 at 6:00am — No Comments

Weekly Digest, June 29

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.  …

Continue

Added by Vincent Granville on June 28, 2020 at 12:00pm — No Comments

QC Paulinesia, !(yet another book about quantum computing)

Today I uploaded to github the first version of my book “QC Paulinesia” about Quantum Computing. The title is a play on the surname of the famous quantum scientist Wolfgang Pauli and the word "Polynesia". The book is based on a paper with the same name that I wrote in 2004. 

Here is  its github repo. Here is the pdf for the book. Let me quote the current…

Continue

Added by Robert R. Tucci on June 27, 2020 at 1:00pm — No Comments

IoT Anomaly detection - algorithms, techniques and open source implementation

Background

Anomaly detection for IoT is one of the archetypal applications for IoT.

Anomaly detection techniques are also used outside of IoT.

In my teaching at the #universityofoxford - we use anomaly detection as a use case because it brings together many of…

Continue

Added by ajit jaokar on June 26, 2020 at 10:00am — 1 Comment

Getting Historical Data from Yahoo Finance in R

Yahoo Finance (below) is a website that gives us a good deal of information about the financial market, including information about stock trading.

To get the numbers for a specific stock, all you have to do is to search for its Yahoo code (ex: VALE - see figure below). This is not necessarily the company’s…

Continue

Added by Norton Trevisan Roman on June 26, 2020 at 10:00am — No Comments

Using Neural Networks to Predict Cimate Change, Droughts, and Conflict Displacements

The Problem

Millions of people are forced to leave their current area of residence or community due to resource shortage and natural disasters such as droughts, floods. Our project partner, UNHCR, provides assistance and protection for those who are forcibly displaced inside Somalia.

The goal of this project was to create a solution that quantifies…

Continue

Added by Omdena Community Members on June 25, 2020 at 3:30am — No Comments

Generative Adversarial Networks (GANs) & Bayesian Networks

Generative Adversarial Networks (GANs) software is software for producing forgeries and imitations of data (aka synthetic data, fake data). Human beings have been making fakes, with good or evil intent, of almost everything they possibly can, since the beginning of the human race. Thus, perhaps not too surprisingly, GAN software has been widely used since it was first proposed in this amazingly recent 2014 paper. To gauge how widely GAN software has been used so far, see, for example, this…

Continue

Added by Robert R. Tucci on June 24, 2020 at 11:30pm — No Comments

Creating Music by Machine Learning

© 2020 by Radna Intellectual Ventures, LLC.

In Sweden, there's something called the Allemansrätten, which literally translates…

Continue

Added by Andrew Vo on June 24, 2020 at 12:00pm — 1 Comment

Introduction to Gradient Decent

The gradient decent approach is used in many algorithms to minimize loss functions. In this introduction we will see how exactly a gradient descent works. In addition, some special features will be pointed out. We will be guided by a practical example.…

Continue

Added by Frank Raulf on June 24, 2020 at 7:00am — No Comments

Eight ways in which data science is helping in the fight against COVID19

Given the scale of its impact and the kind of alteration that it brought into our lives, COVID19 is one of the most unprecedented crises of our times. Although it is not the only pandemic that humanity has been through, COVID19 is occurring in the time of the fourth industrial revolution where everyone and everything is one click way, and where the excess of data and computing has allowed machines to be more intelligent than ever. In the age of deep tech and data, data science is definitely…

Continue

Added by Ghida Ibrahim on June 24, 2020 at 6:30am — 1 Comment

Free book – for #datascience interviews - Guide to competitive programming

Recently Springer made some good books on maths free to download.

Competitive programming strategies are useful for many data science interviews and they help to improve your maths foundations.  There are not many books on this subject (although there are many good websites and YouTube resources).

So, I hope you find this book…

Continue

Added by ajit jaokar on June 23, 2020 at 12:00pm — 5 Comments

External Data Sources Are More Critical Than Ever in Supply Chain Management

External Data Sources Are More Critical Than Ever in Supply Chain Management.

Your company's historical data are not contextual enough for our COVID world.

Danillo Pereira, CAO

June 22, 2020

The…

Continue

Added by Betsy Romeri on June 22, 2020 at 9:49am — No Comments

Weekly Digest, June 22

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.  …

Continue

Added by Vincent Granville on June 21, 2020 at 2:00pm — No Comments

Simple Correlational Analysis on Socioeconomic Factors Impacting Covid-19 Outbreak in US Counties

Background

As part of my PhD work, I recently had to analyze any dataset(s) of my interest and present findings.  I ended up conducting a study on US County-wise Covid-19 data.  I wanted to share my key findings through this blog.

Study Question

The primary question I wanted to address through data analysis was “Do counties’ socioeconomic factors such as population size, poverty rate, unemployment rate, education percent and…

Continue

Added by Murali Kashaboina on June 20, 2020 at 3:16pm — 2 Comments

Enhancing Satellite Imagery Through Super-Resolution

The power of deep learning paired with collaborative human intelligence to increase the quality of crop cultivation imagery through super resolution.

 

The Problem

The focus in this article lies in…

Continue

Added by Omdena Community Members on June 20, 2020 at 9:00am — No Comments

Herodotus Event Data Object - Glimpse Inside

I dropped out of university as a literature major in the 1980s.  (I needed time to find myself.)  Before I left, I read The Histories by the ancient Greek historian Herodotus.  I decided to name my new data object the Herodotus Event Data Object or HEDO.  A number of HEDOs can be stitched together to form codified narrative.  Those who follow my blogs might recall that I previously developed a system of codified narrative called BERLIN.  HEDOs represent my reinvention of BERLIN.  I…

Continue

Added by Don Philip Faithful on June 19, 2020 at 1:41pm — No Comments

A Brief Summary of the Science of Data Analytics

Data Science and Data Analytics are two most common terminologies in today’s data driven world. How a business uses big data to make key business decisions is critical to the future success of the firm. Data is collected into raw form and processed according to the requirement of a company and then this data is utilized for the decision making purpose. This process helps the businesses to grow & expand their operations in the market. Data or information is typically in raw format. The… Continue

Added by Imran Wyne on June 19, 2020 at 7:55am — No Comments

Misleading Graphs Part 2: Ladders, Spaghetti, and Other Ways to Ruin a Graph

If you've spent any time with modeling data, you'll know that there are many pitfalls to be had when it comes to data presentation (I addressed some common pitfalls in Misleading Graphs Part 1). Misleading graphs can be the result of incorrect data collection, ignorance of the basic "rules" of data presentation (like labeling axes), or even deliberate attempts to mislead. A fourth…

Continue

Added by Stephanie Glen on June 18, 2020 at 6:00am — No Comments

Featured Monthly Archives

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

Videos

  • Add Videos
  • View All

© 2020   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service