Subscribe to DSC Newsletter

Featured Blog Posts – March 2020 Archive (60)

How to combine all csv files from the same folder into one data frame automatically with R?



Have you ever experience a situation where you want to import and combine hundred of datasets? If you do this manually, then it will take too much time. On the other hand, we can use a simple R programming to solve this problem in easily and quickly. Therefore, I would like to share two methods of combining all small files in a directory into a single dataset…

Continue

Added by Michaelino Mervisiano on March 23, 2020 at 4:00am — 1 Comment

Machine Learning at the Edge

Edge computing moves workloads from  centralized locations  to remote locations and it can provide faster response from AI applications.  Edge computing devices are getting deployed increasingly for monitoring and control of real world processes like people tracking, vehicle recognition, pollution monitoring etc.  The data collected at the devices gets transported to centralized cloud servers over data pipelines and are used to train machine learning models. Training models needs lot…

Continue

Added by Janardhanan PS on March 23, 2020 at 12:18am — No Comments

An introduction to Statistical Inference and Hypothesis testing

Inference

In a previous blog (The difference between statistics and data science), I discussed the significance of statistical inference. In this section, we expand on these ideas  

The goal of statistical inference is to make a statement about something that is not observed within a certain level of…

Continue

Added by ajit jaokar on March 22, 2020 at 2:22pm — No Comments

Weekly Digest, March 23

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.  …

Continue

Added by Vincent Granville on March 22, 2020 at 10:00am — No Comments

Quantitative Reductionism and the Pandemic of COVID-19

The COVID-19 pandemic has caused organizations to find different ways to maintain their operations amid calls for increased social distance. In some cases, employees have been asked to work from home in order to reduce the amount of social interaction in the workplace. Due to this emerging distance between these workers and their employers, I suggest that quantitative reduction will become more likely: for these employees, performance will be increasingly unrelated to their social presence…

Continue

Added by Don Philip Faithful on March 22, 2020 at 4:57am — No Comments

Speaker Diversity Analytics with the Microsoft AI Face API

How do we increase diversity amongst our event speakers?  To improve something, we must first measure it.  The Face API allows us to collect some of these demographics from past events that we may not otherwise be able to collect, which may allow us greater insight into how we can improve these trends.

The Facial Recognition API is part of the Microsoft AI Cognitive Services suite.  The Face service detects human faces in an image and returns the rectangle coordinates of their…

Continue

Added by alicia moniz on March 21, 2020 at 9:30am — No Comments

Statistical Uncertainty: Why are Covid-19 Figures so Varied?

If you've been keeping up on the statistics for Covid-19 in the last week (and who hasn't?), you've probably noticed a wide variety of projections for deaths in the United States, ranging from the "best-case" scenario  (327 people) to the "doomsday" figure (2.2 million). Recent statistics published include:

  • 327 to 1.6 million (Former Former CDC director Tom Frieden, cited in the…
Continue

Added by Stephanie Glen on March 21, 2020 at 8:00am — 1 Comment

COVID-19 Forecast: China as a Reference

With the onset of the COVID-19 outbreak, we look at the data and use a simple model inspired by China’s example to predict when the outbreak will abate in various countries.

Forecasts: https://www.codoma.tech/blog/covid19-forecast/ (updated daily)

The forecasts tell us that we, every human being, need to keep following the measures (social distancing,…

Continue

Added by Mohamed A. Maksoud on March 20, 2020 at 11:30am — 4 Comments

Reasons to file for a patent, even if your employer does not offer patent bonuses.

What is the Measure of a Successful Patent

What passes for wisdom from a (modestly) successful “Patent Ace”…

Continue

Added by Russell W Anderson on March 20, 2020 at 11:00am — No Comments

Relevancy Modeling with Symbolic AI

At time of crisis, everyone is scrambling for the latest news for signals of changes and make decisions to mitigate risk. Average person uses common sense to draw inference on relevancy. For example, as the Coronavirus Crisis evolves, many decided to hog up bathroom tissue. As much as it is common sense for many, it simply does not make sense. However, when it is evaluated against aggregated common sense, we can eliminate many of these signals.

 

In financial modeling, analyst…

Continue

Added by Sing Koo on March 18, 2020 at 9:00am — No Comments

Time Series and Decentralized Networks - Upcoming free webinar

Date: 3/24/2020

Time: 9:00 AM PDT

Duration: 60 Minutes

Summary…

Continue

Added by Vincent Granville on March 18, 2020 at 8:49am — No Comments

5 Most Important Skills of a Data Scientist

Data scientist was coined the sexiest job of the 21st century and with good reason. In LinkedIn 2020 Emerging Jobs Reports, Artificial intelligence was named the ‘Jobs of Tomorrow’ due to its strong presence. Furthermore, the potential application of data science in multiple industries has attracted people from all…

Continue

Added by Angelia Toh Choon Muay on March 18, 2020 at 5:30am — 1 Comment

The Role of VDR in Enterprise Data Sharing

With gigantic amounts of data regarding products and/or services, sales, marketing, customers, etc. exchanging hands on a daily basis, data sharing is essential for an enterprise. Regardless of whether data is distributed within different parts of the organization or with other entities, doing so in a secure fashion is critical for carrying out an effective and efficient digital operation. 

In other words -…

Continue

Added by Joydeep Bhattacharya on March 18, 2020 at 3:00am — No Comments

ACID to BASE Transformation

In chemistry, we have strong and weak ACIDs. There exists strong and weak BASEs also. In Relational Database Management Systems (RDBMS), ACID stands for Atomicity, Consistency, Isolation and Durability. The volume of ACID transactions handled by RDBMS has undergone a big data transition with the evolution of mobile devices. SQL stands for Structered Query Language, but commonly refers to traditional RDBMS. RDBMS is not designed to be distributed and optimized for space, not for speed of access.… Continue

Added by Janardhanan PS on March 16, 2020 at 9:16pm — No Comments

Mid-Career Switching into Data Science and Data Engineering

Summary:  Mid-career switching into data science is difficult.  But if you already have substantial data base experience becoming a Data Engineer can be a great career by itself, or even the first stepping stone to becoming a Data Scientist.

 

One of the most common requests we get is for advice on mid-career…

Continue

Added by William Vorhies on March 16, 2020 at 11:26am — No Comments

Weekly Digest, March 16

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.  …

Continue

Added by Vincent Granville on March 15, 2020 at 4:30pm — No Comments

COVID 19 - How AI and next generation manufacturing technologies(FMS) can lead to more local manufacturing jobs if we invest in educating the workforce to work with AI

 

Introduction

Due to COVID19, we already have 1000 deaths in Italy.

Two months ago, these folks, many of them elderly, would have been celebrating Christmas and looking forward to the next Christmas – knowing that northern Italy has one of the best healthcare systems in the world

But apart from the human cost, the…

Continue

Added by ajit jaokar on March 14, 2020 at 1:00pm — No Comments

What are Data Pipelines ?

If you have learned temporal parallelism used to speed up CPU execution, you came across instruction pipelines aka pipeline processing. In pipeline processing, you will have many instructions in different stages of execution. The term "Data Pipeline" is a misnomer representing a high bandwidth communication channel used for data transportation between a source system and a destination. In certain cases the destination is called a sink. Pipelines by definition allow flow of a fluid…

Continue

Added by Janardhanan PS on March 14, 2020 at 6:51am — No Comments

Formalizing the Field of Data Engineering

Much like we have Chemical Engineering and Electrical Engineering and Mechanical Engineering, it is time to formalize of field of Data Engineering.  This is a special two-part series on trends and requirements leading to the formalization of the Field of Data Engineering.

“Data is the new oil…in much the same way that oil fueled economic growth in the 20th century, data will fuel economic growth in the 21st century.”

To further raise…

Continue

Added by Bill Schmarzo on March 14, 2020 at 5:00am — No Comments

Data Science Lessons from The Coronavirus

As I write this blog, we are still in the early stages of the coronavirus crisis.  It is a scary situation which has caused hoarding, panic, fake news, lies, stock market turmoil and irrational behaviors.  All indications suggest we will survive this crisis, but it is truly not one of mankind’s best moments.

But learnings abound in…

Continue

Added by Bill Schmarzo on March 13, 2020 at 5:30am — No Comments

Featured Monthly Archives

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

Videos

  • Add Videos
  • View All

© 2020   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service