Subscribe to DSC Newsletter

All Blog Posts (6,993)

How to combine all csv files from the same folder into one data frame automatically with R?



Have you ever experience a situation where you want to import and combine hundred of datasets? If you do this manually, then it will take too much time. On the other hand, we can use a simple R programming to solve this problem in easily and quickly. Therefore, I would like to share two methods of combining all small files in a directory into a single dataset…

Continue

Added by Michaelino Mervisiano on March 23, 2020 at 4:00am — 1 Comment

Machine Learning at the Edge

Edge computing moves workloads from  centralized locations  to remote locations and it can provide faster response from AI applications.  Edge computing devices are getting deployed increasingly for monitoring and control of real world processes like people tracking, vehicle recognition, pollution monitoring etc.  The data collected at the devices gets transported to centralized cloud servers over data pipelines and are used to train machine learning models. Training models needs lot…

Continue

Added by Janardhanan PS on March 23, 2020 at 12:18am — No Comments

Extract Data from Website to Excel Automatically

To extract data from websites, you can take advantage of data extraction tools like Octoparse. These tools can pull data from websites automatically and save them into many formats such as Excel, JSON, CSV, HTML, or to your own database via APIs. It only takes a few minutes to extract thousands of lines of data, and the best part is…

Continue

Added by Erika Foo on March 22, 2020 at 9:30pm — No Comments

An introduction to Statistical Inference and Hypothesis testing

Inference

In a previous blog (The difference between statistics and data science), I discussed the significance of statistical inference. In this section, we expand on these ideas  

The goal of statistical inference is to make a statement about something that is not observed within a certain level of…

Continue

Added by ajit jaokar on March 22, 2020 at 2:22pm — No Comments

Weekly Digest, March 23

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.  …

Continue

Added by Vincent Granville on March 22, 2020 at 10:00am — No Comments

Quantitative Reductionism and the Pandemic of COVID-19

The COVID-19 pandemic has caused organizations to find different ways to maintain their operations amid calls for increased social distance. In some cases, employees have been asked to work from home in order to reduce the amount of social interaction in the workplace. Due to this emerging distance between these workers and their employers, I suggest that quantitative reduction will become more likely: for these employees, performance will be increasingly unrelated to their social presence…

Continue

Added by Don Philip Faithful on March 22, 2020 at 4:57am — No Comments

Speaker Diversity Analytics with the Microsoft AI Face API

How do we increase diversity amongst our event speakers?  To improve something, we must first measure it.  The Face API allows us to collect some of these demographics from past events that we may not otherwise be able to collect, which may allow us greater insight into how we can improve these trends.

The Facial Recognition API is part of the Microsoft AI Cognitive Services suite.  The Face service detects human faces in an image and returns the rectangle coordinates of their…

Continue

Added by alicia moniz on March 21, 2020 at 9:30am — No Comments

Statistical Uncertainty: Why are Covid-19 Figures so Varied?

If you've been keeping up on the statistics for Covid-19 in the last week (and who hasn't?), you've probably noticed a wide variety of projections for deaths in the United States, ranging from the "best-case" scenario  (327 people) to the "doomsday" figure (2.2 million). Recent statistics published include:

  • 327 to 1.6 million (Former Former CDC director Tom Frieden, cited in the…
Continue

Added by Stephanie Glen on March 21, 2020 at 8:10am — 1 Comment

COVID-19 Forecast: China as a Reference

With the onset of the COVID-19 outbreak, we look at the data and use a simple model inspired by China’s example to predict when the outbreak will abate in various countries.

Forecasts: https://www.codoma.tech/blog/covid19-forecast/ (updated daily)

The forecasts tell us that we, every human being, need to keep following the measures (social distancing,…

Continue

Added by Mohamed A. Maksoud on March 20, 2020 at 11:30am — 4 Comments

Reasons to file for a patent, even if your employer does not offer patent bonuses.

What is the Measure of a Successful Patent

What passes for wisdom from a (modestly) successful “Patent Ace”…

Continue

Added by Russell W Anderson on March 20, 2020 at 11:00am — No Comments

How Technology tops HR trends in 2020

The world of today is digital! Every business, every person across the world has felt the impact of a digital transformation, and the field of HR has been no exception to this rule.

The theme of HR technology trends of late has been the force of digitalization: digitally-forward initiatives, strategies, and tools are taking the place of traditional practices of HR…

Continue

Added by Yoey Thamas on March 19, 2020 at 11:36pm — No Comments

Thursday News, March 19

Here is our selection of featured resources and articles posted since Monday.

Resources

Continue

Added by Vincent Granville on March 19, 2020 at 12:30pm — No Comments

Relevancy Modeling with Symbolic AI

At time of crisis, everyone is scrambling for the latest news for signals of changes and make decisions to mitigate risk. Average person uses common sense to draw inference on relevancy. For example, as the Coronavirus Crisis evolves, many decided to hog up bathroom tissue. As much as it is common sense for many, it simply does not make sense. However, when it is evaluated against aggregated common sense, we can eliminate many of these signals.

 

In financial modeling, analyst…

Continue

Added by Sing Koo on March 18, 2020 at 9:00am — No Comments

Time Series and Decentralized Networks - Upcoming free webinar

Date: 3/24/2020

Time: 9:00 AM PDT

Duration: 60 Minutes

Summary…

Continue

Added by Vincent Granville on March 18, 2020 at 8:49am — No Comments

5 Most Important Skills of a Data Scientist

Data scientist was coined the sexiest job of the 21st century and with good reason. In LinkedIn 2020 Emerging Jobs Reports, Artificial intelligence was named the ‘Jobs of Tomorrow’ due to its strong presence. Furthermore, the potential application of data science in multiple industries has attracted people from all…

Continue

Added by Angelia Toh Choon Muay on March 18, 2020 at 5:30am — 1 Comment

The Role of VDR in Enterprise Data Sharing

With gigantic amounts of data regarding products and/or services, sales, marketing, customers, etc. exchanging hands on a daily basis, data sharing is essential for an enterprise. Regardless of whether data is distributed within different parts of the organization or with other entities, doing so in a secure fashion is critical for carrying out an effective and efficient digital operation. 

In other words -…

Continue

Added by Joydeep Bhattacharya on March 18, 2020 at 3:00am — No Comments

Understanding Covid-19 Statistics (in plain English)

My original intent with this article was to write about how to understand statistics in general. However, with the global pandemic on everyone's minds right now, it seems blithe to write an article on understanding statistics without a nod to current events. If you're uncomfortable or unfamiliar with statistics, you might find the facts and figures surrounding Covid-19 hard to decipher. Let's break down the key statistics into plain English and shed a little light on a few…

Continue

Added by Stephanie Glen on March 17, 2020 at 6:30am — 5 Comments

ACID to BASE Transformation

In chemistry, we have strong and weak ACIDs. There exists strong and weak BASEs also. In Relational Database Management Systems (RDBMS), ACID stands for Atomicity, Consistency, Isolation and Durability. The volume of ACID transactions handled by RDBMS has undergone a big data transition with the evolution of mobile devices. SQL stands for Structered Query Language, but commonly refers to traditional RDBMS. RDBMS is not designed to be distributed and optimized for space, not for speed of access.… Continue

Added by Janardhanan PS on March 16, 2020 at 9:16pm — No Comments

Mid-Career Switching into Data Science and Data Engineering

Summary:  Mid-career switching into data science is difficult.  But if you already have substantial data base experience becoming a Data Engineer can be a great career by itself, or even the first stepping stone to becoming a Data Scientist.

 

One of the most common requests we get is for advice on mid-career…

Continue

Added by William Vorhies on March 16, 2020 at 11:26am — No Comments

Best Practices in Data Migration & Cleansing

In developing global businesses practices, many corporate houses are adapting better technological solutions and systems. This mandates smooth data migration from older to newer systems while maintaining data integrity. Many organizations face tough time in migration initiatives due to complicated and sluggish procedures owing to IT issues. 

Most of the data migration efforts are challenging as well as time consuming due to inherent nature of such an undertaking. When moving from one…

Continue

Added by Shweta Bedi on March 16, 2020 at 1:32am — No Comments

Blog Topics by Tags

Monthly Archives

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2020   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service