Subscribe to DSC Newsletter

Featured Blog Posts – September 2019 Archive (61)

Significance Level vs Confidence level vs Confidence Interval

You may have figured out already that statistics isn't exactly a science. Lots of terms are open to interpretation, and sometimes there are many words that mean the same thing—like "mean" and "average"—or sound like they should mean the same thing, like significance level and confidence level.  

Although they sound very similar, significance level and confidence level are in fact two completely different concepts. Confidence levels and confidence…

Continue

Added by Stephanie Glen on September 30, 2019 at 12:00pm — No Comments

Correlation does not equal causation but How exactly do you determine causation?

 

 

Introduction

 

Co-relation does not equal causation – is a mantra drilled into a Data Scientist from an early age

That’s fine ..

But very few talk of the follow-on question ..

How exactly do you determine causation?

This problem is…

Continue

Added by ajit jaokar on September 30, 2019 at 6:33am — 2 Comments

Design Principles for Big Data Performance

The evolution of the technologies in Big Data in the last 20 years has presented a history of battles with growing data volume. The challenge of big data has not been solved yet, and the effort will certainly continue, with the data volume continuing to grow in the coming years. The original relational database system (RDBMS) and the associated OLTP  (Online Transaction Processing) make it so easy to work with data using SQL in all aspects, as long as the data size is small enough to…
Continue

Added by Stephanie Shen on September 29, 2019 at 4:00pm — 1 Comment

Creating a slicer that filters multiple columns in Power BI

Power BI provides slicers for a single column, but there are scenarios where it could be useful to consolidate alternative filters for multiple columns in a single slicer. Technically, this is not possible in Power BI through the standard visualizations, but you can use a particular data modeling technique to obtain the desired result.

Consider the case of a Customer table with a geographical hierarchy with ContinentCountry,…

Continue

Added by Adetayo Kolade on September 29, 2019 at 10:30am — No Comments

Weekly Digest, September 30

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.  

Announcements

  • Earn a Data…
Continue

Added by Vincent Granville on September 29, 2019 at 7:00am — No Comments

Using a Bathroom Faucet to Teach Neural Network Basic Concepts

They say that the best ideas sometimes come to you while you are in the shower, and this idea of how to explain two important Neural Network concepts – Backpropagation and Stochastic Gradient Descent – actually did come to me as I was trying to set the perfect water temperature for my morning shower.

As I was struggling to adjust the two shower handles – one handle that controlled scolding hot and the other handle that controlled flash freezing – it occurred to me that I was a simple…

Continue

Added by Bill Schmarzo on September 29, 2019 at 4:29am — 6 Comments

Discover How IoT Escalates Vehicle Fleet Safety.

With intense urbanization, the transportation industry works round the clock to suffice the demands of…

Continue

Added by Sanjeev Verma on September 27, 2019 at 1:30am — No Comments

TensorFlow 1.x vs 2.x. – summary of changes

Overview of changes TensorFlow 1.0 vs TensorFlow 2.0

Earlier this year, Google announced TensorFlow 2.0, it is a major leap from the existing TensorFlow 1.0. The key differences are as follows:

 

Ease of use: Many old libraries (example tf.contrib) were removed, and some consolidated. For example, in TensorFlow1.x the model could be made using Contrib, layers, Keras or estimators, so many options for the same task confused many new users.…

Continue

Added by ajit jaokar on September 25, 2019 at 11:30pm — 2 Comments

How Fog Computing is changing the BigData paradigm for IoT device?

The new era of BigData and advances in technology have made significant transitions towards the high functionality of IoT devices. The popularity of IoT devices has led to more easier methods for BigData collection, analysis, and distribution at a rapid rate. According to a report by…

Continue

Added by Smith Johnson on September 25, 2019 at 7:30pm — No Comments

Bayesian Machine Learning

Bayesian Machine Learning (part -6)

Probabilistic Clustering – Gaussian Mixture Model

Continuing our discussion on probabilistically clustering of our data, where we left out discussion on part 4 of our Bayesian inference series. As we have seen the modelling theory of…

Continue

Added by Ashutosh vyas on September 25, 2019 at 9:30am — No Comments

Scraping eBay using BeautifulSoup in Python

Even though Amazon is the leader in e-commerce marketplaces – eBay still has its fair share in the online retail industry. Brands selling online should be monitoring prices on eBay as well to gain a competitive advantage. 

Extracting data from eBay at a huge scale regularly is a challenging problem for data scientists. Here is an example of scraping eBay using python to identify prices of mobile phones. 

Lets us imagine a use case where you need…

Continue

Added by Sandra Moraes on September 24, 2019 at 8:00pm — No Comments

Top 12 Data Science Use Cases in Government

Introduction

Big data analytics has been applied to different spheres of human life. One of the best features of data analytics is its adaptability and wide application specter. We have come through the whole series of articles concerning data science application in various spheres that are…

Continue

Added by Igor Bobriakov on September 24, 2019 at 6:31am — No Comments

Using Machine Learning to Solve Business Problems

This article was written by Soft Media Lab.

It has the following sections.

Contents

  • What is Machine Learning?
  • How to apply machine learning to…
Continue

Added by Andrea Manero-Bastin on September 24, 2019 at 6:30am — No Comments

Attacks against machine learning — an overview

This article was written by Elie Bursztein.

 

This blog post survey the attacks techniques that target AI (artificial intelligence) systems and how to protect against them.

At a high level, attacks against classifiers can be broken down into three types:

  • Adversarial inputs, which are specially crafted inputs that have been developed with the aim of…
Continue

Added by Andrea Manero-Bastin on September 24, 2019 at 2:30am — No Comments

We See in 3D – So Should Our CNN Models

Summary:  Autonomous vehicles (AUVs) and many other systems that need to accurately perceive the world around them will be much better off when image classification moves from 2D to 3D.  Here we examine the two leading approaches to 3D classification, Point Clouds and Voxel Grids.

 

One of the well-known…

Continue

Added by William Vorhies on September 23, 2019 at 2:24pm — No Comments

Automation Eliminates Expensive Data Entry Errors

Organizations rarely regard data entry as a key strategic operation. In spite of the rise in automation, companies across the world rely on manual data entry for procurement. They employ staff to transfer data from requisition orders, purchase orders, and invoices into enterprise resource planning (ERP) systems such as SAP, Oracle, PeopleSoft, Netsuite, and many others. 

Transcription and transposition errors are…

Continue

Added by Brady Behrman on September 23, 2019 at 6:30am — No Comments

Executive Mandate #1:  Become Value Driven, Not Data Driven

I hate it when I hear senior executives state that they want to become data-driven, as if somehow having data is value in of itself.  Now, one can hardly blame the unenlightened executive whose only perspectives on data are associated with statements like “Data is really the new oil” (Wall Street Journal) or “…

Continue

Added by Bill Schmarzo on September 23, 2019 at 5:21am — 2 Comments

Artificial Neural Networks in a Nutshell

According to Wikipedia, an ANN is based on a collection of connected units or nodes called artificial neurons, which loosely model the neurons in a biological brain. Each connection, like the synapses in a biological brain, can transmit a signal to other neurons. An artificial neuron that receives a signal then processes it and can signal neurons connected to it.

In ANN implementations, the "signal" at a connection is a real number, and the output of each neuron is computed by some…

Continue

Added by Capri Granville on September 23, 2019 at 4:30am — No Comments

Designing Apparel with Neural Style Transfer

One of the things that excite the most about the predictive analytics of today is how quickly and bluntly it surpasses the existing benchmarks. Moreover, it penetrates the industries and activities where human creativity has traditionally dominated, adding a futuristic touch to the music, fine arts, fashion or architecture. Following the popular trend, we decided…

Continue

Added by Olha Zhydik on September 23, 2019 at 3:30am — No Comments

Stylometric Feature Engineering Techniques in Authorship Analysis

Technical paper, published in IEEE Xplore.

Abstract:

Authorship analysis (AA) is the study of unveiling the hidden properties of authors from textual data. It extracts an author's identity and sociolinguistic characteristics based on the reflected writing styles in the text. The process is essential for various areas,…
Continue

Added by Capri Granville on September 22, 2019 at 1:30pm — No Comments

Featured Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service