Subscribe to DSC Newsletter

All Blog Posts (6,484)

Top 6 Data Science Use Cases in Design

Nowadays, industries are privileged by the opportunity to apply data science to reach new heights in their efficiency, productiveness, and overall success. The range of these opportunities is pervasive starting with advanced calculations for the business to customer service…

Continue

Added by Igor Bobriakov on October 3, 2019 at 3:17am — No Comments

Surprising Uses of Synthetic Random Data Sets

I have used synthetic data sets many times for simulation purposes, most recently in my articles Six degrees of Separations between any two Datasets and How to Lie with p-values. Many applications (including the data…

Continue

Added by Vincent Granville on October 2, 2019 at 10:00am — No Comments

Want to be data-driven? Better start thinking about DataOps

Everyone’s talking data. Data is the key to unlocking insight, the secret sauce that will…

Continue

Added by Lewis Wynne-Jones on October 2, 2019 at 4:30am — No Comments

Is There a Difference Between Open Data and Public Data?

Yep. And it’s a big one.



There is a general consensus that when we talk about open data we are referring to any piece of data or content that is free to…

Continue

Added by Lewis Wynne-Jones on October 1, 2019 at 9:00am — No Comments

Creating maps in R using ggplot2 and maps libraries

Guest blog by  Michael Grogan.

Here is how we can use the maps, mapdata and ggplot2 libraries to create maps in R.



In this particular example, we’re going to create a world map showing the points of Beijing and Shanghai, both cities in China. For this particular…

Continue

Added by Vincent Granville on October 1, 2019 at 7:41am — No Comments

Making Sense of the Wild World of Hadoop

If you still can’t figure out what exactly Hadoop is, don’t worry, you’re not alone. But hopefully this multi-part blog series will help! Hadoop is very difficult to understand if, like most people, you’re gathering bits and pieces of info here and there. And, sorry Wikipedia, but your page isn’t very helpful if you aren’t a…

Continue

Added by Michael Burke on September 30, 2019 at 1:44pm — No Comments

Significance Level vs Confidence level vs Confidence Interval

You may have figured out already that statistics isn't exactly a science. Lots of terms are open to interpretation, and sometimes there are many words that mean the same thing—like "mean" and "average"—or sound like they should mean the same thing, like significance level and confidence level.  

Although they sound very similar, significance level and confidence level are in fact two completely different concepts. Confidence levels and confidence…

Continue

Added by Stephanie Glen on September 30, 2019 at 12:00pm — No Comments

Correlation does not equal causation but How exactly do you determine causation?

 

 

Introduction

 

Co-relation does not equal causation – is a mantra drilled into a Data Scientist from an early age

That’s fine ..

But very few talk of the follow-on question ..

How exactly do you determine causation?

This problem is…

Continue

Added by ajit jaokar on September 30, 2019 at 6:33am — 2 Comments

How Artificial Intelligence Will Affect Design In Future

There are a lot of on-going controversies on the impact of Artificial Intelligence on designing and development. While, designing is a bit of a complex process; as the humans only have the capability to set the context and create what the user’s really are looking for. But researchers and designers are constantly working on the technology in an attempt to find out how AI will affect the future of designing industry. …

Continue

Added by Ashok Sharma on September 29, 2019 at 7:30pm — No Comments

Design Principles for Big Data Performance

The evolution of the technologies in Big Data in the last 20 years has presented a history of battles with growing data volume. The challenge of big data has not been solved yet, and the effort will certainly continue, with the data volume continuing to grow in the coming years. The original relational database system (RDBMS) and the associated OLTP  (Online Transaction Processing) make it so easy to work with data using SQL in all aspects, as long as the data size is small enough to…
Continue

Added by Stephanie Shen on September 29, 2019 at 4:00pm — 1 Comment

Creating a slicer that filters multiple columns in Power BI

Power BI provides slicers for a single column, but there are scenarios where it could be useful to consolidate alternative filters for multiple columns in a single slicer. Technically, this is not possible in Power BI through the standard visualizations, but you can use a particular data modeling technique to obtain the desired result.

Consider the case of a Customer table with a geographical hierarchy with ContinentCountry,…

Continue

Added by Adetayo Kolade on September 29, 2019 at 10:30am — No Comments

Weekly Digest, September 30

Monday newsletter published by Data Science Central. Previous editions can be found here. The contribution flagged with a + is our selection for the picture of the week. To subscribe, follow this link.  

Announcements

  • Earn a Data…
Continue

Added by Vincent Granville on September 29, 2019 at 7:00am — No Comments

Using a Bathroom Faucet to Teach Neural Network Basic Concepts

They say that the best ideas sometimes come to you while you are in the shower, and this idea of how to explain two important Neural Network concepts – Backpropagation and Stochastic Gradient Descent – actually did come to me as I was trying to set the perfect water temperature for my morning shower.

As I was struggling to adjust the two shower handles – one handle that controlled scolding hot and the other handle that controlled flash freezing – it occurred to me that I was a simple…

Continue

Added by Bill Schmarzo on September 29, 2019 at 4:29am — 4 Comments

Wikis for publishing scholarly articles on data science and software

By now you may already know that to add scholarly articles to the English version of Wikipedia is difficult due to the "notability" concept and tight control from anonymous editors (see this article). In recent years, entire Wikipedia topics and articles dedicated to software and data…

Continue

Added by jwork.ORG on September 28, 2019 at 4:30am — No Comments

Discover How IoT Escalates Vehicle Fleet Safety.

With intense urbanization, the transportation industry works round the clock to suffice the demands of…

Continue

Added by Sanjeev Verma on September 27, 2019 at 1:30am — No Comments

How Web scraping and Big Data Analytics can be used to impact the Media and Entertainment industry

Every industry in the world is moving towards data-driven decision making, then one of the most popular and …

Continue

Added by Sandra Moraes on September 26, 2019 at 6:56pm — No Comments

Thursday News, September 26

Here is our selection of featured articles and technical resources posted since Monday:

Resources

Continue

Added by Vincent Granville on September 26, 2019 at 11:00am — No Comments

TensorFlow 1.x vs 2.x. – summary of changes

Overview of changes TensorFlow 1.0 vs TensorFlow 2.0

Earlier this year, Google announced TensorFlow 2.0, it is a major leap from the existing TensorFlow 1.0. The key differences are as follows:

 

Ease of use: Many old libraries (example tf.contrib) were removed, and some consolidated. For example, in TensorFlow1.x the model could be made using Contrib, layers, Keras or estimators, so many options for the same task confused many new users.…

Continue

Added by ajit jaokar on September 25, 2019 at 11:30pm — 2 Comments

How Fog Computing is changing the BigData paradigm for IoT device?

The new era of BigData and advances in technology have made significant transitions towards the high functionality of IoT devices. The popularity of IoT devices has led to more easier methods for BigData collection, analysis, and distribution at a rapid rate. According to a report by…

Continue

Added by Smith Johnson on September 25, 2019 at 7:30pm — No Comments

Bayesian Machine Learning

Bayesian Machine Learning (part -6)

Probabilistic Clustering – Gaussian Mixture Model

Continuing our discussion on probabilistically clustering of our data, where we left out discussion on part 4 of our Bayesian inference series. As we have seen the modelling theory of…

Continue

Added by Ashutosh vyas on September 25, 2019 at 9:30am — No Comments

Blog Topics by Tags

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service