Subscribe to DSC Newsletter

Featured Blog Posts – August 2015 Archive (69)

Simpson’s Paradox in the Age of Real Time Analytics

Summary:  Simpson’s Paradox.  A source of risk for real time analytics and for the citizen data scientist.

Most of us practicing the predictive arts know to look for sources of bias in our data.  There are seven that are common, the first six of which are:

  1. Confirmation…
Continue

Added by William Vorhies on August 31, 2015 at 9:41am — 4 Comments

Who moved the median?

Guest blog post by .

Introduction

As catchy as the title of this post is (which I often do to get audience attention, approvals or argue), the real title should be "What moved the median?". This is a long read compared to my earlier post because I drill down really deep into…

Continue

Added by Vincent Granville on August 31, 2015 at 7:30am — No Comments

How NASA experiments with knowledge discovery

NASA is using big data to make complex knowledge more readily available. Learn how graph visualization can help turn large corpus of documents into concrete insights.

nasa-hero

A database of lessons learned

Even in a mature and knowledge-driven organization like NASA, finding an answer to a common business issue can be frustrating. Past surveys at NASA have shown that most people have trouble finding the…

Continue

Added by Jean Villedieu on August 31, 2015 at 4:14am — 1 Comment

A Plethora of Open Data Repositories (i.e., thousands!)

Open data repositories are valuable for many reasons, including:

(1) they provide a source of insight and transparency into the domains and organizations that are represented by the data sets;

(2) they enable value creation across a variety of domains, using the data as the “fuel” for innovation, government transformation, new ideas, and new businesses;

(3) they offer a rich variety of data sets for data scientists to sharpen their data mining, knowledge discovery, and…

Continue

Added by Kirk Borne on August 30, 2015 at 2:09pm — No Comments

Data Literacy & Democratic Exercise

On September 2nd, 2015, President Peña of Mexico will give his 4th State of the Country address to 120 million Mexicans (who will actually watch is another matter entirely). Given these troublesome times in terms of economy, and our endemic problems as a country, like drug traficking, corruption and violence, it will be a message worth observing and analyzing.

To this end, a few independent news outlets have taken upon themselves to embark on a live 'fact-checking' event.…

Continue

Added by Jesus Ramos on August 28, 2015 at 7:52pm — 1 Comment

2016 The Year of the Zettabyte

Check out this infographic from XO Communications about 2016 being the year of the Zettabyte. 

2016 The Year of the Zettabyte 900

XO…

Continue

Added by Sheldon Smith on August 28, 2015 at 10:00am — No Comments

Using Amazon Redshift's Interleaved Sort Keys for 35x Faster Queries

Introduction

Previously, we discussed the role of Amazon Redshift's sort keys and compared how both compound and interleaved keys work in theory. Throughout that post we used some dummy data and a set of Postgres queries in order to explore the…

Continue

Added by sasha blumenfeld on August 28, 2015 at 7:20am — No Comments

Building Web Data Products with R & Shiny

 Guest Blog by Jose Dianes at R-Data Science

The purpose of many data science projects is to end up with a model that can be used within an organisation to solve a particular problem. If this is our case, we need to determine the right representation of that model so it can be shared in the easiest, cheapest, and most effective way. Web data products are an ideal…

Continue

Added by William Vorhies on August 28, 2015 at 6:30am — 1 Comment

How do your statistics boss you around?

I’m a big fan of statistics. Other than being fun to play with and fun to illustrate, they serve a lot of important tasks for researchers. They can quickly identify which of 500 comparisons is statistically significant. They can offer data to show whether your brand users comprise 2 distinct groups of people or 7 distinct groups of people. They can offer data to show which price your consumers would refuse to pay.

But there are two ways to use statistics. The right way and the wrong…

Continue

Added by Annie Pettit on August 28, 2015 at 4:32am — 1 Comment

Predictive Analytics – a Soup Story

Predictive Analytics – a Soup Story

A simple metaphor for projects in predictive analytics

The analytical scene has recently been dominated by the prediction that we would soon experience an important shortage of analytical talent. As a response, academic programs and massive open online courses (MOOCs) have sprung up like mushrooms after the rain, all with the purpose of developing skills for the analyst or its more modern counterpart, the data scientist. However, in the …

Continue

Added by Geert Verstraeten on August 27, 2015 at 11:57pm — No Comments

Math: My Data Science Stimulus Package and its Guerrilla Analytics

 

Sometimes I don’t trust Data Science, probably because my duty of care is more pronounced on account of working mostly in Legal Analytics. You see as an Analytics Practitioner in the Legal field my Data Science methodology cannot afford to yield…

Continue

Added by Mkhuseli Mthukwane on August 27, 2015 at 7:35am — No Comments

A Visual Introduction to Machine Learning

A lot has been said about the value of data viz, but the folks at R2D3 have truly taken this to a whole new level by using very sophisticated but also very intuitive data viz techniques to teach the basics of machine learning.  I was really blown away by the way the step-by-step visualizations on this page lead the reader through all the intuitive steps to arrive at a pretty clear understanding of machine learning, in this case focusing on decision trees.

If you are an experienced…

Continue

Added by William Vorhies on August 27, 2015 at 7:17am — 2 Comments

Data scientist paid $500k can barely code!

This is not about attacking a guy - a friend of mine - who, at first glance, seems extremely overpaid, like any top executive. Indeed, the question is about whether data scientists should be coders (spending 50% to 100% of their time writing code) or not.

I believe the answer is negative. There are…

Continue

Added by L.V. on August 26, 2015 at 9:00pm — 15 Comments

3 Ways to Get Your Data Into Shape [Infographic]

As data continues to grow at unprecedented speeds, organizations must embrace a data-driven mind-set to stay competitive. With the influx of bigger data and new types of data, companies of all sizes are increasingly dependent on large sets of information to make better business decisions. 

Marketing departments can rely on data to discover up-sell and cross-sell opportunities and to improve customer relationships. According to …

Continue

Added by Larisa Bedgood on August 26, 2015 at 10:49am — 1 Comment

Weekly Digest, August 31

The full version is always published Monday. Starred articles are new additions or updated content, posted between Thursday and Sunday. The picture of the week is from the contribution marked with a +, where you will find the details.

Announcement

  • The right data can greatly accelerate your business, but how do you put your aspirations into action? How do you…
Continue

Added by Vincent Granville on August 26, 2015 at 9:30am — No Comments

Ten top languages for crunching Big Data

With an ever-growing number of businesses turning to Big Data and analytics to generate insights, there is a greater need than ever for people with the technical skills to apply analytics to real-world problems.

Computer programming is still at the core of the skillset needed to create algorithms that can crunch through whatever structured or unstructured data is thrown at them. Certain languages have proven themselves…

Continue

Added by Bernard Marr on August 26, 2015 at 8:00am — 2 Comments

5 Signs That You Are NOT a Data Scientist

Data Scientist is the rock star job title of the moment, and why not? There is a huge demand and a very small pool of qualified candidates. And those who get hired are making big six-figure salaries — who wouldn’t want in on that?

 

But just because you aspire to be a data scientist doesn’t mean you…

Continue

Added by Bernard Marr on August 26, 2015 at 8:00am — 5 Comments

Big Data Analytics and Business Intelligence for Better Customer Experience in Small Businesses

Companies are directing a lot of resources towards data mining and data analytics. Analysis of big data can help improve a business’s online marketing strategy, since…

Continue

Added by Jack Dowson on August 25, 2015 at 9:44pm — No Comments

What is Big Data, and why should you care?

Guest blog by R. Bhargav

What does “Big Data” mean?

The term “big data” is self-explanatory -a collection of extremely big data sets that normal computing techniques cannot process. The term not only refers to the data, but also to…

Continue

Added by William Vorhies on August 25, 2015 at 7:30am — 1 Comment

8 Online Classes That Will Make You Smarter About IT

For so long, money has been the deciding factor whether you will get quality education or not more so, learn about IT. Thanks to the internet, this is no longer the case. These days, there are many websites that are dedicated to offering free IT classes to anyone willing. This may be the reason why most people are getting to develop interest in IT than before. It is important that people should embrace the opportunities offered by online classes as a necessity and…

Continue

Added by Mia Morshead on August 25, 2015 at 5:07am — 1 Comment

Featured Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service