Subscribe to DSC Newsletter

All Blog Posts (7,312)

Key Takeaways: Pivotal’s Top 10 2015 Predictions

On Tuesday 12/16, I attended Pivotal’s Top 10 Data Science Predictions in 2015 webinar.

The webcast was ran by leaders from the Pivotal Data Science  team – Annika Jimenez, Kaushik Das and Hulya Farinas – who shared their insights on the key Data Science industry trends for the coming year. The webcast came off as a bit scripted, but one could tell that these three individuals have a passion for Data Science discipline and it’s future.

In this post, I’d like to take a…


Added by Anthony Dutra on December 18, 2014 at 6:56am — No Comments

Weekly Digest - December 22

The full version is always published Monday. Starred articles or sections are new additions or updated content, posted between Thursday and Sunday. Articles marked with a + have interesting visualizations.



Added by Vincent Granville on December 17, 2014 at 7:30pm — No Comments

Infographic: Data Science 2015 -- What's Hot & What's Not

CrowdFlower is excited to release our first “What’s Hot & What’s Not in Data Science” infographic. According to our team of data scientists, the forecast for 2015 includes data’s major impact on the Internet of Things, changes in the skills and structure of the data scientist role and heavy emphasis on finding rich data within big data.…CrowdFlower_Graphic_Whats_Hot2015


Added by Renette Youssef on December 17, 2014 at 8:23am — No Comments

Can Analysis be Open-Source?

Over the past year, as the head of analytics at a tech startup, I've had many conversations with analysts about what they want to learn from their data. Perhaps unsurprisingly, a lot of companies have similar questions—What drives retention? How do customers interact with products? How do we better understand sales pipelines? What's the lifetime value of a user?

These questions were familiar to us and we'd worked on many of them ourselves. To find answers in our own data, we wrote…


Added by Benn Stancil on December 17, 2014 at 7:00am — No Comments

The Future of Big Data is Wearables

Guest blog past by Rohit Yadav, from BRIDGEi2i Analytics Solution

The Net (Part 1)

The plot goes something like this – Sandra Bullock plays a computer expert Angela Benett, her life changes when she is sent a program with a crazy glitch to ‘de-bug’. Soon she finds out some vital government information on the disk, things gets nutty as fruitcake, her life becomes a nightmare with her records getting erased and she is given a new identity of some chick with a…


Added by Vincent Granville on December 16, 2014 at 7:30pm — 2 Comments

Rules for building a Data Product in IT organizations

In my consulting work in the Enterprise IT space, I am seeing a definite trend of growing interest in Data Product/Advanced Analytics Design and Development which is becoming increasingly mainstream. Even as I view this a positive, it comes with its own set of perils and pitfalls that will need to be avoided.  

Enterprise IT Application Development is often bureaucratic and involves multiple and redundant levels of management through the design, development and testing phases.…


Added by Mark Sharma on December 16, 2014 at 8:30am — No Comments

Data Visualization of Employee metrics at the top Tech companies

The top tech companies by market capitalization are IBM, HP , Oracle , Microsoft , Cisco , SAP , EMC , Apple , Amazon and Google

All of the top tech companies are selected based on their current market capitalization with the exception of Yahoo. The year 2014 is not included as part of this analysis.


Data: The source of this data is from the public financial records from


All the sales figures are normalized and reported in USD…


Added by Nilesh Jethwa on December 15, 2014 at 11:01am — 8 Comments

Best solution to a problem: data science versus statistical paradigm

The definition of 'best' depends on which school you follow. Data science and classic statistical science are at the opposite ends of the spectrum. So let's clarify what 'best solution' means in these two opposite contexts:

'Best', according to statistical science:

  • It usually means the global maximum of a mathematical optimization problem
  • The objective function involved is usually a maximum likelihood function, KS, c-statistics, or some function…

Added by Vincent Granville on December 14, 2014 at 8:30pm — 8 Comments

How-to use Bag of little bootstraps Methodology to Compute Error Bounds on Machine Learning Tasks

We all know that calculating error bounds on metrics derived from very large data sets has been problematic for a number of reasons. In more traditional statistics one can put a confidence interval or error bound on most metrics (e.g., mean), parameters (e.g., slope in a regression), or classifications (e.g., confusion matrix and the Kappa statistic).

For many machine learning applications, an error bound could be very important.…


Added by Anna Anisin on December 14, 2014 at 3:33pm — No Comments

Big Data: The Key Vocabulary Everyone Should Understand

Guest blog post by Bernard Marr, first published here.

The field of Big Data requires more clarity and I am a big fan of simple explanations. This is why I have attempted to provide simple explanations for some of the most important technologies and terms you will come across if you’re looking at getting into big…


Added by Vincent Granville on December 12, 2014 at 12:00pm — 6 Comments

10 data science predictions for 2015

These predictions were published by the International Institute for Analytics (IIA). They produced a nice infographics, featured below, and re-tweeted many times by various bloggers, using the hash tag #2015Analytics. Other interesting predictions include…


Added by Vincent Granville on December 12, 2014 at 10:30am — No Comments

Our iceberg is melting . Now where's that data scientist?

On the face of it, John Kotter’s seminal book “Our iceberg is melting” is a simple tale of a group of penguins who are scared about losing their home, their iceberg, and yes, even more scared of the changes that could entail. But through that simple story and their struggle for finding their new home, the story delivers a more powerful message that…


Added by Debleena Roy on December 10, 2014 at 8:30am — No Comments

Weekly Digest - December 15

The full version is always published Monday. Starred articles or sections are new additions or updated content, posted between Thursday and Sunday. Articles marked with a + have interesting visualizations.



Added by Vincent Granville on December 9, 2014 at 9:30pm — No Comments

Big Data, IOT and Security - OH MY!

While we aren’t exactly “following the yellow brick road” these days, you may be feeling a bit like Dorothy from the “Wizard of Oz” when it comes to these topics. No my friend, you aren’t in Kansas…


Added by Carla Gentry on December 8, 2014 at 6:30pm — No Comments

Data science without statistics is possible, even desirable

The purpose of this article is to clarify a few misconceptions about data and statistical science.

I will start with a controversial statement: data science barely uses statistical science and techniques. The truth is actually more nuanced, as explained below.

1. Data science heavily uses new statistical…


Added by Vincent Granville on December 8, 2014 at 5:00pm — 15 Comments

Don’t Judge a Tweet by its 140 Characters: How One App is Using Machine-Learning to Tackle Credibility on Twitter

When you use Twitter, how do you know when you are being presented with something credible instead of something totally bogus? The answer is, unless you spend a lot of time researching each tweet, you probably don’t. However, one thing is for certain, we rely on what we read on Twitter to be true.

Twitter is one of the fastest and most effective ways we disseminate news across our world. If this…


Added by Renette Youssef on December 8, 2014 at 4:00pm — No Comments

What is the future of Data visualization and Dashboard solutions?

This article is not about any futuristic "Iron Man style dashboard/data visualization product" where you are combing through holographic cubic chunks from your ultra fast Big Data pipeline.

From time to time I keep pondering on what could be the future and I am sure lot of us get this science fiction imagery where…


Added by Nilesh Jethwa on December 8, 2014 at 7:30am — 1 Comment

Quick Survey-Journey to become a Data Scientist

Hello Data Science,

Thanks for allowing me the opportunity to be a part of Data Science Central!

Recently, I have embarked on a journey to become a Data Scientist! In doing so, I have begun to write an article about my findings to help those interested in becoming a Data Scientist as well, but don’t know where to start.

One thing I would love to include in my article is the backgrounds, opinions and teachings of real Data Scientists. In order to capture this, I have put…


Added by Anthony Dutra on December 8, 2014 at 7:00am — No Comments

5 basic rules of data organization

Here I compare these 5 rules published in 1999, with the new 2014 version. Data has changed so much that the opposite rules are now followed. Yet many statisticians and big businesses still stick to the outdated rules.

These rules were initially published in the featured book (see picture) first published in 1999, when software (e.g. SPSS) could not adapt to…


Added by Vincent Granville on December 7, 2014 at 4:00pm — 1 Comment

History, Music, and the Depth of Data

Like many students about to finish their undergraduate degree, I decided to artificially inflate my grades by taking some "bird courses." These are not courses about birds. Other students assured me that the courses were designed to bolster my marks and to help me complete my program requirements. Considering the many bird courses available, I decided to take introductory music, which was essentially a history course focused on music. It required a lot of…


Added by Don Philip Faithful on December 6, 2014 at 8:52am — No Comments

Blog Topics by Tags

Monthly Archives













  • Add Videos
  • View All

© 2020   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service