Subscribe to DSC Newsletter

All Blog Posts (6,259)

Using Decision Trees in Evidence Based Medicine

In today's post, we explore the use of decision trees in evidence based medicine.  In 1996 David Sackett wrote that "Evidence-based medicine is the conscientious, explicit and judicious use of current best evidence in making decisions about the care of individual patients" [Source: Wikipedia].
 
For our analysis,…
Continue

Added by Venky Rao on September 22, 2012 at 5:45pm — 1 Comment

Is Data Scientist the Sexiest Job of the Century? | Harvard Business Review

Hey good lookin’. Yep, I’m talking to you, or at least the data scientists reading this. (The rest of you are incredibly good looking, intelligent, and clearly have good taste, as well.)

TheHarvard Business Review has…

Continue

Added by Vincent Granville on September 21, 2012 at 2:42pm — 2 Comments

Predictive, Descriptive, Prescriptive Analytics

The goal of  Data Analytics (big and small) is to get actionable insights resulting in smarter decisions and better business outcomes. How you architect business technologies and design data analytics processes to get valuable, actionable insights varies.



It is critical to design and build a data warehouse / business intelligence…

Continue

Added by Michael Walker on September 19, 2012 at 11:57am — No Comments

Numeric Measures for Association Rules

In today's post, we dive into understanding Association Rules for Market Basket Analysis and discuss three numeric measures that should be considered before deciding to act on / make a business decision based on associations that have been observed in the data: (1) Support (2) Confidence and (3) Lift.



Association rules are typically written in the format:



Left hand side Implies Right hand…

Continue

Added by Venky Rao on September 15, 2012 at 7:03am — 6 Comments

Comparing SaaS Machine Learning Offerings.

There are various offerings out there if you want to use machine learning in your analysis nowadays. Nick WIlson spent his internship at BigML comparing three SaaS Machine Learning Services (BigML, Prior Knowledge and Google Prediction API), with WEKA as a  benchmark. He wrote a series of blog posts about his findings. In his final post he gives a summary of his work, with links to the different blog posts for details. He let me re-blog his summary here.

"Part 1 -…

Continue

Added by Jos Verwoerd on September 13, 2012 at 3:37am — No Comments

Modern BI Architecture & Analytical Ecosystems

The goal is to design and build a data warehouse / business intelligence (BI) architecture that provides a flexible, multi-faceted analytical ecosystem for each unique organization.

A traditional BI architecture has analytical processing first pass through a data warehouse.

In the new, modern BI architecture, data reaches users…

Continue

Added by Michael Walker on September 12, 2012 at 11:53am — No Comments

Understanding And Interpreting Gain And Lift Charts

Lift and Gain Charts are a useful way of visualizing how good a predictive model is. In SPSS, a typical gain chart appears as follows:







In today's post, we will attempt to understand the logic behind generating a gain chart and then discuss how gain and lift charts are interpreted.





To do this,…

Continue

Added by Venky Rao on September 11, 2012 at 2:54pm — 2 Comments

How do I become a data scientist?

Data scientists are the new astronauts. Everyone wants to become one. And it is not difficult to understand the reason for this.

In this age of “Big data”, more and more businesses are relying on people who can make sense of the vast amounts of information generated around us – people who can use sophisticated tools and complex-sounding statistical techniques to derive insights from larger and larger mounds of data.

Businesses have started to understand the power of data. They…

Continue

Added by Gaurav Vohra on September 10, 2012 at 11:29pm — No Comments

Letter to All Bloggers, Consultants, Graduates and Job Seekers

This is about how to boost your analytic career and/or revenue by leveraging our professional network to the fullest extent.

We invite you to post blogs, or participate in forums (including answering questions asked by peers) on DataScienceCentral and…

Continue

Added by Vincent Granville on September 9, 2012 at 2:00pm — 3 Comments

Facebook's new privacy violation: what do you think?

This Saturday, I've noticed that Facebook now displays a few new boxes on everyone's profile page (not just me). The box that worries me most is the one that shows all the places where you've traveled and where you've lived, including your current location.

To compound the problem, the box in question clearly…

Continue

Added by Mirko Krivanek on September 9, 2012 at 1:00pm — 2 Comments

10 great data science / big data articles from influential news outlets

These are the articles that I enjoyed reading this week: 

Continue

Added by Vincent Granville on September 8, 2012 at 8:30pm — No Comments

Eight Levels of Analytics for Competitive Advantage

Copyright © SAS Institute Inc., Cary, NC, USA. All Rights Reserved. Used with permission. 

 

Optimization answers the question: How do we do things better? What is the…

Continue

Added by Michael Walker on September 6, 2012 at 11:30am — 2 Comments

Did your business intelligence experts have the right tools for text analytics?

 

Companies are looking increasingly to take advantage of Big Data, especially textual information, those generated via user tools by web or desktop applications. The analysts specialized in this subject believe that 70% of information of interest to business are nestled in word documents, excel, email, etc. These data are not predefined in a model and cannot be perfectly stored in relational tables. They occur most often in the very free form, but contain dates, numbers, key words,…

Continue

Added by Michel Bruley on September 2, 2012 at 10:57pm — No Comments

The Data Scientist Will Be Replaced By Tools | Forbes

Do you agree with this? I don't, I think this Forbes article is using a provocative title to get you to read it. While assembler programmers in the seventies were eventually replaced by compilers and programming language interpreters, I believe that real statisticians and data scientists can't fully be replaced by machines or software. When they are,…

Continue

Added by Vincent Granville on September 2, 2012 at 6:00pm — 6 Comments

Big Data Vendor Landscape

Big Data Vendor Landscape

Companies, products, and technologies included in the Big Data Landscape:

-…

Continue

Added by Michael Walker on August 30, 2012 at 2:58pm — No Comments

Announcing Data Community DC!

I'm very pleased to announce the formation of Data Community DC, Inc., a new organization dedicated to the support of the Data Science, Statistics, Analytics and related communities in the Washington, DC area! Formed by the organizers of the rapidly-growing Data Science DC and R Users DC Meetups, and the nascent Data Business DC Meetup, DC2 will support those groups and help to create new Meetup groups and other events and services.

If you live or work in the DC Metro area, we'd love…

Continue

Added by Harlan Harris on August 28, 2012 at 2:20pm — No Comments

Romney Said to Use Secretive Data Mining | WallStreetJournal

WASHINGTON—Mitt Romney's success in raising hundreds of millions of dollars in the costliest presidential race ever can be traced in part to a secretive data-mining project that sifts through Americans' personal information—including their purchasing history and church attendance—to identify new and likely, wealthy donors, the Associated Press has learned.…

Continue

Added by Vincent Granville on August 24, 2012 at 7:30am — No Comments

Hadoop Technology Stack

The Hadoop stack includes more than a dozen components, or subprojects, that are complex to deploy and manage. Installation, configuration and production deployment at scale is challenging.

The main components…

Continue

Added by Michael Walker on August 22, 2012 at 9:40am — No Comments

Top 30 articles for week ending on August 19

Weekly digest from Data Science Central, Analytic Talent and Analytic Bridge:

  1. Why are clinical trials failing?
  2. How to optimize email campaigns? Part I…
Continue

Added by Vincent Granville on August 20, 2012 at 1:54pm — No Comments

Blog Topics by Tags

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service