Subscribe to DSC Newsletter

William Vorhies's Blog (344)

Polyglot Persistence?

Summary:  Yes it’s a real phrase and it’s the secret to picking the right NoSQL…


Added by William Vorhies on May 11, 2015 at 7:04am — No Comments

Do You Really Need a Unicorn?

Summary:  The world of data science is splitting into two distinct camps, the start-up app world and the commercial world.  The good news is that almost all the opportunity lies in commercial predictive analytics where you can broadly specialize and still play with all the latest innovations.…


Added by William Vorhies on March 16, 2015 at 3:02pm — 4 Comments

Have You Heard About DRILL?

Summary:  Of all the current Apache projects DRILL stands a good chance of being really revolutionary.

Hive, Pig, Spark, Yarn, Zoo Keeper, Falcon, Flume, Nutch, Sqoop, Storm.  Unless…


Added by William Vorhies on February 23, 2015 at 11:14am — No Comments

Privacy, Personalization, and the IOT - Retail

Summary:  Thanks to the IOT (internet of things) an internet-like experience of recommendations and awareness of your preferences is coming to the brick and mortar store near you.

You’ve probably noticed the huge difference in the tone of the conversation between data scientists and the general public over the issue of privacy and…


Added by William Vorhies on November 10, 2014 at 10:48am — No Comments

How Many "V's" in Big Data? The Characteristics that Define Big Data

 Summary:  We’ve scoured the literature to bring you a complete listing of possible definitions of Big Data with the goal of being able to determine what’s a Big Data opportunity and what’s not.  Our conclusion is that Volume, Variety, and Velocity still make the best definitions but none of these stand on their own in identifying Big Data from not-so-big-data.  Understanding these characteristics will help you analyze whether an opportunity calls for a Big Data solution but…


Added by William Vorhies on October 31, 2014 at 2:00pm — No Comments

Prescriptive versus Predictive Analytics - A Distinction without a Difference?

Summary:  Is the addition of “Prescriptive” analytics to our nomenclature really worthwhile or are we just confusing our customers?

I admit to being annoyed when this or that industry wag tries to coin a new term to describe some portion of the discipline we are already practicing.  Some of these folks I think are…


Added by William Vorhies on October 23, 2014 at 10:00am — 9 Comments

Quantifying the Value of a NoSQL Project

Summary:  If you’re making the decision to use NoSQL, how do you quantify the value of the investment?

If you are exploring NoSQL, once you become educated on the basics there are two questions that will rapidly move to the top of your list of considerations.  

  • What does it…

Added by William Vorhies on October 17, 2014 at 10:00am — No Comments

The Value of Accuracy in Predictive Analytics

This article was first posted in 2014 but the message bears repeating.  There is a lot being written about tools simple enough for the citizen data scientist to operate.  The unstated constraint is that if you don't have significant experience in data science then these will always be "good enough" models.  The problem is that 'good enough' models under achieve both revenue and profit.  Very small increases in model fitness can translate into much larger increases in campaign ROI.  Business…


Added by William Vorhies on October 8, 2014 at 8:00am — 4 Comments

Optimizing Disease Management Programs Using Predictive Modeling

Summary:  Here’s an easy to understand example of how predictive analytics can reduce cost while increasing efficacy of disease management programs.

Healthcare providers have made major breakthroughs over the last two decades by creating and implementing increasingly sophisticated disease management programs (DMPs).  At their core there are always two motives, improve the human condition by…


Added by William Vorhies on October 2, 2014 at 11:05am — No Comments

Lesson 9: Making Your Selection - Final Considerations


Added by William Vorhies on September 15, 2014 at 8:46am — No Comments

Lesson 8: Graph Databases

Summary:  Graph databases are your go-to choice when a relationship among the data items is key.

Up to about 1999 web search…


Added by William Vorhies on September 12, 2014 at 8:26am — No Comments

Lesson 7: Column Oriented Databases (aka Big Table or Wide Column)

Summary:  Column Oriented DBs excel at OLAP and are efficient at partial updates.

Many folks believe that Hadoop is the…


Added by William Vorhies on September 11, 2014 at 8:00am — 1 Comment

Lesson 6: Document Oriented Databases

Summary:  Unless you have special needs Document Oriented DBs are your most likely default choice.

Second in popularity in the business world behind Key-Value-Stores are Document Oriented Databases. …


Added by William Vorhies on September 10, 2014 at 9:00am — No Comments

Lesson 5: Key Value Stores (AKA 'Tuple' Stores)

Summary:  Key Value Stores like Hadoop were here first and excel at fast storage of unstructured data.

In the beginning (about…


Added by William Vorhies on September 9, 2014 at 7:00am — No Comments

Lesson 4: Features Common to (Most) NoSQL/NewSQL Databases

Summary:  These are the features common to most NOSQL databases.  Be on the lookout for any fundamental differences.

Before we get to the specific pros and cons of the four NOSQL database types there are some features and capabilities true of most of these that you should know.…


Added by William Vorhies on September 8, 2014 at 2:53pm — 2 Comments

Lesson 3: Open Source, Distribution, or Suite

Summary:  Which to pick?  Open Source, Distribution, or Big Data Suite.  Here are the factors you should consider.

Before we get to the…


Added by William Vorhies on September 5, 2014 at 8:30am — No Comments

9 Lessons: Picking the Right NoSQL Tools

Summary:  This blog series is designed to help you understand which NOSQL Big Data database is right for you.  It is addressed to business executives and managers who need a primer on how this decision should be made. 

Starting a Big Data Initiative is…


Added by William Vorhies on September 3, 2014 at 8:31am — 2 Comments

Lesson 2: NoSQL Databases are Good for Everything – Except Maybe this One Thing

Summary:  In general, it is true that NOSQL databases can do everything that RDBMS can do.  And almost always when data is ‘big’ they can do it faster and cheaper.  There is one exception where you’ll need to pay close attention.

In a technical…


Added by William Vorhies on September 3, 2014 at 8:30am — 3 Comments

How to Become a Data Scientist


Added by William Vorhies on August 27, 2014 at 7:30am — 21 Comments

Monthly Archives









  • Add Videos
  • View All

© 2020   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service