Subscribe to DSC Newsletter

All Blog Posts (6,484)

Hadoop Falcon and Data Lifecycle Management

Data management in the Hadoop ecosystem is still in the early stages of development. The goal…

Continue

Added by Michael Walker on July 30, 2013 at 12:13pm — No Comments

How to Avoid Political Blunders in Analytical Discussions

A while back I was running a data mining project for a customer and made a conversational blunder. In one of the meetings, I mentioned seeing one interesting relationship in the data. Customers who purchased one particular product tended to buy and implement a second product at a later time. I did not realize that Everyone in the room INTUITIVELY knew that there is absolutely no relationship between the two products. A big blunder. After the meeting, two friends told me that my standing in…

Continue

Added by Stephen Penn, DM, PMP on July 30, 2013 at 5:45am — 8 Comments

Choropleth in D3.js and Pandas (iPython Notebook)

There have been various attempts to integrate the D3.js visualization framework into iPython Notebook, in order to provide more visualization options than available with the standard Matplotlib. In my blog post today, I take one of the better integration attempts out there, port it from Windows to the Mac, and demonstrate:

1. Passing a Pandas Dataframe from iPython Notebook into the D3.js Javascript

2. Generating geo color maps in D3.js (not a built-in…

Continue

Added by Michael Malak on July 29, 2013 at 4:23am — No Comments

Turning visitors into sales: seduction vs. analytics

The context here is about increasing conversion rate, from website visitor to active, converting user. Or from passive newsletter subscriber to a lead (a user who opens the newsletter, clicks on the links, and converts). Here we will discuss the newletter conversion problem, although it applies to many different settings.…

Continue

Added by Mirko Krivanek on July 27, 2013 at 4:00pm — 3 Comments

We need your vote for Teradata Aster

as a Favorite New Product of 2013! (An External Award Acknowledgement)

Voting will take less than 2 minutes! Deadline is 11:59pm EDT August 9th.…

Continue

Added by Vincent Granville on July 27, 2013 at 9:17am — 1 Comment

Update about our data science competition

What seemed to be an untractable problem involving trillions of quadrillions of computations - far more than required to process all the data produced or collected on Earth since the beginning of times - has been reduced to something computationally feasible and even possibly quite simple. One applicant…

Continue

Added by Vincent Granville on July 22, 2013 at 2:00pm — 1 Comment

Human Factors - Unhappy Truckers

I have an interest in optimization models as well as analytics.  My son, who writes apps for the trucking industry, sent me a link to an article on how human factors may impact the "best" solution as opposed to the "optimal" solution to a class of problems.

So too our analysis may lead us in a certain direction but bring us to the wrong conclusion because important factors were not…

Continue

Added by christopher calvin on July 22, 2013 at 8:41am — No Comments

My first impression about the Microsoft Surface

I was offered a surface for father's day this year. I had an old iPad that I've used for several years, and I was curious to know if you can use the Surface just like a Windows laptop. While it has great features, faster Internet, and much more, the answer is clearly no.…

Continue

Added by Vincent Granville on July 21, 2013 at 5:30pm — 5 Comments

DUI arrests decrease after state monopoly on liquor sales ends

This is another example where, if you lack analytic skills, you will jump to the wrong conclusions. This news article was published in MyNorthWest. It's about the new law that went into effect a year ago in WA, allowing grocery stores to sell hard liquor. Here we provide 16 reasons that…

Continue

Added by Vincent Granville on July 20, 2013 at 11:30am — 4 Comments

M2M PREDICTION ECONOMICS IN 3 INDUSTRIES

As the possibilities at the intersect of M2M+Big Data gets unlocked its very important to examine the economics underlying the use cases
So the core questions becomes…
Continue

Added by derick.jose on July 20, 2013 at 6:37am — 4 Comments

Botnets in the cloud: the new generation of spammers

Big data and data science is not just for good guys. If properly leveraged, it also provides competitive advantages for criminals, over their competitors, or to avoid detection.…

Continue

Added by Vincent Granville on July 17, 2013 at 10:00am — 1 Comment

Demand for Data Scientists and the Datification of Business

Source: EMC2 Survey.

You cannot improve and manage what you cannot…
Continue

Added by Michael Walker on July 16, 2013 at 1:00pm — 2 Comments

IBM 4690 Supermarket Application Transaction Data

Good day folks,

I'm conducting a data analytics project for a small supermarket and the IT guy sent me a file (.db0 extension) of their transaction details, but I'm unable to read it.  Apparently it comes from the IBM 4690 Supermarket Application controller.  I've done internet searches and nothing helpful comes up.  The guy that sent it to me said he has never had to convert or read it into CSV or any other file format, so he can't provide any assistance.  Can anyone help?…

Continue

Added by Bert Riley on July 16, 2013 at 6:42am — 2 Comments

Big Data on the Big Data Conversation: Tracking the NSA Story

By Nicholas Hartman, Director

Recent revelations regarding the National Security Agency's (NSA) extensive data interception and monitoring practices (aka PRISM) have brought a branch of "Big Data's" research into the broader public light. The basic premise of such work is that computer algorithms can study…

Continue

Added by Nicholas Hartman on July 15, 2013 at 8:51am — No Comments

8 M2M Big Data Use cases in Building Management Systems

As the internet of things explodes, the building management industry is ripe for disruption from sensor data.Here are a 8 specific big data use cases which are at the intersect of M2M Analytics + Big Data + Building Management Systems. Please click http://blog.fluturasolutions.com/2013/07/8-m2m-use-cases-analytics-big-data.html for details about each use case…

Continue

Added by derick.jose on July 12, 2013 at 8:27pm — No Comments

Weekly digest - July 15

Featured Articles

Continue

Added by Vincent Granville on July 11, 2013 at 6:30pm — No Comments

Rapid hadoop development with progressive testing

Debugging Hadoop jobs can be a huge pain.  The cycle time is slow, and error messages are often uninformative --- especially if you're using Hadoop streaming, or working on EMR.



I once found myself trying to debug a job that took a full six hours to fail.  It took more than a week -- a whole week! -- to find and fix the problem.  Of course, I was doing other things at the same time, but the need to constantly check up on the status of the job was a huge drain on my energy and…

Continue

Added by Abe Gong on July 10, 2013 at 10:47am — 1 Comment

Blog Topics by Tags

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service