Subscribe to DSC Newsletter

All Blog Posts (6,589)

SparkGrams: compact in-spreadsheet histograms

My new blog post on what I coined as "sparkgrams".  Included is an implementation in YUI3 for custom website presentations of data, but I wish R and iPython Notebook had similar functionality.

http://technicaltidbit.blogspot.com/2013/06/histogram-thumbnails-inside-yui3-data.html

Added by Michael Malak on June 18, 2013 at 5:17am — No Comments

Are you a geek or a nerd?

Or something else. I could not resist to post this chart:

Source: http://slackprop.files.wordpress.com/2013/06/geeknerd-plot-01.png…

Continue

Added by Mirko Krivanek on June 17, 2013 at 7:37pm — 3 Comments

Data’s Beauty: In the Eye of the Beholder

How can organizations use data visualization, visual analytics, and visual data discovery to improve decision-making, collaboration, and operational execution? We present three key insights from the latest TDWI research.

Guest blog by …

Continue

Added by Vincent Granville on June 17, 2013 at 3:30pm — 1 Comment

Big Data & Associative Technology (the future)

Hello

Imagine being able to do anything your mind can imagine, well now it can..

The 6th Normal Form is not a TERM.. it is a GOAL.. the "holy Grail" if you will of data management and more importantly "Information" management.. Imagine being able to Store IDEAS as opposed to disconnected bits of data..



A brief "INCORRECT" comment on WIKI.. "A relvar R [table] is in sixth normal form (abbreviated…

Continue

Added by Jean Michel LeTennier on June 16, 2013 at 5:18am — 2 Comments

Hadoop and Perl

Interesting topic posted by InfiniteLoop on PerlMonks, back in 2008!

This having been a boring week, I was catching up on http://hadoop.apache.org. A brief Google search for Perl interface(s) to…
Continue

Added by Vincent Granville on June 15, 2013 at 3:30pm — No Comments

Big Data: Next Best Offer

Next-best offer refers to the use of predictive analytics solutions to identify the products or services your customers are most likely to be interested in for their next purchase.

 

Facing this topic I have made a personal research, and realize a synthesis, which has helped me to clarify some ideas. The attached presentation does not intend to be exhaustive on the subject, but could perhaps bring you some useful insights:…

Continue

Added by Michel Bruley on June 14, 2013 at 3:22am — No Comments

Privacy vs. Security and Data Science

"They that can give up essential liberty to obtain a little temporary safety deserve neither liberty nor safety." – Benjamin Franklin

Recent revelations concerning the…
Continue

Added by Michael Walker on June 12, 2013 at 3:30pm — No Comments

Weekly Digest - June 17

Featured Articles

Continue

Added by Vincent Granville on June 12, 2013 at 12:00pm — No Comments

Which is the best software to build a data mart

I have two databases on Microsoft SQL Server (daily business activities performed) and also on peachtree and on orange human resources software. I want to build a data warehouse with this databases available. My questions are:

i. Where can I integrate all these databases together

ii. After I integrate, how can I mine this data?

iii. What is the best software to use and mine this data?

iv. Can combining all these databases produce insight…

Continue

Added by Adetula Oluwabunmi on June 12, 2013 at 4:51am — No Comments

The NSA data privacy scandal: a different point of view

There's a lot of talk these days about how governments use all the data they can put their hands on, to monitor every individual in the world. Capabilities offered by big data storage and analytic processing are immense, when in the hands of professional, capable data scientists.  Last week the National Security Agency was under the spotlight, a month ago it was the IRS (Income Revenue Service) for a biased auditing …

Continue

Added by Mirko Krivanek on June 11, 2013 at 8:30pm — 10 Comments

Data storage “The suspended data grid”

Nowadays we hear a lot about big data, cloud, or the big data analytics on cloud. One of the underlying needs here is the data storage. It is stored as 0’s and 1’s in some datacenters. It isn’t cheap to maintain all of this data. Looking at the heat generated by these datacenters, it isn’t even environment friendly.

Can this data be really in the clouds? (The real clouds formed in the sky) :)

In India we have heard about Rishi’s who used to sit under a tree and meditate for…

Continue

Added by Shinto Paul on June 11, 2013 at 8:30pm — No Comments

Want to predict human behavior? Use these 6 lessons based on data from 10 million households

Guest post by . First published June 6, 2013, in Opower.

Over the past couple of years,…

Continue

Added by Vincent Granville on June 11, 2013 at 1:19pm — 2 Comments

Data Science and Perspectives

There are two major perspectives of Data Science we can look at:

- Consumer/User Perspective

- Data Scientist’s Perspective

This article explores these two areas to ponder upon in little more depth.

- Consumer/User Perspective (User will not like “noise”)

A single user/consumer might need some analysis to either start some study or make some decisions. This single user might be a CIO/CTO or perhaps we can also say that this single user is a group of decision…

Continue

Added by Atif Farid Mohammad on June 11, 2013 at 10:40am — No Comments

How to detect three types of hidden data, to eliminate opportunity costs

Your data is like Gruyere. It has holes. Big holes, sometimes the empty space occupies a bigger volume than the data itself - just like dark matter is more abundant than visible matter in the universe. This article is not about shallow or sparse data, but instead about data that you do not see, that you do not know even exist, and yet, data that contains better actionable nuggets than anything in your…

Continue

Added by Vincent Granville on June 10, 2013 at 1:00pm — 1 Comment

Data scientists dictate what we eat

Indirectly of course. There are other factors too, such as regulations which make it illegal to sell un-pasteurized milk, horse meat, foie gras, etc., but the biggest factor influencing what the average American eats is the margin the grocery store makes on the products it sells. This explains why you can't get redcurrants or passion fruits anymore, but you'll find plenty of high energy drinks and food rich in sugar…

Continue

Added by Mirko Krivanek on June 8, 2013 at 12:00pm — 2 Comments

Predictive Modeling is Useless!

If you’re a modeler, you might say, “who the heck is this guy telling me that my precious thing is useless?” Wait a minute. I will explain later. If you’re new to this, let me tell you what predictive modeling is: It is the power to predict the future. Like a prophecy, except its using data, lots of them. Sounds cool? Yes, but it’s useless. Sad? Me too. I’m also a modeler.



Let me tell you a story. Once upon a time, I was preaching in front of senior management on how we could get… Continue

Added by Eka Aulia on June 6, 2013 at 10:05pm — 1 Comment

Data Scientists Sometimes Fool Themselves

The easiest person in the world to fool is yourself. Data scientists sometimes fool themselves - in matters trivial and important. Thus, I strongly suggest that we acknowledge real or subconscious biases in ourselves, the data, the analysis and group think. It is prudent for data science teams to have…

Continue

Added by Michael Walker on June 6, 2013 at 12:11pm — No Comments

SAS Dominates Analytics Job Market?

Bob Muenchen's very useful work on this topic, SAS Dominates Analytics Job Market; R up 42% sent me back to some 2012 work we did at Statistics.com on the subject of what employers are looking for in the way of analytics skills.  First, our main results:

1.  Our numbers showed a much less SAS-dominant world:  1.92 SAS jobs for every R job.  Bob had found the ratio to…

Continue

Added by Peter Bruce on June 6, 2013 at 10:15am — 6 Comments

Join the Data Science Revolution with Alpine

Recently named by Business Insider as one of the Big Data startups to watch, Alpine Data Labs is experiencing massive growth and is currently on a hiring spree, looking for top talent. Alpine is hiring solution architects, software engineers, data scientists, marketing & sales pros, and technical support engineers. Send your resume and a note to [email protected]

Continue

Added by Vincent Granville on June 6, 2013 at 9:30am — 4 Comments

Blog Topics by Tags

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service