Subscribe to DSC Newsletter

November 2012 Blog Posts (28)

Big Data pioneers show the way

There is little time, about 3 or 4 years, if you wanted to process a large amount of textual data or web logs, you need to mobilize large servers and implement consistent SQL programs, long to be developed long and long to give results. Fortunately requests were few and generally volumes were measured at most in terabytes. Now e-commerce and social media have been largely developed, and many companies see their customer relationships, and therefore their survival, entirely dependent on the…

Continue

Added by Michel Bruley on November 8, 2012 at 3:58am — No Comments

R + Hadoop = Data Analytics Heaven

 

Hadoop (MapReduce where code is turned into map and reduce jobs, and Hadoop runs the jobs) is the most well known technology used for "Big Data" because it allows an organization to store huge quantities of data at very low…

Continue

Added by Michael Walker on November 7, 2012 at 3:57pm — No Comments

Retail Analytics - Practical Issues to be considered

Initiating the LPG (Liberalization, Privatization, and Globalization) policies, the consumer behaviour is also changed drastically with their preferences and choices of the products available in the market. Consumers are demanding Global Products at Local Markets. Consumer Point of View (40-60% of the total consumer’s of the economy or total population), with available time if he found more products available (all different products at one place), he will ready to pay higher prices. Retail…

Continue

Added by Vijay Kumar on November 4, 2012 at 12:41am — No Comments

Feature selection for efficient modeling

Feature selection, also known as variable selection, feature reduction, attribute selection or variable subset selection is the technique of selecting a subset of relevant features for building robust learning models (Source: Wikipedia). Data mining problems may involve hundreds, or even thousands, of variables that can potentially be used as inputs. As a result, a great deal of time and effort may be spent examining which variables to include in the model. Feature selection allows us to…
Continue

Added by Venky Rao on November 3, 2012 at 10:03pm — 2 Comments

"Common Mistakes in practice" of analytics

  1. Ignoring to apply Commonsense and practical knowledge in using analytical tools in providing solutions to the given problem.
  2. The results are highly biased towards models (tools and techniques), but unable to realize the importance of the problem; and its basic sampling techniques.
  3. By Ignoring root causes (i.,e proper investigation, deep analysis of the customized problem), Analytics are providing mid-way solution; which is not adequate.
  4. Analytical…
Continue

Added by Vijay Kumar on November 2, 2012 at 8:41pm — No Comments

5 Thing You Need to Know about Hadoop

Big Data is a term used to categorize an excessive amount of aggregated data. But, how do Data Miners manage all of this data? Hadoop is one of the popular tools that data analysts are using to store and mine immense volumes of data.

Here are 5 Things a Data Analyst should know about Hadoop:

1.      Hadoop utilizes parallel processing to store and process…

Continue

Added by Ben Gold on November 2, 2012 at 9:21am — No Comments

Data Science Central Weekly Digest

Sponsored Listings

Continue

Added by Vincent Granville on November 2, 2012 at 7:30am — No Comments

Yo-Yo Ma Social Scientist or Data Scientist

YO-YO MA – SOCIAL SCIENTIST

Stories
Continue

Added by Zach Piester on November 1, 2012 at 3:02pm — No Comments

Blog Topics by Tags

Monthly Archives

2019

2018

2017

2016

2015

2014

2013

2012

2011

1999

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service