Subscribe to DSC Newsletter

Michael Walker's Blog Posts Tagged 'Hadoop' (8)

The Haboob Clouds Hadoops Future

Hadoop is an open source framework for storing massive amounts of data on clusters of commodity hardware.



Haboob is a dense dust storm that moves fast…

Continue

Added by Michael Walker on March 23, 2014 at 9:03am — 3 Comments

Hadoop 2 Helps Systems Integration



Apache Hadoop announced a beta release for Hadoop 2. The Hadoop-2.1.0-beta…

Continue

Added by Michael Walker on September 3, 2013 at 9:31pm — No Comments

Batch vs. Real Time Data Processing

Batch data processing is an efficient way of processing high volumes of data is where a group of transactions is collected over a period of time. Data is collected, entered, processed and then the batch results are…

Continue

Added by Michael Walker on August 13, 2013 at 2:30pm — 2 Comments

Hadoop Falcon and Data Lifecycle Management

Data management in the Hadoop ecosystem is still in the early stages of development. The goal…

Continue

Added by Michael Walker on July 30, 2013 at 12:13pm — No Comments

Accumulo - Sqrrl NoSQL Secure Database

A major concern for organizations building big data analytical ecosystems is data security. One flaw of Hadoop/MapReduce and many NoSQL databases is weak security.

.…

Continue

Added by Michael Walker on May 1, 2013 at 9:00am — No Comments

Spark, Shark and Mesos Data Analytics Stack

The Berkeley Data Analytics Stack (BDAS) is an open source, next-generation data analytics stack under development at the UC Berkeley AMPLab whose current components include …

Continue

Added by Michael Walker on February 27, 2013 at 10:08am — No Comments

R + Hadoop = Data Analytics Heaven

 

Hadoop (MapReduce where code is turned into map and reduce jobs, and Hadoop runs the jobs) is the most well known technology used for "Big Data" because it allows an organization to store huge quantities of data at very low…

Continue

Added by Michael Walker on November 7, 2012 at 3:57pm — No Comments

Hadoop Technology Stack

The Hadoop stack includes more than a dozen components, or subprojects, that are complex to deploy and manage. Installation, configuration and production deployment at scale is challenging.

The main components…

Continue

Added by Michael Walker on August 22, 2012 at 9:40am — No Comments

Videos

  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service