Apache Hadoop is an open source application framework for storingand handling enormousscale of datasets on commodity hardwareclusters. Hadoop is an Apache topmost-level project being designed and utilized by a group of usersand contributors. It is authorized under the Apache License 2.0.
The Apache Hadoop framework contains following four modules:
Each module in Hadoop are outlined with a basicsuppositionthatfailures ofhardware (racks of machines or of individual machines) are normal and consequently ought to be naturally taken care of in software by the system. Apache Hadoop's HDFS and MapReduce modules initially builtseparately Google File System (GFS) papers and Google's MapReduce.
Beyond HDFS, MapReduceand YARN, the whole Apache Hadoop "framework" is currently normally considered to comprise of various related projects also: Apache Hive, Apache Pig, Apache HBase, and others.
For the end-clients, however MapReduce Java code is basic, any programming dialect can be utilized with "Hadoop Streaming” to execute the “map” and “reduce“parts of the client's project. Apache Hive and Apache Pig and among other related projects, uncover more elevated amount client interfaces like Pig latin and a SQL variation individually. The Hadoop Platform itself is generally composed in the Java programming language, with some code in C and order line utilities composed as shell-scripts.
For More Details :
Visit Here :
Posted 12 April 2021
© 2021 TechTarget, Inc.
Powered by
Badges | Report an Issue | Privacy Policy | Terms of Service
Most Popular Content on DSC
To not miss this type of content in the future, subscribe to our newsletter.
Other popular resources
Archives: 2008-2014 | 2015-2016 | 2017-2019 | Book 1 | Book 2 | More
Most popular articles