Hadoop – Introduction & features
Let us start with what is Hadoop and what are Hadoop features that make it so popular.
Hadoop is an open-source software framework for distributed storage and distributed processing of extremely large data sets. Important features of Hadoop are:
Hadoop is an open source project. It means its code can be modified to business requirements.
In Hadoop, data is highly available and…
Added by Sheetal Sharma on July 31, 2017 at 7:30pm — No Comments
There is a huge hype of Big Data and its features, most of them have been summed up in 9 different Vs of Big data like Volume, Velocity, Variety, Veracity, Validity, Volatility, Value, Variability, Viscosity.
In a recently published white paper by credit reference agency Experian, a proposal has been given to add another “V” to the…Continue
Added by Sheetal Sharma on July 20, 2017 at 8:00pm — No Comments
First of all we will see what is R Clustering, then we will see the Applications of Clustering, Clustering by Similarity Aggregation, use of R amap Package, Implementation of Hierarchical Clustering in R and examples of R clustering in various fields.
2. Introduction to Clustering in…Continue
Added by Sheetal Sharma on July 19, 2017 at 9:00pm — No Comments