*This article was written by Mohammad Sajid**.*

* *

Statistical cluster analysis is an Exploratory Data Analysis Technique which groups heterogeneous objects(M.D.) into homogeneous groups. We will learn the basics of cluster analysis with mathematical way.

Cluster Analysis can be done by two methods:

- Hierarchical cluster analysis.
- Non-Hierarchical cluster analysis.

**Hierarchical Cluster Analysis(HCA):**

- In HCA, the observation vector(cases) are grouped together on the basis of their mutual distance.
- An HCA is usually visualised through a hierarchical tree called dendrogram tree. This hierarchical tree is a nested set of partitions represented by a tree diagram.

**Characteristics of HCA:**

- Sectioning a tree at a particular level produces a partition into
**‘g’**disjoint groups. - If 2 groups are chosen from different partitions then either the groups are disjoint or 1 group is totally contained within the other.
- A numerical value is associated with each partition of the tree where branches join together. This value is a measure of distance or dissimilarity between two merged clusters.
- Different distance measures give rise to different hierarchical clusters structure.

**There are two types of approaches for HCA: **

- Agglomerative HCA
- Divisive HCA

**Agglomerative HCA: **

- Operates by successive merges of cases.
- Begin with clusters, each containing single cases.
- At each stage merge the 2 most similar group to form a new cluster, thus reducing the number of the cluster by n.
- Continue till(eventually as similarity decreases) all subgroups are fused to form one single cluster.

**Divisive HCA: **

- The divisive method operates by the successive splitting of groups.
- Initially starts with a single group(i.e. one single cluster).
- Group is divided into 2 types: 1) The objects in one subgroup are as far as possible from the objects in the other group. 2) Continue till there are ‘n’ groups, each with a single cluster.

* *

*To read the rest of the article, click here.*

© 2020 Data Science Central ® Powered by

Badges | Report an Issue | Privacy Policy | Terms of Service

**Most Popular Content on DSC**

To not miss this type of content in the future, subscribe to our newsletter.

- Book: Statistics -- New Foundations, Toolbox, and Machine Learning Recipes
- Book: Classification and Regression In a Weekend - With Python
- Book: Applied Stochastic Processes
- Long-range Correlations in Time Series: Modeling, Testing, Case Study
- How to Automatically Determine the Number of Clusters in your Data
- New Machine Learning Cheat Sheet | Old one
- Confidence Intervals Without Pain - With Resampling
- Advanced Machine Learning with Basic Excel
- New Perspectives on Statistical Distributions and Deep Learning
- Fascinating New Results in the Theory of Randomness
- Fast Combinatorial Feature Selection

**Other popular resources**

- Comprehensive Repository of Data Science and ML Resources
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- 100 Data Science Interview Questions and Answers
- Cheat Sheets | Curated Articles | Search | Jobs | Courses
- Post a Blog | Forum Questions | Books | Salaries | News

**Archives:** 2008-2014 |
2015-2016 |
2017-2019 |
Book 1 |
Book 2 |
More

**Most popular articles**

- Free Book and Resources for DSC Members
- New Perspectives on Statistical Distributions and Deep Learning
- Time series, Growth Modeling and Data Science Wizardy
- Statistical Concepts Explained in Simple English
- Machine Learning Concepts Explained in One Picture
- Comprehensive Repository of Data Science and ML Resources
- Advanced Machine Learning with Basic Excel
- Difference between ML, Data Science, AI, Deep Learning, and Statistics
- Selected Business Analytics, Data Science and ML articles
- How to Automatically Determine the Number of Clusters in your Data
- Fascinating New Results in the Theory of Randomness
- Hire a Data Scientist | Search DSC | Find a Job
- Post a Blog | Forum Questions

## You need to be a member of Data Science Central to add comments!

Join Data Science Central