What are some of the pros and cons of hierarchical clustering compared to K-Means?

Pros:

  • Do not have to specify the number of clusters before running the algorithm
  • Results are reproducible and not subject to randomness introduced by choice of initial centroids as in K-Means

Cons:

  • Requires computation of pairwise linkage matrix, which can be computationally expensive
  • Results can differ based on the linkage criteria used
  • Can be sensitive to noise in data

Author

Help us improve this post by suggesting in comments below:

– modifications to the text, and infographics
– video resources that offer clear explanations for this question
– code snippets and case studies relevant to this concept
– online blogs, and research publications that are a “must read” on this topic

Leave the first comment

Partner Ad
Find out all the ways that you can
Contribute