What are some of the possible linkage types to use in order to form successive clusters?

  • Ward: The Ward linkage forms successive clusters by finding the clusters that minimize the increase in within-cluster variance at each step. It is the default option in sklearn’s hierarchical clustering API. 
  • Single: The next cluster is formed by merging the observation that has the shortest minimum distance to an existing cluster. 
  • Complete: The next cluster is formed by merging the observation that has the shortest maximum distance to an existing cluster. 
  • Average: The next cluster is formed by merging the observation with the shortest average distance to an existing cluster. 

Note: In divisive clustering, clusters are formed by dividing existing clusters that maximize the choice of linkage, essentially working inversely to what was described for each option. 

Author

Help us improve this post by suggesting in comments below:

– modifications to the text, and infographics
– video resources that offer clear explanations for this question
– code snippets and case studies relevant to this concept
– online blogs, and research publications that are a “must read” on this topic

Leave the first comment

Partner Ad
Find out all the ways that you can
Contribute