What is Silhouette Score?

Silhouette Score compares the distance of observations to the centroids of the clusters they are assigned to against that to the centroids of other clusters in an algorithm like K-Means. It ranges between -1 and 1, where values close to -1 indicate that observations might have been assigned to the wrong cluster, where values close to 1 imply observations are close to the centroids of their own cluster but far from centroids of other clusters, which is indicative of compact clustering. Values closer to 0 indicate possible overlap between clusters, meaning it is ambiguous which cluster an observation should belong to. The average Silhouette score across all observations is reported to arrive at a global measure

Author

Help us improve this post by suggesting in comments below:

– modifications to the text, and infographics
– video resources that offer clear explanations for this question
– code snippets and case studies relevant to this concept
– online blogs, and research publications that are a “must read” on this topic

Leave the first comment

Partner Ad