Silhouette Score compares the distance of observations to the centroids of the clusters they are assigned to against that to the centroids of other clusters in an algorithm like K-Means. It ranges between -1 and 1, where values close to -1 indicate that observations might have been assigned to the wrong cluster, where values close to 1 imply observations are close to the centroids of their own cluster but far from centroids of other clusters, which is indicative of compact clustering. Values closer to 0 indicate possible overlap between clusters, meaning it is ambiguous which cluster an observation should belong to. The average Silhouette score across all observations is reported to arrive at a global measure
What is Silhouette Score?
Help us improve this post by suggesting in comments below:
– modifications to the text, and infographics
– video resources that offer clear explanations for this question
– code snippets and case studies relevant to this concept
– online blogs, and research publications that are a “must read” on this topic
Partner Ad