What loss function does K-Means seek to minimize?

K-Means minimizes the total within-cluster sum of squares (WCSS), meaning that it penalizes observations that lie far away from the centroid of the cluster to which they are assigned more so than those close to the centroids. K-Means does not require a specific choice of distance metric, but the Euclidean distance is used most often. 

Author

Help us improve this post by suggesting in comments below:

– modifications to the text, and infographics
– video resources that offer clear explanations for this question
– code snippets and case studies relevant to this concept
– online blogs, and research publications that are a “must read” on this topic

Leave the first comment

Partner Ad
Find out all the ways that you can
Contribute