How a clustering algorithm in R can end up with negative silhouette values? AB

819 views Asked by At

We know that clustering methods in R assign observations to the closest medoids. Hence, it is supposed to be the closest cluster each observation can have. So, I wonder how it is possible to have negative values of silhouette , while we are supposedly assign each observation to the closest cluster and the formula in silhouette method cannot get negative?

Behnam.

1

There are 1 answers

4
Has QUIT--Anony-Mousse On

Two errors:

  1. most clustering algorithms do not use the medoid, only PAM does.

  2. the silhouette does not use the distance to the medoid, but the average distance to all cluster members. If the closest cluster is very wide, the average distance can be larger than the distance to the medoid. Consider a cluster with one point in the center, and all others on a sphere around it.