Affinity Propagation (sklearn) - strange behavior

Question

Affinity Propagation (sklearn) - strange behavior

489 views Asked by Baba At 14 June 2015 at 13:25

Trying to use affinity propagation for a simple clustering task:

from sklearn.cluster import AffinityPropagation
c = [[0], [0], [0], [0], [0], [0], [0], [0]]
af = AffinityPropagation (affinity = 'euclidean').fit (c)
print (af.labels_)

I get this strange result: [0 1 0 1 2 1 1 0]

I would expect to have all samples in the same cluster, like in this case:

c = [[0], [0], [0]]
af = AffinityPropagation (affinity = 'euclidean').fit (c)
print (af.labels_)

which indeed puts all samples in the same cluster: [0 0 0]

What am I missing?

Thanks

Original Q&A

There are 1 answers

**Andreus** · Answer 1 · 2015-06-15T15:56:47+00:00

I believe this is because your problem is essentially ill-posed (you pass lots of the same point to an algorithm which is trying to find similarity between different points). AffinityPropagation is doing matrix math under the hood, and your similarity matrix (which is all zeros) is nastily degenerate. In order to not error out, the implementation adds a small random matrix to the similarity matrix, preventing the algorithm from quitting when it encounters two of the same point.

TechQA.

Affinity Propagation (sklearn) - strange behavior

There are 1 answers

Related Questions in SCIKIT-LEARN

Related Questions in CLUSTER-ANALYSIS

Popular Questions

Popular Tags

Trending Questions