Handling empty components in EM implementation for GMM learning

Question

Handling empty components in EM implementation for GMM learning

398 views Asked by AruniRC At 29 November 2014 at 16:41

I am trying to implement learning a Gaussian Mixture Model using EM from scratch in MATLAB. The project requires some later modifications to the standard GMM model, which is why I am not using off-the-shelf implementations such as VLFeat or the Stats Toolbox. Rolling out an implementation would be a learning experience and be easily customizable later on.

Specifically, coding EM for a GMM with spherical covariances.

Handling empty clusters. I am having trouble handling the case when some components of the GMM are not assigned any data - they have zero or negligible posterior probability mass. This case arises when there are a large number of clusters defined. What is the standard way of handling this case?
Intuitively, I would select the component with highest covariance and assign half of its data to the empty component.

My question is: is there a standard and principled way of handling this in EM implementations (which I haven't managed to find via Google)?

Original Q&A

There are 1 answers

**Has QUIT--Anony-Mousse** · Answer 1 · 2014-12-14T12:14:40+00:00

Has QUIT--Anony-Mousse On 14 December 2014 at 12:14

Empty components in GMM should not arise.

Usually, you do soft assignments, so at least a tiny fraction of some object will remain in every component. This is why you need a convergence threshold for EM.

TechQA.

Handling empty components in EM implementation for GMM learning

There are 1 answers

Related Questions in MATLAB

Related Questions in CLUSTER-ANALYSIS

Related Questions in MIXTURE-MODEL

Popular Questions

Popular Tags

Trending Questions