Topics and LL/token in Mallet change every time

Question

Topics and LL/token in Mallet change every time

148 views Asked by AudioBubble At 13 November 2021 at 11:56

Why do I get different keywords and LL/token every time I run topic models in Mallet? Is it normal?

Please help. Thank you.

There are 1 answers

**David Mimno** · Answer 1 · 2021-11-13T21:39:12+00:00

Yes, this is normal and expected. Mallet implements a randomized algorithm. Finding the exact optimal best topic model for a collection is computationally intractable, but it's much easier to find one of countless "pretty good" solutions.

As an intuition, imagine shaking a box of sand. The smaller particles will sift towards one side, and the larger particles towards the other. That's way easier than trying to sort them by hand. You won't get the exact order, but each time you'll get one of a large number of equally good approximate sortings.

If you want to have a stronger guarantee of local optimality, add --num-icm-iterations 100 to switch from sampling to choosing the single best allocation for each token, given all the others.

TechQA.

Topics and LL/token in Mallet change every time

There are 1 answers

Related Questions in MALLET

Popular Questions

Trending Questions