Mallet topic modelling issue when training with large number of topics

253 views Asked by At

I am trying to build a topic model with 10000 topics on a dataset of 1M samples. After data is loaded I am getting this line:

bin/mallet: line 62: 17428 Killed $JAVA_COMMAND $CLASS $*.

This is the command I am running:

`bin/mallet train-topics \
  --input data.mallet \
  --output-model topics.model \
  --output-topic-keys topic-keys.txt \ 
  --topic-word-weights-file topic-word-weights.txt \
  --word-topic-counts-file word-topic-counts-file.txt \
  --output-doc-topics doc-topics.txt \
  --num-topics 10000 \
  --num-threads 28 \
  --num-iterations 2000 \
  --use-symmetric-alpha FALSE`

Any suggestion is appreciated.

0

There are 0 answers