When issuing the same query with match all query (* : *) I get different clusters and scores all the time. What could be the reason?
First try:
label: "В Минске"
score: 52.79549568196028
Second try:
label: "В Минске"
"score": 54.74385944060893
Third try:
label: "В Минске"
"score": 48.884082925408734
Document ids inside clusters are also different. Clusters themselves change: in one query response I get a cluster "тысячами евро", in the subsequent one it is gone, but new cluster appears: "Тысячами Долларов"
Is there some carrot parameter that could make clusters stable for a given query? Could it be desiredClusterCountBase ?
The Solr index is the same for all cases. Algorithm used: org.carrot2.clustering.lingo.LingoClusteringAlgorithm with StopWordLabelFilter.enabled=false and clustering.rows=1000.
It looks like I found the reason:
I don't know if this is a bug still, but removing all documents from the index and putting them back with the correct publication date has solved the issue.