I am using doc2vec to convert the top 100 tweets of my followers in vector representation (say v1.....v100). After that I am using the vector representation to do the K-Means clusters.
model = Doc2Vec(documents=t, size=100, alpha=.035, window=10, workers=4, min_count=2)
I can see that cluster 0 is dominated by some values (say v10, v12, v23, ....). My question is what does these v10, v12 ... etc represents. Can I deduce that these specific column clusters specific keywords of document.
Don't use the individual variables. They should be only analyzed together because of the way these embeddings are trained.
For a starter, find