I have read quite a few tutorials since morning . My problem involves finding the similarity between two documents. I am looking forward to use LSA in java for this purpose.
I understood the creation of the term-document matrix and then the SVD(Dimensionality gets reduced) is applied to it . 3 Matrices are obtained as results.This might sound stupid but i have been stuck with this for a quite a while . Now if i have to find the similarity between the two documents what do i have to do ?
After calculating the 3 matrices using SVD, you need to calculate the correlation between the vectors of the two documents you want to compare. you can use spearman's correlation. Another way is with using the cosine distance.
you will find more details at LSA, there is a full example with explanation.
you might search for some java libraries for LSA.