List Question
20 TechQA 2015-06-12T14:50:13.993000MinHashing vs SimHashing
1.2k views
Asked by cjauvin
How to calculate similarity of two texts with Jaccard similarity of two bag via MinHash?
643 views
Asked by Yuansheng liu
Mahout minhash org.apache.hadoop.io.LongWritable cannot be cast to org.apache.hadoop.io.Text
710 views
Asked by Osy
Relationship between (1) hash function, (2) length of signature and (3) jaccard similarity?
1.3k views
Asked by user3142979
LSH implementation for finding clusters
491 views
Asked by Samarth Shah
Is the number of rows always 1 in each band in the Spark implementation of MinHashLSH
872 views
Asked by zyxue
One-hot encoding minHashed genomes
135 views
Asked by C. John
Faster implementation of LSH (AND-OR)
2.2k views
Asked by Ramki
Clarification needed about min/sim hashing + LSH
1.1k views
Asked by cjauvin
Should we consider two sets to be similar if their rows contain the same hashes but in different order?
110 views
Asked by haky_nash
Why does my query using a MinHash analyzer fail to retrieve duplicates?
954 views
Asked by Davide Fiocco
Generate sparse vector for all the column values in spark dataframe
488 views
Asked by Tanmay Sinha
Transform a dataframe for the minHashLSH in spark
251 views
Asked by Galuoises
Minhashing on Strings with K-length
410 views
Asked by Afonso Guimaraes
Set distance as similarity metric for MinHashing algorithm
239 views
Asked by Maggie
Spark MinHashLSH Never Progresses
345 views
Asked by Charlie
making LSH implementation faster in C++11
322 views
Asked by SBDK8219
why is JaccardDistance always 0 for different docs from spark MinHashLSHModel approxSimilarityJoin
708 views
Asked by steve99
How do I find the k-nearest values in n-dimensional space?
653 views
Asked by Eyal