By the fuzzy match here , I mean to find the documents which have like 60-70% of word matches from the word list in query.
Eg :
>> #(Query string as passed by user)
>> query = i am searching for a document that is matched fuzzily with what i am giving here.
>> QueryParser("content", ix.schema).parse(query)
This query will look for documents with all the words but i want to find all those documents which contain at least 60% or more of the above words.
Since the count of words that I would be dealing with is large and I do not want programatically partitioning of this word set into different sets (for ORing).
This seems
Not implemented yet
in Whoosh (Checked28/05/2015
).However, in the documentation of
[whoosh.query.Or][1]
, there is a reference to aminmatch
argument:If we supposed
minmatch
is the minimal matched keywords so the solution whould be likeIn this case, you should ignore stop filtering or you should filter the stopwords from the query before calculating the length of query.