I have two dataframes of unequal length and would like to compare the similarity of strings in df2 with df1. Is it possible to apply Jaro-Winkler distance method to calculate the string similarity on two dataframes through map/lambda function.
df1
Behavioral disorders
Behçet disease
AV-Block
df2
Behavioral disorder
Behçet syndrome
The desired output is:
name_left name_right score
Behavioral disorders Behavioral disorder 0.933333
Behçet disease Behçet syndrome 0.865342
The scores mentioned above are hypothetical. Any help is highly appreciated
Assuming you want the max score and that the original columns in the input are "name":