Python recordlinkage identity

Question

Python recordlinkage identity

170 views Asked by Taiwotman At 20 September 2018 at 17:12

Similar issue as R recordlinkage identity but in python. The algorithm generates new identity that do no reflect the correct identity of the records that were matche. Assuming data duplication with a single dataframe.

PS: It seems to be okay in the data duplication example

Original Q&A

There are 1 answers

**Taiwotman** · Answer 1 · 2018-09-20T20:51:07+00:00

Taiwotman On 20 September 2018 at 20:51

The index column that is generated using pandas needs to be dropped and replaced by the preferred column in the dataframe to use as the identify column

Logic is

replace index column with identify column in dataframe

TechQA.

Python recordlinkage identity

There are 1 answers

Related Questions in PYTHON

Related Questions in RECORD-LINKAGE

Popular Questions

Popular Tags

Trending Questions