What is the method used by Pandas profiling tool to identify duplicates rows?

376 views Asked by At

I'm looking for the rationale about the method used by pandas profiling tool to identify duplicates rows (in a dataframe with multiple columns)? I couldn't find it in Pandas Profiling documentation.

1

There are 1 answers

0
loopy On

See model/summary line 571-575.

In other cases, can be simplified as sum(df.duplicated())