My data has the following shape:
id column1 column2 a x 1 a x 3 a y 3 b y 1 b y 2
And I want to get to most repeated value for each id as well as its frequency percentage.
id column1 % column2 % a x 66.6 3 66.6 b y 100.0 N/A N/A
a special case is when there are equal frequencies, I output N/A for both column and percentage.
Right now my solution is purely using python dictionaries and lists. However, I am struggling to approach this from a DataFrame point of view