create a new column based on cumulative occurrences of a specific value in another column pandas

Question

create a new column based on cumulative occurrences of a specific value in another column pandas

526 views Asked by Paul At 20 April 2022 at 20:38

I want to count the number of occurrences of one specific value (string) in one column and write it down in another column cumulatively.

For example, counting the cumulative number of Y values here:

col_1  new_col
Y        1
Y        2
N        2
Y        3
N        3

I wrote this code but it gives me the final number instead of cumulative frequencies.

df['new_col'] = 0
df['new_col'] = df.loc[df.col_1 == 'Y'].count()

Original Q&A

There are 2 answers

G.G On 31 August 2023 at 15:03

df1.assign(new_col=df1.col_1.eq("Y").cumsum())

Output:

  col_1  new_col
0     Y        1
1     Y        2
2     N        2
3     Y        3
4     N        3

**mozway** · Accepted Answer · 2022-04-20T20:47:00+00:00

To count both values cumulatively you can use:

df['new_col'] = (df
                 .groupby('col_1')
                 .cumcount().add(1)
                 .cummax()
                 )

If you want to focus on 'Y':

df['new_col'] = (df
                 .groupby('col_1')
                 .cumcount().add(1)
                 .where(df['col_1'].eq('Y'))
                 .ffill()
                 .fillna(0, downcast='infer')
                 )

Output:

  col_1  new_col
0     Y        1
1     Y        2
2     N        2
3     Y        3
4     N        3

TechQA.

create a new column based on cumulative occurrences of a specific value in another column pandas

There are 2 answers

Related Questions in PANDAS

Related Questions in CUMULATIVE-FREQUENCY

Popular Questions

Popular Tags

Trending Questions