cumulative aggregate a polars list[struct[]]

I have to accomplish a complex dataframe conversion like this:

original_dataframe = pl.DataFrame({'index': ['A', 'B', 'C', 'D', 'E', 'F', 'G'], 'content': [{'key': 3, 'val': 20}, {'key': 4, 'val': 50}, {'key': 3, 'val': 8}, {'key': 5, 'val': 70}, {'key': 4, 'val': -60}, {'key': 2, 'val': 30}, {'key': 4, 'val': 5}]})┌───────┬───────────┐│ index ┆ content   ││ ---   ┆ ---       ││ str   ┆ struct[2] │╞═══════╪═══════════╡│ A     ┆ {3,20}    ││ B     ┆ {4,50}    ││ C     ┆ {3,8}     ││ D     ┆ {5,70}    ││ E     ┆ {4,-60}   ││ F     ┆ {2,30}    ││ G     ┆ {4,5}     │└───────┴───────────┘       ||       \/ ┌───────┬──────────────────────────┐│ index ┆ content                  ││ ---   ┆ ---                      ││ str   ┆ list[struct[2]]          │╞═══════╪══════════════════════════╡│ A     ┆ [{3,20}]                 ││ B     ┆ [{3,20}, {4,50}]         ││ C     ┆ [{3,28}, {4,50}]         ││ D     ┆ [{3,28}, {4,50}, {5,70}] ││ E     ┆ [{3,28}, {5,70}]         │└───────┴──────────────────────────┘

This conversion combines:

cumulative add struct into list row by row;
if it exists same struct 'key' field in the list, aggregate the two struct by sum struct 'val' field;
if the struct 'val' field<= 0 after aggregation, drop it in the list;
sort each list by struct 'key' field;
also drop struct if its 'val' field or 'key' field is null.

The conversion can be ugly down by use iter_rows() and to_list() to iterate dataframe rows with intermediate python data type list, dict. But this way is slow. How it can be solved just use polars functions for fast and elegant?

PS: Thanks @jqurious' reminder, there is an additional requirement, so I updated the question.

pl.DataFrame({'index': ['A', 'B', 'C', 'D', 'E', 'F'], 'content': [{'key': 3, 'val': 20}, {'key': 4, 'val': 50}, {'key': 3, 'val': 8}, {'key': 2, 'val': 30}, {'key': 4, 'val': -60}, {'key': 4, 'val': 5}]})┌───────┬───────────┐│ index ┆ content   ││ ---   ┆ ---       ││ str   ┆ struct[2] │╞═══════╪═══════════╡│ A     ┆ {3,20}    ││ B     ┆ {4,50}    ││ C     ┆ {3,8}     ││ D     ┆ {2,30}    ││ E     ┆ {4,-60}   ││ F     ┆ {4,5}     │└───────┴───────────┘        ||        \/ ┌───────┬──────────────────────────┐│ index ┆ content                  ││ ---   ┆ ---                      ││ str   ┆ list[struct[2]]          │╞═══════╪══════════════════════════╡│ A     ┆ [{3,20}]                 ││ B     ┆ [{3,20}, {4,50}]         ││ C     ┆ [{3,28}, {4,50}]         ││ D     ┆ [{2,30}, {3,28}, {4,50}] ││ E     ┆ [{2,30}, {3,28}]         ││ F     ┆ [{2,30}, {3,28}, {4,5}]  │└───────┴──────────────────────────┘

the updated requirement is:

if the struct 'val' field<= 0 after cumulative sum, drop it at the corresponding row's list immediately; and if the struct 'key' field appears again in the following rows with struct 'val' field> 0, it should be cumulative aggregate again;

cumulative aggregate a polars list[struct[]]

Trending Articles

RAMAYAMPET Mandal Sarpanch | Upa-Sarpanch | Ward member Mobile Numbers Medak...

लड़कियां सेक्स के दौरान क्यों करती है उह! आह!लड़कियां सेक्स के दौरान क्यों करती...

Neem Baba Extra Questions Answer Class 6 English Poorvi

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Lowe faces four theft charges

Practice Sheet of Right form of verbs for HSC Students

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

The 10 Tennessee Cities With The Largest Black Population For 2021

Materials Around Us Class 6 Worksheet Science Chapter 6

デスクトップヒープの枯渇

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

Kanulanu Thaake Lyrics and translation | Manam (2014)

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

Teen Shot In Miami Drive-By Dies From Injuries

Download: IQ Muzatasha feat Shy D & Pmj – Ulesi NiFertilizer Yamavuto

Mahakal Attitude Status

Property developer set up cannabis factory to help pay off debts...

♡

KB: How to troubleshoot issues when adding a Hyper-V host in System Center...