Deriving correlation coefficient on a grouped Pyspark dataframe

I have been struggling implementing the following function.

I would like to apply first groupBy operation to customer_name and for each group, I would like to calculate Pearson correlation coefficient between price and units. So, the final dataframe should have two columns, customer_name and correlation. I would like to use pyspark.ml.stat.Correlation library to calculate correlation coefficient. Would you please help figuring out the code. Here is an example.

spark = SparkSession.builder.appName("CustomFunctionExample").getOrCreate()# Sample data (replace with your actual data)data = [    ("2021-01-06", "a1", "b1", 8.0, 8.0),    ("2021-03-13", "a1", "b1", 1.0, 0.0),    ("2021-06-20", "a1", "b5", 2.0, 0.0),    ("2021-10-27", "a1", "b5", 8.0, 8.0),    ("2021-01-06", "a1", "b2", 2.0, 2.0),    ("2021-03-13", "a2", "b2", 9.0, 9.0),    ("2021-06-06", "a2", "b4", 3.0, 3.0),    ("2021-10-06", "a2", "b4", 8.0, 8.0)]schema = ["date", "customer_name", "upc", "price", "units"]df = spark.createDataFrame(data, schema)

Expecting a pyspark dataframe with customer_name and corr_coeff columns.

Deriving correlation coefficient on a grouped Pyspark dataframe

Trending Articles

RAMAYAMPET Mandal Sarpanch | Upa-Sarpanch | Ward member Mobile Numbers Medak...

लड़कियां सेक्स के दौरान क्यों करती है उह! आह!लड़कियां सेक्स के दौरान क्यों करती...

Neem Baba Extra Questions Answer Class 6 English Poorvi

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Lowe faces four theft charges

Practice Sheet of Right form of verbs for HSC Students

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

The 10 Tennessee Cities With The Largest Black Population For 2021

Materials Around Us Class 6 Worksheet Science Chapter 6

デスクトップヒープの枯渇

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

Kanulanu Thaake Lyrics and translation | Manam (2014)

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

Teen Shot In Miami Drive-By Dies From Injuries

Download: IQ Muzatasha feat Shy D & Pmj – Ulesi NiFertilizer Yamavuto

Mahakal Attitude Status

Property developer set up cannabis factory to help pay off debts...

♡

KB: How to troubleshoot issues when adding a Hyper-V host in System Center...