Run parallel function on Apple GPU/metal in Python

I have a rather simple function that runs quite fast in parallel using Numba, and I would like to know if I can run it on my Apple M3 Max GPU. However I have never worked on GPU code before (coming from Macs), so I am a little lost..

I have included a small use case:

import numpy as npimport numba as nbN = 15      # size of the a and bK = 127     # size of the resultL = 943     # size of the operatora = np.random.standard_normal(size=N)b = np.random.standard_normal(size=N)operator = np.zeros(shape=(L, 4), dtype=np.int64)operator[:, 0] = np.random.randint(size=L, low=0, high=N)operator[:, 1] = np.random.randint(size=L, low=0, high=N)operator[:, 2] = np.random.randint(size=L, low=0, high=K)operator[:, 3] = np.random.randint(size=L, low=1, high=10)@nb.njit(parallel=True)def shuffle_mul(a: np.ndarray, b: np.ndarray, operator: np.ndarray) -> np.ndarray:    res = np.zeros(shape=K, dtype=a.dtype)    for n in nb.prange(len(operator)):        i, j, k, count = operator[n]        res[k] += count * a[i] * b[j]    return resshuffle_mul(a, b, operator=operator)  # warm-up%timeit shuffle_mul(a, b, operator=operator)  # 173 µs ± 41.3 µs per loop (mean ± std. dev. of 7 runs, 10,000 loops each)

Question 1: Is it even theoretically possible/interesting to run such a function on GPU? The fact that multiple instances might read a and b or add to res at the same time is a problem?

Question 2: What would be the best approach? Also, what library should I use? metalcompute, jax, PyTorch, ... ?

Thanks a lot!

Run parallel function on Apple GPU/metal in Python

Trending Articles

RAMAYAMPET Mandal Sarpanch | Upa-Sarpanch | Ward member Mobile Numbers Medak...

लड़कियां सेक्स के दौरान क्यों करती है उह! आह!लड़कियां सेक्स के दौरान क्यों करती...

Neem Baba Extra Questions Answer Class 6 English Poorvi

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Lowe faces four theft charges

Practice Sheet of Right form of verbs for HSC Students

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

The 10 Tennessee Cities With The Largest Black Population For 2021

Materials Around Us Class 6 Worksheet Science Chapter 6

デスクトップヒープの枯渇

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

Kanulanu Thaake Lyrics and translation | Manam (2014)

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

Teen Shot In Miami Drive-By Dies From Injuries

Download: IQ Muzatasha feat Shy D & Pmj – Ulesi NiFertilizer Yamavuto

Mahakal Attitude Status

Property developer set up cannabis factory to help pay off debts...

♡

KB: How to troubleshoot issues when adding a Hyper-V host in System Center...