That's not accurate at all. Someone writing for the rocm platform will be writin... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		jacoblambda on Dec 18, 2023 \| parent \| context \| favorite \| on: AMD's CDNA 3 Compute Architecture That's not accurate at all. Someone writing for the rocm platform will be writing HIP and then HIPCC will compile that down into the actual runtime targeting the given architecture. HIP is pretty platform agnostic so very little device specific optimization tends to go into idiomatic HIP.

treprinum on Dec 18, 2023 [–]

There are many low-level details in CUDA to get it working fast on a given GPU architecture that compilers don't address at all.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact