Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That's not accurate at all.

Someone writing for the rocm platform will be writing HIP and then HIPCC will compile that down into the actual runtime targeting the given architecture. HIP is pretty platform agnostic so very little device specific optimization tends to go into idiomatic HIP.



There are many low-level details in CUDA to get it working fast on a given GPU architecture that compilers don't address at all.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: