Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
rurban
10 months ago
|
parent
|
context
|
favorite
| on:
FP8 is ~100 tflops faster when the kernel name has...
That's strange because the cutlass docs explicitly does NOT mention fp8 support. So it looks like it can be used nevertheless with fp8 by using the name hack.
mlazos
10 months ago
[–]
It supports e5m2 and e4m3 right in the doc linked.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: