Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's very hard for simultaneous good audio generation with video generation (simultaneous generation is necessary to maintain lip sync). Veo 3 et al also have flat monochannel audio, but not as bad as these Sora 2 demos.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: