Which is the best one, so far? Eleven labs? OpenAI apparently has something in t...

smusamashah · on Sept 25, 2023

Google Soundstorm had the best demo so far. It takes few seconds of original audio and continues it with the same voices. Just hearing those examples you wont figure out where original finished and generated one started.

narrationbox · on Sept 26, 2023

Yeah, neural codecs are pretty amazing. The most incredible part is that they can do compression well across the temporal domain, something which has been non-trivial.

airstrike · on Sept 25, 2023

I'm also curious. A review of what's state-of-the-art today would be a great idea for a blog post. Just don't post it on medium.com please

yzydserd · on Sept 25, 2023

Still Azure Speech in my experience.

radicalriddler · on Sept 25, 2023

ElevenLabs is better quality wise, but it's vastly more expensive. Azure Speech hits a really good price:quality ratio.