Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I thought this was fantastic! Surprised not more people are commenting on this. Is there a reason I am not aware of?

To the author: what happens to my voice after I upload it? What is your plan moving forward? I am too far left field to understand how to build a business and monetize an open source product like this, even though I found it fun to play around with.



Thanks! There is a model that turns the voice into an embedding that is used to determine the voice. Unlike the STT and TTS, we won't be releasing the weights of this voice cloning model, but we will provide it over an API so that we can do verification and prevent abuse.

edit: Ah yes, and we do not store the voice sample on our server. The voice embedding is cached for 24 hours.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: