Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I remembered about a paper that sheds light on this issue. An embedding can store/recover exactly a short sentence:

> a multi step method that iteratively corrects and re embeds text is able to recover 92% of 32-token text inputs exactly

https://arxiv.org/abs/2310.06816

So it's probably 1 sentence == 1 embedding



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: