Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

"I really want to match items like these, but it does not work" is just a fine tuning problem.


Yes, in a sense that if you have infinite appropriate dataset and compute. No, in a sense what is practically achievable.


You don't need infinite data. You need ~100k samples. It's also not particularly expensive.


Dude, you are talking total nonsense. What does 100k samples even mean. We have not even established what task we are talking about and you already know that you need that many samples. Not to be offensive, but you seem like the type of guy who believe in these magical properties.


… you established the task earlier? Item X and Item Y should be colocated in embedding space.

This is what people using embedding models for recsys are doing. It’s not rocket science and it doesn’t require “infinite data”.

By 100k samples I mean 100k samples that provide relevance feedback. 100k positive pairs.

I’m working on these kinds of problems with actual customers. Not really sure where the hostility is coming from.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: