In addition to the choices for how to chunk (i.e. defining chunk size, chunk bou... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		crosen99 on March 28, 2023 \| parent \| context \| favorite \| on: Launch HN: Metal (YC W23) – Embeddings as a Servic... In addition to the choices for how to chunk (i.e. defining chunk size, chunk boundaries, chunk overlap, etc.), there's also the question of what actually gets returned once finding the chunks that match. For example, perhaps I have a document with 100 1-page sections where each section is broken into roughly 5 chunks. I may get optimal performance in my RAG application not by retrieving the top K chunks from the index, but rather by returning the top K sections fom the document, where sections might be scored based on the number and scores of child chunks. It also might be useful to incorporate section summaries, etc., in the retrieval process.

jxodwyer1 on March 28, 2023 [–]

This is great, and that makes a ton of sense! Would you want to define + experiment with these various configurations yourself explicitly, or would you expect a system to determine this automatically? I like the concept of rolling-up chunk scores!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact