I’ve been meaning to create & publish a structured extraction benchmark for a wh... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		byt3bl33d3r 26 days ago \| parent \| context \| favorite \| on: Trying out Gemini 3 Pro with audio transcription a... I’ve been meaning to create & publish a structured extraction benchmark for a while. Using LLMs to extract info/entities/connections from large amounts of unstructured data is also a huge boon to AI-assisted reporting and has also a number of cybersecurity applications. Gemini 2.5 was pretty good but so far I have yet to see an LLM that can reliably , accurately and consistently do this

simonw 26 days ago [–]

This would be extremely useful. I think this is one of the most commercially valuable uses of these kinds of models, having more solid independent benchmarks would be great.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact