What's silly about it? It can accurately identify when the concept is injected v... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		hackinthebochs 8 days ago \| parent \| context \| favorite \| on: Signs of introspection in large language models What's silly about it? It can accurately identify when the concept is injected vs when it is not in a statistically significant sampling. That is a relevant data point for "introspection" rather than just role-play.

XenophileJKO 8 days ago [–]

I think what cinched it for me is they said they had 0 false positives. That is pretty significant.

Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact