A system outputting correct facts, tells you nothing about the system's ability ...

phendrenad2 · 2025-11-10T10:52:12 1762771932

As usual, my argument brought all the people out of the woodwork who have some obsession about an argument that's tangential. Sorry to touch your tangent, bud.

1718627440 · 2025-11-10T12:03:40 1762776220

> LLMs not being able to detect correctness is just demonstrably false if you play around with LLM agents a bit.

How is telling you that this method of determining correctness is incapable of doing so, only tangential?

phendrenad2 · 2025-11-18T06:52:25 1763448745

Correctness and proven correctness are different things. I suspect you're a big Rocq Prover fan.