The models hold more information than they can immediately extract, but CoT can find a key to look it up or synthesise by applying some learned generalisations.
The models hold more information than they can immediately extract, but CoT can find a key to look it up or synthesise by applying some learned generalisations.