> but with a good harness they are able to achieve things with SotA that I could...

windexh8er · 2026-05-28T19:32:51 1779996771

I think this is a big component, but also context. A large factor in any model being able to handle complexity comes down to context length.

I think multiple SLMs driven by an orchestration frameworks (harness or otherwise) will ultimately displace LLMs. Right now we're in the era of diminishing returns with respect to LLM gains. Moving the needle percentages doesn't excite as many people anymore and with "reasoning" capabilities there's no reason why small distributed models can't be run more efficiently, especially if/when we start to see gains in modularized context management solutions.

coderenegade · 2026-05-30T08:13:40 1780128820

It's hard to know for sure. There are good information theoretic reasons to suspect that general models will always be better than smaller expert models, but maybe a MoE can claw some performance back, albeit with redundant computation. The properties of conditional entropy, for instance, always favor more generality. This assumes that the harness isn't a factor, or is at least equivalent across different models.

mswphd · 2026-05-28T20:11:28 1779999088

sure, but high-quality harnesses require less gpu compute/VRAM, and plausibly can be used locally by most users.