Aye, there’s the kicker. The correct configuration of hardware resources to run ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		foundry27 11 months ago \| parent \| context \| favorite \| on: How I run LLMs locally Aye, there’s the kicker. The correct configuration of hardware resources to run and multiplex large models is just as much of a trade secret as model weights themselves when it comes to non-hobbyist usage, and I wouldn’t be surprised if optimal setups are in many ways deliberately obfuscated or hidden to keep a competitive advantage Edit: outside the HPC community specifically, I mean

codybontecou 11 months ago [–]

The economic barrier to entry probably has a lot to do with it. I'd happily dig into this problem and share my findings but it's simply too expensive for a hobbyist that isn't specialized in it.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact