Once you can run a GPT5 level LLM locally on a device, it’s over. All this might...

gmadsen · 2025-10-14T18:19:23 1760465963

10 years from now, the capabilities of gpt5 will be as relevant to AI as Atari is to modern gaming

GoatInGrey · 2025-10-14T18:36:26 1760466986

Unless GPT 5 is more like the PlayStation 5 and not the Atari 2600.

deadbabe · 2025-10-14T21:15:10 1760476510

I doubt it. Newer state of the art models might be a little better, but not enough to justify paying $1000/month for the average person or employee.

If you can get a GPT5 level AI, locally and privately, for just the cost of electricity, why would you bother with anything else? If it can’t do something, you’d just outsource that one prompt to a cloud based AI.

The vast majority of your prompts will be passing through a local LLM first in 2035, and you might rarely need to touch an agent API. So what does that mean for the AI industry?

bigbuppo · 2025-10-14T18:49:31 1760467771

So people will be paying money for the nostalgia of ChatGPT after it dies? That tracks.

culll_kuprey · 2025-10-14T23:46:33 1760485593

Walmart will sell cheap Chinese emulations to hop on the nostalgia train too.

hadlock · 2025-10-14T18:34:06 1760466846

Consumer devices are already available that offer 128gb specifically labeled for AI use. I think server side AI will still exist for IoT devices, but I agree, 10 years seems pretty reasonable timelie to buy a GTX 5080-sized card that will have 1TB of memory, with the ability to pair it with another one for 2TB. For local, non-distributed use, GPUs are already more than capable of doing 20+ tokens/s, we're mostly waiting on 512gb devices to drop in price, and "free" LLMs to get better.

robocat · 2025-10-14T19:49:13 1760471353

Are we constrained by RAM production?

RAM Price per GB Projected to decline at 15% per annum.

That's quite a few years before you'll get double the RAM.

For mobile I'm guessing power constraints matter too.

hadlock · 2025-10-16T19:24:06 1760642646

My guess was that nvidia is limiting memory size on consumer cards to avoid cannibalizing their commercial/industrial sales. I see no reason why a 5060 or 5070 can't come with 64/128/512gb memory outside of intentional decisions to not support those memory sizes; I don't need a 5090 as I don't need more than ~20-40 tokens/s as a 1-4 user household system

lysace · 2025-10-14T18:03:51 1760465031

Have you tried the reasoning mode of Gemini Pro 2.5? https://aistudio.google.com/

It gives me the chills, thinking about when it has 1000x cheaper ~GPU compute.

GoatInGrey · 2025-10-14T18:37:16 1760467036

This a hosted model with closed weights, though.

lysace · 2025-10-14T20:13:24 1760472804

Yes. Not the point.