Here is a charitable perspective on what's happening: - Nvidia has too much cash...

moogly · 2025-10-31T18:17:13 1761934633

> Nvidia has too much cash because of massive profits and has nowhere to reinvest them internally.

Here's an idea: they could make actual GPUs used for games affordable again, and not have Jensen Huang lie on stage about their performance to justify their astronomical prices. Sure, companies might want to buy them for ML/AI and crash the market again but I'm sure a company of their caliber could solve that if they _really_ wanted to.

Lord-Jobo · 2025-10-31T18:26:05 1761935165

I also just don’t understand, as someone with no business experience, how they aren’t just pouring all of that money into enhancing their production capacity. That’s very clearly their bottleneck here.

Yes, I’m certain they are spending an astronomical amount on that already, but why not more? Surely paying more money for construction of more facilities still nets gain even if you run into diminishing returns?

Instead they set up this whacko tax laundering scheme? Just seems like more corporate pocket filling to me, an idiot with no business knowledge.

lukev · 2025-10-31T19:04:01 1761937441

The bottleneck is TSMC, who also make chips for almost every other hardware vendor.

TSMC is indeed increasing their production capability as fast as possible, but it's not easy... chip foundries are extremely expensive, complex, and take serious expertise to operate.

brookst · 2025-10-31T19:05:28 1761937528

It’s called seeding the market. If they can accelerate the growth of potential customers, it will be more profitable than just increasing production to serve existing customers.

Think of exponential growth — would you rather increase the base or the exponent?

sleight42 · 2025-10-31T18:51:58 1761936718

Hedging their bets against a potential sudden downturn in consumption of their product, e.g., an AI bubble exploding? If they invest heavily in production capacity only to find that there is not commensurate consumption, then they'll have lost badly.

mvdtnz · 2025-10-31T21:51:13 1761947473

> as someone with no business experience, how they aren’t just pouring all of that money into enhancing their production capacity. That’s very clearly their bottleneck here.

They know this madness can't go on forever. The last thing they need is to be left with billions of dollars of unused capacity when the bottom falls out of this very stupid bubble.

creer · 2025-11-01T01:00:19 1761958819

> They know this madness can't go on forever.

If they thought that, would they invest massively in the companies that own all these GPUs? They would not. They would invest in anything ELSE they could think of.

cudgy · 2025-11-01T12:22:53 1761999773

But doing that would spook investors and possibly pop the bubble, so they look for ways to mitigate risk and fuel the bubble at the same time.

creer · 2025-11-03T19:23:01 1762197781

That might be the case if nvidia commonly invested in its customers - if they suddenly stopped doing that - but I don't think that was the case. On the contrary, these investments surprised because they were NOT the usual.

rangestransform · 2025-11-01T01:10:11 1761959411

Consumers should already be glad that R&D for their products is being subsidized by OpenAI and Google

HDThoreaun · 2025-10-31T18:22:17 1761934937

Why would they want to do that? The only sector that matter to nvidia is datacenter, its where 90%+ of their profits are. Making their consumer sector even less profitable just seems like a waste of time

moogly · 2025-10-31T21:36:24 1761946584

How about positive mindshare? Regular people not growing up absolutely hating nvidia's guts and only begrudgingly buying their products. Also ensuring that a pretty big industry won't die from becoming too expensive.

Plus, diversification is good for when the bubble inevitably bursts.

But that's long-term thinking and we can't have that. People give Huang credit for having had a long-term vision on AI, but it feels like he definitely has blinders on right now.

creer · 2025-11-01T00:54:09 1761958449

> How about positive mindshare?

Does anyone who can afford an nvidia card actually buy something else? Yes some people hate nvidia, but it's not like they have a hard time selling cards.

senordevnyc · 2025-10-31T22:11:12 1761948672

The consumer gaming card market is minuscule in comparison to their primary market now, to the point where worrying about diversifying there probably doesn’t make sense. Nor does it really matter whether consumer gamers hate them. That is likely to have zero effect on their core customer now.

rangestransform · 2025-11-01T21:57:17 1762034237

CUDA is where it is today because some years ago, people in grad school screwed around with CUDA on their personal gaming PCs

Eisenstein · 2025-10-31T22:33:51 1761950031

Underestimating compounding and secondary effects, especially while rationalizing the abandonment of their core market and capability is one of the most famous ways that big companies provide evidence of their downward spiral. I can feel the MBA energy from here.

senordevnyc · 2025-10-31T22:52:09 1761951129

Can you name any companies that suffered by switching focus away from one market where they dominate in order to also dominate a market that is 10x the size of the first market already, and growing faster?

Eisenstein · 2025-11-01T00:26:48 1761956808

Every specific situation is different, but the pattern I mentioned is easy to find. Here are three examples: RCA, GE, HP.

Eisenstein · 2025-10-31T14:29:23 1761920963

Your conclusion about training being the cost factor that will eventually align with profitability in the inference phases relies on training new models not being an endless arms race.

vmg12 · 2025-10-31T14:37:08 1761921428

If the inference is profitable and training new models is actually an endless arms race that's actually the best outcome for nvidia specifically.

Eisenstein · 2025-10-31T15:01:35 1761922895

Only in the short term.

MangoToupe · 2025-10-31T18:56:55 1761937015

I'm just confused why people think token-based computing is going to be in such demand in the future. It's such a tiny slice of problems worth solving.

thorncorona · 2025-10-31T18:59:16 1761937156

It's like how every big co these days is ML. It will transition to LLMs as well.

Just give it a few years.

brookst · 2025-10-31T19:07:19 1761937639

Yep. Same vibes as “ha ha who needs internet connected appliances” (pretty much all appliances are internet connected now). And the apocryphal “there is a worldwide market for maybe 5 computers”.

baobabKoodaa · 2025-10-31T22:28:00 1761949680

No-one "needs" or even wants appliances to be connected to the internet. You claim that "pretty much all" appliances are internet connected, while almost none of the appliances in my house are.

MangoToupe · 2025-11-01T02:57:35 1761965855

I mean, who DOES need internet connected appliances? Are you implying these token-based techs will be shoved down our throats regardless of demand?

mikewarot · 2025-10-31T23:58:39 1761955119

>This is all dependent on token economics being or becoming profitable

What is the actual value of a token? If it were generated by a human expert in a given field? This should set an upper limit for now.

creer · 2025-11-01T00:57:09 1761958629

> - Nvidia also benefits from growth due to the equity they own.

aka this might be nvidia's next pivot. Contrary to gaming cards, the AI GPUs are productive assets. If nvidia feels that they will be very productive, then it makes sense that they invest broadly in the companies that are likely to make these profits. To share in the rewards while selling more GPUs.

tqian · 2025-10-31T23:34:29 1761953669

If models are profitable once trained, isn't it weird that chatgpt and Claude have $200 tiers that still have usage limits?

pols45 · 2025-10-31T14:35:11 1761921311

Yup. Not just Nvidia. Just look at the quarterly results reported by Amazon, Google, Meta, Microsoft and Apple. Each one is reporting revenues never before seen in history. If you make 100 Billion a quarter you have to spend it on something.

These guys are running hyper optimized cash extraction mega machines. There is no comparison to previous bubbles, cause so no such companies ever existed in the past.

solarwindy · 2025-10-31T14:43:54 1761921834

100 billion a quarter is Alphabet, right? Given how much click fraud there is, and that every org and business under the sun is held to ransom to feature on the SERP for their own name even — it’s tempting to say Google’s become a private tax on everything.

daedrdev · 2025-10-31T16:56:38 1761929798

No, Apple also has 100 billion dollars in revenue despite floundering AI and running a very hardware dependent business.

creer · 2025-11-01T01:10:41 1761959441

> Given how much click fraud there is [etc]

It's easy for the techies to see the problems. But advertising results have been very measurable for a very long time by now. Larger advertisers can leave the details to their techies and still be very clear as to their advertising's productivity post-cost of doing business.

dingaling · 2025-10-31T20:25:33 1761942333

What's shocking is the gulf between those companies and corporate 'normality'.

Eastern Airways, a UK airline, has just gone bust due to accumulated debts of £26 million. That's not even a rounding error for Google, yet was enough to put a 47-year-old company into bankruptcy and its staff out of work.

I think the only historical parallel to this disparity was the era of the East India Company.

trollbridge · 2025-10-31T14:39:39 1761921579

Odd how they are simultaneously having large layoffs even as reporting record revenues.

The question is where the profits are.

wmeredith · 2025-10-31T14:51:01 1761922261

Amazon - 14,000 layoffs; significant

Microsoft - 14,000 (multiple rounds); significant

Meta - 600 layoffs; insignificant for company size

Google - "Several hundred layoffs"; insignificant for a company size

Apple - No layoffs

Source: https://techcrunch.com/2025/10/24/tech-layoffs-2025-list/

ponector · 2025-10-31T20:23:57 1761942237

Also every one of them has hundreds of thousands of external contractors which are not reported anywhere.

And offshoring is also a huge cost-cutting effort everywhere.

_DeadFred_ · 2025-10-31T19:47:02 1761940022

Google's Youtube unit is doing soft layoffs.

https://news.ycombinator.com/item?id=45766368

jcheng · 2025-10-31T15:01:03 1761922863

The layoffs at Amazon and Microsoft are not due to lack of profits. They’re massively profitable right now.

https://www.macrotrends.net/stocks/charts/MSFT/microsoft/ebi...

https://www.macrotrends.net/stocks/charts/AMZN/amazon/ebitda

h2zizzle · 2025-10-31T15:29:34 1761924574

They're "massively profitable" because they're laying off large portions of a major cost center - labor - and backloading uncoming data center construction costs. As those come due, and labor needs rise again, that profit disappears.

bdangubic · 2025-11-01T03:21:45 1761967305

yes, these companies have a track record of disappearing profits :)

h2zizzle · 2025-11-01T05:53:01 1761976381

They have a track record of cornering a market and abusing their position, and also still somehow not being able to balance expenses and revenues to turn a profit that pleases shareholders. You get to decide if that's a problem with the company or the shareholders, I guess.

skywhopper · 2025-10-31T14:55:50 1761922550

So many such profitable companies are the best possible evidence for the need for drastic antitrust intervention. The lack of competition and regulation is leading to a massive drain on every other sector.

marbro · 2025-10-31T17:32:10 1761931930

This bubble is caused by excess competition. There are 4 large companies who believe that a large new market is being created so each is investing large amounts without any evidence that there will be a single winner that dominates the future market. None of these companies has anything remotely resembling a monopoly except for Amazon in online retail.

griffzhowl · 2025-10-31T19:12:10 1761937930

Google: search, chrome, youtube

Microsoft: desktop software

Meta: social media

Maybe on some technical definitions of "monopoly" these aren't monopolies, but nothing remotely resembling a monopoly? come on maan

zigzag312 · 2025-11-01T09:20:51 1761988851

Monopolies being bad for free markets is a simplification. Substantial control over the market would be more accurate description of the issue.

tigershark · 2025-10-31T17:47:43 1761932863

How much more was worth USD at the beginning of the year?

belter · 2025-10-31T14:30:52 1761921052

> Everything seems to indicate that once the models are trained, they are extremely profitable

Some data would reinforce your case. Do you have it?

Here is my data point: "You Have No Idea How Screwed OpenAI Actually Is" - https://wlockett.medium.com/you-have-no-idea-how-screwed-ope...

trollbridge · 2025-10-31T14:40:53 1761921653

Right. As far as I can tell, OpenAI, Grok, etc sell me tokens at a loss. But I am having a hard time figuring out how to turn tokens into money (i.e. increased productivity). I can justify $40-$200 per developer per month on tokens but not more than that.

koolba · 2025-10-31T14:54:18 1761922458

There’s about 5M software devs in the US so even at $1000/year/person spend, that’s only $5B of revenue to go around. Theres plenty of other uses cases but focusing on pure tech usage, it’s hard to see how the net present value of that equates to multiple trillions of dollars across the ecosystem.

treis · 2025-10-31T15:36:17 1761924977

It's the first new way of interacting with computers since the iPhone. It's going to be massively valuable and OpenAI is essentially guaranteed to be one of the players.

trollbridge · 2025-11-04T04:51:20 1762231880

In 2006, I foresaw that the smartphone was going to exist and be significant. But at the time I carried a Sharp Zaurus which just identified me as a gigantic nerd, and the clearest company to invest in was... HP/Compaq for the iPAQ, in terms of the most forward-looking device.

Then Apple came out with the iPhone but it was not clear they were going to be the market leader. They were certainly not the first mover.

With OpenAI, I see even less of a moat. Someone else comes along, makes a better foundational model, and they've got a gigantic advantage. What does OpenAI have?

_aavaa_ · 2025-10-31T17:02:36 1761930156

Why is their product not palm? Or windows mobile?

treis · 2025-10-31T17:30:25 1761931825

It's not windows mobile because OpenAI was first and is the clear leader in the market. Windows mobile was late to the party and missed their window.

Palm is closer but it's a different world. It's established that Internet advertising companies are worth trillions. It's only in retrospect that what Palm could have been is obvious.

Barring something very unexpected OpenAI is coming out on top. They're prepaying for a good 5-10 years of compute. That means their inference and training for that time are "free" because they've been paid for. They're going to be able to bury their competition in money or buy them out.

_aavaa_ · 2025-10-31T18:01:28 1761933688

Windows mobile by the time it looked like the iPhone was late to the party. But windows had been releasing a mobile os for a long time before that. Microsoft was first, they just didn’t make as good of a product as Apple despite their money.

OpenAI is also first, but it is absolutely not a given that they are the Apple in this situation. Microsoft too had money to bury the competition, they even staged a fake funeral when they shipped windows phone 7.

> Barring something very unexpected

Like the release of an iPhone?

treis · 2025-10-31T19:47:05 1761940025

Yep. It would have to be something that dramatic to render all the technology and infrastructure OpenAI has obsolete. But if it's anything like massive data training on a huge number of GPUs then OpenAI is one of the winners.

red-iron-pine · 2025-10-31T20:19:51 1761941991

I'm waiting for my Google Glass smart glasses to be useful for anything other then annihilating the privacy of everyone around me

Blackberry was a big deal for a while, too

HDThoreaun · 2025-10-31T18:28:06 1761935286

> Theres plenty of other uses cases

This is where the money is. Anthropic just released claude for excel. If it replaces half of the spreadsheet pushers in the country theyre looking at massive revenue. They just started with coding because theres so much training data and the employees know a lot about coding

schwarzrules · 2025-10-31T15:00:01 1761922801

I'm not trying to be annoying, but surely if you'd justify spending $200/developer/month, you could afford $250/month...

The reason I wonder about that is because that also seems to be the dynamic with all these deals and valuations. Surely if OpenAI would pay $30 billion on data centers, they could pay $40 billion, right? I'm not exactly sure where the price escalations actually top out.

h2zizzle · 2025-10-31T15:31:44 1761924704

No? That's a 25% expense increase. You just ate the margins on my product/service, and then some.

yugioh3 · 2025-11-01T06:48:33 1761979713

The equivalent of an additional $50 uber ride to the airport once a month can tank your business?

h2zizzle · 2025-11-14T19:48:23 1763149703

Per seat. And it's not $50 more thane baseline, it's $250. The $200 may have been justifiable where the $250 is not.

simianwords · 2025-10-31T14:50:47 1761922247

why would they sell you at a loss when they have been decreasing prices by 2x every year or so for the last 3 years? people wanted to purchase the product at price "X" in 2023 and now the same product costs X costs 10 times less over the years.. do you think they were always selling at a loss?

simianwords · 2025-10-31T14:42:05 1761921725

Inference cost has been going down for a while now. At what point do you think it will be profitable? When cost goes down by 2x? 5x?

logicprog · 2025-10-31T19:58:56 1761940736

I can't read your hyperbolically titled paywalled medium post, so idk if it has data I'm not aware of or is just rehashing the same stats about OpenAI & co currently losing money (mostly due to training and free users) but here's a non paywalled blog post that I personally found convincing: https://www.snellman.net/blog/archive/2025-06-02-llms-are-ch...

ragingregard · 2025-10-31T21:25:05 1761945905

The above article is not convincing at all.

Nothing on infra costs, hardware throughput + capacity (accounting for hidden tokens) & depreciation, just a blind faith that pricing by providers "covers all costs and more". Naive estimate of 1000 tokens per search using some simplistic queries, exactly the kind of usage you don't need or want an LLM for. LLMs excel in complex queries with complex and long output. Doesn't account at all for chain-of-thought (hidden tokens) that count as output tokens by the providers but are not present in the output (surprise).

Completely skips the fact the vast majority of paid LLM users use fixed subscription pricing precisely because the API pay-per-use version would be multiples more expensive and therefore not economical.

Moving on.

logicprog · 2025-11-07T12:57:56 1762520276

> Nothing on infra costs, hardware throughput + capacity (accounting for hidden tokens) & depreciation

That's because it's coming at things from the other end: since we can't be sure exactly what companies are doing, we're just going to look at the actual market incentives and pricing available and try to work backwards from there. And to be fair, it also cites, for instance, deepseek's paper where they talk about what their power foot margins are on inference.

> just a blind faith that pricing by providers "covers all costs and more".

It's not blind faith. I think they make a really good argument for why the pricing by providers almost certainly does cover all the costs and more. Again, including citing white papers by some of those providers.

> Naive estimate of 1000 tokens per search using some simplistic queries, exactly the kind of usage you don't need or want an LLM for.

Those token estimates were for comparing to search pricing to establish whether — relative to other things on the market — LLMs were expensive, so obviously they wanted to choose something where the domain is similar to search. That wasn't for determining whether inference was profitable or not in itself, and has absolutely no bearing on that.

> Doesn't account at all for chain-of-thought (hidden tokens) that count as output tokens by the providers but are not present in the output (surprise).

Most open-source providers provide thinking tokens in the output. Just separated by some tokens so that UI and agent software can separate it out if they want to. I believe the number of thinking tokens that Claude and GPT-5 use can be known as well: https://www.augmentcode.com/blog/developers-are-choosing-old... typically, chain of thought tokens are also factored into API pricing in terms of what tokens you're charged for. So I have no idea what this point is supposed to mean.

> Completely skips the fact the vast majority of paid LLM users use fixed subscription pricing precisely because the API pay-per-use version would be multiples more expensive and therefore not economical.

That doesn't mean that selling inference by subscription isn't profitable either! This is a common misunderstanding of how subscriptions work. With these AI inference subscriptions, your usage is capped to ensure that the company doesn't lose too much money on you. And then the goal is with the subscriptions that most people who have a subscription will end up on average using less inference than they paid for in order to pay for those who use more so that it will equal out. And that's assuming that the upper limit on the subscription usage is actually more costly than the subscription being paid itself, and that's a pretty big assumption.

If you want something that factors in subscriptions and also does the sort of first principles analysis you want, this is a good article:

https://martinalderson.com/posts/are-openai-and-anthropic-re...

And in my opinion it seems pretty clear that basically everyone who does any kind of analysis whether black box or first principles on this comes to the conclusion that you can very easily make money on inference. The only people coming to any other conclusion are those that just look at the finances of U.S. AI companies and draw conclusions on that without doing any kind of more detailed breakdown — exactly like the article you linked me, which now I have finally been able to read, thanks to someone posting the archive link, which isn't actually making any kind of case about the subscription or unit economics of token inference whatsoever, but is instead just basing its case on the massive overinvestment of specifically open AI into gigantic hyperscale data centers, which is unrelated to the specific economics of AI itself.

bwfan123 · 2025-10-31T14:42:26 1761921746

This is behind a paywall. Is there a free link you can share ?

belter · 2025-10-31T14:51:44 1761922304

Would love to, and its normally what I do, but archive.is is currently down. At least here from the outer belt.

khimaros · 2025-10-31T19:53:09 1761940389

https://archive.is/fnzB4

PeterStuer · 2025-10-31T19:06:27 1761937587

It is'nt just nvidia though.