Here is a charitable perspective on what's happening:
- Nvidia has too much cash because of massive profits and has nowhere to reinvest them internally.
- Nvidia instead invests in other companies that use their gpus by providing them deals that must be spent on nvidia products.
- This accelerates the growth of these companies, drives further lock in to nvidia's platform, and gives nvidia an equity stake in these companies.
- Since growth for these companies is accelerated, future revenue will be brought forward for nvidia and since these investments must be spent on nvidia gpus it drives further lock in to their platform.
- Nvidia also benefits from growth due to the equity they own.
This is all dependent on token economics being or becoming profitable. Everything seems to indicate that once the models are trained, they are extremely profitable and that training is the big money drain. If these models become massively profitable (or at least break even) then I don't see how this doesn't benefit Nvidia massively.
> Nvidia has too much cash because of massive profits and has nowhere to reinvest them internally.
Here's an idea: they could make actual GPUs used for games affordable again, and not have Jensen Huang lie on stage about their performance to justify their astronomical prices. Sure, companies might want to buy them for ML/AI and crash the market again but I'm sure a company of their caliber could solve that if they _really_ wanted to.
I also just don’t understand, as someone with no business experience, how they aren’t just pouring all of that money into enhancing their production capacity. That’s very clearly their bottleneck here.
Yes, I’m certain they are spending an astronomical amount on that already, but why not more? Surely paying more money for construction of more facilities still nets gain even if you run into diminishing returns?
Instead they set up this whacko tax laundering scheme? Just seems like more corporate pocket filling to me, an idiot with no business knowledge.
The bottleneck is TSMC, who also make chips for almost every other hardware vendor.
TSMC is indeed increasing their production capability as fast as possible, but it's not easy... chip foundries are extremely expensive, complex, and take serious expertise to operate.
It’s called seeding the market. If they can accelerate the growth of potential customers, it will be more profitable than just increasing production to serve existing customers.
Think of exponential growth — would you rather increase the base or the exponent?
Hedging their bets against a potential sudden downturn in consumption of their product, e.g., an AI bubble exploding? If they invest heavily in production capacity only to find that there is not commensurate consumption, then they'll have lost badly.
> as someone with no business experience, how they aren’t just pouring all of that money into enhancing their production capacity. That’s very clearly their bottleneck here.
They know this madness can't go on forever. The last thing they need is to be left with billions of dollars of unused capacity when the bottom falls out of this very stupid bubble.
If they thought that, would they invest massively in the companies that own all these GPUs? They would not. They would invest in anything ELSE they could think of.
That might be the case if nvidia commonly invested in its customers - if they suddenly stopped doing that - but I don't think that was the case. On the contrary, these investments surprised because they were NOT the usual.
Why would they want to do that? The only sector that matter to nvidia is datacenter, its where 90%+ of their profits are. Making their consumer sector even less profitable just seems like a waste of time
How about positive mindshare? Regular people not growing up absolutely hating nvidia's guts and only begrudgingly buying their products. Also ensuring that a pretty big industry won't die from becoming too expensive.
Plus, diversification is good for when the bubble inevitably bursts.
But that's long-term thinking and we can't have that. People give Huang credit for having had a long-term vision on AI, but it feels like he definitely has blinders on right now.
Does anyone who can afford an nvidia card actually buy something else? Yes some people hate nvidia, but it's not like they have a hard time selling cards.
The consumer gaming card market is minuscule in comparison to their primary market now, to the point where worrying about diversifying there probably doesn’t make sense. Nor does it really matter whether consumer gamers hate them. That is likely to have zero effect on their core customer now.
Underestimating compounding and secondary effects, especially while rationalizing the abandonment of their core market and capability is one of the most famous ways that big companies provide evidence of their downward spiral. I can feel the MBA energy from here.
Can you name any companies that suffered by switching focus away from one market where they dominate in order to also dominate a market that is 10x the size of the first market already, and growing faster?
Your conclusion about training being the cost factor that will eventually align with profitability in the inference phases relies on training new models not being an endless arms race.
I'm just confused why people think token-based computing is going to be in such demand in the future. It's such a tiny slice of problems worth solving.
Yep. Same vibes as “ha ha who needs internet connected appliances” (pretty much all appliances are internet connected now). And the apocryphal “there is a worldwide market for maybe 5 computers”.
No-one "needs" or even wants appliances to be connected to the internet. You claim that "pretty much all" appliances are internet connected, while almost none of the appliances in my house are.
> - Nvidia also benefits from growth due to the equity they own.
aka this might be nvidia's next pivot. Contrary to gaming cards, the AI GPUs are productive assets. If nvidia feels that they will be very productive, then it makes sense that they invest broadly in the companies that are likely to make these profits. To share in the rewards while selling more GPUs.
Yup. Not just Nvidia. Just look at the quarterly results reported by Amazon, Google, Meta, Microsoft and Apple. Each one is reporting revenues never before seen in history. If you make 100 Billion a quarter you have to spend it on something.
These guys are running hyper optimized cash extraction mega machines. There is no comparison to previous bubbles, cause so no such companies ever existed in the past.
100 billion a quarter is Alphabet, right? Given how much click fraud there is, and that every org and business under the sun is held to ransom to feature on the SERP for their own name even — it’s tempting to say Google’s become a private tax on everything.
It's easy for the techies to see the problems. But advertising results have been very measurable for a very long time by now. Larger advertisers can leave the details to their techies and still be very clear as to their advertising's productivity post-cost of doing business.
What's shocking is the gulf between those companies and corporate 'normality'.
Eastern Airways, a UK airline, has just gone bust due to accumulated debts of £26 million. That's not even a rounding error for Google, yet was enough to put a 47-year-old company into bankruptcy and its staff out of work.
I think the only historical parallel to this disparity was the era of the East India Company.
They're "massively profitable" because they're laying off large portions of a major cost center - labor - and backloading uncoming data center construction costs. As those come due, and labor needs rise again, that profit disappears.
They have a track record of cornering a market and abusing their position, and also still somehow not being able to balance expenses and revenues to turn a profit that pleases shareholders. You get to decide if that's a problem with the company or the shareholders, I guess.
So many such profitable companies are the best possible evidence for the need for drastic antitrust intervention. The lack of competition and regulation is leading to a massive drain on every other sector.
This bubble is caused by excess competition. There are 4 large companies who believe that a large new market is being created so each is investing large amounts without any evidence that there will be a single winner that dominates the future market. None of these companies has anything remotely resembling a monopoly except for Amazon in online retail.
Right. As far as I can tell, OpenAI, Grok, etc sell me tokens at a loss.
But I am having a hard time figuring out how to turn tokens into money (i.e. increased productivity). I can justify $40-$200 per developer per month on tokens but not more than that.
There’s about 5M software devs in the US so even at $1000/year/person spend, that’s only $5B of revenue to go around. Theres plenty of other uses cases but focusing on pure tech usage, it’s hard to see how the net present value of that equates to multiple trillions of dollars across the ecosystem.
It's the first new way of interacting with computers since the iPhone. It's going to be massively valuable and OpenAI is essentially guaranteed to be one of the players.
In 2006, I foresaw that the smartphone was going to exist and be significant. But at the time I carried a Sharp Zaurus which just identified me as a gigantic nerd, and the clearest company to invest in was... HP/Compaq for the iPAQ, in terms of the most forward-looking device.
Then Apple came out with the iPhone but it was not clear they were going to be the market leader. They were certainly not the first mover.
With OpenAI, I see even less of a moat. Someone else comes along, makes a better foundational model, and they've got a gigantic advantage. What does OpenAI have?
It's not windows mobile because OpenAI was first and is the clear leader in the market. Windows mobile was late to the party and missed their window.
Palm is closer but it's a different world. It's established that Internet advertising companies are worth trillions. It's only in retrospect that what Palm could have been is obvious.
Barring something very unexpected OpenAI is coming out on top. They're prepaying for a good 5-10 years of compute. That means their inference and training for that time are "free" because they've been paid for. They're going to be able to bury their competition in money or buy them out.
Windows mobile by the time it looked like the iPhone was late to the party. But windows had been releasing a mobile os for a long time before that. Microsoft was first, they just didn’t make as good of a product as Apple despite their money.
OpenAI is also first, but it is absolutely not a given that they are the Apple in this situation. Microsoft too had money to bury the competition, they even staged a fake funeral when they shipped windows phone 7.
Yep. It would have to be something that dramatic to render all the technology and infrastructure OpenAI has obsolete. But if it's anything like massive data training on a huge number of GPUs then OpenAI is one of the winners.
This is where the money is. Anthropic just released claude for excel. If it replaces half of the spreadsheet pushers in the country theyre looking at massive revenue. They just started with coding because theres so much training data and the employees know a lot about coding
I'm not trying to be annoying, but surely if you'd justify spending $200/developer/month, you could afford $250/month...
The reason I wonder about that is because that also seems to be the dynamic with all these deals and valuations. Surely if OpenAI would pay $30 billion on data centers, they could pay $40 billion, right? I'm not exactly sure where the price escalations actually top out.
why would they sell you at a loss when they have been decreasing prices by 2x every year or so for the last 3 years? people wanted to purchase the product at price "X" in 2023 and now the same product costs X costs 10 times less over the years.. do you think they were always selling at a loss?
I can't read your hyperbolically titled paywalled medium post, so idk if it has data I'm not aware of or is just rehashing the same stats about OpenAI & co currently losing money (mostly due to training and free users) but here's a non paywalled blog post that I personally found convincing: https://www.snellman.net/blog/archive/2025-06-02-llms-are-ch...
Nothing on infra costs, hardware throughput + capacity (accounting for hidden tokens) & depreciation, just a blind faith that pricing by providers "covers all costs and more". Naive estimate of 1000 tokens per search using some simplistic queries, exactly the kind of usage you don't need or want an LLM for. LLMs excel in complex queries with complex and long output. Doesn't account at all for chain-of-thought (hidden tokens) that count as output tokens by the providers but are not present in the output (surprise).
Completely skips the fact the vast majority of paid LLM users use fixed subscription pricing precisely because the API pay-per-use version would be multiples more expensive and therefore not economical.
> Nothing on infra costs, hardware throughput + capacity (accounting for hidden tokens) & depreciation
That's because it's coming at things from the other end: since we can't be sure exactly what companies are doing, we're just going to look at the actual market incentives and pricing available and try to work backwards from there. And to be fair, it also cites, for instance, deepseek's paper where they talk about what their power foot margins are on inference.
> just a blind faith that pricing by providers "covers all costs and more".
It's not blind faith. I think they make a really good argument for why the pricing by providers almost certainly does cover all the costs and more. Again, including citing white papers by some of those providers.
> Naive estimate of 1000 tokens per search using some simplistic queries, exactly the kind of usage you don't need or want an LLM for.
Those token estimates were for comparing to search pricing to establish whether — relative to other things on the market — LLMs were expensive, so obviously they wanted to choose something where the domain is similar to search. That wasn't for determining whether inference was profitable or not in itself, and has absolutely no bearing on that.
> Doesn't account at all for chain-of-thought (hidden tokens) that count as output tokens by the providers but are not present in the output (surprise).
Most open-source providers provide thinking tokens in the output. Just separated by some tokens so that UI and agent software can separate it out if they want to. I believe the number of thinking tokens that Claude and GPT-5 use can be known as well: https://www.augmentcode.com/blog/developers-are-choosing-old... typically, chain of thought tokens are also factored into API pricing in terms of what tokens you're charged for. So I have no idea what this point is supposed to mean.
> Completely skips the fact the vast majority of paid LLM users use fixed subscription pricing precisely because the API pay-per-use version would be multiples more expensive and therefore not economical.
That doesn't mean that selling inference by subscription isn't profitable either! This is a common misunderstanding of how subscriptions work. With these AI inference subscriptions, your usage is capped to ensure that the company doesn't lose too much money on you. And then the goal is with the subscriptions that most people who have a subscription will end up on average using less inference than they paid for in order to pay for those who use more so that it will equal out. And that's assuming that the upper limit on the subscription usage is actually more costly than the subscription being paid itself, and that's a pretty big assumption.
If you want something that factors in subscriptions and also does the sort of first principles analysis you want, this is a good article:
And in my opinion it seems pretty clear that basically everyone who does any kind of analysis whether black box or first principles on this comes to the conclusion that you can very easily make money on inference. The only people coming to any other conclusion are those that just look at the finances of U.S. AI companies and draw conclusions on that without doing any kind of more detailed breakdown — exactly like the article you linked me, which now I have finally been able to read, thanks to someone posting the archive link, which isn't actually making any kind of case about the subscription or unit economics of token inference whatsoever, but is instead just basing its case on the massive overinvestment of specifically open AI into gigantic hyperscale data centers, which is unrelated to the specific economics of AI itself.
- Nvidia has too much cash because of massive profits and has nowhere to reinvest them internally.
- Nvidia instead invests in other companies that use their gpus by providing them deals that must be spent on nvidia products.
- This accelerates the growth of these companies, drives further lock in to nvidia's platform, and gives nvidia an equity stake in these companies.
- Since growth for these companies is accelerated, future revenue will be brought forward for nvidia and since these investments must be spent on nvidia gpus it drives further lock in to their platform.
- Nvidia also benefits from growth due to the equity they own.
This is all dependent on token economics being or becoming profitable. Everything seems to indicate that once the models are trained, they are extremely profitable and that training is the big money drain. If these models become massively profitable (or at least break even) then I don't see how this doesn't benefit Nvidia massively.