Hacker Newsnew | past | comments | ask | show | jobs | submit | trjordan's commentslogin

Totally. Every "we're losing our craft" article has the same gloomy shape. That's enough of a bummer, but they also argue against themselves halfway through.

This one, for instance:

> But exactly which details are deemed “unimportant” is a very consequential and sometimes subjective decision. And eventually, the details always leak through.

Right, so you're saying this new technology will still reward deep technical understanding, because there's no way around it. I agree. Why is the whole tone of this thing "AI is making my craft a cheap commodity?"

Websites are largely better, technically, than they were 10 years ago. They're more full-featured, they're faster, SSL/a11y/responsiveness are stronger defaults. Content mills / SEO / news sites are a separate, terrible failure mode of ads and corporate incentives. That's not React's fault!


A craftsman's pride is an industrialist's nightmare! Software has been transitioning from a craft into an industrial process for the last two decades or so, and the software craftsmen of all stripes understandably do not like this!

> Software has been transitioning from a craft into an industrial process for the last two decades or so

This seems like a good insight and it feels true to me as well.

My guess is the absolute number of people who treat it like a "craft" is higher than 20 years ago, but as a fraction of all developers it has shrunk dramatically.


I've been meaning to write down my thoughts about software explicitly not being a craft for many years now and life keeps getting in the way. It's a direct response to the Etsy engineering blog, "Code As Craft". I agree that there are more code craftsmen in general than before, but by percentage there's way more software engineers. Engineering best practices to me are in many ways about robbing coding and software from the mystique of craftsmanship and turning it into a repeatable industrial process that isn't inhumane per se but doesn't depend on any particular person to make it work.

Ya it's definitely been an ongoing process. LLMs have just accelerated it.

I am not joking when I say that software craftsmen lost the war when tabs vs spaces was obviated as a point of contention by CI enforced formatting and linting around broader community standards.

>Websites are largely better, technically, than they were 10 years ago. They're more full-featured, they're faster, SSL/a11y/responsiveness are stronger defaults.

This is the opposite of my experience. I find websites take much more time to load, are designed to require many more actions and interaction time to navigate, often break and are replaced by a blank page if any error occurs, use huge numbers of ad/tracking requests and JS, and are filled with accessibility-standard-violating unnecessary JS animations.


> Websites are largely better, technically, than they were 10 years ago.

That is not remotely the case. All software, not just websites, is a lot worse than it was 10 years ago. Bloated, slow, buggy messes that resulted from the industry hiring a bunch of people who just wanted to do the bare minimum and make fat stacks, rather than hiring people who actually care about good engineering.


It's just not what I wanted. I got into software because I liked coding, deep technical understanding only excited me because it would help me code better. I don't want to get too "woe is me", there are far worse things in the world than having a vaguely unsatisfying job, but there are life choices I would have made differently had I known coding would be automated in 2026.

You can still code all you like, youre just not going to get paid for it.

Sure, but I've got other hobbies which better satisfy my itch for making things. Doesn't really solve my problem.

> Websites are largely better, technically, than they were 10 years ago.

Holy shit, no, they are not. What world do you live in?


A lot of them weren't even up 10 years ago. It's not hard to be better than that.

It feels like we're far past the point of where having AI do more faster is helpful.

It's telling that they used "rewrite Bun in Rust" as the proof point here. It's cool! But the vast majority of software engineering doesn't start with tens of thousands of tests, where making them pass is the whole job.

In my experience, AI still drifts from what I meant it to do on anything bigger than building a widget. My time is spent suspiciously reviewing output for changes the agent snuck in, or invariants it broke. I talked with a friend recently where the agent broke the test harness badly enough that none of the tests mattered for 3 weeks. They did pass, though, so CI never complained.

There's something at the intersection of context engineering, managing that sloppy pile of markdown plans, and good old fashioning system understanding that's the real bottleneck.


"In my experience, AI still drifts from what I meant it to do on anything bigger than building a widget."

I've had code bases with tens of thousands of lines of code built from scratch that I hand-reviewed every line of and worked with the AI to improve, and haven't had this issue. I feel like a significant part of this is due to an involved /plan stage -- going back and forth on building out a plan for what you want the AI to do involves surfacing the assumptions that you would have called drift if you asked them to implement it directly from your prompt.

Once the plan has been refined and is what I want it to be, getting it to implement everything in TDD style has for the most part given me 100% working code, as I wanted it to be, without issues. It definitely helps that I'm a principal-level engineer with extensive architectural experience -- but if you're able to tell the AI in detail what you want, have it ask questions for clarifications, and read through a plan before getting it implemented, and have a solid testing plus manual qa process (automated by chrome devtools mcp) in place, I've find that you can one-shot complex features, rewrites, and even not-insignificant applications that would have taken days to write by hand in a few hours.


Depends - using Sonnet here and generally it should be as you say: plan would produce the result.

Still Claude will sneak things in - in my recent plan, for example I had defined, per acceptance criteria what colours the statuses should be: green for live, blue for sold, grey for anything else; it changed this to: green for live, orange for in progress, blue for sold, red in demolition, etc. When pressed why did it to this, it was unable to explain why. This is with a plan where AC were explicitly provided from the task in Given/When/Then format and were to be adhered to strictly. I've caught this within planning, but I shouldn't need to be doing this.

Even in standard prompts where I tell it "Change this label from X to Y", it ended reordering the tabs unrelated to ask. Again I was not able for it to explain why - it was so abrupt. And it was in fresh context, without any pollution on what I expect it to do.

I also noticed a different behaviour regarding skill; today and yesterday it would not be following skill guidance at all ie: skill writing skill - I'd have to explicitly tell it to test skills after writing them, when this is a behaviour expected by default. Similarly with other skills - knowing that it should have done something per skill guidelines and it not doing it at all. This is new behaviour that I've not seen a week ago.


There are certainly domains where AI is not so effective, but at this point I would agree that at least in terms of web development if you can't get effective results from agents at this point it is a skill issue. That skill can be learned, if you recognize that learning is part of the solution. I do think prior experience in product design, specifications & business analysis as well as engineering leadership are all extremely helpful. Its about putting the agent in a box so small that it really can't screw up; but its also about being able to review design and code rigorously - to see around corners and anticipate possible weaknesses etc. There is really nothing I have to do when working with an agent that I haven't already been doing for decades but it seems to me that a lot of developers have never found a single bug while reviewing someone else's code.

They've got, ballpark, $5t to $10t to make back in the next 5 years, or the hardware buildouts will start getting written down.

This means we're going to need $1t+ per year in spending, per year, on tokens. 200m knowledge workers in the world, 30m developers. We're talking about a world where you need 5% of every knowledge workers salary to go into tokens. 20% if you're a developer.

That's a _huge_ shift. Most people I know cite +20%-40% velocity with these tools, against the actual work their company cares about doing. +20% speed for +20% spend isn't going to motivate a trillion dollars a year in spending.

We're not there yet. This is still the upswing of the hype cycle, and unless we figure out how to make developers 2x, 5x, 10x as productive on stuff that matters, this isn't going to play out well.


Here are a few thoughts:

- The publicly available information about how inference costs compare to training costs is conflicted. EEs involved in datacenters talk about power usage spikes during training runs as if they were a major factor in the designs, but academic papers discussing cost-optimal scaling confidently treat inference-time compute as a major factor.

- On the side of the balance indicating that training is more compute-intensive after amortization than inference is that Chinese providers, constrained primarily by access to compute, have nearly unlimited token availability at a lower price than US providers (inference), but poorer model capabilities (training). That would make sense only if US providers are inflating inference costs by 20-30x due to amortized training costs that overseas providers were not able to take on (there are other factors too).

- If training >> inference, they're in a prisoner's dilemma that far exceeds the ordinary zero-marginals model of competition between firms (due to its huge discrete stepwise nature). On the other hand, if inference>>training, the high-level analysis popularized by certain thought leaders, that it's like a utility, would be true. You'd tend to count this as a vote for inference>>training, but the CEOs saying it at least have a huge incentive to agree because the alternative, the prisoner's dilemma, would stop investment very fast.

- The only voice in the story that I just told you to have anything to do with fact (as opposed to high-level analysis and ivory tower armchair management of a secretive business) were the rumors from facilities engineers. That shows you the state of our understanding...

- If we don't even know the ratio between amortized capital expenses and operational costs, outside investor analysis is impossible. It doesn't matter how finely they divide the accounting buckets for office ferns and indoor ferns if the single biggest part of their business is obscured for trade secret reasons.


I'm about to leave a shallow comment, but I am a bit skeptical of the supposed drop in inference costs. If AI labs saw a lot of potential there, they'd surely be bragging about it non-stop? So the fact that publicly available information is conflicted is probably a sign that at the very least, the numbers aren't amazing.

Yes I know there's no evidence and this is lazy reasoning. But there's probably a bit of truth to this line of thought.


Why on earth would AI labs be bragging about how little the product they sell actually costs them to make? You don't want to do anything that reduces it's perceived value to the user, that might make them less willing to pay for it.

Also, inference costs are bound to go way down with more optimized architectures. GPUs are fundamentally not great at inference. No platform where the weights are streamed from a large pool of memory is. If the models ever quiet down, there will be massive step changes in cost/token, energy/token and tokens/second, as models are etched into silicon ala https://chatjimmy.ai/


A couple of years ago Altman was saying the price of AI compute is going to drop 90% year over year or something like that, so I don't think they're nervous about talking about lowering their costs. They probably just haven't been able to lower their costs.

You have to keep in mind that about 99% of their announcements are targeted towards investors (their most important revenue source..), so they're not going to be afraid to mention metrics that make the business look better.


Jevons paradox. Cheaper tokens does not mean we will spend less.

Cheaper tokens means the company's margins increase, which would be valuable for investors to hear

The main limit to my token spend right now is that I'm running out of hours in a day.

Ah yes, Sam “Not Consistently Candid” Altman

Oh, is that the guy that sold Loopt by claiming it had hundreds of thousands of users and it turned out to have 500 DAU after his exit?

Yep, the very same scammer. Wonder if he's lying about OpenAI too? Maybe about a person blowing a metal instrument?

he lied. he's good at that.

Why would any company brag about their margins ? Yet they do, to attract investors.

The key AI labs are not public companies, they are at liberty to brag about their margins to potential investors in private.

And investors will leak such claims quickly enough that this reasoning cannot plausibly hide big secrets.

It's not a big secret. If you just do the math yourself, it's easy to compute that inference doesn't cost all that much. People just see all the capital investment going around and all the new data centers being built, see that it's spent on "AI", put two and two together and get a three, or "clearly serving AI requests costs an arm and a leg".

The 1 they were missing is that AI requires both training and inference, and training is by far the expensive part. And that in principle you can stop training at any point and keep using the models as they are. (But that means that if other companies keep improving their models, you'll be left behind...)

In contrast, inference is fairly cheap and all the providers have great margins on it. Eventually either investment in training stops having commensurate impact on model quality, and people stop doing that and instead concentrate on making inference faster and even more efficient. Or if that doesn't happen, things will get very weird very quickly.


The market already shows where it will go.

If you want frontier model you will pay more for inference to essentially fund the expensive training.

If you don’t need frontier model you will get dirt cheap inference, which eventually will approach the cost of electricity spent per token.


This is technically correct, but practically false.

They can't stop training as then the AI's knowledge will become out-of-date very quickly. Their knowledge stops the day you stop training.


Yes it seems that this discussion that has sparked such controversy involves an already well defined concept in business.

Net margin versus gross margin.

Net shows profitability after extracting all expenses while gross only extracts the cost of the goods sold. Putting the model training costs into a one time fixed expense provides a much better gross margin.

This is known as COGS reclassification or classification shifting and is a common tactic to mislead investors.

This is why analysts look at Free Cash Flow Margin.

WorldCom and MicroStrategy did this before the Dotcom Bubble imploded.


> If you just do the math yourself, it's easy to compute that inference doesn't cost all that much.

Show us your work, then. If it's so easy to do, this should be a trivial request to accommodate, no?


Just look at large open weights models being served by inference providers.

Kimi 2.6 is a 1 trillion total / 32B active parameter model that's something comparable to Sonnet. Sonnet's API pricing is $5 in, $15 out per million tokens. Deepinfra serves Kimi at $0.75 in, $3.50 out, and about the same at openrouter. So you're looking at a 4-7x multiple that Anthropic is charging compared to market rates that any plebe can get with a credit card.


I'm not sure just how good that looks for Anthropic/OpenAI.

4-7x isn't a tiny markup, but how does that compare to high-margin internet businesses like AdSense? Meta and Google do hundreds of billions in ad revenue a year, and after taking out the publisher's portion (60-80% per some searching), I wonder what the ratio of the remaining tens-of-billions is against the compute cost and headcount required to run it.

And how much room for maintaining or improving that margin do they have if the cheap competitors also continue getting better? Is there a "good enough" point where the easier inference tasks are all moving to vendors massively undercutting them, and then they don't have the volume necessary to justify spending on further cutting-edge development?


> Kimi 2.6 is a 1 trillion total / 32B active parameter model that's something comparable to Sonnet.

No it's not. On some rigged paper maybe. Some such benchmarks say all models group together, which they clearly do not.

> Sonnet's API pricing is $5 in, $15 out per million tokens. Deepinfra serves Kimi at $0.75 in, $3.50 out, and about the same at openrouter. So you're looking at a 4-7x multiple that Anthropic is charging compared to market rates that any plebe can get with a credit card.

That's not saying much. You can get "cloud" at AWS and you can get a VPS. There is likely a 10x difference. It's not "same". Whilst AWS costs more they also don't have 7x margins similarly.


I’m wary of “has not been leaked in a way that was picked up in public news” as proof or disproof of anything.

this is changing soon

Not really, how much of a public company are you when 5% of your capital is public ?

That doesn't matter for the legal requirements.

The short and only kind of wrong version is:

In the US, companies are not allowed to unfairly privilege some investors over others by giving them access to secret information that would let them judge the future prospects of the company. (Except in all the ways they can, but these usually involve some kinds of insider trading rules.) Private companies can handle giving out secrets to investors by literally writing and memo and mailing it to all their investors, if they want to give out some secrets to one of them.

Public companies cannot do that, even if they knew who all their investors were, but must instead consider every member of the public a potential investor, even if they don't already own the stock. Because of this, when public companies want to reveal material information about their future prospects, they must reveal it to everyone.


The percentage is irrelevant for this discussion. As soon as you’re public, you need to report detailed financial numbers.

Plus, you have to do real GAAP accounting, not their made up metrics.

Besides the legal requirement, the reason these companies go public is often to provide liquidity for early investors or employees. So they do want to have as good of a margin story that they can, at least in terms of unit margin.

That's changing with this administration though. Reduced reporting cycles reduce transparency.

It won't impact the disclosure of key business details because it doesn't reduce the level of disclosure needed in the S-1 or the 10-K.

This is an interesting anomaly in the US. In the civilised world all corporations have to file public accounts, as the price for their limited liability. The detail and audit requirements depend on the size, turnover, staff numbers etc. This is because the shareholders are not the only stakeholder. The companies creditors, for instance, who are exposed to the limited liability have a right to see what they are lending to.

To answer the sibling comment, all of these public accounts follow local GAAP or IFRS.

The US still astounds me with its willingness to allow corporations to rip people off!


Creditors in the US can make visibility into financials a requirement for financing if they want. Protecting creditors isn’t a good argument for public reporting.

What about potential employees, can they look? The local community that consents to let the company build and operate in their town? How does that help, if they don't follow have to follow GAAP anyway?

Why are those things relevant to either employees or a town?

Most of the US is at-will so the financial health of the company is unlikely to be the reason you’ll suddenly lose a job.

Same for a town, if you’re structuring a deal that has counterparty risk then you mitigate the risk. If an employer is just leasing some office space in your town, why in the world would you ever even think you had the need to look at their financials?


What are the arguments against public reporting?

As a consumer you are often sending deposits or even the full cost of goods to companies some time before you receive those goods (in effect you become a creditor). You are also dependent upon some of those companies for service and repairs. It seems reasonable that you can check the finances of a company you are creating a business relationship with, I know in the past I've checked company statements.

You are unlikely to have significant enough sway to force that kind of disclosure. Small businesses as consumers have less legal protection and are similarly unlikely to be able to make disclosure a precondition of a deal.


So what. As a customer you can insist on seeing audited financial statements as a condition of purchasing, or purchase from another vendor, or do without. No problem.

Or, in the real world, running a limited liability company could come with some sensible reporting requirements?

Why? And what's sensible about it?

Isn't there a limit on the public markets where if a company has less than a certain percentage of its ownership traded publicly then it is no longer a public company and therefore de-listed?

I remember hearing about a guy trying to squeeze out short sellers of his own company but ended up effectively taking his company private because he bought out like 95% of all the shares.

I wonder how that aligns to these small releases of stock for the public.


There is no legal minimum free float requirement before deregistration in US, however, different exchanges have different rules

Essentially, a stock has to stay above 1$ per share, have a minimum market cap of $15m, minimum 400 shareholders and "adequate" liquidity If it meets those 4 criteria, it's essentially not at risk of deregistration


Growing companies don't brag about their margins, they brag about their growth and revenue. Margin talk is for when you're a mature company squeezing out every bit of profitability you can - if anything it would be a negative sign to be worrying about your margins when you're supposed to still be growing and innovating.

I mean, did anyone expect them to not have margins? Why keep it secret?

> Why on earth would AI labs be bragging about how little the product they sell actually costs them to make? You don't want to do anything that reduces it's perceived value to the user, that might make them less willing to pay for it.

Wouldn't they be bragging about it to investors? It feels like something that would matter a lot to them, and at least OpenAI kinda feels desperate to find them.

There's also the small question about whether a drop in inference cost would actually change anything about profitability, when training seems to get exponentially more expensive.


Because companies that want to go public need to look profitable or potentially profitable. And before they go public they have to release real, actual, legally demonstrable numbers for their costs and revenue anyway.

When they will actually file to go public, their numbers will be intensely scrutinized. That's all that global headlines will be talking about for weeks on end. Why would they create forward expectations before it's necessary?

Of course they don't want to create forward expectations in a volatile macro environment, with the public listing being 6 months out.


Because the most important thing for any pure play AI company right now is to prove they are a viable company. And sure they have proved they can make billions, but also that they can lose billions more. They are going to need even more money and to prove to the next round of investors at an even higher valuation that they are a viable business they need to show not that they can generate revenue, but that they can one day turn a healthy profit. And that is the trillion dollar question.

I doubt having to replace every single chip in your data center every time you release a new model will bring down costs.

Went to that URL asked one question - "how is this different from other AI" and it took 598/6144 tokens, not sure what that means.

Not super clear from the site itself, but this LLM is running on specialized silicon implementing just it. So has super low energy use and blazing speed.

See https://taalas.com/products/

Edit: updated link


Incredible increase over Nvidia! Need to read more.. Thanks!

Because they can think more than one quarter into the future? Why on earth would someone adopt something into their core workflow that was fantastically unprofitable? Uncertainty and business don’t mix. Most people aren’t hype-eating bacteria that only care about maximizing their next paycheck.

One reason is that all the code you write with this goes in your private git. If using AI no longer is possible because of cost, you can still profit a lot from what you did with it before.

For consultants? Sure. What percentage of contractors are consultants? And is that better than going with something in your stack that’s sustainable even if it’s not totally optimal? I’d wager most would say no.

Regardless of profitability there will always be multiple good LLM vendors as well as open-source alternatives (slightly worse but still pretty good). If one vendor fails then it's easy to switch your core workflow to a competitor.

On an individual basis for coding? Sure. If you’re a significant business with agents that do more nuanced work, which is the only kind of customer that will let any of these companies pay back those trillions of dollars as quickly as they need to to stay alive, these are not fungible services.

I wonder if inference costs will go down...

or will it be like microsoft office, where the software bloats to use/fill current hardware?

(and in this case bloats might mean better thinking or pulling in more data)


If inference costs drop 90% or whatever, that would be a massive write-off of hardware even before they gave any returns for it?! Given Chinese and others are snapping at the heels and would also benefit from such reduction in cost.

> Why on earth would AI labs be bragging about how little the product they sell actually costs them to make?

Investor confidence. They have a bit of a need for cash (also an interesting part of the profitability discussion of course).

> Also, inference costs are bound to go way down with more optimized architectures

I agree. Jimmy is incredible, I wonder what non-toy use cases they have. Surely they’ll come out with updated chips soon.

That said, I was apparently a bit over-excited for Groq and Cerebras. I thought they’d quickly dethrone Nvidia for inference, but not so far. Even the GPT spark trial isn’t seeming to go far.


Inference has traditionally been far less expensive than training. One public example is the fact that hobbyists can run StableDiffusion ($600k training costs[1]) on their personal computers.

Speaking to your point, inference being dramatically less costly than training would not be seen as a delta from the norm. The model of providing inference for anything near the operational costs (like a utility would), would the delta from the norm if it were true.

[1] https://x.com/emostaque/status/1563870674111832066


The difference between training and inference is 1) one have to keep intermediate results for backward pass in training and 2) computation for training double because of the backward pass.

Training is also done over batches, which increase memory requirements by several orders of magnitude. This is why training needs costly compute.

One of the ways out of this unfortunate situation is to use something like Stochastic Average Gradient Descent [1]. Examples there are mostly concerned with regularized logistic regression, which makes problem more or less convex. Neural networks are inherently non-convex. Still, maybe some ideas from there can be utilized in the context of neural networks, like use of estimated Lipshitz constant to derive curvature and appropriate learning step.

  [1] https://www.cs.ubc.ca/~schmidtm/Courses/540-W19/L12.pdf

So one way to think about it is roughly,

Training is inference + backwards pass (~2x inference cost) + activations (vram overhead) + optimizer (vram overhead) + gradients (vram overhead).


Multiply "inference + backwards pass (~2x inference cost) + activations (vram overhead)" by batch size (thousands) to get to the actual RAM and compute cost. Optimizer like ADAM adds only two or three model-sized overhead.

And last, but not least, you need only one hidden layer kept in RAM for inference, but you need all of them (61 for Deepseek models) kept in RAM for computing gradient for one sample.


Microbatch size is a hyperparameter, it can be set to 1 and work just as effectively. With gradient accumulation it's equivalent even. Large batch sizes are used to increase parallelism, and sometimes to reduce variance in the loss signal (at the cost of increased bias).

Batch size is frequently limited by compute bottlenecks well before memory.


And of course you do all of this for every object in your training set, which is going to be larger than the total number of uses for any individual user.

Does it matter what is the difference in size of needed inputs for inference vs. training?

It's all got much more complex than that in recent years. Training now involves large amounts of inference for RL rollouts and similar. You can't disentangle them computationally like that. "Inference" is just the word used to mean serving customer traffic now, and "training" means creating the model you serve.

That is an estimate of the relative cost of one training step, but you have to multiply it by the number of training steps, an unknown quantity.

I think in your StableDiffusion example, a lot more than $600k will have been spend on electricity alone for inference (on those personal computers you mention). So inference is more expensive then training.

For equal capability tokens, there has been about a 10x drop in cost every 6 months.

We are still chasing the best because the best is moving rapidly, but it’s a simple thought experiment to work out what the cost to serve an 8B model from 2 years ago is in a world of 2T models.

Note: parameter counts are illustrative. Concretely, qwen3.6 27B delivers opus 4.5 capability at 1/27th the cost on openrouter. Single chip llama3 8b performance can exceed 17k tokens/sec.


8B models would be consider obsolete in the world of 2T models, at least if we're talking about the competitiveness of OpenAI/Anthropic. The only reason why they are valued so highly is their supposed dominance at the top end.

The main story of agent use cases is in enterprise so far. An enterprise will only pay for a model capable of handling the task and no more. Most enterprise's see no need to hire PhDs as factory line workers.

Coding is an interesting case as [1] the pace of progress has been absurd and [2] it's hard to put an upper bound on required capability. However hard to put a bound on and will are different, it's quite possible that the average engineer will cease to see the benefit of rapid progress - or that their employer will be satisfied with lower tier models.

How smart of a model do you need to build a high quality CRUD app for internal users? Or build a scalable web service?


yes, which is why the revenue growth story is not looking so great for Anthropic/OpenAI, when open-source alternatives are not far behind with much lower costs.

> For equal capability tokens, there has been about a 10x drop in cost every 6 months

Is this still happening? Opus 4.5 was six months ago, can you get its capabilities for 1/10 cost now? Are we on track to get the same for 4.6 in a couple months?


Pretty much, Kimi K2.6 is opus 4.6 quality for coding. If you include discounts due to more efficient input caching it is around 1/10th of opus4.6.

https://openrouter.ai/moonshotai/kimi-k2.6

The march of cost efficiency moves on.


Why haven’t I heard of this? Is it available in IDEs like Cursor?

> I am a bit skeptical of the supposed drop in inference costs. If AI labs saw a lot of potential there, they'd surely be bragging about it non-stop?

Unless to the grandparent commenter’s point they’re using it to obscure their large prisoner’s dilemma (training) cost?


> If AI labs saw a lot of potential there, they'd surely be bragging about it non-stop?

Google seems to pretty regularly post about how their TPU and algorithm advancements have been decreasing energy costs for both inference and training.


What other companies brag about lowered costs? Isn’t that just a complicated way of asking customers to demand lower prices?

Small alternative potential future changes that alter this analysis:

* At some point model capability reaches diminishing returns. Then inference >> training in the future but training >> inference now. It’s not a prisoner’s dilemma but a land grab to solidify market position and be one of the 2-3 firms left standing as dominant in the space. The model companies aren’t super sticky yet but they’re working on it.

* even if training remains >> inference, it’s possible to have multiple price points like they do today. If you need the most capable model you’ll be paying exponentially more per token to supplement the training cost even though the serving cost is marginal because most people will be satisfied with cheaper / less capable models for most tasks.

I buy that inference is a dropping line item while training is a growing one. There’s all sorts of things on the horizon that’ll be order of magnitudes improvements, from startups burning models into ASICs to get order of magnitudes more performance to alternate architectures like diffusion transformers that have orders of magnitude structural optimizations. It’s inevitable that it’ll come down even further from where we are. It’s possible model training also will go down but I’ve not seen any compelling research suggesting major “easy” reductions here.


The issue is that most tasks do not require frontier-level intelligence, but companies like OAI can really only profit off of the frontier. Capabilities from a year or two ago are so outdated that even OpenAI gives it away for free and there are many other models biting at their heels. In other words they are spending huge amounts of money to cash in on a depreciating asset.

So one possible future is that frontier-level training becomes so expensive and the use cases so sparse that it simply isn’t viable to keep going bigger.


Once the land grab is over, the market will consolidate and the winners will absorb the losers. Then the few winners will be the only ones with real capital to train frontier models and will have true pricing power. Similar to how social media companies or the gig-economy benefits from network effects, AI companies will benefit from having the lion's share of paying customers (that also constantly feed in more data to train the models on).

We have GPU costs, power costs, and how many token/s models can generate on those GPUs. It’s possible to figure out the marginal cost based on this. The current estimate is about $0.40 per million tokens for gpt4 equivalent model. Sonnet 4 is $15 per million tokens, so they are charging high margins on inference. The issue is how large of a margin is needed to recover their costs before the GPUs age out, and how high of a margin can be charged before it’s not economically viable.

https://www.gpunex.com/blog/ai-inference-economics-2026/


That seems way off to me.

I skimmed the article, but couldn’t spot any details on their estimates. They mention 70b+ params as being large in several places. But we’ve had several 100b+ param models that trail Sonnet.


Why would power spikes from training runs imply training>>inference? The cost of a training run scales with energy, whereas power is energy per unit time. All that tells you is that they're speeding up their training run so it will take less time overall (probably chasing some first-mover advantage, where they're out with a given model before their closest competitors), whereas they obviously can't do that for inference (which is a steady flow of requests over time).

Yes the huge discrete stepwise training spend is critical.

Maybe investors will realise that "the only winning move is not to play".

And so we are left with (as was) frontier models getting more and more out of date as whoever their post bankruptcy custodians are tries to eek pennies on the dollar for inference on their decaying property. Perhaps along with local and/or highly specialized models still feeding on the after-glow of the huge amount of training that was (and is no longer) done.

The next AI winter is going to be deep, savage, and long.


Bankruptcies? The winners will gobble up the losers and the few remaining players will have pricing power. Don't be naive thinking that OpenAI or Anthropic can possibly go bankrupt. There will always be someone happy to buy them up for a nice price. Yes, the market will have to go through a consolidation phase though.

> frontier models getting more and more out of date

Why are they getting out of date? Is it because we have new content from the internet that the older models did not have? Or are we simply trying to increase the size of the training data? In other words not more up-todate in terms of time the content was created vs. wanting to use bigger training-input-sets?


lack of new content from the internet will make them go out of date. Not just facts and figures, but (for example) new programming languages/techniques.

I see makes sense. Then it it kinda says that quality of the model is the topicality of its inputs I assume.

I don't see how it would be possible for inference costs to dominate training costs, even after amortization.

Training involves multiple passes over the entire training dataset, ideally in large batches where you can perform inference on as many samples as possible simultaneously and then perform backpropagation to adjust the model weights (which is about as expensive as inference).

Let's consider the size of the dataset we're dealing with here. The dataset likely consists of practically every piece of digitized text they can get their hands on (including that extracted from audio and video). We know Google has digitized a large portion of the books in existence as part of their "search book contents" feature and we have no reason to believe they're not using it alongside their cache of 90+% of the internet to train their models. We're talking about 100s of millions of books each with an average of 100,000s of tokens. The internet has 10s to 100s of billions of pages on it with who knows how many tokens on average. This is a huge dataset that we've got to go through hundreds of times.

Second, let's consider the effect of batching and how it sets requirements for our hardware. We know that larger batch sizes converge faster, are more stable, and produce better models. So if you want a good model you need large batch sizes. This means that you need machines several orders of magnitude more powerful than you use for inference. From what I heard Google uses clusters of 100s of the their TPUs all located in a single rack for training. These clusters are organized in a customized computing architecture to maximize memory locality between cores (really critical for efficient back-propagation). Further, you can't use reduced precision weights for training like you can for inference, so there are no shortcuts.

Finally, the initial training stage is followed by reinforcement learning stages - this is key development in how AI models have improved in the past year. This may mean going through a curated set of traces (either synthetic or captured from users) and adjusting the weights based on experienced outcome.

Overall there's so many orders of magnitude more work and more hardware requirements for training that I find it improbable that inference dominates. The number of "inference" steps in training is freaking ridiculous and includes such factors as the "number of words ever written".


It's been a while since I saw a detailed paper on a high end training run, but extrapolating from what I remember, it seems those training runs are in the 10s of trillions of tokens. This already accounts for potentially sampling tokens multiple times during the training run.

That seems like a large number, until you realize that OpenAI claims to have almost a billion weekly users. And OpenRouter shows many models at over a trillion tokens per week.

So in pure token terms, I'd say it is in fact extremely plausible that inference dominates, at least for the popular models.


Not saying you're wrong, but I'll note why inference might dominate despite everything you mentioned.

A given model is trained once but applied N times. A large enough N will dominate training, no matter how complex and costly it was.

But how long is a model useful for? How often will labs need to train new models? Time will tell.


This statement is well known to be incorrect for at least a year.

Great points. - At the end of the day those are still private companies (albeit huge ones), so we can only speculate about the state of their private financial situation. Once they will decide it's the right time to IPO, they'll publish all their financials and we'll start to have a clearer picture. - Later, each company will slightly specialize and have a different go-to-market strategy, which will allow us to understand on a deeper level what works in the market and what doesn't (think about how Facebook, Instagram and TikTok are all huge universal social media platforms, but, each with a different target audience and different user base). - Finally, the market will go through a consolidation phase in which winners will gobble up the losers and then the incumbents will have a real moat (against new-comers) and real pricing power on their user base.

> If we don't even know the ratio between amortized capital expenses and operational costs, outside investor analysis is impossible.

And yet we surely need this data for the IPO? Or are they relying on rule changes on the indexes to force ETFs to buy shares?


The IPOs are months away, potentially 6 months or more. We're in a volatile macro environment. AI companies have all the incentives to not create higher expectations regarding their financial situation a long time before the IPO. Obviously at IPO they will have to disclose their full financial situation.

The market is super hyped anyway for their IPOs. If they raise investors expectations now and things change until the IPO, investors will be disappointed. It's a lose-lose proposition.

The smart play for any company is to keep their cards close to their chest until close to the IPO time.


I work for a tiny little company ($150MM annual rev with 9% net) and we are already looking at dropping $100k on hardware to run local models because, for us, they're "good enough."

Our estimated spend for AIaaS would exceed that cost in less than a year.

In a few years, there will be hardware capable of running frontier models good enough for most things at accessible prices for even tiny companies.


Yeah, that's the part that just seems to be wildly under-discussed to me.

If open source models are ~3-6 months behind SOTA, and ~opus4.6 capabilities are good-enough for product market fit, do the frontier labs have half a decade to catch up on their prior burn?

AI cost ballooning faster than companies can afford is becoming a very common topic in my circles right now. The era of "I'll pay infinitely more for marginal gains" is over from what I can tell.


> If open source models are ~3-6 months behind SOTA, and ~opus4.6 capabilities are good-enough for product market fit, do the frontier labs have half a decade to catch up on their prior burn?

They know they do not and that’s why they’re all trying to IPO right now, so they can pass the bag to consumer investors


More correlation, if more correlation was needed:

1- SpaceX + Tesla + xAI merger / IPO while Musk was vocal against IPO for about a decade

2- Warren Buffett cash at record highs

Someone got to be exit liquidity


The printing press was good enough for product market fit back in the 1700's. But now it isn't.

Last year's AI models will be the same. Do you want to spend 3 hours prompting free AI to fix your code or 1 hour prompting AI you paid $20 for?


That's only if these AI companies can keep improving their model performance faster than open source options can keep up. I don't think performance will keep scaling with more training data, and even if it does they've likely already used the entire history of content created by humans for training. Everything points towards diminishing returns in an increasingly crowded space of competitors, there's no other reason for these companies to be rushing to an IPO if they felt secure in their market positioning.

Open source models that you can run locally are much more than 3 to 6 months behind. 6 months was the November inflection for Claude. No open source model is as good as Claude Opus 4.6.

It depends what you mean by locally. I don't foresee running a model on my laptop anytime soon to power a coding agent. Far more likely is an infra team at my company operating an open source model on cloud infrastructure. When they're already paying $1000 / month / dev, it starts to pencil pretty quickly.

Is there any open model as good as opus 4.6 at any price?

How many problems require Opus-4.6-level performance? The "I'll accept nothing but the very best model for any task" thinking is perplexing to me.

People got a lot done before Opus 4.6. In 6 months, would you be dissatisfied by Opus-4.6-level open-weight models, just because Opus 4.8 will be out?


Not OP but I've been thinking about this a lot (like everyone ha) and I think my answer is, yes?

I hope there's a "good enough" point but I don't think we're there yet. Like for me hardware got good enough several years ago. But while opus 4.7 is really good compared to everything else, it's not so good that I would use it at a discount over whatever is available in a few months. The improvement in quality, speed, and daily frustration is worth it to me... Spoken as someone whose employer is footing the bill, so take that with a grain of salt.

I want to run my own local models, but I don't think that's feasible without lots of frustration until a few generations of frontier models are so good that they're almost indistinguishable for common tasks. Kind of like how MacBook pros have been for a while.


While I can imagine that I'd want to use Opus 4.8 over 4.6 for a fair number of things (at least if they can avoid further speed regressions), I also have noticed that certain types of failures seem to be systemic. Bigger context has been helpful for bootstrapping, but still doesn't fix problems of getting stuck on the wrong things - you can toss more things in the blender, but you don't necessarily know which way it'll slice them up in advance, or which things from them it'll latch onto. And output still seems to get into "blindered" states where important details get dropped - even though it'll agree very quickly when you point that out. As long as we're in that sort of "spit something out in local targeted manner, and then do a revision loop until tests are green" style of execution, bigger models haven't shown me the ability to really avoid finding non-optimal / subtly-broken outputs for complex problems.

Using Cursor to hop between models, I've found Opus to be generally better at really tricky debugging than GPT 5.5 or earlier models, but not reliably better at execution because of these things. I'm not sure Composer 2.5 is quite there yet for the execution side, but it's getting pretty close to those other ones, such that I'm definitely still in a "debug and plan with slow, execute with faster ones" operating model for working on hard shit.


Why should I need to talk to Opus 4.7 when my day-to-day task is about programming in Java and Python? I don't need my model to know about biology or chemistry. If I need those capabilities (for someone who is working as software engineer in chemical industry), I will talk to Opus 4.7 for planning and then fan-out work for cheaper coding models. I think we will soon start to see specialized highly effective English language only programming models. I don't need my coding model to know about literature, art, philosphy, ethics, etc.

If there were a coding model as good as opus that didn't know multiple languages, biology, etc, I would happily use it. But I'm not aware of one - are you?

It actually seems somewhat difficult to train such a model since "all the text on the Internet" is easier to provide in bulk than a highly curated set.


Well language detection isn't all that hard in the scheme of things (especially now), but maybe having only training on English makes models less effective programmers. It would be interesting to see that as an experiment.

I would think that the surrounding chemical "knowledge" could be useful in the context of programming in that industry. Have you ever found it to draw links and conclusions between what you're doing in computer science and the chemistry it's in the middle of?

I would use Opus 4.7 for the planning stage where chemical knowledge is required then delegate to smaller English-Programming-Only-Opus to do the actual coding.

I'm very happy to have multiple sessions open (and do) and switch between fast and slow models, and if there were a batch mode in codex or Claude code I would use it. (Just like I sometimes use codex fast mode)

But at the moment, I can't imagine why I wouldn't be spending the majority of my time with the best models. I'm spending a lot of time with them! Reducing the number of back-and-forths is extremely valuable to me.

I expect in two months I will still want to spend >80% of my time prompting the best models, and that's true if I were spending my own money on hobby projects, too.


Something that's under appreciated right now is when designing systems and proposing solutions, my colleagues and I do a lot of brainstorming with llms. The core architectures have come out of that, but the best pieces of that architecture are still coming from humans.

These are ideas that simplify the design, reduce future work and tie together the entire system. If in two months I can arrive at ideas of that quality with normal brainstorming with llms that will be extremely valuable


As long as the improvement is vastly more valuable in my time than the added cost I will always use the best model. I think it depends on your situation and tasks what makes sense.

    would you be dissatisfied by Opus-4.6-level open-weight 
    models, just because Opus 4.8 will be out?
Well, I see what you mean, but two big concepts...

1A. Models get stale pretty quickly w.r.t. new developments that occur past their cutoff date. "But you can just keep them current by linking them to never documentation, etc!" Well, no, you sorta can't -- at least not in perpetuity. Those search results fill up your context window real quick. So that gets unsustainable real quick.

1B. Even when your context has plenty of free space, the results you get from "here's a link to the documentation for this new framework that released after your cutoff date" absolutely pales to the results you get from knowledge that is fully baked into the trained model as opposed to your context window. For one thing, that documentation link you pasted into your context might link to... a dozen code examples. Whereas if that was baked into the model itself, the model might have been trained on many thousands of examples in Github etc.

2. It's also a reality that most professional engineers have to keep up with their peers and competitors. We can maybe say it shouldn't be that way, but it is. So if $SOME_NEW_MODEL is significantly better than 4.6... and my peers and or competitors are using it, then yeah I might but really feeling the need to match them. And I'm not even necessarily talking about some kind of cutthroat dog-eat-dog stack-ranked workplace.

These limitations aren't relevant for all use cases or careers but they're hiiiiiiiighly relevant for professional software engineering.


I image that'd be handled via a fairly regular minor bit of additional fine tuning to update them with new information rather than polluting the context space.

It seems that the cutoff date for all models is stuck at some point before AI generated content started being pervasive.

that's the nice thing about open weights, you can always retrain them with the latest documentation, no need to fill your context

Kimi 2.6 probably. Needs over 300GB of GPU memory to run (1TB for for full capabilities) so either a 4x A100 or 8x A6000 would do it.

A $50k - 100k rig could do it and an entire company would be able to use it a full speed.


No, but the big open models are on the level of Sonnet 4.6, which is very good for most problems.

The people who are claiming Opus level capability does not have sufficiently complex problems to see the difference.


And neither side brings any evidence ...

For coding don't think so, but they are very close. I code with sonnet mostly because I think opus is just useful if you fail to dissect problems adequately, but anyway.

Kimi is close for example regarding SWE bench for code. For reasoning there are open models that surpass opus by quite a margin already.


> that you can run locally

That's doing a lot of work here.

The future I see isn't most companies buying hundreds of thousands in hardware to run models, it's them adding a line item to their AWS bill. Inference costs on the larger hosted open source models are dramatically lower than the frontier labs API pricing.


The future I'm seeing is AI coprocessors running inference locally in most devices that today have a CPU. Just look at how powerful your mobile phone has become compared to your desktop computer 15 years ago and compared to a main frame 30 years ago.

The days of requiring a data center to run anything resembling opus 4.6 are already counted. (But the industry will fight hard to get people to keep paying the Claude tax.)


I'm already running a google TPU over USB on an otherwise very cheap board to do local computer vision on a front-door camera since I wanted to get away from Ring and other cloud services for that use case.

And yeah, that may be the ~decade world, but we're in the mainframe era of the frontier models. It's going to be more economical for basically any consumer, and most businesses, to pay someone else to host models for quite a while.


A gaming PC can already host models that perfectly serve casual users who just want recipes, todo tracking, picture identification, etc. E.g. Qwen 3.6 35b which will run on a $650 GPU at 75 t/s (Nvidia 1660 ti 16GB).

Said model will also run as a tool-calling coding model excellently (it's no Opus, but for a thing that once set up is just the cost of energy, it's incredible). It can type faster than you can, probably 10x faster, so with guidance it'll make you faster. And it's free.

It's here. If folks want ChatGPT without a subscription, they can have it today on their computer. The only money to be made is in the high end models doing "serious business" work spanning 1M+ token contexts and massive uncertainty. Everything else is already set to be eaten by today's local models.


The problem with models like Qwen 3.6 35B (which really is an excellent model) is that my expectations of what a model can do have gone SO high now.

Here's a prompt I just ran against Claude Opus 4.7:

> Use python3 to experiment with whether the SQLite3 authorizer mechanism can be used to detect an INSERT OR REPLACE based just on running an explain query without examining the SQL string itself

Opus nailed it: https://claude.ai/share/c4212606-3fee-4b7c-bc97-505e0348ccac

I tried the same thing against qwen/qwen3.5-35b-a3b running locally in lmstudio, with the Pi coding agent. At first it looked like it was going to do great! And then it fell apart over the course of several tool calls: https://gisthost.github.io/?8ae2f842df619fb7fd8f1ccd82fe41c7

I'm used to GPT-5.5 and Opus 4.7 handling that kind of prompt without any problems at all.


Something is definitely going wrong with your Qwen setup, in the link you posted it starts and ends with a compaction step due to a 4k token context limit. Qwen 35b supports I think up to 200k+ context limit (though I run only with 128k), that seems to be a major source of the problem.

Good call, I need to check if LM Studio is misconfigured.

This worked for me with qwen3.6-36b-a3b even at a q4 quant. I ran pi in a docker container and it had to figure out how to install python as well. I used the same initial prompt you had without any additional. You talked about Qwen 3.6, but then said you tried Qwen 3.5 in lmstudio. Not sure if you meant Qwen 3.6. I ran with llama.cpp llama-server with the recommended settings from unsloth.

I'm not an expert in SQLLite so I can't say if this is 100% correct, but it seemed directionally similar to the conclusion from claude.

  ### TL;DR
  
  - Authorizer + EXPLAIN:  No — authorizer only sees SQLITE_INSERT, not VDBE opcodes
  - EXPLAIN opcode analysis alone:  Yes — Delete opcode at position 10 is the unique signature of INSERT OR REPLACE / REPLACE
I can't help but think the not-so-distant future will see language models expected on commodity personal computing devices.

OK that's a very good answer! Do you mind sharing the transcript?

Sure I cleaned up the jsonl session file a little here: https://pastebin.com/PL9EPn9Y

I tried it a second time, and it spent a lot of time trying to figure out some authorization issue, so definitely not a slam dunk. I might run it a few more times for science. But while this is a new model it's also quite lightweight, and as hardware adapts and improves it seems inevitable that for many use-cases a packaged language model running locally will do the trick.


So one of the prominent LLM advocates known for testing every model shared a prompt intended to exhibit Opus 4.7 capabilities, and Qwen 3.6 sorted it out okay? Interesting.

Not saying they're equivalent, local models still decohere much quicker as the context grows in my experience. But... Interesting.


Thats when your build a better Ralph loop around your llm for it to converge to an answer and not rely on 1 shots

> a thing that once set up is just the cost of energy

I don't think we can discount this, frankly. Newer electronics are energy efficient, but older devices are more energy-intensive, and unless configured well, a gaming PC can easily use a few dollars a month in electricity, so now you're approaching subscription territory. A subscription comes with no upfront cost, higher reliability, no wasted space in your home, mobile apps, etc. (and less privacy).


Curious why you went for a custom solution. I am aware of at least one company that seems to ship devices with local computer vision (Reolink).

My experience over the past decade has been being subsequently burned by being reliant on one provider's ecosystem after another. This is great until Reolink starts doing something shady to pad the bottom line and then it's on to the next.

I wanted the ability to run whatever cameras on a VLAN and own the stack.


I'm guessing that they are using Fargate which is an OSS NVR. It supports a little addon USB stick you can buy for about $30 that will run common computer vision tasks for object detection. Stuff that we've been able to do with WebAssembly and Canvas for a long time now.

> But the industry will fight hard to get people to keep paying the Claude tax.

I bet this will ironically be couched in "safety" reasons or regulation to get anti-AI folks on board, even if it favors the large incumbents.


Counted but not yet numbered?

Even when run on datacenters, it would be like current day webhosting. It is hyper competitive and it will be a race to the bottom. There is money to be made but not as much as investors hope. There will be datacenters in random countries like Kazakhstan because some oligarchs have found a free energy glitch (like with bitcoin mining).

Magical thinking. I guess if your phone is going to have 128gb of dddr5 then sure. You people fundamentally don't understand the memory requirements for running inference. Your cute local models seem good enough because you have no standards and anything an LLM produces seems like magic to you.

> Magical thinking. I guess if your phone is going to have 128gb of dddr5 then sure.

Why would it not? The typical new phone today has 16gb of RAM. 20 years ago that was somewhere around 32mb. Factor 512. It's not hard to see that we'll get there rather soon, especially if there is an application that provides demand.

> You people fundamentally don't understand the memory requirements for running inference.

You seem to be overlooking how fast things change in this industry, especially if tons o money can be made as a consequence.

> Your cute local models seem good enough because you have no standards and anything an LLM produces seems like magic to you.

Please don't generalize. I'm an expressed AI skeptic and have to deal with the bad consequences of AI slop every day. But you can't deny that there are enough applicationn areas where people have use cases and those will be much easier if things don't need a few round trips to a data center that sucks all the electricity and water out of neighboring communities.


Eh, you're off by an order of magnitude or so on both ends.

The iPhone 17 has like 8 gb, the Pixel 10 12.

The original iPhone was 128mb, and the iPhone 6 from 2016-2018 was around 1gb; that puts the iPhone at around 8x RAM per decade, and puts us at 128gb in our pockets at around 2036 or so.

(Incidentally, the big news in phone RAM is that a lot of new phones are dropping back to 4gb because of RAM shortages.)


> it's them adding a line item to their AWS bill

That's the future Amazon sees too. We just had a week long session with the AWS team and they pushed that to us multiple times.


Buying "hundreds of thousands in hardware" sounds like a lot but many companies - especially software companies - already do that if they have 100+ employees.

Running software in the cloud gives you certain reliability and scaling advantages that would be very hard to replicate locally. Running some code agents in the cloud vs local hardware, if the local hardware gets "good enough," breaks the other way - offline usage, alone, would be hugely valuable to many people and companies.

It'd be very interesting to see where various players would decide to make a call "local is good enough" though. Buying the hardware isn't a small bet, if it's not something that ends up as part of your standard computer.


Many business tasks do not need the latest frontier models. I have a production system running since early GPT-4o. It now runs with GPT-5.2, not for improvements, but because it is cheaper. I could invest in switching to a local model, I tried and it works well enough, but api costs for this task are so low, it barely scratches $30/month. So I am using the local machine for other things and leave the inference on OpenAI, for now.

But one will be in few months. And then you have choice of paying say $100k for hardware and pay just power cost (or pay someone to do that for you), or pay way, way more for your team to have access to marginal improvement.

And 5% worse model for 10% of the price of the bleeding edge will be worth it for majority of people


This project argues that with appropriate harness, the performance gap between frontier and much smaller open weight models shrinks dramatically: https://github.com/antoinezambelli/forge. I haven't kicked the tires yet.

I've been doing my work with OpenCode Go, with Kimi2.6. It is not as good as Claude Opus, but it's good enough to get the job done, and I never run out of tokens.

I keep hearing about this "inflection", but it feels extremely exaggerated to me. And yes, I was using it at the time. It got incrementally better, it wasn't that amazing.

I think the bigger shift was harnesses and the two ended up somewhat commingled in people's minds.

Claude code was a lot of people's introduction to using coding agents that could do a lot more than copy-pasting from a chatbot or autocomplete.


The tool usage + skills got markedly better and so did the thinking cohesion. Add 1m context windows and it was a very noticeable shift.

Opus 4.6 quality for local inference would be revolutionary.


1m context is garbage

It's just a metric. If it can find a needle in a 1Mtok haystack, then it's likely good at coding within a 200Ktok context (or whatever, insert your number here, I'm just trying to make a point)

Opus 4.6 is a February model. Every time this subject comes up it seems like people post intentionally misleading things and move the goalposts.

The goalpost we've been bludgeoned with over and over again is that, in particular, Everything Changed in November 2025. That GPT 5.2 and Claude 4.5 were the inflection point. That is actually 6 months ago. And DeepSeek 4 is already there.

> run locally

You can't run DeepSeek locally on consumer hardware[1], but you can on enterprise hardware, and enterprise spend is the subject of this conversation -- and even if you aren't self-hosting, it doesn't matter, because you can just get your inference from one of the the many companies serving DeepSeek, who trivially undercut the pricing of OpenAI/Anthropic because they didn't have to spend hundreds of billions on training frontier from scratch but instead only invest in supporting inference, which is already profitable.

[1] Since this misconception comes up all the time, I'll go ahead and pre-empt it: no, training a 32b parameter model on outputs from DeepSeek and running that locally is not "running DeepSeek", despite the hundreds of stupid articles and Youtube videos making that idiotic claim that they're running it on a 5090.


> You can't run DeepSeek locally on consumer hardware

Maybe not DeepSeek v4 Pro, but I've run DeepSeek v4 Flash on my 128GB MacBook Pro using antirez's carefully quantized https://github.com/antirez/ds4 and it's impressive.


Oh sure, yeah, that's nothing to sneeze at either. I think unqualified "DeepSeek" should generally refer to the main model, though, especially in the context of GPT5.2-grade quality.

> You can't run DeepSeek locally on consumer hardware

I'd qualify that by writing that you can't run it with ordinary, real-time speed and throughput. If all you care about is slow and high-latency inference, there's no reason why that shouldn't be feasible even on the cheapest miniPC around, as long as it can literally store the model weights and keep around the (rather small) context.


To be relevant to this discussion, models running on reasonably-priced local hardware do not have to be as good as the best.

They just have to be useful enough that companies don't need the best.

They are.


Deepseek v4 pro is damn close to Claude 4.6, and whilst you'll pay quite a lot for a rig able to run it, it is open source.

Kimi is better.

There's still a lot of room for the best models to get better at coding .

Your argument rests on the "for marginal gains" part but it's really not clear that the gains are marginal in the foreseeable future.


This is totally valid and I don't agree with the downvotes you're getting. Someone coming out with a 10x improvement is possible and would change the game immediately. The thing is, we really have been seeing marginal gains with shifting leaders in who's got the "best" since GPT3, and at least as a user of these tools that pace has been slowing, not accelerating. Subjectively it feels like we're in the back half of an S-curve.

We're 3.5 years into this current AI wave, and a lot of the valuations have been predicated on what you're arguing here -- that essentially should one of the labs make an order-of-magnitude improvement or hit escape velocity on recursive self-improvement they'd become the most powerful economic chokepoint in history.

The reality has been that given access to compute + capital all of the labs can stay pretty competitive with each other. Someone does a bit better on coding, someone else does a bit better on tool calling, and then they swap after each spending another $100bn.

The market looks like a commodity market where the commodity is intelligence, not a winner-take-all market with massive margins. Plenty of people get rich in oil and airlines, but they notably don't tend to be the innovators long term, they tend to be the operators. Obviously if the machines become sentient tomorrow, turn on their masters, and hit world-dominating intelligence, that assessment changes, but after several years of that narrative while objective reality looks quite different I think the more sober voices are starting to gain a foothold.


I agree with most of what you're saying, but I think the point I was trying to make wasn't as high-flying as you and others understood it.

I'd pay a premium for even just a model that's 20% better, no ASI required, and I think a lot of people would. I wouldn't call that marginal, if it means I'm getting frustrated on 20% fewer tasks.

A recurring pattern that I've seen in myself and others is to at first be very impressed by a new model's coding capabilities, and then desensitize quickly and start being frustrated by the shortcomings.


> I'd pay a premium for even just a model that's 20% better

The point I'm making is that I think we're rapidly hitting levels where corporate buyers aren't willing to pay multiple-times-more for marginal gains, and I expect that to become more the case over time, not less. You, and a small % of other power users in the market might tolerate a $400/month pro-supreme-plan for access to Mythos or whatever, but I don't think that's going to scale up in quite the same ways we've seen so far.

Even a year ago paying multiples times more for a 50% gain was very sensible for a lot of workflows. But if we're getting to "good enough" for things like coding, justifying to your CTO/CFO why the org should go from spending $1m/year to $5m/year for a 10% higher hit-rate on one-shot prompts from the engineers is a much tougher sell.


What? The gains between gpt4->5 seems to be marginal. No phd level discoveries here

The leap from GPT-4 to GPT-5.5 has been astounding in my opinion. There is no way GPT-4 could run a coding agent harness like Codex at even a fraction of the quality that GPT-5.5 does.

I don’t think that’s exactly indicative of GPT-5.5 being an astoundingly more intelligent model, however. An alternate interpretation is that GPT-5.5 was trained on tool usage/harness patterns and has been optimized for this use case.

I remember that even when GPT-4 was king, the Gorilla paper showed that Llama 7B could be fine-tuned to outperform GPT-4 on tool calling.

On domains that don’t involve agentic tool calling*, I haven’t found the frontier to have advanced that much.

Edit: I should broaden this to domains that naturally lend themselves to RLVR training. Models are drastically better at math now.


None of this matters in the product: it either is capable of agentic loop workflows or it isn’t. A 10% improvement in probability of single task success makes or breaks the use case.

For me any of the codex models run circles around the non codex models for codex usage.

I'm not sure why you're so obsessed with the non-codex versions


Open source models, especially qwen are pretty dang good. But its not opus 4.6, the evals dont tell the full story. I question the assumption open source models are 3-6 months out.

Its not just about the quality of output, but you also can finetune them to proprietary needs, if the skillsets are their internally, to make them better without governance risks. So being SOTA doesn't matter as much, since generalized tasks are not what matter most to companies, its the specialization relative to business need or internal datasets.

To make an extreme comparison, desktop Linux was originally supposed to happen in 1999.

Maybe I misspoke by saying open source.

The larger point I'm making is I think models are rapidly becoming commoditized. There is probably a small market long term that's willing to pay 10x for 10% marginal gains, but the majority of the buyers in the market will be economic and we're likely to have a lot of folks willing to spend 1/10 the cost for 90% of the performance, and plenty of companies that haven't raised hundreds of billions-trillions who can provide that.

A lot of the frontier labs valuations has been based on an assumption that 1-2 companies would get break-away intelligence that basically made them economic chokepoints indefinitely into the future. The reality that's becoming increasingly clear is that model quality is a pretty linear function of (cash burned - ability to copy other's homework) and the economics are starting to look a lot more like airlines than online advertising.


Lets go one step further.

The economics of airlines are such that they generally earn a return on capital less than cost of capital.

I think this is exactly where we are heading and OAI-Anthropic are the concordes.


Not OP, but it is a known fact that the cumulative profits of the airlines industry (in US) over it's history has been basically 0. We can say that essentially airlines are in business to support other businesses. I believe this is what OP might've been referring to.

If only the AI era was born in ZIRP.

Better now than ZIRP for me - at least people are asking timid questions about the unit economics and how long the runway is _early_ while also spending absolutely insane amounts of money on this bet. During ZIRP, these companies would have turned down any investor asking questions. Less contagion when rates aren't zero hopefully? :grimace:

The size of the AI bubble and the IOUs being passed around like a hot potato already dwarfs the real estate bubble preceding the 2007 crash.

If we still were in the ZIRP era, busting the bubble would certainly kill off the world's economy for good simply due to its size.


You have to think about why open models are behind. Exfiltration is a big part of it. So you could change the Nash equilibrium by increasing your security, or other multilateral approaches.

For give my naiveté, but who pays for the training of these models?

> ...we are already looking at dropping $100k on hardware to run local models...

Just think how much further that $100K would have gone if the hardware market wasn't so screwed-up.

Anecdote: I priced-out adding 1TB of RAM to a four node cluster a couple months ago. The cluster was purchased in fall of 2024 w/ 4 nodes, each with 256GB RAM. The nodes cost just over $14K apiece back in 2024 (entire box, not just the RAM).

Dell wanted >$90K a couple months ago to add 256GB to each node.


> Dell wanted >$90K a couple months ago to add 256GB to each node.

RAM is expensive, but not THAT expensive. I just bought 128Gb for about $5k for our build cluster (it's not even for AI, sigh). Even if you need larger-sized DIMM sticks, it's still going to be in the vicinity of ~15k tops.


It was crazy. I found the part on the open market for a lot less but the edict from the Customer was to buy from Dell to keep the support entitlement intact. That inflated the price to an astronomical level to be sure.

I haven't had problems w/ Dell support and 3rd party memory, personally, but given the machines' application I understood the concern.


I get the impression the hive mind hasn't come to terms with the point that a model is optimised for certain tasks. It's like having someone ask you "is that a good hammer?". Good for what? There are claw hammers, sledgehammers, ball-peen hammers, club hammers, mallets, .... Yes, in a pinch, they can all bang in nails, but you wouldn't choose a dead blow hammer for that if you had a choice.

The Gemini Flash is very good at searches. Just about any low end model can toss out a poem. All the higher end models (open source and otherwise) seem to be able to churn out code that passes tests. The smaller, "less capable" ones are much faster at it, which means in the hands of a skilled practitioner are the best choice for that task. But they rapidly fall apart where there isn't a hard source of truth (like a good test suite) to grind against. Because of that you have to use a bigger model for bug finding. In that task the open source models tend to fail on larger code bases, where something like Opus still shines. I gather Mythos is an absolute monster, and unparalleled, and unavailable. I'm sure one of the reasons for that is it's so expensive to run.

Or to put it another way - you don't use a 100 tonne crane to pick up the shopping. And ... the smaller models will happily run on in-house hardware. You may not do it today because of the current DRAM price and integrated NPUs have just started shipping, but in 5 years time models will be running on your phone.


Yes exactly, we will have specialized models soon. These will be trained with plugin architecture with a core reasoning model asking plugin models to do stuff on its behalf. I don't need chinese or russian knowledge in my workflow.

Yes 100% this. A lot of people keep talking about how OpenAI and Anthropic will need to raise their prices. What is less discussed is how they CAN'T raise their prices because competition exists, and sure it's not SOTA, but it's literally an order of magnitude cheaper in many cases and the drive to figure out how to make it work well enough is going on right now (and will only intensify when the SOTA models raise their price).

It's a given that the SOTA models need to raise their prices. It's also a given that they can't. The more they raise the more customers will move to their competition.

So what happens next? Well I think it will suck horribly if you can't move off of SOTA sooner or later, because the Big Two are going to lose customers, and therefore have to raise prices on the locked in customers even more than these projections suggest.

Beyond that if you're looking to start a business, figure out how to use cheap models in new scenarios. Build software which does that and license it. This is kind of contrary to the idea that you shouldn't over optimize for deficiencies in the models that will likely go away in the next generation - for instance a lot of problems were solved when context windows got way bigger. So it's a thin line to walk but I think it's there because a lot of orgs are using Claude today for pretty basic tasks.

The dev who's addicted to SOTA models honestly is going to have to settle for less or get totally screwed. Most applications within business from what I see aside from complex research do not require SOTA. They summarize, they classify, they transform, and doing that accurately has been cheap for a while.


I'd qualify your point that Anthropic and OpenAI can't raise prices, that is as of right now. Once the industry will go through a phase of consolidation and the bigger players will have some moat around their product, they'll have more pricing power.

Your last point is common sense in my opinion, I agree with it. At the end of the day most employees are (by definition) of average intelligence and most businesses are average in complexity. Thus, it is logical that average tools (AI models) should do the job for most people and most businesses.


On prem AI makes sense for more than just the cost. More control, IP, model improvements you can keep, data privacy to name a few. People will realize that AI is not like compute the moment they get their own knowledge sold back at a premium.

What are the advantages to on-prem for a company that's already in the cloud and trusts it with their IP? That company can just rent GPU instances from the cloud if they want to train/fine-tune their own models and keep avoiding CapEx.

> People will realize that AI is not like compute the moment they get their own knowledge sold back at a premium.

But what if your competitors sell their knowledge to AI companies?

Then you're still screwed.


I don't quite understand, what would 100K buy you?

AFAIK you would get about ~5 concurrent users, with a max context window of ~128K tokens on the larger models.

This wouldn't be good enough for coding -- are you guys thinking of using it for something else?


By my calculations 100k could get you 18 5090's + compute to host them, or 18 96gb Mac mini's. You can get a lot of context window and users out of that setup.

Gigabyte 4x AMD Instinct MI300A rack server (512GB GPU RAM total)

Roughly equivalent to 4x H200's for less than half the price.

Vaguely around 60k tokens per second...


Do you think this will be a trend for larger companies as well?

The decadal move to all-cloud-all-the-time killed off in-house hardware teams while the C-suite chased their OpEx dreams.

It would be interesting if we come full circle on this.


I doubt it. Companies that have moved to the cloud are already trusting the cloud with their IP. You can rent time on a high end Nvidia system from various clouds. OpEx means there's no write down in three/five years as that system goes out of date so it would only make sense if the performance/$ is there, or the company is highly protective of their IP and doesn't trust the cloud, at which point they're not on the cloud anyway.

Agree. You have these tipping points when a model is good enough to do some task. Yes, a better model will further improve your capabilities but the unlock is at a certain intelligence level. We see this also with humans. People with very low intelligence can't learn to read. Once you cross a certain threshold of intelligence you can learn to read. More intelligence doesn't really help you in the task of reading. A person with an IQ of 160 is not substantially better in reading than someone with an IQ of 85. If your IQ is 50, you might not be able to learn to read at all.

Have you considered that a smarter person will understand what they have read better?

Depends on the task and the writing though doesn't it?

There's not that much depth in a lot of 'everyday' writing. For many tasks that means that you don't need to be hyperintelligent - reading a recipe or a shopping list, reading a newspaper article, etc.


I configured a dual DGX Spark cluster, and it's certainly "good enough" for my agentic and coding needs.

what models are you using on that? My experiences with apple hardware have convinced me that it is not really good enough for coding locally.

DeepSeek v4 Flash, various quantised versions of Kimi K2.6, MiniMax 2.7, Qwen 3.5 “full sized, with a dual spark setup you can fit some decent setups on here

My single spark has me running Qwen 3.6 27B and antirez’s specially quantised DeepSeek v4 Flash (which is shockingly impressive)


Kimi K2.6 does not run well on 256GB.

Have you tried it? It would be slow for sure, but the main limitation AIUI would actually be storing the context in RAM - models like Kimi and GLM have high demands there which limit your ability to get meaningful aggregate throughput via large batches.

No need to try really. 1100b weights with 256GB RAM that‘s less than 1.8 bits per weight if you want a little bit of context.

How is that supposed to give good results?


True, I might be thinking of some of the communities four-Spark clusters for it; it’s already int4 right?

Yeah, the default quants are 595GB. Even four Sparks would require a quant lower than 4bit

It isn’t the models, it’s the closed api and the tooling associated with it. It’s driving me crazy how not-talked-about this is.

You can point both Codex and Claude Code at a local model and they'll work just fine. Codex even explicitly supports that as a feature! [1]

With a nice UI on top, for the desktop app too: [2]

[1]: https://developers.openai.com/codex/config-advanced#custom-m...

[2]: https://docs.ollama.com/integrations/codex-app


As in the coding harnesses?

If I could leverage the same closed api VSCode uses, the entire moat is drained.

What you call harnesses I call… bullshit?


Minimax M2.7.

I’m curious: are you spending on beefy developer machines, or some kind of shared local inference server? Would be interested to know more if it’s the latter.

I am aware of at least a handful of companies doing the latter. I don’t work for them and cannot speak to their setup.

> In a few years, there will be hardware capable of running frontier models good enough for most things at accessible prices for even tiny companies.

I was going to say - the models are just going to keep growing at a pace exceeding the pace of hardware pricing/availability

But then I realised that, far more likely, there will be a plateau reached (again) where nobody is seeing gain, and at that point hardware will catch up


I see even smaller companies ask for that. Less about cost, European companies have a deep mistrust of US companies at the moment. I see companies a tenth of your companies size ask about local models.

It might be possible that in a few years someone will be able to engineer a reasonably priced machine to run today's frontier models (hint, your price is an order of magnitude off). However, they won't be able to run the frontier models that will exist in a few years.

That’s exactly where the market is heading and it’s going to have to reckon with this fact

My guess is there’s gonna be some legislation or something “you can’t share anything over this level of complexity” and I think that that’s what a lot of that mythos rattling was all about


> In a few years, there will be hardware capable of running frontier models good enough for most things at accessible prices for even tiny companies.

What makes you so confident about this prediction? Hardware costs haven't exactly been cratering recently.


.> Hardware costs haven't exactly been cratering recently.

No, but local models have been booming in performance/quality improvements. The RAM shortage won't last forever (more supply will come online when if demand doesn't diminish), and then the math would be pretty easy.


> there will be hardware capable of running frontier models

The current frontier? Sure. The frontier then? No - obviously that frontier is going to keep consuming available datacenter compute capacity, which will be better


My much larger company has got people already using various models through Bedrock because the Claude and OpenAI limits are too harsh and it's too expensive.

What about using DeepSeek API? Practically free.

same, but you need more then 100k of hw to run something like kimi k2.6 for a bigger team. on the other hand there is a ds4 flash that you can run on a macbook with 128gb ram. an that one is perfectly usable for a lot of tasks.

https://github.com/antirez/ds4


I think the quote came out to $107k. 4 AMD MI300A's. Around 60k tokens per second, 512GB of GPU memory.

https://www.gigabyte.com/Enterprise/GPU-Server/G383-R80-AAP1


What models? Last I tried different local modals there was a pretty big difference from frontier.

You people are delusional. How many times a day am I going to read this fiction of "good enough in a few years for most things".

There are physical limits to how much you can compress data and how much is needed for a capable model. If by hardware capable for running SOTA you mean a 7 figure investment for a company, than sure. But how come these companies didnt do the same thing for cloud? There's been this option for self hosting infrastructure for a decade but companies don't use it, they pay AWS.


Eh, one question. Where do you intend to buy the hardware if datacenters take over the market?

I was in college in the late 1990s/early 2000s and I distinctly remember an econometrics professor state the following:

"As cable TV and Pay Per View came out, there were studies done about how many movies people would watch if given unlimited access to films. The results were bandied about as proof that we should build out all this infrastructure to support this line of business. When the data was further analyzed by statisticians etc, it turned out that people claimed they were going to watch films 10-12 hours a day, every day of the week. Impossible."

I feel like we are in a similar boat here where some people are assuming:

- EVERYONE is going to be using max tokens

- tokens will NEVER get cheaper due to improvements in hardware, software, design, market forces etc etc


>I feel like we are in a similar boat here where some people are assuming: >- EVERYONE is going to be using max tokens >- tokens will NEVER get cheaper due to improvements in hardware, software, design, market forces etc etc

I feel like the reverse assumption is being made, that the current model looks like IBM doubling down on Mainframes soon to become cheap enough to deploy everywhere, when the real action is that the costs coming down represents cheaper hardware or more efficient software, and that a big chunk of "cheaper" AI will be eaten by smaller products deployed by individuals. Whatever the Personal Computer of AI looks like is going to be more disruptive than just an API endpoint you can fling tokens at.

We already see this with things like chrome auto installing an LLM.

You cant tell me with complete certainty that theres a moat here for the people spending 1 trillion + on this infra.

>When the data was further analyzed by statisticians etc, it turned out that people claimed they were going to watch films 10-12 hours a day, every day of the week. Impossible.

I also think this applies to people suggesting that companies will sack workers for AI, when the costs of replacing everything someone does in a day is more expensive in terms of tokens (likely even at a reduced price) than just hiring a bloke.


> it turned out that people claimed they were going to watch films 10-12 hours a day, every day of the week. Impossible."

I realized it long ago: one needs output to make meaning. Input can only be the cherry on a cake in one's life. That, actually, makes FIRE or Fat FIRE not so sustainable unless one has other hobbies.


> it turned out that people claimed they were going to watch films 10-12 hours a day, every day of the week. Impossible.

And what happened? How many hours per day/week are people spending watching now?


What people: I'm sure some people are watching 10-12 hours per day - in places like nursing homes or hospitals. I know a reasonable number of people who watch a film nearly every day: 2-3 hours. Most people watch something every few days - often a tuesday night movie night for the family (or something like that). There are some who never watch anything. I don't know what the statistics are on this.

My friends in day care tell me the kids hate "movie day" because movies are all the get at home and they are sick of them - they want to play all day. (but I'm not sure if this is representative of anything other than the types of people who put their kids in that particular daycare)


I've encountered plenty of people who have the TV on all the time when they are at home (and awake). That's from when they get back from work right up til when they go to sleep. So that could easily be five hours. TikTok and YouTube have eaten into that but are much the same thing.

Binge watching is common as is sports.


> they were going to watch films 10-12 hours a day, every day of the week. Impossible.

A lot of these LLM demand scaling scenarios make broad "up and to the right" assumptions about things which in practice have finite limits. Only some percentage of knowledge work benefits from acceleration, optimization or other improvements, and even then the amount of economic gain is capped.


But isn't it wonderful that they did?

It's vaguely disturbing that people "watch" films 10-12 hours a day. Many of them are using it as a radio, for background noise, without really caring what the program is beyond vague genre, tuning in and out without particular regard to the plot… and yet we have all the cost of transmitting high-resolution video point-to-point.

Surely we could just put better stuff on the radio, and accomplish most of the same goals for a far lower price?


My Dad was in the hospital, and just wanted to watch the Pirates play. The TV was filled with apps, some of them free to watch, others demanding a subscription and log in once you selected something.

None of them had the Pirates game.

I was thinking how the transistor radio was a far superior experience for this use case. Just tune to the channel broadcasting the game.


You mean the station that the MLB regulatory captured into not broadcasting when the local team was playing?

Radio has not gone anywhere you know? There is of course podcasts, but for instance Radio France has amazing music services like FIP: https://www.radiofrance.fr/fip

Then there’s NTS, BBC… Ypu can listen to them from online service, but at least in Europe there’s amazing national FM broadcastimg services.

TV is just bad radio with flickerimg lights.


Who has the time to watch films 10-12 hours a day?

I think the comment put forward that as an incorrect assumption that was made prior to the cable build-out.


Which is now an actual way that people use streaming services.

The quote in the original comment assesses the survey responses as "impossible". A good-faith reading of the comment is that the professor was not talking about a handful of respondents.

Nobody is doubting that there are some people who watch films 10–12 hours a day, every day of the week.


https://www.nplusonemag.com/issue-49/essays/casual-viewing/ and its HN comments (https://news.ycombinator.com/item?id=42529756) argue that it's more than just a handful of respondents.

> - EVERYONE is going to be using max tokens

anthropic already hunts down OpenClaw users for using too much on their plan.

I'll give different example: When LED lights started to be more popular, the power usage didn't drop by the amount of power saved

>- tokens will NEVER get cheaper due to improvements in hardware, software, design, market forces etc etc

Well, first, improvements in computing stalled or even rolled back just purely because price of everything compute shot up cos of AI and that will NOT be fixed for a while and ESPECIALLY if AI usage will continue to increase

Second, the token per model might go down in time but better models have more expensive tokens, so we quickly get into spot when:

* price increase in token might not be worth marginal improvement next, better model brings

* more and more models are passing "good enough for the task" threshold so for less and less companies there is any economic sense to pay for the "best" instead of paying deepseek or some other company to run "previous gen" models


The bottleneck has moved from producing a thing that works to knowing that the thing was the right thing to build. The more of the latter they can take on, the fewer knowledge workers are needed at all. So rather than 5% of every knowledge worker's salary going into tokens, 100% of the knowledge worker's total employment cost goes into tokens and you get a 20x productivity boost as a theoretical minimum across those tasks.

That's the game. There's a view you could take of this that this is just a growing of the pie: with those cost dynamics a lot more "small businesses" get a vast amount of leverage, so the overall economy grows without replacing the knowledge workers. I'm not sure I trust the MBA class to have that view.


>The bottleneck has moved from producing a thing that works to knowing that the thing was the right thing to build

I would argue that that's been the case for quite some time before AI. As an example, what innovative amazing world-changing products have Google or Meta launched in the past decade with their very high numbers of very talented and highly-compensated engineers? The issue with most big tech companies are leadership, strategy, and product direction. I'm not saying that they don't make any profits, just that they probably aren't "building [the right thing]".

AI for product development and management would be far more impactful than automating rote coding tasks / building React UIs that mirror API structures IMO.


> AI for product development and management would be far more impactful than automating rote coding tasks [...]

Yeah, if this stuff actually worked that well already, OpenAI et al. would just run AI CEOs and engineers. Why get some other company to pay you at all when you can automate every other company out of existence and take all the money they make?

The fact of the matter is that while the tech has some uses, it sure as hell isn't a full scale replacement and you almost always actually have to massage the input into LLMs to get anything decent back out in practice. Some CEOs and managers can learn to do this, of course, and some already are... but that quickly turns into a second full time job. A "programmer" is still needed. The job might change from mostly hand-writing C++/JS/Python to prompt engineering + some manual coding to fix all the stupid fuck-ups that the bots can't solve themselves, but you still need someone to actually prompt the bot.

When that changes, it won't just be engineers losing work; there will be no reason to even have a human CEO any more.


> When that changes, it won't just be engineers losing work; there will be no reason to even have a human CEO any more.

The human race isn’t ready for that world IMHO. The only reason there is a middle class is because people have leverage in the form of their labor. When that becomes worthless … the people who own stuff and make their living from doing so won’t hesitate to get rid of everyone else - whom are now worthless to them.


Humans will revolt and mutiny on the ai ceo so fast its not even funny


Imagine being deluded enough that you, a CEO, can continue to go buy groceries and drink at a cafe relaxing in that future

I don't know, if you've ever tried to build something at companies of that scale you run into incredibly boring problems "what data table do I need for X" and "who is the right person to reach out to for Y" and "they aren't answering me I guess I'll have to escalate"

I don't think there is any shortage of great ideas at these companies, they are just extremely bloated. And I don't think its something like indecision or bad PMs, it's "we have a finite amount of time and resources so we need to be conservative but also not too conservative"

If you have AI systems that can simply build out POCs in days, backtest on real data, show reliable results and numbers, you get a suite of product options you were never able to get before. If you have coding agents that can speed up implementation, you can build more stuff and choose the things that stick.

It changes the cost/benefit calculus of the entire business. I think you are exactly right in that: PMs/leadership are by their nature orchestration machines. Other roles are as well, but I think PM's are at a particular advantage here in that it will be quite awhile I would expect before core product decisions and creativity can be delegated to an AI, but not quite awhile until virtually everything that they're blocked on (legal approvals, POCs, wire frames, etc etc etc) will become less and less of a blocker


>If you have AI systems that can simply build out POCs in days, backtest on real data, show reliable results and numbers, you get a suite of product options you were never able to get before. If you have coding agents that can speed up implementation, you can build more stuff and choose the things that stick.

I'll also add this: within a large organization, you often need to interact with many different codebases owned by many different teams. Agents have made it much easier to wrangle by having the ability to deploy one to scope out your web of dependencies to learn about what would be needed for feature X, and how that integration can happen.

We've been doing far more away team work simply because it makes things move faster. It's easier to convince a team to sign off/review something than it is to get them to commit to the planning and eventual work.

It genuinely is helping things move faster inside large organizations. Or at least, it is for us, particularly since we're getting organizational prioritization to actually build the scaffolding to make those agents more effective at search.


> It's easier to convince a team to sign off/review something than it is to get them to commit to the planning and eventual work.

1000x yes: you have touched on what I think is the single biggest factor here, that is the humongous value of POCs. they are gnarly to build without agents, and so we used to have to get everyone on board so we didn't get screwed in performance reviews, which was monumental task because that means convincing very busy PMs who have a lot on their plate and dont want to take risks on things they don't understand, and now it's like "can we scale this out" and you have a very nicely formatted proposal and POC. It de-risks things very quickly


Pieces of concept and other prototypes have always been cheap (see hackatons). The main issue is that as soon as you’re touching customer data or modifying process they’ve paid you for, you have to be really careful. No one wants to be responsible for an outage that cost the company its biggest customer.

Yes, but at scaled companies, where building a simple POC hooked into real systems is nowhere close to easy. To the point where it means that you might as well just do the full thing. That's where the enterprise spend and the impact is.

Isn’t that a matter of configuration management? Or do you want to alter the real systems as well?

historically it's been a matter of an absolutely horrific amount of Kafka-esque problems.

Say I want to build a feature in a product.

- DS has to do a deep dive (need buy in) to opportunity size and derisk with data. That DS has to work with other DS (people may have left or moved teams) to figure out how to get the right data and figure out what the difference is between 10 different tables that have overlapping but inconsistent data. - Eng has to build up an actual simple demo (need buy in) - Design has to make it not hideous (need buy in) - Legal has to review what you're doing; POCs should involve real data where possible because otherwise no one will trust it, even if its just for user analysis on existing products

This plus about 6 internal system bugs for custom tools that are flaky and who's team has long been re-orged or laid off, 8 people who won't answer you, 2 PTO's for the stakeholders, 6 weekly meetings

no one did POCs, they just had ideas and tried to get PM's to put it on the roadmap so if it fell through at least it was bought into


Legal approvals won’t be in that category.

You still want someone whose ass is on the line if they get it wrong.


Absolutely but you want to package it to them nicely and efficiently. The biggest blocker is legal and everyone else speak two completely different languages and we often don’t know what’s important to flag and legal doesn’t know enough to ask all the right questions. Also, many things can be templated, and in an industry where regulations and precedents change so quickly, agents are at the very least a good tool to flag issues (e.g. we were approved to use data X for Y but now decision Z negates this). The propagation of this information is not very effective now and legal review at tech companies, while absolutely essential, is somehow a worse experience than going to the DMV when it’s crowded.

Yes, that exists at the wider business level. No question. I think what needs to get asked is are we talking about a bottleneck within the business as a whole, or a bottleneck within the scope of the knowledge work in question. Within software delivery there's a very clear shift when it's suddenly trivial to drop a 100kLoC plausible-looking PR into code review within an afternoon. Producing working code with a whole bunch of tests which make a very clear assertion that it does, in fact, work has had (if you're going that way) all the human-scale thinking time taken out of it, down to a rounding error. It still needs to be checked by a human, which was previously assumed to be a comparatively quick task in comparison to producing the thing. At least, it does where I am, and I don't think that's a silly position today at all.

If they can crack that latter review/spec-check/assurance step, checking that what was built was what was demanded of the problem such that we don't have humans in the loop at that step either, then the bottleneck moves again. Then I think it moves to requirements capture and to product development, but that might depend on the industry.


Trusting CodeRabbit for sign-off is "just" a small matter of configuration.

And convincing your org that such a thing is safe to do, which is decidedly not.

> As an example, what innovative amazing world-changing products have Google or Meta launched in the past decade

Kubernetes is at 11 years ago, and is huge enough to be included there. The Google Pixel was just under 10 years ago. So... not nothing haha


Numbers I see put Pixel at less than 5% of iPhone sales. Not nothing, but world-changing? I doubt the world would look significantly different had Google not done Pixel.

>I would argue that that's been the case for quite some time before AI.

I would agree but it's really minimized the building. More and more time is being spent on pre-coding work.


If you really think this you simply have no theory of mind for this stuff. There are tons of immensely successful products in the ad space that both of those companies have launched. They don't need to innovate in the product or technology space (doing so certainly makes a big difference in having more placement for ad real estate), but to suggest there have been no real innovations (specifically engineering specific innovations) related to ad tech would be completely ridiculous to suggest. You don't need to change the world to get rich, just look at wall street where major innovations have been made in the pricing models of fixed income securities.

Second to this are countless other areas that have a major impact on the companies bottom line that are entirely engineering driven, especially at google given they are a cloud provider and have meaningfully grown the workspace business and launched waymo in this time.


Google's internally developed and sometimes even launched plenty of innovative new products in the past decade. Stadia, Fuchsia, federated learning, and the whole transformer architecture that underlies this AI boom are good examples.

The problem is they get killed by some other executive who is afraid of their department looking bad by comparison.

I think this is fairly illustrative of the challenges in AI becoming as impactful as the Internet. The bottleneck is not making things. There are plenty of people who are really good at making things and can easily be 10x or 100x as productive as the average corporate worker. YCombinator was founded on that premise - small teams of founders and early employees could be orders of magnitudes more productive than the 1000s of corporate employees at their competitors.

The bottleneck is on bringing your product to market. If your innovative new product is built within a corporate environment, it'll get killed unless the executive you work under can get a promotion out of it, and you'll be denied all sorts of help with approvals, launch process, PR, marketing, branding, etc. If it's a startup, they'll try to shut you out with exclusive distribution deals, legal threats, lobbying efforts to change the legal environment, PR campaigns, FUD, etc.

The Internet was revolutionary because it let millions of people bring products to market without asking permission. Instead of having to bid for retail shelf space among dozens of entrenched competitors that all had sweetheart deals with the retailer, you could just put up a website and sell it to anyone across the globe. Instead of following hundreds of regulations that governed existing commerce, you could just launch something and sort it out later. AI doesn't really have that property - if anything, it makes things more centralized, with more gatekeepers, and so seems more likely to destroy economic value than add to it.


Google does not follow through in the long run on many of their pet me-too follow projects, however they do not stray away from their core remit making their real customers happy the ones who buy the ads…

Obviously that includes whatever needs to be done to hoover in data from their marks and Meta also does the same thing without fail and both are really good at it. But outside their remit not so much.


What I think is happening is that the scale of thing you can hope to build at a below-corporate scale should radically grow. Corporate environments should suffer for this, being that inefficient.

> YCombinator was founded on that premise - small teams of founders and early employees could be orders of magnitudes more productive than the 1000s of corporate employees at their competitors.

I think this is still true, but the theory is:

1. You don't need YC-type funding to do YC-type business any more; 2. You don't need to scale the business past those small teams any more, you just buy more tokens.

For clarity YC still obviously has a place as an incubator, mentoring, and networking function. I just think that what was previously the inevitable conclusion that you have to hire all the people the second you hit PMF to keep up with scaling the business as you scale sales is no longer inevitable. If you didn't want to go that way before AI, you were a "lifestyle business" and not worth investing in. As more and more knowledge functions get capably implemented by AI, it's the preferred position: humans are vastly more expensive than tokens, so you want them doing the stuff the AI still can't do.

I don't think this necessarily translates to mass unemployment. I think it translates to masses of smaller businesses that are radically more efficient because the handoffs between business functions are tool calls, not emails to someone who doesn't want to help.

> The Internet was revolutionary because it let millions of people bring products to market without asking permission.

Think about it this way: if I am a small business owner but I think it makes sense to do something that previously only a team in a corporate environment could do but is now within the reach of AI, not only can I do it now, but I also don't have to ask anyone for permission! Who wins between the corporation and the small business in that scenario?

> AI doesn't really have that property - if anything, it makes things more centralized, with more gatekeepers, and so seems more likely to destroy economic value than add to it.

I think this will turn out to be backwards. I can see a version of this where the number of things you can do without needing to turn to a gatekeeper for help increases to the extent that the balance completely inverts.

The vast majority of businesses are small, and AI can give them tools which previously required corporate scale to make sense, without the inefficient hand-offs between busy, political humans. Which is also something that the internet did! Getting an advert in front of a national market pre-internet was Hard but sometimes you had to do it because your target market was "all Canadians who buy toothpaste" or whatever and that meant saturation-bombing the physical environment with physical billboard ads, posters, flyers, and so on. So you only did it if you were P&G-scale. Now you, personally, can do it, trivially, for better or worse.


I dunno if the employees were ever really needed for scale. WhatsApp famously had 300M users and 13 employees at the time of acquisition; Instagram was something like 50M users and 55 employees. If you know what you're doing software scales basically infinitely, and the employees are there to make the software just slightly more tailored to specific user populations (and because upward career mobility for managers involves having more headcount). Yeah, building a revenue model takes people, but Valve employs only about 400 people and makes billions, as do various quant hedge funds like DE Shaw or RenTech.

The insta/whatsapp/plentyoffish model works if you're very lucky with both product-market fit and the technical constraints of the product itself. If you have something that technically scales extremely cleanly, it basically sells itself, and it doesn't need feature churn to retain or gain users, you're golden. I do think more businesses could do with checking whether they do in fact have that lottery ticket before hitting the scale button; there aren't that many examples around.

> Valve

Arguably a monopoly. They've got a product that sells itself with very low infra overheads for the income.

> Hedge funds

Very different model. I don't think the same intuitions apply.


Most of the people they have on staff are there to support their real customers, and I don’t mean the marks out in Internet land Google and Meta‘s real customers are the people placing ads and giving them money, most of the staff is dedicated towards servicing them, that again is where most of the money goes to support their real customers.

Google & Meta are illustrative of late-stage capitalism -- it's all about distribution, not innovation. Their job is (mostly) to just acquire the products that have passed the gauntlet, then scale up their monetization through their distribution-focused machine. The same dynamic plays out in virtually every industry (not just tech).

You'll find that most internal "innovation" teams are just lip service. In most cases, the "mothership" will be incapable of reproducing true innovation -- from a statistical perspective, culture perspective (mega corps are anti-scrappy; internal politics), and motivation perspective (startups aren't 9-to-5). It's much easier to have big M&A budgets, a VC arm, and some handwavvy internal innovation group.

Every now and again, you'll get real innovations (Waymo, transistors, GUIs), but even those have a spotty track record of commercialization when created internally.


The one I'd point out for that list is Kodak and the digital camera.

This is the same argument that has been historically made for outsourcing developers. Get 20 more devs for the cost of 1 dev in the US.

I suspect that AI will fail to pan out to the same extent for the same reason why outsourcing hasn't fully panned out (even though every company tries it after getting big enough).

The problems that will come up will be and always have been ongoing maintenance. AI is great at writing new code without a brain behind it, but once you get to the point where you need to refactor code, you start really needing someone with coding experience to guide the AI or veto it's mistakes.

I don't think that's really fixable even with a lot better AI. It's not something that ultimately comes out of the likes of github data.

I'm not saying that AI isn't going to make things better, btw, I just don't think we'll see a 20x improvement. Probably more like 1.5 or 2x.


Outsourcing of knowledge workers didn't work out because at large enough scales, the geographic arbitrage disappeared. Companies mostly always got what they paid for.

The determinant of success was only whether the task needed American-tier labor or could make do with sub-American quality labor.


I am not sure this feels right. I agree that the US currently has leading minds in terms of tech, but I am not sure it is too big of difference with the EU knowledge workers. EU is still a lot cheaper then US in terms of wages you would need to pay.

EU workers themselves get a lot less, but the EU is expensive because of 1) the huge payroll tax (45% in France) and 2) the challenges with hiring and firing mean you are carrying people that aren’t contributing.

EU doesn't have the scale. There are probably more SWEs in one little town in the Bay Area than entire European countries.

Also, many European devs simply move to the US. Immigration is the primary means by which the arbitrage disappears.


Sure is an interesting thought. None of this is sarcasm: why do US companies deal with the time zone differences and language barriers they won’t need to bother with so much by outsourcing to say, Ireland?

The mechanism is often that they'll actually outsource to someone like Accenture, who have teams everywhere, and whose contract managers will try to get their cheapest viable team onto the contract to maximise their margin. If the buyer can't judge the quality of what they're buying, or doesn't know why the resulting hand-offs, delays, mistakes and rework will cost them more than keeping everything in-house ever would have, they're going to have a bad time.

Er, US companies do outsource to Ireland.

Basically every big tech has large offices and employ a lot of people there.

The limitation is that Ireland is a relatively small country, and most Irish developers are already employed (which is why Ireland end up being one of the main destinations for tech workers being hired from abroad).


Ireland isn't that much cheaper than, say, Oklahoma. And the cultural differences with Ireland are not a lot smaller than those with India or the Philippines or what have you, once you try to actually start working together.

(Yes, all the good developers from Oklahoma move out, but the same is true of Ireland)


That's certainly part of it. But the other part that I've heard time and time again is that in order for outsourcing to be successful you basically needed an american engineer in the mix hand holding everything, clarifying requirements, and vetoing bad code.

That part of dev work, the requirements gathering, attention to details, clarifying requirements, is something AI also struggles with. A lot of companies basically waste time and money on outsourced devs because without a clear path forward they effectively will sit and do nothing, waiting for a prompt.


I would not agree on that point. It really depends on company's structure. I mean it also depends with people that makes the team. I would say there are a lot of unknowns but I would certainly not generalize.

How I find your argument is that one distinguished engineer from US could do the same with the use of AI.

I worked with both and I know great and bad engineers from both sides. Only thing is that US has a bigger pool of great engineers.


I think the mechanism here isn't that American engineers are magic. It's that you need that contextual knowledge really close to where the work is actually being done, so that the turnaround for questions, blockages, clarifications, "we've got no work to do", quality level-setting and so on is on the scale of minutes, not time-zones.

It doesn’t have to be an American but it does have to be a direct employee of the company ideally working in the same time zone as management and the people defining the requirements.

Outsourcing of knowledge workers is still ramping up. The issue in the past was the skills were few and far between internationally. Facilities were also not built. That has changed now in a lot of fields. New campuses have been built in places like Bangalore and Hyderabad, even Singapore. The skills are there now, the training is decent, and you can see that the hiring is going on for very skilled titles in these cities. It is a different animal than just 10 years ago in this.

The “American tier” labor of course is distributed across the world and the top performers in every nation find ways to get paid at something approaching American salary levels. Look at all the international FAANG offices paying high salaries, in purchase pricing parity terms.

> I suspect that AI will fail to pan out to the same extent for the same reason why outsourcing hasn't fully panned out

My mental model for that is that outsourcing fails where the work is being done organisationally far from the knowledge needed to do it. We know that's true of teams inside organisations, there's been a lot of research on how distance in the organisational tree negatively impacts productivity. Outsourcing is a pathological worst-case of that.

The promise (promise! We're not there yet!) of AI is that I can have a cross-functional team on my laptop. Organisational distance is zero. Where previously the outsourced team has to wait for the time zones to roll round so I can answer their blocking question when I get to my email STRICTLY AFTER I have had my coffee, now it's a prompt in a chat window with a button I can click to make a choice in 5 seconds. Delay is gone, cost of delay is gone.

> The problems that will come up will be and always have been ongoing maintenance. AI is great at writing new code without a brain behind it, but once you get to the point where you need to refactor code, you start really needing someone with coding experience to guide the AI or veto it's mistakes.

Oh, absolutely. That's a minefield. Today. It will be, right up until it isn't. There are ways to set up agents and projects right now that make a dramatic difference to how this part of the picture plays out, but those will sink into the harnesses as time goes on.

But also the big problem with maintenance and outsourced teams tends to be the commercial structure around the contract. You get a Build team, who Build the Thing and then: no more features for you, anything you want to add past the original spec costs extra. They hand over to the Run And Maintain team, who get to fix all the bugs that the Build team left but without the knowledge gained from building the thing, but are scaled and located to be absolutely as cheap as the supplier can get away with so probably don't have the skill, inclination, motivation, or permission to take on any restructuring to make the bug fixing easier and they're on the wrong end of the globe so there's a 24-hour latency on any queries. It's a terrible way to set teams up, but it looks good on paper.

Again, that's peculiar to outsourcing and completely goes away if I have the same team that built the thing own the thing long-term. That's true if it's humans or AI!

> I don't think that's really fixable even with a lot better AI. It's not something that ultimately comes out of the likes of github data.

No, it's a harness problem. You need to start from a maintainable point and keep standards in place. It'll take work to get the harnesses there and it's not ubiquitous. You might also need better models, but I've already personally seen big differences in outcomes between projects that took certain steps and others that didn't; it's nothing revolutionary, mostly stuff that works for humans also works for AIs but you need to know to ask for it.

> I'm not saying that AI isn't going to make things better, btw, I just don't think we'll see a 20x improvement. Probably more like 1.5 or 2x.

I think people radically underestimate the cost of delay. I don't know if 20x is realistic for the AI itself, but I think it's not impossible once the inefficiencies of having to go to other humans is factored in.


> mostly stuff that works for humans also works for AIs but you need to know to ask for it

I'm most curious about this sentence. What have you noticed about the similarities? I'm getting really good at asking for confidence levels, tests and pushing back, but I'm curious what you found


If you don't start with a clear file structure and code architecture appropriate to your problem domain, the agent won't fill in the gaps and will give you mush that it will eventually fail to navigate effectively. So you need to be explicit and say (e.g.) "this project is structured as a Ports And Adapters project, one class per file. Adapters go in src/adapters, core domain in src/core/app, ports in src/core/ports, entry points in src/main" in AGENTS.md. Picking something like that rather than free-handing the structure also helps humans enormously, that's why those ideas exist. I've found that doing something like that helps the agents to stop the code from creeping into mush over time, and actually gives them a reason to self-correct drift if it happens. I tend to prefer something off the shelf like Hexagonal Architecture because the chances are good that the agent already knows about it and has a lot of background on it, but has no reason to pick it amongst all the alternatives unless prompted.

Tests are non-negotiable. I've found it helps to be very explicit about Red-Green-Refactor loops so you don't get it trying to one-shot more than it should, alongside git pre-commit hooks that won't let failing tests accidentally get committed. Again, this helps humans.


Thanks for that

I'm starting my first coding project with Claude Code and your comment gave me some helpful topics to study


Outsourcing also fails because it’s a pathological case of adverse selection. The businesses that outsource projects are ones who are organisationally incapable of managing those projects well internally. But, that inability extends to their oversight of outsourcing shops as well.

End result is that many outsourcing firms are borderline fraudulent in the way they treat their customers.


Who pays for that value, and from what, if all knowledge workers lose their jobs?

It sounds like the economy would largely reduce to the small minority class of independently wealthy people.


The more time I spend using agent tools the less I worry about knowledge worker job loss.

It takes a skilled knowledge worker to use these things.


Yes, but I do worry about junior knowledge worker job loss. These models are very good (and getting better) at the vast dark matter of "donkey work" that happens in knowledge-based industries -- work typically done by junior devs / analysts / lawyers / consultants, paralegals, admin assistants, customer success / support, etc. -- and those roles comprise the bulk of the workforce.

And worse, these are the tasks that help the junior people eventually grow into the skilled knowledge workers required to operate models, so there's a pipeline problem too.


I do too, but I think it currently has a lot more to do with the quasi-recession we've been in since the end of ZIRP and AI is a better excuse to stop training juniors than telling investors it's belt tightening, just like layoffs.

I'm already seeing tech execs/hiring managers getting very frustrated at the lack of new-senior-engineers to hire. The market will correct for this in time.


Curious if you can share any backing information from your last statement? As a senior engineer (well, that's my job title anyway), I find it encouraging.

This doesn't break it down by experience, and I can't find specific data on that, but the recent spike in demand for engineers + subsequent drop in unemployment this year is well documented [1].

The demand for senior+ engineers has remained steadier through this downturn from my anecdotal observations, with new grads being by far the most negatively affected, but even that seems to both be shifting from talking to people a handful of years younger than me + CS enrollment has already precipitously declined [2] as the narrative that programming is dead because of AI has spread rapidly.

All that leads me to think it's going to be a junk-show over the next decade for people trying to hire as the pipeline was destroyed.

1: https://www.citadelsecurities.com/news-and-insights/2026-glo... 2: https://www.washingtonpost.com/technology/2026/04/13/compute...


We'll get around to training job specific models or the equivalent. Thats just lower on the value chain for now.

Sure. I was challenging the parent on how the “game” they are positing would play out.

See https://news.ycombinator.com/item?id=48300427 for an alternative take. I don't think either direction is inevitable, yet.

To follow on from that comment, if the growth in breadth of capacity of AI leads to a decrease in the risk of running a smaller business, which I don't think is an unreasonable prediction, then it's not inevitable people do lose their jobs. Employers get smaller, higher-leverage, and more plentiful.


There were no knowledge workers in the middle ages.

Back then people were mostly farmers, but we already automated that job away.

Not completely, but compared to the middle ages we 50x'd their output. Which is a great illustration what it means to make a job 50 times more productive. We went from 80-90% of the population being required to barely make enough food for everyone to survive, to 4% of the population producing such an abundance that consuming too much food has become a systemic health issue


At the mere cost of destroying soil, and polluting water and the atmosphere in only 200 years! Possibly this will also play out well, and there is a huge market of... maybe social media influencer economy to pick up those being automated out of their previous work... or rather identity, as actually much like in the middle ages, the modern world also makes the profession largely the identity of the individual.

I'm pretty skeptical on the outcomes and the costs also (natural and social as well), but possibly we can have 50x or even more software in the end! The phrase will be truer than ever:

> Software is eating the world!


Maybe ironically, but software and robotics should allow us to scale regenerative agriculture in a way that doesn't leave everyone in poverty. We already have lasers mounted to trailers doing precise weeding instead of broad herbicide usage.

https://www.agtechmarket.net/news/laserweeding (random web search, I don't vouch for this site, it just looks legit at a glance)

Next innovation could be to scale succession planting, which keeps the ground from being exposed in between crops and lets you transition from nitrogen fixers to users quicker, getting more food out per acre while reducing fertilizer usage. But you can't do that with current harvesters and human labor is too valuable to spend on this.

Also take broccoli harvesting, typically you get a few big heads, then it keeps producing smaller heads, but it's not economical to harvest the smaller heads with human labor. Robotic harvesting lets the same plant produce more food per acre and uses the energy needed for new plants instead to keep producing food.


Masses will be unemployed, due to robots displacing them, but human labor will also be too costly. We won't be able to afford a person shepherding, but we will need to produce "meat" (substitutes) in plants, or in inhumane animal-jail, and we'll need robot-weedkiller lasers to produce the feedstock instead of letting animals graze... and we'll give the food produced this way to people on UBI...

This is where this is going, the whole industrialism is totally self-serving, and for every problem its answer is digging deeper in the rabbit hole, and creating 2 more problems in addition to solving the initial problem only half-way.

I don't want to say what you are suggesting is not possibly useful, I just want to emphasize how stuff works out in reality, in addition to doing some nice stuff like what you called out (the halfway solution to the problems). All we get is more alienation and humans getting depressed and feeling a lack of purpose... but somehow we cannot afford to pay fair prices for the agricultural work, and pay fair prices for the food, and not overproduce and overpollute... and the same thing is happening in every aspect of the human condition, not only food production, which is the most basic and ancient activity we have been doing.


Cattle grazing is helpful for fields left fallow, but succession planting is far superior in so many other ways. You can mix plants which repel particular pests with those susceptible to them (and other beneficial strategies), topsoil is grown instead of depleted, flowers are present for wide range of season so bees naturally thrive with food always available, you don't need a significant generator of greenhouse gas running around (cows), and it gets more vegetables per acre so it would be good if vegetables were cheaper because we don't eat enough of them.

I have done succession planting in my home garden, but it's definitely not worth the time investment for the food alone. But it's real neat to see your aphid problem disappear as the nasturtiums pop up without any pesticides needed. You can even feed the world with it, if most everyone wanted to be farmers... (as opposed to some Organic practices which is the same mass farming but the pesticides are "naturally-derived")


Not all tilled land is suitable for planting, as many are drained swamps. In my home this is a huge problem, as the water circle has been distrupted so much that we are on the way to becoming a desert with climate change compounding on top the the more than a century long draining megaproject nobody is willing to undo due to short term economic interests. Fuck the next generations, I guess...

There are other grazing animals in addition to cows btw, as temperate marshland forest can be grazed by pigs, for example. Also sheep, and goat exist for example.

Vegetable production is nice, but we don't need to work all land. We don't need to produce so much food that we cannot use up meaningfully, just waste or take to places to create overpopulation there also. Reckless industrialism and capitalism are the core barriers to sustainability. (the former includes socialist planned-economical models also, we've seen our part of that also)


out of edit time: suitable for sustainable planting was what I wanted to express

Farming has been destroying soil and polluting water for thousands of years. The Tigris & Euphrates used to be crazy fertile, now it's desert. Yes, the destruction has accelerated but farms now feed 8 billion people.

We need to go faster! We need more people, and more machines, so we can go even faster!

This also leads to a kind of a singularity.


There definitely were what could be considered knowledge workers in the (high) middle ages, it just wasn't the majority of work like today. The knowledge workers then were just a tiny, elite faction, mostly employed by the church or directly by nobility. Kindgoms were still big bureaucracies and needed scribes, theologians, academics, lawyers.

Relatively few anyway. Monks (who wrote and edited books and managed libraries, and also taught), artists and musicians, bookkeepers/treasury/exchequer, scribes/chancery (who were the administrators of the kingdoms), and bankers all existed in European "middle ages". But a significantly smaller part of economy/society compared to "western world" now, yes.

Are you sure? Any functional organization requires keepers to oil the machine. First the government. The best examples were the chinese empire, the catholic church, and the various kingdoms. Or do you think that everyone was either fighting or farming? Stewardship is knowledge work. Bookkeeping is another.

There wasn’t 20x value to pay for in the middle ages either.

> Who pays for that value, and from what, if all knowledge workers lose their jobs?

They do not care unless these companies can get a bailout.

UBI only exists for companies that are too big to fail. Case in point, 2008 and SVB when there was too much money on the line.

One of the AI companies attempted to guarantee themselves a way for the government to bail them out if they were close to defaulting on the debt from the data center build out.


SVB didn't get bailed out, their investors and creditors were wiped out. You could argue depositors were bailed out -- as they took the undue risk of keeping more than $250k in a single bank (though as part of a requirement for getting a loan from SVB, you had to have your operating accounts with them. So lots of companies had no choice, as SVB was one of the few banks that would lend to them).

Arguably, the main impact of securing SVB depositors above the $250k limit is that it prevented thousands of people from being laid off that week, as their employers wouldn't have had the money to make payroll the following Wednesday.


Thank you for saying this. Continuing to point at SVB as a bailout is annoying. They were not bailed out. Anyone with deposits in an accredited bank should be made whole - always. Without trusted banking we have no economy.

> Anyone with deposits in an accredited bank should be made whole - always

Sure, but is that the case now? Is everyone made whole when a bank fails and they have more deposits than the insurance limits? Or only when it's the well-connected / too-big-to-fail?

Looks like the answer is no: https://www.wsj.com/finance/banking/a-small-banks-failure-le...

So I don't think it's unreasonable to describe SVB as a bailout. Not for the investors, but for the depositors. Has anything changed to reduce the moral hazard / make it less likely to recur?


So we all now know that a bailout DID occur with the SVB depositors who had all their money in the bank and most deposits were over the FDIC insurance limit. The FDIC insurance rules somehow didn't apply here because there was too much money at risk. (And too big to fail).

But if there was a bank failure at a regionally smaller bank with a regular customer or startup depositing the same amount of money over the insurance limit, their money is gone.

Just like Intel got a "bailout" from investment as chosen by the US government, AI will eventually have a very similar story.


> Sure, but is that the case now?

Pretty much and has been for awhile.

https://nyulawreview.org/wp-content/uploads/2025/05/100-NYU-...

In early 2023, within the span of two months, the United States experienced three out of the four largest commercial bank failures in U.S. history, as Signature Bank, Silicon Valley Bank, and First Republic Bank all toppled.1 Yet, despite these banks having roughly $300 billion in uninsured deposits at the time of their failures2 and despite the failures costing the Deposit Insurance Fund (DIF) of the Federal Deposit Insurance Corporation (FDIC) an estimated $38 billion, uninsured depositors took no losses in any of the failures.3 While these results were striking, they were far from unusual. Since 2008, uninsured depositors have experienced losses in only 6% of total U.S. bank failures.

...

Formally, the United States caps deposit insurance at $250,000 per account,6 but, in reality, the post-2008 financial system comes close to providing de facto total deposit insurance covering all amounts in all accounts.


> UBI only exists for companies

What's the U stand for in UBI?


Producing a thing has always been cheap since personal computers existed. From mail-order software companies' times to SaaS times, producing a sellable MVP was an initial cost that is relatively small compared to the later cost of expansion and maintenance. Marketing and selling was and still is the hardest part.

Why do you think of knowledge workers as a fungible commodity?

What makes you think the people who used to build (or would have built) software will switch into the industry of "knowing that the thing was the right thing to build", as opposed to something cooler like surgery, city planning or experimental physics? The roles within a tech company are not the only jobs in the world.


> Why do you think of knowledge workers as a fungible commodity?

I don't.

> What makes you think the people who used to build (or would have built) software will switch into the industry of "knowing that the thing was the right thing to build", as opposed to something cooler like surgery, city planning or experimental physics?

Because it's probably already part of the job. It's a change of emphasis, not a change of career. Your boss can already ask you to do it. If you're producing code, you're probably also reviewing code, checking it matches the acceptance criteria, testing it, sanity checking that it was the right code to have been written, today.


> The bottleneck has moved from producing a thing that works to knowing that the thing was the right thing to build

“There’s more capital than good ideas to fund” has been a complaint from the likes of A16z & other VCs for a long time now. It’s why we ended up with stuff like NFTs getting funded.


If knowledge workers get laid off in mass, you can expect political curbs on AI adoption.

That’s very unimpressive return on investment compared to what was promised.

I have nothing riding on any specific multiple. Choose your own adventure there.

YEPPP... and I'm kind of shocked at how many people can't do simple math.

Let's put it context. Google's annual revenue seems to be north of $400B. So if OpenAI suddenly had Google's revenue, it would still be insufficient to recover their investment.

and it's a ticking time bomb because $1T in servers, CPUs, GPUs and memory is going to be worth $200B in 5 years. You can say they can keep using what they've got. Sure. But they're also not going to stop spending on new hardware. And the competitor that comes along in 5 years and spends $1T doing the exact same thing is going to have a huge advantage.

OpenAI at this point reminds me very much of the Russ Henneman pre-money hype cycle.


It's actually worse than that. It's not just financial depreciation or that the existing hardware becomes obsolete due to being less powerful than new hardware but also that hardware being run all the time at high load actually has a limited lifetime of a few years so it will physically break...

I agree but it's even worse than that.

Data centers come down to performance-per-Watt. Electricity accounts for 20-30% of a data center's operating cost [1]. I don't know the exact breakdown but the GPU part of that is probably the majority given how power hungry GPUs are. The B200 is upwards of 1200 Watts [2]. The B200 is rated at ~4.5PFLOPS of dense FP8. So you're getting 3.75PFLOPS/W. We don't know what the next generation will look like. The A200 (Hopper architecture card that preceded the B200) had ~4PFLOPS apparently but also lower power consumption. Obviously this changes depending on whether you're looking at dense or spare and FP8 vs INT8 vs INT4 vs FP4, etc so we're just using FP8 as a yardstick.

Imagine a fictional B200 successor, the T200 that has 8PFLOPS of dense FP8 at 1000 Watts. Well then a DC built on that where the T200 will likely cost similar to what the B200 does now, you'll get nearly double PPW so the same size DC and same electricity load is going to be like 2 of your old DCs in operating costs. That's a big deal when you've laid out a trillion dollars.

[1]: https://iaeimagazine.org/electrical-fundamentals/how-much-el...

[2]: https://www.trgdatacenters.com/resource/h200-power-consumpti...


How could extremely capable artificial brains ever pay for themselves?

Prices are not going to stay where they are.

You have either never seen a tech cycle, or need to be reminded of that. The pressure to buy more expensive plans is already starting to form.


This should be the top comment. Also, I think its not that many people, including our Simon here, are not good at math. Its more like, some of them seem to be incentivised to not be cough, cough, "good at math". How else will the hype sell?

I thought my post was pretty free of hype. I said that this new revenue "Maybe even enough to start covering their costs!"

that statement is pretty high on hype relative to the actual financials though

See what you get for saying things with subtlety instead of hype these days... sigh.

Well, your title certainly was not, in any case!

I mean, a company that loses money on every widget they sell might technically have found "product-market fit." :)

It seems quite possible to me that developer tooling is going to end up being the biggest win from LLMs because there is a product-market fit -- and also quite possible that OpenAI and/or Anthropic end up getting bought for pennies on the dollar because their burn rate is unsustainable. AI may end up being this generation's "dark fiber."


My title was actually intended as a subtle burn at those companies.

What are you doing demanding a trillion dollar valuation for your IPO if you haven't even achieved product-market fit yet!


At a certain point, I genuinely feel like the best way this hype is being sold is by making people genuinely believe in it.

and in that sense, if Anthropic and OpenAI are able to create the projection that they can-be profitable despite finances seeming bubbly at best, I think that what happens is that these companies spew so much amount of content that people like Simon get into it too.

There is a deeper problem of people falling into AI psychosis too, in general, I am not sure if Simon has fallen into it or not

I think that the greatest point which can be made here is to not offload your thinking to others and to think about the situation yourself. Sounds familiar (looks like we are all off-loading our thinking itself to machines)

Side-note: As humans, we have a tendency to quickly judge or make quick decisions which stems from our times foraging and scavenging in jungles.

Another Side-note: at a certain point, I am unsure of how much to think about AI or not, certainly discussions about it that were happening 2 years ago weren't helpful in contexts that they are used now (well not in any way or form that a person discussing and getting into the weeds of AI 2 years ago is better than a person just getting into it say 2-3 months ago)

With the industry (moving so fast) [but that doesn't mean that you can't catch up with it, I feel like the fast word has made people think that they are falling behind which is imo wrong i suppose]*, It is basically unsure to me of any FOMO or anything if you aren't using AI already, I find this notion naive.

People might be making strong opinions (AI psychosis) and skills on the tools available at the moment the same done 2 years ago. We don't quite know about the tech as these are still black-boxes and how they progress and what these "AI skills" might survive or not in future. Heck, we aren't even sure if these tools might survive or not or wouldn't be made magnitudes more expensive simply to break even as they are given to us for the first time at percentages of the price.

I don't know if I should form (strong) opinions yet and also a question of its worth so much thinking efforts in the first place, probably just gonna do my own thing (the way I want to) which includes learning C at the moment. because learning is fun.


I didn't exactly say that they were about to become wildly successful companies. I suggested that they had "found product-market fit" - not too impressive for more than a decade of work - and that their revenue may even be "enough to start covering their costs".

Firstly thanks for responding and I wish you to have a nice day. your suggestions have value and I appreciate you writing the article. Perhaps enterprise businesses do end up becoming the fat and meat of the AI industry.

My question which I wish to ask: What would happen to these AI companies if they turn out to be anything but wildly successful companies, both to the investors who have already invested in it and to those who might be investing indirectly into it in the near-future (passive investors, retirement funds)

I would love to hear your thoughts on it!

Thanks and have a nice day :-D


> What would happen to these AI companies if they turn out to be anything but wildly successful companies

I'm not nearly enough of an economist / finance person to answer that credibly, but I expect they'll go bust, and a lot of people will lose their shirts.

... and the model weights will be sold to other companies who will then run them at a profit, and eventually figure out an economically sustainable way to train new ones.

The 1800s railway booms are a good comparison here - a lot of companies went bust, a lot of investors lost money, and we still ended up with railways.

If the AI companies all go bust we're going to have a lot of spare data center capacity!


> If the AI companies all go bust we're going to have a lot of spare data center capacity!

I can be wrong I usually am but an AI DC != compute DC or that it might decrease the prices of servers substantially because of it. (well not exactly, I hope you read my whole message so that I am able to better explain what I am saying.). AI DC's try to optimize for one thing: running GPU's for immense scalability and flexibility (0 to numbers>=large_number).

Currently, its actually way worse, the server providers are some of the worst impacted by the industry at the moment because each server requires ram and ram is well... increasing in its price exponentially. It's really a tough time to be a provider at this time (in certain respects) directly because of AI.

It is unclear to me if spare DC capacity will have any meaningful impact to it. I don't think that atleast within compute (and not GPU/AI DC), that space was too large of a problem.

Fun fact but one of the largest providers (BuyVM) had its datacenter price from where they colo'd increase because of the immense demand at the moment for spots in datacenters by many tens of thousands of dollars that they did the first price hike in at this point at decades! The situation is this dire :-(

Ram prices might come falling down and DC's might get cheaper but they can only get cheaper to limit, they still need to for example DC security employees

and I wish to suggest that if anything, investors might wish to re-coup their losses within the AI loss, they might want to make up with what little they might have (ahem DC)

For example, if you wish to want to take at an even more egregious example of what I am suggesting, there are many new york LLC's who would much rather leave the properties that they own empty rather than decreasing the price of what it costs (which they have set to some egregious amounts). I think that for them, somehow the math ends up working out in the end somehow, so there might be something more to it.

I wish I was optimist but I don't believe that the gains in spare data center capacity are worth even a fraction of fraction of the damage if AI were to go bust as you suggested with trillions of dollars vanished.

So, with the data I have at the moment, I am unable to suggest that compute would be cheaper. Heck, it was cheaper before AI and compute prices have never been something that people worry about because there are sometimes 10x cheaper options than AWS,GCP,Azure with things like Hetzner/OVH and others (yes its not a 1:1 situation but still its a 95% overlap and for all intents and purposes, great)

I can see a potential where GPU compute can get cheaper, oh boy, its so much more expensive than compute but I feel like GPU's aside from AI might still have a much more limited niche than generic CPU.

The issue wasn't ever the pricing. Simon, I own 7$/yr vps's which run my websites fine because they are written in golang. I doubt it can get cheaper than it. (You can get a 3$/yr vps if that is what you are interested with using Nat VPS + cf tunnels)

I would once again appreciate to hear your thoughts on it. The only thing I realistically see is if Ram producers ramp up their productions and create a ram price glut in the next few years, but imo the prices would even out over the long term.

I have seen the point of spare DC capacity being raised up multiple times but I finally ended up writing a message which hopefully captures the nuance, but once again, I don't know the future about it.

Waiting for your reply and have a nice day Simon (& other readers) and thanks for reading if you did, I appreciate it :-D


I think we are in agreement that if the bubble bursts a lot of people will lose a lot of money. I don't have a strong opinion on the data centers, my main point is that I don't think AI "just goes away" if the bubble bursts, which seems to be something that a lot of people assume.

Yes, I think we are in agreement too.

> my main point is that I don't think AI "just goes away" if the bubble bursts, which seems to be something that a lot of people assume.

i agree with your main point because the cat is out of the box with open source models and others in general. I don't particularly know the extent of what they would be used for.

The technology is still novel so people are trying out too many things with AI, I don't particularly like it being spearheaded into each and every thing but perhaps we are just in experimentation face and seeing what sticks and doesn't. Either way, I disagree with when people treat it more than a tool or is "the tool"

I do also think that the "cat is out of the box" and you are right that it isn't going to go away, particularly with open source models.

I think that there are some use cases where AI might make sense (prototyping or building things for yourself when all you want is the end result or thing which would be too time-consuming/complex to be built and thus wouldn't be worth making in first place for its use-cases)

So overall, yes I think that we are overall in an agreement. I do still believe though that learning the strong foundations.

But all in all I agree with you yes that AI might not be going anywhere, it can certainly have its benefits and hopefully the world uses it in beneficial way rather than negatively, glad we could come to an agreement. Have a nice day Simon.


I will also tell you, as someone who works at a company that's trying to remain profitable, that token spend has caught the eyes of finance and much like cloud spend they've already started applying pressure to control costs. This May my team is protected to use 30% fewer tokens than we did in April - this was by intention. I suspect we'll drop more in June.

I expect in the future, when these AI companies stop subsidizing costs, the idea of spinning up 20 agents to work on some brain fart idea that you throw out after looking closer will come to an end. It'll be seen like assigning developers on work that hasn't been properly planned for or reviewed.

Can't wait till June, when finance gives the team the choice: everyone gets double tokens if you choose to fire somebody.

Oh we already had that with a RIF earlier in the year.

It might be time to start interacting with agents using grug speak only


> They've got, ballpark, $5t to $10t

What are you basing this on? For reference, Anthropic raised ~$70 billion in total and OpenAI ~$190 billion. Why do they need to make 20-40x that?


All the planned infrastructure commitments. At least for OpenAI I think they're supposed to spend $300+bn in the next few years.

I still don't understand why that means they need to make 5-10 trillion over the next 5 years.

I think the original argument is too limited in its scope. The wider AI market, which is primarily fueled by OpenAI, Anthropic, Google and the large frontier labs (are there any other in the West, except for these 3?) is spending how much... $900bn this year in DC buildouts? After the spent $500bn last year and they're probably planning to spend just as much the next few years if things go remotely their way.

So yeah, I wouldn't be shocked that in the 2023 - 2033 timespan total AI investment worldwide will be around $5tn, maybe even going towards $10tn.

All that money will have to be repaid, and it will have to be repaid 10x, otherwise heads will roll.

The enshittification we've seen so far is nothing compared to what's coming.


OK so let's say build out this year is $900b. Depreciated over, IDK, 6 years (mix of 3 year GPUs and 20 year buildings)? That's $150B a year, but you want the investment to be profitable, so let's call it $250B a year.

That seems... Pretty reasonable? Like Anthropic is at $45B annual revenue, let's say they enter next year with $100B annual revenue? Let's say they have 30% gross margins (no idea), so $70B goes to data center owners/operators. That's one company doing roughly 1/3rd of what's required to pay the investments off. And you have Ant+OAI+GDM+Internal AI at GOOGL/META/etc.+all the servers for open models.

I'm sure there's a world where you can paint a picture that requires $5-10T but that would require capex increasing significantly NEXT year. And the cloud companies won't do that unless revenue keeps growing.


> Let's say they have 30% gross margins (no idea)

There's a ton of speculation that all major labs are losing tons of money, so I that 30% gross margin sounds more like -30% to -130% gross margin.


> They've got, ballpark, $5t to $10t to make back in the next 5 years

OpenAI's spending commitment is in the ~1T range for the next 5 years, and Anthropic is ~300B.

If they continue to show strong growth, they likely need to be at 100-300B in revenue/yr to support their yearly payments + financing, not 1T.


I thought Anthropic and OpenAI's combined CapEx has been <100B?

source: https://isaiprofitable.com/


That site needs Apple on the list. ;-)

Why? All their money is going to Apple Silicon and the five ecosystems, so far in Apples entire history, the largest acquisition has only been $3 billion dollars, OpenAI is currently getting nothing and they gave Google a measly $1 billion refund per year for the use of Gemini.

If John Ternus wants to spend some money, spend it on bringing memory in house. Apple has the money and the engineering talent to do so, have it fab/made onshore in partnership with TSMC.

Do it Apple because you have to not because you want to the Chinese probably will be taking over the memory industry, worldwide, by taking advantage of the greed from three memory companies and their AI overlords.


That's the point. To show how they _haven't_ lost billions on this.

Maybe so far, but they've committed to well over a trillion in future capex.

And there's the indirect capex that their revenues will pay for indirectly, like in the case of oracle

Here's the question - does that future spending already appear on partners' balance sheets

> That's a _huge_ shift. Most people I know cite +20%-40% velocity with these tools, against the actual work their company cares about doing.

We all have our own observations and mine don’t significantly diverge. But that’s bottom up. At this point shouldn’t we be seeing it top down?

If we are beyond potential and into significant productivity gains, why isn’t that showing up for the customers?

Why didn’t delta airlines get significantly more operationally efficient in the last 3 months due to the introduction of better software?

This is a genuine question, I am seeing a disconnect.


Anecdotally, my take on this is that biggest value lever is strategy and alignment, not implementation. The typical company is dozens of little vectors pointed in different directions, and they cancel each other out. Scaling up the magnitude of each is still net zero.

I was recently consulting at org where two separate engineering teams were all in on two different, incompatible deployment platforms and using AI to accelerate adoption of each.

Management was mystified why their engineering leads kept telling them they couldn’t deploy a complete implementation of their solution.


> Why didn’t delta airlines get significantly more operationally efficient in the last 3 months due to the introduction of better software?

The coding agents got good in November. Most individual engineers didn't fully clock this until January/February. This means that companies didn't really figure it out until March/April.

Assuming companies like Delta have adopted coding agents (which would be pretty fast) it still takes months from adopting a new tool to the code results of that tool rolling out to production.

I expect (and would hope) Delta's software development culture is very conservative. Since nobody can confidently tell Delta "here are proven practices for using this tech to produce high quality, more secure code" yet it would be surprising if they were blasting full-steam ahead.

I expect that even companies that got on board with coding agents in January will only just be starting to ship user-facing features that benefited from those new tools. Shipping software takes a long time, no matter how much faster the "typing the code in" bit gets!


>The coding agents got good in November.

Maybe irrelevant to your point, but I'd argue they were really good already in May if one used the right workflow (planning etc.). They've become better, but they're not saving me significantly more time now than they did 12 months ago.


> We're talking about a world where you need 5% of every knowledge workers salary to go into tokens.

They are assuming ~10% global GDP growth instead of ~3%. You probably don't need the same %s if the pie grows a ton.

I'm highly skeptical we get that growth, but if you aren't, it makes it easier to digest.


I mean this case with AI-productivity fires itself back when we talk about GDP.

The more AI causes productivity increases, the less and less number of workers will be needed. This will heat up the job market even more and bring salaries down.

Net effect of this productivity increase: less consumption by the masses, even though you may be producing more good and much more efficiently.

A third effect also comes into play that once all this starts to happen, common people, who are generally living paycheck to paycheck, will now start to hesitate towards making any long term investment, housing included. And that indirectly will end up impacting financial and banking sector, which will then impact existing savings, bonds yields and retirement funds, and the recession-like cycle starts.

This productivity increase only makes sense if it is capped to a very small number.. like 20% max. Beyond that, who these companies will even be selling to?

Am I overthinking all this?


> The more AI causes productivity increases, the less and less number of workers will be needed.

That only holds if companies have a fixed need for "productivity" which is met by their current employees, such that their employees becoming more productive means they need less of them.

Every company I've ever worked for has wanted to achieve way more than they are able to get done with current resources.

But generally yes, the biggest open question about all of this is how the impact will play out on the economy, job opportunities etc. I've not seen anyone come close to a confident prediction about how this will play out.


> Every company I've ever worked for has wanted to achieve way more than they are able to get done with current resources.

I mean sure. Every company wants an infinite addressable market. But that doesn't mean it exists.

It might not be possible to sell 10x the software we sell today. It might not even be possible to sell 2x


It's hard to imagine how making insurance sales cheaper for the brokers, churning out astrology apps faster, AI boyfriend bots or running ad campaigns with fewer and lower paid designers is going to drive 10% GDP growth in developed and middle income countries, that's the sort of figures you see when very poor countries finish rolling out electrification, sanitation and transportation.

>The more AI causes productivity increases, the less and less number of workers will be needed. This will heat up the job market even more and bring salaries down.

>Net effect of this productivity increase: less consumption by the masses, even though you may be producing more good and much more efficiently.

Big tech companies can't even create login flows and account recovery flows that work for everyone yet. There are countless stories of folks losing access to business Instagram accounts that get hacked, Google support from a human to fix a problem that is outside of their help articles is non-existent, etc etc. There's still so much "low-hanging fruit" IMO that isn't particularly fun or exciting to fix, but ask your average non-tech friend or family member what they think of the Facebook + Instagram security settings pages / sites / desktop-only settings.

Who is going to pay for all of these subscriptions that will power this GDP increase when average purchasing power of those outside of the top ~10% of earners is decreasing YoY? We're headed toward food and water shortages next to sprawling datacenters, not shared societal prosperity and a healthy middle class.


First of all, common people are not living paycheck to paycheck in the sense that they're at risk of not having money[0]. This is corporate content marketing that has entered the collective memory of people, not anything close to reality.

Secondarily, reducing the cost of making a thing doesn't always mean you get less of a thing. For me, certainly, what happened is that I write way more software than I originally did. When we built compilers, the amount of human engineering effort required to do things plunged, but the amount of software engineering jobs didn't go down.

This is as bad as models will ever be. That part is true. And it's entirely possible we go foom. But it's also possible we don't, and then it depends on where the asymptote lands.

0: https://www.slowboring.com/p/this-economic-myth-needs-to-go-...


Respectfully, that is truly ignorant. The vast majority of humans do not have any savings and would be in big trouble if regular income ceased. No paycheck no food. It’s wage slavery and it’s pervasive.

>Am I overthinking all this?

Nope, if AI were to realise the hype, you have to take into account macroeconomics. Usually this isn't a problem for most businesses

>The more AI causes productivity increases, the less and less number of workers will be needed. This will heat up the job market even more and bring salaries down.

People also underestimate that the reason why companies are so excited about AI isn't to increase productivity, its to fire workers and crack down on worker rights. They won't lay people off because AI means they don't need as many people to get the job done, they'll fire everyone while doing a much shittier job, because they hate having to abide by worker's rights and pay people


> The more AI causes productivity increases, the less and less number of workers will be needed.

Why does this have to be the case with AI but it didn't have to be (and wasn't) the case with the steam engine, electricity, the automobile, or the computer & internet?

Certainly, AI could be different.

It's curious to me why the vast majority of people on here think it must be different.


Because the previous revolutions only automated a small subset of jobs, it didn't automate manual work.

Some people take the view that AI could make knowledge work largely irrelevant. Any niche humans could carve for themselves would only live long enough to generate training data for the AI to automate.


> The more AI causes productivity increases, the less and less number of workers will be needed

This might not necessarily be true. Increased efficiency creates induced demand to the point where more workers are needed. Because the new capabilities unlock more value to extract and the economy rushes in to get it. The steam engine is a huge example of this

I dont exactly know what new value genAI will unlock but i think its more likely than not


And yet the job everyone loves to hate, the humble "burger flipper", continues to resist automation yet command minimum wage labor rates. This future of either being a CEO of a company consisting primarily of AI agents building some monthly subscription-based solution to some trivial digital chores OR manual labor that isn't [yet] fiscally viable to automate seems quite bleak. We'd also need a ton of robot technicians and manufacturing that the US has neither the educational and training institutions to support nor the will of the population to fill. Given the ongoing war on immigration, visas, and foreign-made hardware, if this continues, good luck.

This would be a Bladerunner future Pope Leo XIV warned against (https://news.ycombinator.com/item?id=48265206), though in different words.

1. Global IT spend is $6T per year

2. Where does this $5T number come from? If they make $4T in revenue over the next 5 years instead, what happens?


Hey, I wrote this down one time. I estimated way higher yearly revenue required, to be adversarial. And you can keep the "cost per unit AI work" a parameter and play with the results.

But the point is that if people are willing to delegate part of their salary (e.g., buy consumer products), vs requiring employers to pay for the tokens, then it's quite possibly a net win. Something like "I pay a largeish fee every month to make my own job much easier", similarly to how we buy a car to make commuting easier.

https://jodavaho.io/posts/ai-jobpocolypse.html


The fuck is going on with HN that a comment making up completely fantastical numbers is on top?

Perhaps it's not only CEOs who are delusional.

You are making the assumption that the models are only used / paid for by 2.5% of the population (your knowledge workers value). There will be new value created by these models which people are happy to pay for which simply did not exist at all before. It is also naive to say that the hyperscalers are going to be expecting a return on this in 5 years, it will be entirely propped up by investments / IPOs as has been the case with any tech company for decades now to reach scale. The hyperscalers are currently spending ~650b combined annually, which they have the cash for and can sell in future compute instantly.

I'm sorry, what the feck does "value creation" mean here? I live in a place where people are so, insanely squeezed from every angle. Wages are stagnant, prices rocketing. Where is the money to pay for this value going to come from?

No one I know feels richer than they did a decade back. I've not been able to meaningfully put up my prices for a decade. People are tired and stressed and scared, particularly scared of a technology everyone keeps telling them will make them redundant.

There is no rising tide lifting all boats, just most of us drowning whilst a few whizz past in their yachts.

I honestly hope these guys faceplant ASAP. Couldn't happen to a nicer bunch of people.


Feelings aren’t fact. A lot of data shows the doomerism is not reflected in the actual numbers and much of it has to do with rapid inflation and continued vibes.

Consumption has risen, inflation adjusted wages have risen for blue collar and white collar alike. Most social mobility has been the middle class moving into the upper middle class, not moving to the lower class.

The main thing holding people back is the housing crisis. This is orthogonal to the value creation of businesses.

Value creation is growth. If it didn’t exist the S&P would still be 42.55$.


> Consumption has risen, inflation adjusted wages have risen for blue collar and white collar alike.

My wages haven't risen for nearly 5 years, while inflation has occurred over the past 5 years. Why the blanket statements?

> The main thing holding people back is the housing crisis. This is orthogonal to the value creation of businesses.

Are you suggesting a "housing crisis," in your words, wouldn't impact consumption? I'm watching my spending (and living like a child in his parent's house, except it's not my parent and I have to pay for it) in the hopes that in about a decade, I'll have saved up enough of a down payment for a home somewhere in my state that I could actually afford the mortgage on the remaining amount. There are plenty of things I'd potentially spend money on but won't as long as I feel like I'm economically stuck and have a chance in hell of saving my way out of it. So this feeling translates to fact.

If you think my personal experience is just an anecdote and doesn't count because it's not being told through the lens of large-scale numbers, fine. But I really agree with the person you replied to that you're gonna have to be a whole lot more specific than "value creation" if you want people to spend money on your AI products "in this economy," whether it's because they're actually strapped for cash or just pretending like you seem to think they are.


> The main thing holding people back is the housing crisis. This is orthogonal to the value creation of businesses.

This feels wholly at odds with saying most social mobility is upwards. So most of the social movement is into a class where a home and vacations are a given, but we also have a growing class of people who can't afford a home? Per BLS, average real wages are down 0.3% YoY https://www.bls.gov/news.release/realer.nr0.htm .

> Value creation is growth. If it didn’t exist the S&P would still be 42.55$.

This reductively assumes "value creation" is the only effect on the S&P pricing. You'll note a ton of graphs correlate with it, e.g. https://tradingeconomics.com/united-states/inflation-cpi is the US inflation rate, which also tracks the S&P pricing. Ie if a company is worth $100 a year ago and inflation was 4%, I'd expect to pay $104 for their stock with 0 value creation whatsoever.


Home ownership is not the definition of economic class.

The s&p has vastly outstripped inflation, this isn’t even an argument. It’s a very bizarre and uninformed opinion to say “inflation is correlated with s&p value”.

In economics, you deduct the inflation rate from growth to get the real rate of return.

I wonder why so many people with such little understanding of financial markets make comments like these.

https://www.multpl.com/inflation-adjusted-s-p-500


> Home ownership is not the definition of economic class.

It was at one point, so if you're saying it doesn't now then your "social movement" is really just goalpost moving. I'm sure the upper class does grow if you just declare that the boundaries of it are now lower. We're all millionaires if we just redefine millionaire to mean "not currently overdrafted".

> In economics, you deduct the inflation rate from growth to get the real rate of return.

Which neglects the impact of inflation on the principal. If I put $100k into the S&P in 2016 I'd have 303k now (303% value). Cumulative inflation during that period was 38.8%, so your metric says I'm at 164% rate of return. $303k today is only worth $218k in 2016 dollars though, so I'm really only up 118% in absolute value.

That's also only using the CPI, and neglecting asset inflation which impacts stock prices outside of actual value creation. More money floating around means more gets parked in index funds, regardless of whether the company is actually doing anything better.


The upper decile of income earners account for more than half of all consumption in the US. Household balance sheets and wealth have never looked stronger, again when you account for all the appreciated stocks and properties owned by the upper quartile. True incomes for the lowest decile rose significantly for the first time since 1970 in 2022 and then sort of stayed flat again. Sure, statistically significantly, not "significantly" as in personally meaningful after figuring in rising consumer costs. There is a narrative where you can see all this as hugely positive but this is also largely a "vibes" based narrative. I don't know why you'd expect most people to care about what the "vibes" are like for the best off in society, that's a bit removed from their daily concerns.

> Feelings aren’t fact... much of it has to do with rapid inflation and "continued vibes".

Oh the lost irony.


Is it ironic? Or did you just read the comment incorrectly?

Sounds like internet sentiment and not research data.

It's kind of become socially taboo to not be suffering "in this economy", but on paper it's hard to see weakness in places that there isn't always weakness. As long as the 65-95% are doing well, there isn't going to be a collapse.


The most recent U Michigan 'Survey of Consumer Sentiment', which is THE authorative source in the US, shows consumer sentiment at the lowest levels since the survey started in 1977

From the U Michigan page: https://www.sca.isr.umich.edu/

or from the FED https://fred.stlouisfed.org/series/UMCSENT


Sentiment is just vibes though.

Everyone hates Walmart but they still do gangbusters numbers.

So people may be hating, but they sure as hell are also still spending

https://fred.stlouisfed.org/series/PCEC96


A literal example is that I can use AI to file my taxes instead of spending a weekend and hundreds of dollars to have an accountant do it for me. It costs me like $5. that 245$ delta is the value of that output to me, as long as I am confident it is correct.

Seems to be a thing in the US to need specialised software, an accountant or AI to file taxes.

In most of Europe individuals at least don't need any of that. I'm in France and it's just a connection to a government run website to enter a few figures, takes less than an hour most of it is already pre-entered (salary etc), the main thing to add manually is charitable donations.

If you're running a business then yes an accountant can be good (or be required depending on the legal form of the business) but not for individuals.


In the US you don't need software or an accountant either. People have been convinced it is hard, but the reality is taxes are basic math for most people. It isn't hard to copy the few figures from a couple forms and add them up, the look things up in a chart. It is tedious, but not hard. That said, I use the software anyway because once in a while I make a human mistake and those are annoying when the IRS catches latter.

There are a few people who have a complex situation who need an accountant. Most do not, but since they have never looked at what is really involved they don't realize what is going on.


Taxes are one of those things that seem difficult and people reach for tooling or expertise without trying initially without, but are pretty easy to do yourself just filling out the forms.

It really depends your situation. If you’re entrepreneurial, self employed, others things become messy fast, and you cannot just know what you need from filling forms

Luckily IRS makes this information available for you:

https://www.irs.gov/businesses/small-businesses-self-employe...


Part of the value of paying an accountant is that you can get representation in case you are audited. Though I guess you did say you were confident it is correct.

This also sounds like rich people problems. Vast majority of people with a W-2 take the standard deduction.

I think that to sum things up, we will have to wait until we can evaluate the cost of the mistakes. You could be lucky but you could also end up with a very negative output value in the longer time frame.

I did my taxes this year too with 5.5 and 3.1

Otherwise normally costs around $800 to do, because I have a small business too.


> as long as I am confident it is correct

Are you? Does it cost you extra (time or money) to be?


Yes, and they were accepted. A year or two ago I would have been less confident but now almost UX is happy to cite sources.

Not speaking to the wisdom of filing taxes using LLMs, but just FYI (assuming US here) taxes being accepted doesn't mean they were correct. It just means the IRS hasn't found anything major wrong (e.g. SSN used on multiple returns). Even being approved isn't a guarantee, an audit could come later.

Even if an audit never comes they could be incorrect.

Audits often find incorrect in favor of the person audited as well. If you are not audited that is a bad thing as you have no idea if things are wrong, and it could be costing you a lot of they are wrong.

For sure.

Thats the thing; the "increase in productivity" isn't being felt by the general public, the end user. If your "increase in productivity" just means more money being shifted around at the corporate level then it is meaningless.

> There will be new value created by these models which people are happy to pay for which simply did not exist at all before.

True, but I think the GP's point was that what consumers will pay won't be nearly as profitable as what enterprises will pay to increase the output of their developers and knowledge workers. ChatGPT is currently the overwhelming leader in consumer AI usage but only ~5% pay $20/mo.

As a recently retired serial tech founder, I'm now one of those consumers. I use AI webchat daily for general search, Q&A and even to write little automation scripts for myself, yet I haven't paid anyone anything for AI yet. Even after being heavily restricted and performance nerfed to hell in recent months, free webchat AI is still fine for everything I do, and I'm not remotely price sensitive.

Even as AI compute costs fall over time, I doubt serving ads against AI webchat to consumers will generate the kind of high-margin, sustainable growth VCs get excited about. It's so undifferentiated I bounce around between all four leading providers because there's virtually no moat locking casual consumers to any chatbot beyond a single question thread. I guess if it had a nearly infinite context window seamlessly integrated across all sessions, that might be somewhat sticky for some consumers but it could also get creepy for some others - and it would devour gobs of the scarcest resource in AI. Beyond Maslow's Hierarchy of Needs, the mobile phone is the largest revenue, long-term mass consumer product ever but I just got a new flagship phone from a top-tier provider for $30/mo over 3 yrs. IMHO, even an all-you-can-eat, infinite context window, next-gen Mythos couldn't reach and sustain mobile phone levels of global consumer adoption at ~$20/mo. Unlike professional developers and knowledge workers, consumers don't have any "job to be done" big enough for an LLM to command that much of their zero-sum discretionary spend.


What are the non-tech people in your life using AI for? $20/month, next to Starbucks and avocado toast, is discretionary. Maybe the novelty will wear off and non-tech consumers will leave it in droves, but everyone declared they'd leave YouTube if they started playing ads, but YouTube doesn't seem to have noticed.

> What are the non-tech people in your life using AI for?

Mostly asking random questions they used to search Google for.

> next to Starbucks and avocado toast, is discretionary.

Sure, but your description implies highly affluent, urban professionals in western nations. I was talking about getting several billion global mass-market consumers to all keep paying ~$20/mo. Mass consumer adoption of mobile phones worldwide is currently >5.8 billion or >70% of humans alive. Only ~50M people are paying $20/mo for an LLM and I suspect many of them are not pure consumers but actually knowledge workers that AI vendors are losing money on and will eventually force into higher tier plans just like the $200/mo developers they're currently losing money on. These heavily subsidized loss-leader offers are all going away post-IPO.

Personally, I know maybe a dozen people who pay $20/mo for an LLM but only two of them are really 'pure consumers' who don't use it for knowledge work. Both of them are multi-millionaires and neither has had a job in ten years. One is retired like me and the other is so wealthy she has a Netjets credit card and has new cars delivered like some people order shoes. Everyone else I know paying $20/mo is a professional who uses the LLM for a lot of office or knowledge work and writes it off as a business expense - examples include a couple of attorneys who are senior partners in a law office they own, a solo architect, and a dentist who owns his own practice.

At $20/mo, AI vendors are probably losing money on most of my professional friends because they use it pretty heavily all day. They're only making money on the two multi-millionaires who both use it so infrequently they could easily be using free chatbots instead but are so rich they could lose $10,000 in their couch cushions and not notice. While they are profitable at $20/mo, they aren't exactly "typical consumers" that there are billion more of. I expect AI vendors will find ways to force my lawyer, architect and dentist friends to switch to higher priced plans soon because they're really knowledge workers abusing a consumer tier plan into unprofitability.


100%, a driving factor will likely be how good we can make models that are so small they use almost no compute. Until then it is a race for adoption and moat-building (or screwing people over?) once you have users

> a driving factor will likely be how good we can make models that are so small they use almost no compute.

That will certainly help but it doesn't move the fundamental limit because resource efficiency is a cost driver not a demand driver - and my argument is against the thesis that lying beyond professional devs and knowledge workers, there's an untapped trillion dollar industry serving LLMs to mass global consumers.

Using Simon's cost estimates, I agree that halving the current $1,000 - $1,200/mo MSRP to profitably serve frontier inference to professional developers and knowledge workers (PD&K) will help Vendor A steal share from Vendor B or C. It will also increase LLM sales penetration into the segments of the global PD&K TAM which can't afford ~$1K/mo for every seat. A fair chunk of the PD&K workers in many SMEs aren't included in today's ~$1K/mo per seat license pool, especially in 2nd and 3rd world geos. When the price falls to $500 and $250 most will but that's still just saturating the existing PD&K TAM - not pushing into mass consumers.

While the PD&K TAM is big, justifying Trillion+ dollar capex spend requires believing the TAM is much more than PD&K and eventually grows into converting a couple billion non-PD&K consumers into ~$20/mo subscribers. I don't buy it for two reasons:

1) The Comps: There are vanishingly few examples of long-term, mass consumer adoption of a discretionary technology at that scale. Mobile phones at ~$15 to $30/mo are the obvious one but LLMs are nowhere near being that valuable to the average plumber in Des Moines, baker in Jakarta or retired nurse in Hamburg. Pondering it, I just imagined forcing any of those people to choose between their mobile phone and an LLM chatbot. Sure, some who are flush with cash might choose both but for most consumers in the world ~$20/mo is big enough they'd have to pick one and ~zero percent would choose the LLM over their phone. After mobile phones, the second comp for discretionary tech spend I thought about was XBox and Playstation monthly gaming subscriptions but combined they have less than 90M paying subscribers and the ARR is just under $10/mo. As an industry, "Big LLM" is spending well over a trillion dollars every five years. XBox and PS ARR doesn't even cover paying the interest on that capital, much less the 3 to 5x returns hedge fund investors are betting on.

2) The Alternative: It's useful to doubt my own intuitions and one counter to my skepticism is to assume "But LLMs aren't finished yet, they're going to get much better." How much better could an LLM which can be profitable at ~$20/mo get than Claude Mythos in the next five years? Instead of debating future unknowables with myself, I've found it's better to just imagine the most perfect future product I can that's still realistically plausible. So, let's imagine we're willing to spend a million dollars a month to very unprofitably deploy a prototype to test the consumer demand for "Tomorrow's Awesomest $20/mo LLM" today. So we gather a few hundred super smart, broadly knowledgeable intellectuals together at one top-tier university research library, where they'll have access to every commercial database and unlimited Claude Mythos 2.0 and ChatGPT 6.0. Since our experimental budget is $1M/mo we can afford to add in several Nobel prize and Fields Medal winners too. They'll work together manually reviewing and improving not only every LLM answer but also our test user's prompts - and of course our test chatbot will have human-level real-time speech recognition and vision (via Zoom and screen-sharing with actual genius-level humans), making this truly a test of the "smartest, most accurate, best consumer chatbot" we can imagine.

Now, let's run the test by having one thousand mass consumers try it out and see how many Des Moines plumbers, Jakarta bakers and Hamburg retired nurses we can convert to a 1 year @ $20/mo subscription for our $1M/mo ultimate chatbot simulation. Playing this thought experiment out in a bunch of ways, I find some percentage of outliers, iconoclasts and closet intellectuals would go for it but... the vast majority just don't find it enough better than "free" chatbot alternatives AT&T includes with their phone subscription or Samsung bundles with Galaxy phones - despite only being ChatGPT 5.4-level. It turns out, most plumbers, bakers and ex-nurses don't have a compelling "job to be done" in their daily lives that even an MoE panel of actual Nobel and Field's medalists with ivy league professors can make enough more valuable than an inferior but free-to-me chatbot, in the judgement of our Des Moines plumber. While the world's smartest chatbot is nice, when it comes time to pay, he prefers having one additional premium football match on TV and a six pack of cold beers every month.


I'm having a hard time understanding this huge post that doesn't talk about enterprise users. I'm convinced that the consumer isn't going to be coming up with enough money to justify AI valuations... but doesn't this just mean that we expect the money to come from large enterprise users?

A recent post here said AI spend could be "20% of every software developer's salary"... and that seemed plausible based on productivity improvements. That's not about a phone bill.


> doesn't this just mean that we expect the money to come from large enterprise users?

Yes, if you agree that not enough consumers will value "smart paid chatbots" over dumb free chatbots at ~$20/mo, then, as you say, the money has to come from enterprise developers and knowledge workers (PD&K). The big problem with that is the numbers don't work. There aren't enough PD&K workers that are highly-paid enough to justify $12,000 to $16,000 a year to cover the astronomical spending run rates.

Without recapitulating all the various scenarios AI CEOs like to hand-wave, my take-away is the scenarios which show today's frontier AI vendors earning the returns they've promised investors to get those trillions of dollars they're spending ALL require truly extraordinary deltas far above current historical actuals. Whereas ALL the scenarios which I find more plausibly realistic, even if still quite bullish, never even get near the required ballpark.

I've tried to make various scenarios fit but inevitably, when the PD&K demand side starts getting implausibly inflated, I start pushing the cost side down to keep the adoption rates remotely plausible. For example, assuming things like the number of PD&K workers will grow at 2X the highest rates ever seen before or that the percentage of the tasks knowledge workers actually do at sufficient frequency to really matter and which are also "LLM improvable" is implausibly vast.

It gets challenging because one quickly hits finite limits on the increase in tangible economic value that an LLM can possibly deliver on many common knowledge work tasks like drafting an email or a product proposal. Even if we assume next year's frontier LLM is so good literally ZERO slop is even possible so that our knowledge worker doesn't ever have to review anything - turning a 20 minute task into 2 minutes, or even 200 milliseconds, yields finite economic value to the enterprise. Even if you go extreme and suppose pretty crazy stuff like 18 months from now LLMs have eliminated 50% of PD&K workers, that messes with the spreadsheet assumptions because now the there are half as many $16,000/yr seats so the LLM license prices have to double again just to stay even.

At the edges it quickly gets nuts. I even tried assuming that in five years LLMs eliminate 100% of non-managerial PD&K jobs. All actual work is done by Super-AGI LLMs. Even if we assume such amazing intelligence can be profitably sold in five years for only 4x the price of today's far dumber LLMs, that Super AGI still costs $50,000/yr per human replaced. Even that assumption only cuts the blended labor cost of non-managerial roles roughly in half (depending on industry and geo). In the end, no matter how much labor cost LLMs enable companies to cut, it still only reduces overhead. Lower costs might allow Vendor A to steal some of Vendor B's customers but it doesn't increase the total TAM or demand of the entire sector both vendors serve.

Once you're out of plausible labor savings, one has to move to assuming that within five years Super-AGI LLMs working unsupervised will be making and validating fundamental scientific breakthroughs, then reducing those breakthroughs to engineering practice enabling new technologies which create entire new markets. Then they'll create whole new companies with maybe only 1/10th the humans and profitably grow those new markets. It quickly starts to feel like bad sci-fi instead of plausible, near-term financial scenario planning. Is stuff that extreme and unprecedented actually possible? Sure, incredibly unlikely and unprecedented things DO sometimes happen but things so far out of the distribution are very rare. But when making Sam and Dario's scenarios actually 'math out' starts requiring more than one "Black Swan" unlikely, historically unprecedented, earth-shaking event. And then goes on to need whole flocks of Black Swans, it just isn't credible. That said, I do believe that some parts of these scenarios may happen in 10 or 20+ years. It's just that the investment theses trillions worth of our 401K's are sunk into assume five or six year amortization and financial payback.


But will they pay big actors running top end models for that? You don't need latest openai or anthropic model to go thru your mails, get summary of the some products from web, or to do your to-do list.

The AI might very well be used by noticeable % of population daily, but that doesn't mean they will be paying trillion dollars to the leading US AI companies


> There will be new value created by these models which people are happy to pay for which simply did not exist at all before

What sort of new value, and why will people pay for it from someone else rather than prompting for it themselves?


> Most people I know cite +20%-40% velocity

Seems roughly right, that does seem to be about the boost in the most well-suited cases where you essentially know exactly how to solve the problem, the problem won't change much, and it's truly a matter of just churning out the implementation.

In that case precisely prompting, doing the review & nudge loop, can be a pretty nice (nice, still not game changing) speed boost over literally typing out the code to match the design in your head.

The less optimistic view though is that most things you build aren't like that. Even if they seem like it first. These things get booked as a nice speed boost, but you'll only find out much later they weren't.

A confounding factor is that it seems like many people not in the detail of building software do seem to think of most to all things are like that, even before AI assisted coding. Not much need to say more - see the entire history of the 'agile' movement for evidence of this.

And because most things aren't like that, I actually struggle to see fundamentally how more than 20-40% will ever be achieved (short of the ever-present deus ex machina of AGI argument), simply because the generation is already really good for these types of things. So since things like this aren't going to increase in overall proportion of things to be done, I don't see where the overall extra gains come from by models improving at this point.


Bigger than that, they have to contend with open weight local inference. Open weight models right now haven't caught up to the frontier models of right now, but they're as good as the frontier models of not too long ago. If open weight models reach a certain point, then frontier model providers are going to struggle to make anything selling tokens, because eventually people will realize they don't need Mythos for everything.

That assuming once they start squeezing people won't just go to deepseek or other cheaper competition

> That's a _huge_ shift. Most people I know cite +20%-40% velocity with these tools, against the actual work their company cares about doing. +20% speed for +20% spend isn't going to motivate a trillion dollars a year in spending.

And most research shows people far over-estimating their own gains. Once companies start counting the actual (and not just reported) gains, the AI budgets will be more limited as people realize it's an useful and versatile additon but not replacement for most types of work

> We're not there yet. This is still the upswing of the hype cycle, and unless we figure out how to make developers 2x, 5x, 10x as productive on stuff that matters, this isn't going to play out well.

Upswing of the hype cycle while growth of tech itself is flattening, both coz of techs innate issues (which might or might not be solved, but some papers claim they are unsolvable with current approach) and just the fact the spike in growth caused so high economy cost that it put brakes on itself.

T


There's a lot of workslop pumping the numbers. People can generate a 300 page PDF in a tiny fraction of the time it would have taken, but now the report is full of mistakes and fluff, and the stuff that would have been learned and caught in the process of making the report is now not happening.

and the recipient pulls that into LLM and generate summary.

It's lossy compression for thoughts at this point


Also, not all developers work on software products. The vast majority of developers work supporting software solutions as part of a much bigger business model, such as infrastructure, industry, healthcare and services. Many of these are complex organizations. So, unless you get to turn every employee into a 10x employee, the 10X coder along won’t necessarily make a 10X productivity contribution. What’s likely going to happen is the 10X coder will start to slow down or adding more (unnecessary) complexity to avoid having to sit and wait on overhead, for other areas of the business which are not easily automated away to AI to catch up. As a developer I can finish my project in June instead of December, but what if the customer is still not ready for integration until December? what do I do?

Yeah, claiming “product-market fit” on coding assistants for this multi-trillion dollar capital expenditure seems premature. Anthropic will post one and only one quarter of “operating profit” (aka losses after taxes and debt obligations) on the back of free-for-all spending by enterprise and engineer tokenmaxxing, neither of which will last. The investment was commensurate to a world-eating AGI, and if all that comes out of it is coding agents and slightly better enterprise software, I don’t think that makes up for the money spent.

The real question is: can you incentivize a non-tokenmaxxing Uber to spend the same amount on AI as they were when tokenmaxxing, just with fewer tokens and higher per-token costs? Even with plateauing improvement in frontier models? I think the answer may be yes.

And part of my reasoning for this is: the only system capable of actually fixing bugs in vibe-created code is an LLM. If we humans couldn't write it without assistance, we certainly won't be able to debug it without assistance. So there's a real stickiness here.

We're signing pacts with demons - we have to, if we want to outcompete the other warlocks - and those pacts are written in the very size of our codebases.


Those are rookie numbers. We are going to blow past $1t per year in spending in no time. As a developer for 29 years, I couldn't go back to coding by hand. For better or worse, AI will be woven into the fabric of life in no time.

If people figure out how to run agents on-prem (already becoming feasible for both agentic tasks and coding on consumer hardware like Mac Studio 128GB+ or DGX Spark with some models) these companies will be in deep trouble.

Privacy is also a huge issue.


I agree in principle with the math. But I believe that in reality if revenues don't show up quickly, then lenders will just restructure the debt and defer the payback period. Similar to SF commercial real-estate; many buildings should've come due during the depressed covid market, but lenders (banks) were willing to delay payment until the market picked up again.

The scale of these investments put the lenders at substantial risk, so the lenders will do anything to make it work. If the current lenders will be damaged by extended payback periods, they can simply sell the debt to someone else who won't be.


Anthropic raised less than 100B up to now and as of March has 30B ARR. Why does it have to make back 2.5T to 5T ?

> They've got, ballpark, $5t to $10t to make back in the next 5 years, or the hardware buildouts will start getting written down.

Depreciation and write-offs are about accounting models. Hardware will still be running after five years and still be making money. They may not be as efficient as the new hardware, but they will still be making real money even though they are valued at $0 in the books.


GPUs are driven really hard plus they use up a ton of energy and water, they cost a ton to run.

So you've got that market. Let's call it the demand BY knowledge workers to do the work. You've also got:

2. The companies themselves buying tokens for operations to make the work more efficent. e.g. Salesforce agent or Microsoft Office agent or random saas inventory agent. (and if you say those will go away (which I don't believe), it's even more bullish. The tokens just go to someone vibe coding XYZ, which is EVEN MORE than if you were to buy saas because it's SaaS product x Companies that built it instead of just one)

3. The companies SELLING tokens. This is also new markets like schools and small business (e.g. the local gas station buying an inventory tool)

4. The consumers "buying" (I put in quotes because it can be subsidised but the company) through chatgpt, strava, instagram/netflix recommendation, etc.

Local models still take compute, and while it may be cheaper, it is the same argument of on prem vs cloud. No one operates on prem unless you HAVE to for regulatory. Margins will come down and you just spin up a GCP/OpenAI/Anthropic agent.

It may be "cheaper" but rationally its better to pay someone to manage it. Thats why Hetzner only had $367M in revneue (a lot but tiny compared to managed services)


One factor to consider , the base will not remain the same over the next 5 yearts.

Every generation of developer tooling that increase of absolute code throughput creates a new class of developers (and users).

Always been the case since first compilers, through eras of frameworks to today, and the skill level needed to be one has dropped. In mid/late 80s only Master / Doctorate level Comp Sci professional could write any applications. It dropped to undergrad and just Information Technology engineers and comp sci theory became mostly optional and dropped further to any college level educated with some training and has been trending below with no/low code tools like retool pre 2022, that was before agent codegen services such as v0/replit and so on.

The next generation developers will not produce applications and architecture as previous generations did, just as we most of us here don't produce the level of quality that pg did when building this platform[1] , but as long as the user can find value it doesn't matter as countless enterprise applications of middling quality already prove today.

All this to say the 200M/30M numbers will not remain the same is the thesis for these businesses, will it change by large enough at a fast enough pace to justify the capex, I don't think so either. However web 1 then 2.0 , saas and mobile revolutions were pretty quick with new class of users and developers so not completely unrealistic .

[1] While HN is a heavy outlier with its custom lang lisp implementation, there are any number of examples from previous eras that are more moderate in choices but written with solid architecture with skill levels would be hard to find in today's generation founders.


Putting some more numbers out there (some of the links are broken, but numbers look about right):

https://github.com/danielmiessler/Substrate/blob/main/Data/K...

Knowledge worker compensation is 35 - 50 trillion a year globally (6 - 12T in the US alone.) That's a huge TAM. It's still close but 5T over 5 years seems doable.

>... unless we figure out how to make developers 2x, 5x, 10x as productive on stuff that matters, this isn't going to play out well.

The way we make ICs 10x productive is not just making each of them individually more productive, but by removing the coordination overhead of large organizations, because overhead scales super-linearly with the size of the org. And orgs will shrink automatically as AI-assisted ICs take ownership of larger and larger scopes of work, leaving much more budget for tokens.

I went into this in a bit more detail along with some made-up numbers here: https://news.ycombinator.com/item?id=48040999


> We're talking about a world where you need 5% of every knowledge workers salary to go into tokens. 20% if you're a developer.

Just realized something: if one worries about losing jobs to AI, token's high unit cost is good news. To say the least, high cost would delay the displacement, if any, right?

In the meantime, someone shared the below on X. I guess the moral of the story is that "good enough" does not just displace software engineers, but also models.

   > I Went From $3,000/Month on Claude to $5/Week on DeepSeek

   > And honestly?80% of my work is identical.

   > For the past two months, I was burning $3-5K monthly on Claude Code. Every idea from design to development to testing - full end-to-end automation, even simulating users to test my products and provide feedback.

   > Extremely token-intensive. But Claude's caching sucked, making it insanely expensive.

   > Then I discovered DeepSeek V4.

At some point, if we reach stability on the models, we'll start getting silicon optimized for individual models. They are optimizing for time to market, not efficiency right now. I don't know how much it will move the needle on the cost math, but at this scale any improvement has a crazy multiplier.

But that means going back to "80% profit margin" Jensen and further digging your capex hole. The benefits would have to pay not only for current capex but also past capex.

But by then, I will be able to go one line down in my dropdown menu to switch to a newer LLM provider who doesn't have to amortize those past capex.


Yes, the whole thing feels dot comish. I’m betting there will be a few (maybe only 1-2, like what happened to search) winners, and everyone else is in for a bad time. It’s the same dynamic too: the winners are going to win so big that everyone wants to get their money in for a chance at a piece of the pie.

I could see such productivity gains being possible, if only because the current tooling around LLMs is terrible. The fact that we have 30 blog pieces per day making the front page of Hacker News about someone’s convoluted system to guide LLM output to something reasonable is absurd. There needs to be standardization in tooling, and it needs to be open source. Then, and only then IMO, will we see huge productivity gains.

But, at that point I think the big players’ moats will have dried up. Local models will probably be sufficient for 99% of daily office worker tasks.

So I disagree with TFA’s premise. I think this fear is probably shared amongst the LLM giants, and they’re still hoping that neural network transformers are somehow the path to AGI (probably not, imo).


This is why 'agents' are the solution for these companies. Token spending goes through the roof. As long as a human is in the loop needing to read or review at human speed, that's a ceiling on how many tokens per user they can generate.

> ... against the actual work their company cares about doing. [...] stuff that matters

This is a key point. Some engineers are having fun doing e.g. greenfield stuff with AI that they never would have had time for otherwise. Whether the company cares about that is another question.

It's related to Goodhart’s Law. If AI token usage is a target, then you're going to get a lot of token usage, but it's not likely to correlate well to improved business outcomes.


> 5% of every knowledge workers salary to go into tokens. 20% if you're a developer.

When you break it down like that it seems reasonable. I'm spending about $5k/mo on tokens, seems more and more normal.


> That's a _huge_ shift. Most people I know cite +20%-40% velocity with these tools, against the actual work their company cares about doing. +20% speed for +20% spend isn't going to motivate a trillion dollars a year in spending.

20-40% sounds about right for me, today. Maybe 40-60% on a good day. But a lot of the reason it's not higher comes from harness gaps and org processes that haven't caught up.

All of that will get fixed with time.


Depreciation starts on day 1 and most likely they IMHO dont have 5 years. They dodged the deepseek bullet but who knows what is out there that will make all of this investment essentially worthless?

What value do the big model makers provide other than having a head start on gathering up humanity’s IP to train their proprietary models?

What’s their moat? Is it hoping for regulatory capture where scraping is made illegal the day after they finally finish scraping all human language?

It’s like OpenAI dammed the Colorado, and Anthropic dammed the Hudson, and now they’re both trying to sell us bottled water subscriptions at $100 a month. I don’t know how well the dam part of the analogy holds up, but the water part feels strong. Compiling models based on humanity’s written output feels like something no corporation should own.


Is it possible that you are narrowly sizing the opportunity? While PMF does not always mean that early pioneers will be the leaders, I think the market itself goes beyond knowledge workers and developers. Agents, robots, drones etc will all use LLM or some world model.

I am rather more concerned about competition from CHINA. With how Huawei (2000 -> 2020) crushed every other telecom company and went from nobody to the most revered leader in 20 years, and with the depth of leadership in manufacturing and work culture, if China surpasses USA in AI, all US companies lose.


> 5% of every knowledge workers salary to go into tokens

In general, I don't think you can reason from the existence of potentially stranded investments back to revenue projections.

And when you frame this as percentage of salaries, that's a sneaky implication that this is only about reducing salaries and headcount, and not about adding capability, or doing things you couldn't do before, or making fewer mistakes, or capturing more revenue, or expanding margins, or competing more effectively.

That said, 5% of knowledge worker comp actually seems very low to me, given the capabilities, and considering the percentage of "knowledge work" that is absolute bullshit.

Two weeks ago I received an email from my HOA saying I'd been billed for a service I never asked for. So I replied to the email saying they'd made a mistake. There are now more than 30 messages in the thread, involving at least 8 "knowledge workers" at the property management company all passing the buck, and the problem is no closer to resolution.

An agent could wipe out all 8 of those bullshit jobs and solve my simple problem in five minutes instead of two weeks. Think of how many hundreds of thousands people are doing this nonsense just in the property management industry alone.

5% is nothing.


I don't think the unit economics are too terrible. Expensive, but not impossible.

200m knowledge workers in US and EU. Total salary around $15T/year.

$1T/year in token spending is about $5k/year per person. A big number, but not totally mad. That's the low end for office space per person for example. Probably close to the existing SaaS spend per person for a lot of roles.

We are still early in the deployment cycle for these tools so I would expect them to get better and also cheaper too.


This assumes that we won't need new hardware in ~2 years. I find that unlikely. So they have to make back what they got up until now PLUS the running upgrade/development costs. So what will it be in 5 years? $20t? $30t? It's all getting a bit outlandish.

What I'm often hearing though is the equivalent of "gg ez" when I bring that up. I don't understand how this will at any point blitz scale to profitability. As far as I know they don't have positive cash flow, no one has a moat and I don't think they will push out engineers.


Here is a serious question.. Can we sell into the hype cycle and on the way down with this: https://safebots.ai/costs.html

I asked claude to generate a frontend and it made the same template. Same san serif and serif fonts together. Same colors. Same typography. Same layout and animations even. It’s wild how similar it is. No not similar it’s the same damn thing.

I’ve seen the same dashboard for a dozen custom web applications now, including a couple I had it make for me.

It really does have a particular lane for each chore, and it’s reproducible.


Yep and when you see it in the wild it stands out like a sore thumb, absolutely no thought into a bit of a unique design or branding.

I have a few live websites built using LLMs and they will just go for default generic templates and colours if there's no vision.


It produces the "most average" web design unless you really prompt your way out, isn't it? If you don't care enough to prompt, Claude does not care to be individual.

Technically from claude's POV, it's one individual copied millions of times. All claudes are clones.

I don’t think these numbers are accurate? It seems to ignore the fact that the models have cache for ongoing sessions, which means you (normally) aren’t actually sending all those tokens on every request… you only need to if you go too long between requests.

The most often cited figure for knowledge workers seems to be 1B, an order of magnitude difference to your assumption.

Also, according to https://isaiprofitable.com/ total industry spend is also an order of magniture less than what your assumption is.

So in your model 0.2% of knowledge worker salaries instead of 5%, IF all the AI players win the investing gamble and do infact make back their money.


> 20% if you're a developer. That's a _huge_ shift. Most people I know cite +20%-40% velocity with these tools, against the actual work their company cares about doing. +20% speed for +20% spend isn't going to motivate a trillion dollars a year in spending.

Of course it will. The value of an employee is a multiple of what they get paid.

If you pay an employee $500k and they make $2M for your company (like Meta), then of course a 20% increase for the salary is justified if the velocity is increased 20% as well.


The difference between what the employer makes per employee and what they spend in compensation doesn't matter. If the increase in productivity isn't greater than the increase in cost, there isn't a reason to pay for AI over hiring more developers.

Imagine an employer with 10 employees paying $500k per employee and making $2M per employee in revenue (to use your numbers). They could hire two more employees and spend an extra $1M (+20%), but make an extra $4M in revenue (+20%). Alternatively, they could buy all ten employees a $100k AI subscription, for a total of $1M extra spending (+20%) but an extra $4M in revenue (+20%). You'll notice both scenarios are identical, so an employer optimizing for profit would have no reason to prefer one over the other.


There’s a lot relationship and culture management overhead involved when adding 2 more people to a 10 person company. I think any business leader would take the productivity speed up from buying a tool over hiring more people and integrating personalities/habits/viewpoints to an existing established culture any day of the week.

You're basically positing that the real cost of a 20% headcount increase is higher and/or the productivity gain is is lower than 20%. That isn't an unreasonable claim, but it's basically rejecting the premise here. You might just as well object to the premise that you can buy a 20% speedup by spending an extra 20% on tokens.

To get that revenue and adoption they have to vastly increase their infrastructure spending. If they are currently losing in even the 200/month plans how is it sustainable?

> 200m knowledge workers in the world, 30m developers

Your scope is too narrow. The companies target more than white-collar jobs. And $1t is around 0.5% of the world economy.


I don't think the maths works like that. They have raised ~$200bn so far and need to make that back. Saying they need to make $5 to $10tn isn't really real. They might need that to meet some extravagant Altman projections but not to justify what they have actually spent.

> 200m knowledge workers in the world, 30m developers. We're talking about a world where you need 5% of every knowledge workers salary to go into tokens. 20% if you're a developer.

This is where the napkin math is breaking down in a big way. There is absolutely no reason to assume this will only impact "knowledge workers". Farmers use computers. Farmers will use AI.


AI for what? None of the AI a farmer could or would use would be any more meaningful that light chatbot usage or already existing computer vision/gps

And around 400k H-2A workers. Humanoid robots... Who works on them I wonder.

The kind of farm that would use AI is already 99% machinery and automation.

Not to mention the competition: chinese open-weight models and open-source harnesses. Qwen3.6-(27B and 35B) have proven to be worthy and capable of running locally. I am confident more SMEs would look into this as a solution given the ballooning costs of API usage. You get a decent setup with an RTX 6000 Pro.

"5% of every knowledge workers salary to go into tokens. 20% if you're a developer"

Not unreasonable. I'm a hardware developer, and my employer spends ~10% of my salary on software tools. Add hardware tools and their maintenance and it's more like 30%.


Given what costs are and availability of parts, that 5 year write down is not in practice going to be the case. Maybe tax wise perhaps but especially for big fancy expensive multi million dollar 100-500kW racks these things are going to stick around for a while, I think.

Plus, at some point there are less tokens because local models being optimised and can work with protected information. For enterprises that want an AI with a knowledge base of internal documents, this becomes more interesting by the day.

Also hardware will be obsolete or dead in 5 years, and warrantys are 3 years from Nvidia. Ask crypto miners how these kind of hardware economics work. Numbers have to keep going up all around. Its a fundamentally broken business model unless prices increase 10x

That’s on the order of 1% to %2 of global GDP per year just to pay for their hardware commitments.

My hope is that hardware improvements (better node densities every 2-3 years, better designs, etc) will pick up the majority of the savings for these companies in the future, assuming LLM performance starts to taper off with diminishing returns.

There is also the EV (expected value) of developing AGI. Even if you personally believe the probability is low within the lifetime of either of these companies, the value would still be extraordinarily high, enough to forgive a $5T or so miscalculation here or there.

I don't think AGI was ever a serious endeavour, just something the labs talked up to grab attention.

I am willing to bet a Twix we'll look back on that stuff in 2 years with a lot of embarrassment


The high-risk side of that bet would need to win more like a lifetime supply of Twix. But in a post-scarcity nirvana, everyone already has that. So sure, you're on at even money. See you in two years.

Theres no reason to believe, based on recent trends, that AI would lead us to a post-scarcity world, even if it could do all of our jobs better and cheaper.

I'll wager a hypersled of my Twix against your next three rations of gruel. But I think I'm done betting after this one.

Only if running AGI makes economic sense. We actually have no idea if that’s the case. We don’t even have a definition for AGI

Somehow Uber and WeWork survived the same kind of grand projections that they never met.

uber sure....but how did wework survive? they are a smoldering husk of a failed company looted by its founder

The company’s gone but the assets just got sold to other commercial real estate firms.

Uber was basically only ever software to help people use their own cars so a very small part of their valuation was physical stuff to upkeep, it was just deals and obligations they had.

Not sure how it shakes out for Anthropic and OpenAI. There’s a lot of physical capacity that needs to be built out and can depreciate. But there’s also a lot of network effects and dependencies being built in with enterprise users.

I don’t know how swappable the tooling is either. I think over the long term the UI, model training and documentation, and infrastructure are going to end up being run by different parties and I’m not sure which leg of that chain ends up in a position to skim most of the profit off. My guess is that Apple and Google end up raking in all the money since they control the OS and app stores while the rest of the stack gets driven down to being generic commodities. At least where mass market consumer adoption is concerned.


I'm sitting in one right now and don't see any smoldering...

They literally went bankrupt and wiped out the original shareholders.

I guess I'm just not clued into your exotic definition of "survived" if continuing to function doesn't qualify. I tend to go by the dictionary definition.

Chapter 11 is not Chapter 7. Businesses survive chapter 11 bankruptcies all the time. For example, WeWork.


It's snowing right now therefore climate change is fake

lmao. I'm sitting in Hiroshima and nothing is burning

I don't think Uber was doing $1 trillion in infrastructure spend.

The difference is that they had room to charge more of their customers and pay less to their workers. The AI industry doesn't have both sides to play at this point. Training and inference are getting more expensive and if you take on the high prices now you're just floating yourself further downstream from profitability long term (which does not look viable for any of them currently).

WeWork absolutely did not survive

uber doesn't own trillion in cars

Funny you should mention Uber. What was it their COO said recently about the AI costs?

I quoted exactly what they said in my piece, under the heading "The AI-failure stories around this are pretty thin": https://simonwillison.net/2026/May/27/product-market-fit/#th...

> But then you sometimes go and talk to your senior engineering leaders and you’re saying, OK, how many projects that were on the cutting room floor got moved above the line because of the productivity gains because 25% of our code commits were via Claude Code last quarter?

> That link is not there yet, right? I think maybe implicitly there’s more that is getting shipped. But it’s very hard to draw a line between one of those stats and, OK, now we’re actually producing like 25% more useful consumer features, right? And that line is hard to draw.

That's pretty weak sauce. I don't think that justifies the headlines that came out of it, personally.


? What are you talking about mate? The man all but says "this shit does not work for us". It iss layered in that careful, sanitised corporate shit-sandwich communication approach, where you take a nice piece of shit and layer it in between two slices of avocado so its sweeter to swallow for the "consumer" of your message.

He also said in that article that what prompted the discussion was the public statement by the Uber CTO that he had already burnt through his organisations yearly AI-budget in April. Please stop this shilling mate, and trying to hide the overall perspective between this or that word.


Did you read my piece? I covered the Uber CTO thing too: https://simonwillison.net/2026/May/27/product-market-fit/#th...

> The most discussed has been Uber, based on this report where CTO Praveen Neppalli Naga indicated that Uber had “maxed out its full year AI budget just a few months into 2026”, mostly thanks to Claude Code.

> Given that Claude Code only got really good in November it’s entirely unsurprising to me that a budget set in 2025 may have failed to predict demand for that tool in 2026!


somehow the invisible hand of the market is also blind af

Makes sense if you think about it: if all photons pass through you (invisible) then you can't capture them to get info (blind).

We're going to reach a point where these companies stop asking for money and start mandating it. They've got a vice grip around the nuts of many governments and loads of companies have gone all in on investing in these slop heaps.

At some point, companies are going to start removing basic features. Governments and essential services are going to make people go through chatbots to get basic service. They're going to require AI to validate stuff that's already automated and working fine. Google search? That'll be all AI (and I guess they're already rolling it out). Dentist appointment? Going to need to do it through some AI app that requires an account and tokens "for a better patient experience". Verifying your ID when buying alcohol? Going to need AI to scan it and take 90 seconds to determine whether it's real. And it'll say you're an 7 year old farm worker in rural Botswana, so you can't get alcohol. And they're going to milk money at every level of this.


> They've got, ballpark, $5t to $10t to make back in the next 5 years, or the hardware buildouts will start getting written down.

I find it disappointing that a completely wrong statement like this ends up the top comment on HN.

It is wrong in both the math, the logic about public markets and understanding accounting.

> $5t to $10t to make back in the next 5 years

I don't know where this number comes from, but it has gone unchallenged.

OpenAI and Anthropic combined have raised around $100B. This is an investment so isn't something the have to "pay back" from earnings - instead investors expect to make that back from the share price being higher than what they paid for it.

> or the hardware buildouts will start getting written down.

The hardware buildouts get written down anyway!! That is a good thing for investors because as the value gets written down they can book a tax loss. ANd it turns out that generally agreed depreciation schedule for GPUs (used to be 3 years, now 5 years by places like Coreweave) is still too conservative since GPU rental prices for 5 year old chips are higher now than when they were new (!!)

All of this makes the rest of the math in the comment incorrect by at least an order of magnitude and under some scenarios possibly 2 orders of magnitude!

That's not a small error!


consider cloud spending vs on-prem before the great cloud migrations. people are spending a lot more for cloud services now.

I hear conflicting things about finances, some have a different opinion, that it won't be written down so long as more funding comes in and revenue keeps increasing. it isn't like how you take mortgage or business loan, it isn't even a loan it's an investment funded by loans. So long as the investment is still promising, what are they going to do? destroy its value by calling in trillion dollar loans?


Let's skip to the part where they put the taxpayer on the hook for a bailout as an industry since they integrated everywhere with big promises

lol I’m spending max $50/month right now on a couple light subscriptions and my velocity is insane right now (full stack mobile app development) I’m leaning into it hard while these cheap plans still exist and building out a big platform that I can easily generate new apps from. Hoping by the time the rug pulls I can just go back to hand cobbling these apps together from the modules I’ve pumped out and never even consider giving these companies a massive portion of my monthly income

Also, with announcements of replacing developers with AI and consequent job losses, who is going to use the tokens? AI using its own tokens to produce code?

I just don’t understand how people are getting negative value out of AI or even only 20% productivity boost. I can only conclude that people don’t know how to use agents.

Are you mostly creating new things or integrating with complex, undocumented, untestable systems?

Mostly brownfield systems in Java, Elixir and TS. I use OpenSpec in explore mode and point the agent to all the different repositories (when not working in a monorepo) to identify changes. Once done, i switch to propose mode and spend at least 15 minutes there iterating over the plan until I'm satisfied with the TDD approach (agents need tests to verify their work). Then apply and review. This also auto generates docs etc.

I mean, it doesn’t really matter if it caused by people failing to use the agents well or not. You cannot assume everybody to use the technology the best way possible

> +20% speed for +20% spend isn't going to motivate a trillion dollars a year in spending.

I'm increasingly realizing this math is wrong, because LLM use is really sticky.

If Anthropic 100x'd prices tomorrow for their best model, so some companies offered 50% salary to keep 100% of your AI usage:

a) There are programmers who would take this deal. They've gotten to the point of doing what feels like even less than 50% of the work, developers were already pretty well paid, so they'll take it.

b) There are companies that'd offer this deal. Even if the only people who are taking this deal are not the best engineers, and the AI output is not the greatest, I think the last 6 or so years have seen a lot of companies realize capitalism is not as competitive as it seems.

They're not worried about putting out a worse product because... frankly, what else are you going to do? CF lay a bunch of people off, support gets awful: well you're probably not building a new Cloudflare in the next few years.

In the meantime the AI will get incrementally better, their market share will grow, and you won't be able to compete without taking the same faustian bargain.

-

Maybe I was just naive but it's making me realize how much we take for granted in the world. Both the quality and relative value of things don't have to go up over time. Quality can go down while prices go up, and nothing will really stop it. Competition should stop it, but competition is really slow and can be interfered with. And as prices go up competition gets really hard.


> unless we figure out how to make developers 2x, 5x, 10x as productive on stuff that matters, this isn't going to play out well.

Simple - you make them work 2x, 5x, or 10x more hours.


There are not enough hours to do that

Now try to take back llms from developers and see what happens.

If, by some miracle, all LLMs ceased working right this second, any developer who would no longer be productive should not have been a developer in the first place.

You don’t need a miracle, if Anthropic API is down due to technical issues you don’t have software development anymore. It’s insane how much we are delegating to 3rd parties. It’s not like having cloudflare down where your users cannot access your services. The AI tools used to investigate prod issues stop working, developers stop working. The AI support system that allowed the company to get rid of their support team stops working. In addition to all the issues that causes to customer facing products based on AI. The sales team cannot work anymore.

It’s like the industry is willingly introducing a common external risk to everything


True, but they will not want to work for you anymore, they'll want to work for company that provides it.

I'd happily work for a company that paid me the money they would have spent on LLMs.

They don't want to spend this money.

They will eventually have to

They're going to start seeing real bill on Monday from Microsoft/Github.

Limiting token quotas would be fine. Encourage developers to use efficient models, plan the work first, and to not burn thousands of GPU hours on waste.

It's much like when developers would waste tons of money on AWS spinning up massive test VMs and leaving them running without care. Until the finance people cracked down on it.


we all know it is impossible goal to make. surely AI will be even more useful in the future, but as long as china exists and continue to undercut the price, the goal will be never meet.

> We're talking about a world where you need 5% of every knowledge workers salary to go into tokens. 20% if you're a developer.

with that much money, the companies can easily buy their own hardware and hosting free public models, no need for those expensive subscriptions.


Author seems strangely unwilling to distinguish usage from profitable product market fit. And from his own numbers:

Anthropic Max: $100/month

OpenAI Pro: $100/month

Total paid: $200/month

API equivalent usage: $2,180.16 in 30 days

So paid only 9.17% of API-priced value a 90.83% discount, or about $10.90 of API priced usage for every $1 paid...

That proves heavy usage but not sustainable unit economics.

Anthropic reported numbers point the same way:

Q2 revenue: $10.9B

Adjusted operating profit: $559M

Margin: 5.1%

SpaceX compute: $1.25B/month = $3.75B/quarter

So one compute supplier alone equals 34.4% of quarterly revenue and 6.7x quarterly adjusted operating profit.

Its difficult for the blogger to understand something when its incentives depend on not understanding it...


My point with the $2,180.16 thing is that the price for consumers like myself is heavily discounted... but the price for enterprise companies is not discounted.

My usage is therefore a useful indicator of quite how much those enterprise companies may be spending on tokens, given the new pricing scheme.

If enterprise companies were still getting the same discounts that I get myself I would not have written this article.

(I had to dig into your margin figure - looks like you calculated 5.1% as 559000000 / 10900000000 * 100 but that $559M "adjusted operating profit" figure includes training costs, where usually when we talk about margin on inference we're not including those since those costs are fixed, margin calculations make more sense against the variable costs of serving a token.)


When you have to train a new model every few months to stay competitive, discounting that cost is rather dubious.

They key difference here is that training costs are fixed. If you train a model for $100m dollars, how much of that training fee should you allocate to each token that the model serves?

It's impossible to know, because you don't know how many tokens total will be served by that model until you retire it at some point in the future.

So you can't say "1,000,000 tokens costs $X in inference and $Y in training" because $Y is not possible to correctly calculate.

So, if you want to have a productive conversation about "margin on inference", it's sensible to look at the cost of serving the tokens independently of the cost of training the underlying model.


> 200m knowledge workers in the world, 30m developers

1 in 6 knowledge worker is a developer ! Surely that’s too high thou explains the job market


There's a lot more things that are going to be built that weren't built before as well.

there are many paths towards ROI and ruin. but towards ROI:

+ LLM-powered robotics, autonomous, IoT, smart manufacturing

+ LLM-powered biotech, healthcare, genetic engineering, medicine

+ Recursive model improvement

+ Multiply the # of devs (software truly eats world)

+ Exponential increases in model performance / cost decrease (algorithms, power, infra, chips, architectures, etc.)


> +20% speed for +20% spend isn't going to motivate a trillion dollars a year in spending.

Except that if your company go 20% faster than the others companies, you win market shares. But then, everyone will use the same tools and companies will be at even speed, but the tool will stay.

Now...if the market is saturated, it's useless to try to do things faster. Cheaper yes, but not faster.


Pretty much all major tech companies today are horribly bloated and mostly metastasizing instead of innovating. I'm not sure how 20% increased productivity will help in any way with that. If anything, it might accelerate enshittification and turn potential customers off even more.

It's worth noting that if each developer is 20% more productive with AI (let's take that as a premise and not dispute it), then it makes sense to go even further and reduce human headcount by more since the communication overhead of having 25% fewer developers is in and of itself a force multiplier.

tldr; 10 developers with 20% more 'productivity' can be replaced by 7.5 ideal developers and more like 6 or 7 developers due to the benefits of simply requiring less organizational communication.

I still think the ideal team size is unchanged however and that's 7-10 people. Note that teams aren't necessarily the same as direct reports. A CEO for instance has a certain number of reports and a leadership 'team' but they're not a team in the traditional sense since they are more about making good decisions and collaborating on specific things but mostly about leading their own orgs that have vastly different skillsets from eachother.


> We're not there yet.

And that's not considering that capitalism is going to do what it does best: if they really found a way to be profitable, competitors are going to fight them on pricing. Anthropic, OpenAI, Google, etcetera 's margins are a competitors' opportunities.

It's not as if there weren't chinese models nearly SOTA. Don't know where the french (Mistral) are but they may try to get in the game if there's a way to be profitable (not that France or the EU for that matter are relevant in anything tech or had any tech company besides ASML and SAP in the Top 100 but who knows).


One thing I genuinely don't understand is these companies are constantly taking in incredibly large amounts of investments, so presumably they're giving up large chunks of equity or these are loans that need to be paid back or they're committing to spending obligations they're very unlikely to be able to meet.

So besides the insane hardware buildouts you're correctly mentioning, I don't understand how anyone that invests in these companies is supposed to make their money back in any sort of reasonable timeframe?

The cynical part of me is looking at what happened to the NASDAQ rules recently where essentially index funds are going to be forced to buy SpaceX shares much earlier than they previously would have (ie, before the price has a chance to reach it's real valuation). Which, um, I'm guessing these stocks are going to drop pretty hard when people start looking at the financials of these companies.

My suspicion is that the point of these IPOs is essentially to dump the bill on the unwilling public by forcing various institutions to buy it (ie, your 401k or pension is buying this shit), and maybe their investors can squeeze some money out of this before the stocks reach an equilibrium that's probably like 1/10th of what they're "valued" at.


> make developers 2x, 5x, 10x as productive on stuff that matters

What does this even mean? Is this about speed of development? Is this about headcount? LoC? How are coding agents contributing to productivity in places like GitHub, Shopify or Meta? I mean companies that already have an established product. I really wanna understand this because I'm not seeing that GitHub's product suddenly became so much better than it was 2 years ago, so where's all that productivity going?


The productivity is going into perverse incentives[1], e.g. we have improved (by which I mean "increased") token use. More PRs every day. More lines of code. All things we knew were shit-brained metrics a decade ago (obviously except token use).

We've also increased how much our coworkers need to read, or deal with. You can get an AI to make any point you want, so you can ignore the 5 humans raising alarms due to the 1 clanker you made say what you want to hear.

All numbers going up.

There are obviously people producing additional true value with it, probably, but that's almost certainly scarce.

[1]: https://en.wikipedia.org/wiki/Perverse_incentive


People have been arguing about how to measure developer productivity since time immemorial. But the bottomline is that if products and features are hitting the market faster than they used to, developers are doing more with less. It's what we're seeing in my workplace.

Productivity is measured in the number of AI-generated Twitter posts developers can make about their AI-generated startups

Your severely underestimating the idea that people are just not going to use developers for certain things in the future

For example I don’t anticipate somebody making a living off of making website ever again

Somebody with absolutely no technical experience who needs a website for their business can now make one with almost no money whatsoever.

That’s good enough for their business. and the code can be totally shit and it does not matter because it’s meeting their business objectives. I am seeing this in the wild and I’m paying money to companies that have these types of websites and because it doesn’t matter I don’t need for the website to work perfectly on all my devices all I need to be able to do is pay them through the website which is what they need me to do and our transaction is done.

Don’t forget ultimately the people who pay technologists right now are primarily advertisers

work on hard problems is going to continue to be some tiny fraction percentage of the software engineering discipline

just expect a total bloodbath because the goal isn’t developer productivity the goal is that “I don’t need to pay somebody $200,000 a year to build a website authoring tool like WordPress.”


Why would a small business use a coding agent to build a custom website when they could use something like Squarespace or Shopify with prebuilt templates that mean they have to know even less than if they were to use some kind of chat UI?

Cause it’s still easier and cheaper apparently

This is the most recent example I found last week for a local barber:

https://news.ycombinator.com/item?id=48166050

They seem to be using Manus: https://manus.im/

And my other assumption is that it immediately integrates with IG/Facebook which is where they do a lot of their marketing

I see no reason that trend is going to slow, especially if you can go to meta to manage your entire business marketing.

Regular people running business just want fast cheaps and good enough.


I hear this and I keep wondering what I’m missing. My productivity has shot through the roof over the last year as a result of having these tools. I’ve been able to unlock projects that I’ve wanted to do for years.

...does anyone have a guess as to the total amount of money spent on software developer salaries each year? What percentage of that would the AI companies need to capture to be profitable?

(I'm not trying to imply that LLMs can replace software engineers, it's just an interesting comparison. If nothing else, I suspect that if the cost of development goes down, demand for custom software will go up.)


^ For what it's worth, Claude estimates $1.3–1.5 trillion.

https://claude.ai/share/8a3de813-677e-4a75-9b7f-1785495c2569

Honestly doesn't seem great for the AI companies.


Source on 200 million knowledge workers worldwide? My understanding is that it's just above 1 billion. I dont think a billion subscriptions at $1000/yr is out of the question but it might take a decade to get roiling

You're suggesting that 1 in 8 people worldwide, including every one from infants and the elderly, are knowledge workers. Are you sure that's what you mean?

I'm not even sure that 1 in 8 people I know would qualify as a knowledge worker, let alone a knowledge worker that might profoundly benefit from on-the-horizon AI. And I'm in a highly skewed population.


I think the underestimation is how many people want a personal knowledge worker in their pocket, and are willing to pay ~$65/mo for it.

Personally, I've only encountered any of those people on line, and almost exclusively here on HN.

Most people I've met -- and again, in a pretty darn skewed sample globally -- see $65/mo as a lot of money to spend on technology of any kind and can't think of anything much they need from "a personal knowledge worker in their pocket". I don't know a single person in real life who remains excited about AI at all, and only a few software engineers who feel it'd be worth that much.

Everybody seems to be mostly confident with the "knowledge productivity" in their personal and professional life and a pretty skittish about spending in today's economy. Most would be excited about a magic new robot that affordably saved them from unwanted physical labor and drudgery, but nobody needs much real help making appointments or filling out forms or whatever.

That's not to say I won't be proved wrong some day, with some further innovations in AI products, but global-scale demand isn't waiting for anything that's been released so far.


I've yet to meet a person that fits that description IRL. Admittedly I don't live in the valley but I do work in tech. The only place I see that demand is on hacker news (and I imagine twitter - I'm not on it).

The competitors of $65/mo subscriptions are the free models and services that are good enough. It will only get worse as open models or free tiers catch up. For most people, they just use whatever that's free

Apple TV, Netflix, BritBox and PBS add up to about $45 a month. Most people are gonna judge AI up against what they’re already paying for and the AI model makers simply don’t have a good enough product.

There’s only two things useful to the average person something to help them translate and something to help them write in everyday life.

Something else that might be useful would be local single purpose AI agents who’s remit is to help you with one specific task, but I don’t think that’s what the people building those expensive data centers want to sell to the market.


> Apple TV, Netflix, BritBox and PBS add up to about $45 a month. Most people are gonna judge AI up against what they’re already paying for and the AI model makers simply don’t have a good enough product.

What's the actual TAM for premium tv subscriptions though? Unlike free models which are keep improving, you can't get premium tv for free. Also they are actually competing with the total subscription price a person is willing to pay for a year.

> There’s only two things useful to the average person something to help them translate and something to help them write in everyday life.

Exactly, free models are good enough for these tasks.

I just don't see how b2c is a large enough market to ask average Joe to cough up hundreds per year when there are free stuff everywhere. B2b is another story


Well around 40% of people work. I dont think its crazy to say around a third of jobs are knowledge jobs, but what do I know

85% of the world population lives outside of developed nations.

27% of the world's workforce is in agriculture (contrast to the US where it is 1-2%). 15% in manufacturing.

A lot of people work in "services" (especially in high income nations, where it's roughly three quarters) and some of those are knowledge workers... but a huge number of them are nail technicians or hairdressers or bartenders (etc etc).


A billion subs at 1k a year????

I see a lot of out of touch takes here but this might take the cake


A billion? Really? At 200M you’re already including a lot of people that stretch the definition of knowledge worker.

> At 200M you’re already including a lot of people that stretch the definition of knowledge worker.

How do you know this? Im certainly open to recalibrating my numbers which is why I asked for the source


What's your source, because it looks wildly out of proportion compared to numbers we have now.

Here's a source from 2019 that says: "By 2023, the number of knowledge workers in the world will increase to 1.14 billion, with more than four-fifths of that growth coming from the emerging world."

https://www.gartner.com/en/newsroom/press-releases/09-24-201...


Thank you for validating my point.

> "...with more than four-fifths of that growth coming from the emerging world."

If anyone thinks this is a part of the global TAM that's got $1000 a month to blow, well then I've got a stable of flying unicorns to sell you.


To add an actual source to this thread, a brief paper by researchers at the International Labour Organization (ILO) states that for knowledge workers globally "... there are between 644 and 997 million jobs, which represents between 19.6 per cent and 30.4 per cent of global employment respectively." [1]

[1]: Berg, Janine and Gmyrek, Pawel, Automation Hits the Knowledge Worker: ChatGPT and the Future of Work (April 21, 2023). UN Multi-Stakeholder Forum on Science, Technology and Innovation for the SDGs (STI Forum) 2023, Available at SSRN: https://ssrn.com/abstract=4458221


Globally, sure. The assumption here is all users are on the same economic footing, they are not. Only about a 1/3rd (at most) of that count can afford $1000+ monthly, and even then that is wildly out of line with what most will.

I googled "number of knowledge workers worldwide" and read the top results. If you read it as I was confident in a billion I apologize, Im just trying to get an accurate count. What numbers do you have now and where did you find them?

That's not the TAM of 1B knowledge workers globally. If that were the case many industries would have a 2-3x target market.

To simplify break that 1B up into 3 levels of purchasing:

1) High-tier (US, Western EU, ANZ, Japan, South Korea, Singapore, UAE, etc) - 200-250M knowledge workers.

2) Mid-tier (Eastern EU, Latin America, urban China, India tech sector, etc) - 300-400M

3) Low-tier (Rest of the world) - 300-400M

Low-tier users are mostly free tier or heavily subsidized pricing.

Mid-tier are going to account for USD sub-$100 tiers. Probably averaging less than $50/seat.

High-tier are who you are assuming is the 1B. Users are not equal in that knowledge worker count, so there aren't 1B knowledge workers to charge money.

And when you consider Low-tier users a majority of those are free users which need to be subsidized by the High-tier users. So either free tiers get much more restrictive or the providers lose additional training data. A bulk of Low-tier users cost money and provide little to no revenue.

Edit: And think about Mid-tier and Low-tier for 5 seconds. Why would they pay Anthropic or OAI when they get get 100x+ inference from DeepSeek or Xiaomi? Mid-tier may be the only area that is willing to spend money on a US provider, but I would wager significantly on the fact that users in the Low-tier almost universally do not care.


Thank you. So with these numbers it seems like half a billion subscriptions at $500/yr is on the table. Obviously theres going to be competition in this market and self hosting cheap models may become the dominant use case. Assuming the labs are able to get most of the market though, the market size is something like a quarter trillion a year within the next decade. It's hard for me to imagine the whole sector failing if that happens.

I do think free accounts are going to end pretty soon, and some of the workers in your tier 3 will pay, but even without them this seems like a pretty healthy market size. I also wouldnt be surprised if mid tier workers are able to afford the $1000/yr vs $500. I use yearly rates because I find it easier to compare them to GDP/salary numbers


I mean, sure. Assume all you want but to guess that the entirety of High-tier plus almost all of the Mid-Tier will spend, on average $500 per annum is bonkers.

I believe we've started to see the top of what individuals and businesses are willing to pay for the current model capabilities. We are nowhere near AGI and models are really only providing significant value in niche markets currently (programming and cybersecurity). And just like SaaS the enterprise has the option to buy hardware and leverage their own models at will which can potentially offset costs and TAM as well. I have talked to a number of large financial corporations in the last 6 months and most have internal initiatives. The same applies in the healthcare vertical.

$250B per annum with AI? That's 20% of global software spend now. Sure, that's possible but that assumes current market prices hold. What if inference ends up normalizing between DeepSeek/Xiaomi & Anthropic/OAI? There's 50% of your revenue and with current costs for inference and training in the US at astronomical levels the US AI industry could also very well be setup to implode overnight.

Lastly I don't believe free can go away anytime soon because it can't. As soon as Anthropic and OAI remove that option those users will move to whatever is. For most of those users it's not a luxury to choose, it is the only option.

The financial engineering occuring right now is something I don't doubt will be text book lessons of the future. We've seen it before and I believe Peter Sorkin when he says that we will see a crash of this bubble, it's just a matter of how catastrophic it ends up being.


I agree that a lot has to go right for the AI labs for it to work out for them, I just dont think it's already over like the top comment in this thread seems to. MSFT makes $55 billion on their office products, the AI labs can use a similar strategy I think. I think AI assistants will be more indispensable than office products within a decade. Hard for me to imagine doing office work without an assistant a few years from now, but maybe I need a better imagination.

Yeah, just looked into this. Knowledge workers is a big group and probably much larger than you think it is.

Basically if you're not doing manual labor, it's probably knowledge work.

Roughly 1/3rd of the working population.

Some data tucked in here: https://gist.github.com/danielmiessler/2dc039762a202b083753b...


A lot of those ‘edge cases’ in the definition of “knowledge worker” are probably the stuff that’s most likely to have significant parts of the work augmented or replaced by AI agents. Like, call-centers are almost certainly going to get turned over in a big way. It’s not like the median tier-1 support operator just reading off a script is much better than an LLM anyway.

"Next 5y" doesn't apply to AI factories

At least they're not going to make us watch ads.

Why 5 years? What happens in year 6?

One quick question. Did tax payer money fund these data centers? If so, how does that money translate to their profit and a return for the people whose work paid for the resources?

Or did we just get scammed?


Doesn't matter, it will be pushed and forced down people's throats because someone invisible thinks it's the new way forward. And for that you need more money for NVDA and the like, and now people have to be made cultists in order to let the money flow in.

Same happened and happens in gaming. The gamers "invested" into NVDA by eating all the bullshit about ray tracing and the like. And they kept buying all the crappy 1000$+ gpus because youtubers said that the extra 1000 dollars worth those +15 fps plus the ray tracing....


I assume the bet is that as you swap humans for machines, this pays for itself. Swap entire devs and teams and frankly, managers, and you make up a lot of 5%’s fast.

If it works. And I’m not sure who is going to buy the stuff the machines produce, but shrug. Presumably some bots click ads for NFT’s that other bots generate.


> $5t to $10t to make back in the next 5 years

Wait what? They spent 2 order of magnitude less on hardware.


From the verge: https://archive.is/kU4Zg

> Gartner forecasts that large AI companies would need to earn cumulatively close to $7 trillion in AI-driven revenue through 2029, which is close to $2 trillion per year by the end of the period. In order to achieve “historic returns,” the providers would need to earn nearly $8.2 trillion in the same period.


Those numbers don't even track even in the same sentence. If it is $2T/year by the end of 2029, it would be something < $6T cumulative in 3 years.

“Through” 2029 is a bit more than three and a half years. The $2T are likely the yearly average of the $7T in that period.

The numbers are made up political correctness anyway.

Everyone's agency is 100% captured by belief in Wall Street. Too few <50 have any meaningful labor skills to blink.

We'll continue to have consent manufactured via media platforms and in 3 years no one will bat an eye at these companies being worth $12 trillion as Altman and Musk climb two ladders holding a "mission accomplished" banner.


I understand some startup deciding to take a punt on "this will all work out financially if our new product demonstrably boosts productivity of large sectors of the economy by a breathtaking factor that's incredibly rarely ever happened before in history: 2x. Sometimes a plucky group of people take a risk, it pays off. If it doesn't work, the company fails.

What I do not understand is: large sectors of the economy all simultaneously taking this punt, with the necessary productivity boost, as you say, far more like: 2x, 5x, 10x


if you ignore all catastrophic mistakes, these numbers are true

they need to make 5t-10t back, but not necessarily through selling tokens. as we can see, the frontier labs are making vertically integrated products. their revenue is no longer strictly tied to inference.

imo if your developers arent at least 2x as productive, then something is being done wrong on the employees part and/or the organization's. cli tools are ridiculously powerful provided you were an actual developer before using AI.

Maybe it's just me being (trigger warning from me providing an honest self assessment) very intelligent + a generalist, but i went from only full stack webdev and .NET to being able to implement an end-to-end LLM training pipeline (data curation, tokenizer, pretrain, sft, DPO - using ~$100 in cloud compute to train a class-competitive 1B STEM model)...and a full economic financial modeling and quant analysis application that pulls up to date economic, economic, news, stock data from the entire world and uses Dagster to orchestrate tech ical indicators and fundamentals and signals... and i did these things for learning and for fun. i built my own sublime text and obsidian replacement. i built my own reddit/twitter/hackernews/substack/news aggregator. i built countless other useful tools and utilities for me personally and for work I build more that empowers multiple departments.

Ive built 2 browser games, one already released to great reviews and 100k+ hours played. Ive built a tool on top of claude code that does ~60% of my job. Ive run data analysis on company financials for forecasting that have been refined and are producing very accurate predictions. Ive built competitive analysis tools and trackers.

All of this in 3 years. The projects are all clean, documented, with great code practices and modularity. A purist would surely consider some of the code slop. But it all works completely and fills real needs.

This is a huge shift. Anyone not realizing it yet is just simply behind the curve. I would not have accomplished 1/10 of this without AI coding. I went from copying code into and out of browser chats for 2 years before getting on the CLI train, and it is absolutely ridiculous the ROI you get from subscriptions to Claude or Codex.


It's going to be a typical saturation curve. A lot of upfront tokens spent on things that have stockpiled over the years, and then the derivative on token spend trends to zero as the users run out of immediate things to try. Sure there will be ongoing maintenance and experiments, but it wont be nearly as close as the initial inrush.

The "+20% velocity" framing misses what's actually shifting.

This is never going to materialise. It’s dead in under 2 years.

The market is shrinking and saturated already and it’s not because of AI gains but geopolitical instability and supply chain issues, some of which are caused by AI spending and stupid ass PE firms refocusing on AI supply chains.

Only our pensions and futures burning.


What do you mean by the market is shrinking?

Literally revenue is collapsing in most sectors. Technology purchasing is declining. Service models are failing to turn a reasonable ROI.

People stopped buying shit.


Wait do you have any numbers to back this up? Every number that I've seen contradicts this. Most sectors have positive revenue growth, even non tech sectors. Technology purchasing is increasing in every bucket (software, IT services, devices, communications, and of course DCs). Retail and food-service sales are up MoM and YoY. Personal consumption is up 0.2% in real terms. I assume by service models you're just talking about AI? I actually may agree with you but this is clearly not true for long if it is true today.

I'm reminded of that [terrifying in hind-sight] Newt Gingrich interview in which he was more concerned about his constituents feelings about things getting worse than any silly statistics provided by government agencies.

https://www.youtube.com/watch?v=xnhJWusyj4I


It's consolidating into fewer, higher value assets. Over 40% of the S&P500 is in companies that are heavily (potentially over) invested in AI.

tech companies have grown disproportionately to other industries, but that says nothing about the growth in other industries

- S&P has a Q1 2026 blended revenue growth of 11.3% according to FactSet - most sectors are growing, not just tech


This is ... not new at all?

App-bundling apps existed. Apple rejected them.

Low-code apps existed. Terminals existed. Apple rejected them.

LLM apps exist. Apple allows them, because they render text, pictures, and video, but they don't run arbitrary code.

Running arbitrary code is flatly forbidden, because users can't reason about them. I see absolutely no evidence that software is moving away from versions, any more than it was when apps could first search the internet, render recommendations, or deliver messages.



I see no reason those emulators wouldn't fall under the Blunded App rule, they "execute code" (interpret instructions).

It's just Apple enforcing their rules when they see fit...


Yes, but: writing code always teaches you something.

I've worked at founder-sized startups and $xxb dollar public companies. I've never read a product spec, a pitch deck, or a PRD that describes a solution that, if implemented in the way described, would solve the problem. Building the thing teaches you how it should behave.

Software is a complex, interactive medium. Iterating in the code, with people who understand the problem and care to see it solved, is the only way I've seen valuable products get created. Meetings and diagrams help, but it's not until you write some working software that you know whether you have something.


As they say (they being me), no design survives implementation.


Color me unsurprised.

Anthropic ran a weeks-long roadshow on how powerful Mythos is. They pointed to the danger, their controls, the capabilities, and practically begged the world to be scared of it.

Simultaneously, the current US regime realized there was a way to demand fealty from the AI labs. If they're so dangerous, don't we need to see them first? That will cost you, obviously. Standard extortion from the government, at this moment in time.

The labs get their marketing; the white house gets its pseudo-bribe. I hope nobody involved is confused about how we ended up here.


Yeah, I saw several instances of important folks taking the Anthropic promotional campaign too seriously and this is what they got in return. I'd say internally people are cursing whoever's idea that was because clearly scaring people backfired.


I would wager they're cheering, because this builds the moat they don't otherwise have. Want to do business in America? Get government-approved. Can't afford the regulatory fees, or your government won't let you submit to foreign programs? Good luck!


Yes, this has been a steady play from the start. From the skynet fears, to the safety fears, now the it's to powerful fears. All of these have been a play to get the government to lock out any smaller or foreign competitors and build a moat where there otherwise would be none.


What extortion are you claiming?

Are you claiming there will be a fee?


> What extortion are you claiming?

Universities: https://www.npr.org/2026/01/29/nx-s1-5559293/trump-settlemen...

Companies: https://news.bloomberglaw.com/esg/extortionary-intel-stake-s...

Law firms: https://www.lawfaremedia.org/article/the-law-firms--deals-wi...

Media: https://www.nytimes.com/2025/07/02/business/media/paramount-...

Why would AI companies be any different?

> Are you claiming there will be a fee?

I'd be more concerned with "your model can't be too woke" regulatory scenarios.


> I'd be more concerned with "your model can't be too woke" regulatory scenarios.

Honestly that's exactly where my mind went. We already see the current administration trying to censor free speech (e.g. Jimmy Kimmel, blocking/restricting press access to the White House unless you are pro-Trump).

I'm afraid of the potential to move in the direction of what we see in China where queries to LLMs referencing things like Tianenmen Square are censored (at best).


I'm not american but it seems like americans are MORE free to speak their minds now than before in terms of being banned/silenced by dominating online platforms


Not sure what you're trying to say here, but I get the sense you don't have enough information as to what's going on here (nothing wrong with that, I've been trying to tune out myself).

Regardless: "...an increasing number of travelers report being questioned about legally protected online speech when crossing the border."

https://www.nytimes.com/2025/12/31/travel/airport-border-pho...


I am saying it "seems like americans are MORE free to speak their minds now than before in terms of being banned/silenced by dominating online platforms".

Previously you could not post about certain subjects around covid, ivermectin, anything not in line with woke ideology, political scandals. Accounts would be banned. Communities were shut down on Reddit. High profile figures were banned from Twitter. Now you can post whatever you like on most of these platforms. That's not just me who thinks that by the way, it's a widely discussed thing. Zuckerberg even cited government pressure around content suppression and censorship. Do you disagree?


Got it. I'm now picking up on your point given the implied political bias that your comment hints at. I don't want to resurrect this long-dead debate as I take it you are not pro-science.


It sounds like you've brought baggage to the conversation. HN is not the place for that.


> High profile figures were banned from Twitter.

One of Musk's first acts as owner was to do that. https://x.com/elonjet (and the journalists reporting on it: https://en.wikipedia.org/wiki/December_2022_Twitter_suspensi...)

(After previously explicitly promising not to, citing… freedom of speech! https://x.com/elonmusk/status/1589414958508691456)

Bunch more: https://en.wikipedia.org/wiki/Suspensions_on_X#2023

> anything not in line with woke ideology

How did so many Republican politicians survive on there?!

> Now you can post whatever you like on most of these platforms.

As long as it isn't the word "cisgender". https://www.vanityfair.com/news/2023/06/elon-musk-free-speec...

Or a link to Mastodon. https://www.bbc.com/news/technology-63999452

> Zuckerberg even cited government pressure around content suppression and censorship.

To curry favor with the new admin, sure.

> Accounts would be banned. Communities were shut down on Reddit.

This happens regularly, today. https://www.reddit.com/r/BannedSubs/


Even with these examples, my comment still stands. I guess it could show how bad it was before!


The government literally removed research into trans-fats because of their transphobia. Much free speech


> I'm afraid of the potential to move in the direction of what we see in China where queries to LLMs referencing things like Tianenmen Square are censored (at best).

We are already there.

"Canva admits its AI tool removed 'Palestine' from designs: https://gizmodo.com/canva-admits-its-ai-tool-removed-palesti...


Or your model is not "woke enough"


I have very little patience for bothsidesism at this point.


Your side is not the only side.


Obviously.

"Bothsidesism" posits that the two sides are broadly similar. The last few years have debunked that concept pretty conclusively.


It also posits that there are only two sides.


In the US, that is functionally the case, and likely to remain that way.


Yes but OP made a good point (model censorship going too far in the name of "woke"ness) and you shut them down.


Yes, because it's disingenuous bullshit.

As it was with "campus protests violate free speech!" from the folks who immediately turned around and banned voluntary diversity programs at universities.

As it was with "Twitter bans violate free speech" from the folks who bought it and banned @elonjet and the word cisgender.


Are people not allowed to suggest models may be too censored? Is that idea censored?


Am I not allowed to suggest it's disingenuous bullshit to pull the "both sides" thing?


That's right. It was uncalled for. I see no evidence OP was making a bad faith argument, but you assumed that right away.


Because it's incredibly frustrating to see the government remove women and men of colour from government websites, deleting climate data, and sending out violent mobs to round people up while people sit around saying their main worry is that regulatory bodies will move to make things "too woke." There's no "woke" equivalent to the insanity being acted out by the US administration.


why? Most regulations (ADA, affirmative action, etc.) fall into the "not woke enough" category of model regulation. Current administration aside, complaints of this sort are more likely. It’s absurd, really, to believe there would be a regulation governing a model being too woke; regulation itself is woke


> Most regulations (ADA, affirmative action, etc.) fall into the "not woke enough" category of model regulation

For sake of argument, let’s assume this is true. Those rules are still structured as laws, with boundaries and legal recourse. The precedent being set, that the President gets “voluntary” deference from private companies, is un-American and will be abused by the left.


I don't think I'm smart/intellectual enough to respond to this

or i just don't understand what you're saying


The ADA isn't about wokeness. It's about being able to live in society with a disability.


Limiting a business's ability to exist because they can't afford to accommodate a small percentage of the population is 100% about wokeness


Well, we've just now defined safety rules, health codes, paying taxes, and the like as "woke".

You'd be calling the First Amendment woke if we proposed it now.


"Woke" is just a dog whistle. It's used by anti-intellectuals on the right to signal their allegiance to whatever their dear leader says, and is used to say "I am triggered by this idea" regardless of what the idea is. Anything can be woke to these people, up to and including the 2nd Amendment.


it is actually perfect, you hear or read “woke” you can immediately turn around and know the level of intelligence in front of you requires immediate extraction from any further proceedings :)


The terrifying thing is that we can't just ignore it anymore, though. These people are wielding the power of the world at a time of global crises.


it is the people that are buying the BS though… those wielding power will say whatever the F they want to rile up the people but end of the day it is the people buying this.

the outmost funny thing to me glancing at this from the sidelines (I stopped following news and am generally largely disconnected from “politics”) is that people that are buying the “woke” garbage are not realizing they voted for the most woke there is. entire current administration is DEI hires (this is literal, every single person working is DEI hire), cancel culture is raging, if they could they would cancel 50% of the population (this is especially funny to me, shows you how “small” people are), you got billions of dollars literally being stolen in broad daylight while bitching about nancy pelosi, you got two president’s sons pillaging the country while bitching about hunter’s laptop… so fascinating to watch america slowly disintegrate into what has become


Watching from the sidelines isn't disconnecting from politics, it's supporting the status quo. Please reconsider your position.


Taxes? How?

I agree with the rest, sure.

Health codes and safety rules are woke, yes. I would have thought that as given. Debates over where you draw the line are absolutely a matter of wokeness.

The way freedom of expression is regulated today is generally woke. The WPFI is insanely woke.


> Taxes? How?

They, at times, "[limit] a business's ability to exist because they can't afford to accommodate a small percentage of the population".

> Health codes and safety rules are woke, yes.

I take it you never read the parable of the boy who cried wolf.


getting the vibe that english isn't your first language, and don't feel like arguing theough a language barrier

i don't begin to understand either of these points

fwiw i'm woke, happy to be woke, encourage wokeness


> getting the vibe that english isn't your first language

Well, that's a first for me.

Alternate theory: https://news.ycombinator.com/item?id=48023653


Every business has a threshold tax rate beyond which it ceases to be viable. Herego, for every tax rate there are businesses right below that threshold that are taxed out of existence.


> Debates over where you draw the line are absolutely a matter of wokeness.

This is offensive in how trite it is


ADA preceeded wokeness by at least 2 decades


Nah. Worse is better.

The reason agents work is because they have access to stuff by default. The whole world is context engineering at this point, and this proposal is to intermediate the context with a bespoke access layer. I put the bare minimum into getting my dev instance into a state where I can develop, because doing stuff (and these days: getting my agent to do stuff) is the goal.

This makes slightly more sense if you're building a SaaS and trying to get others to give you access to their code, their documents, and the rest so you can run agents against it. But the easiest, most powerful way is to just hook the agents up to the place that's already set up.


They are building exactly what you described and this is their architectural solution to ensuring their YOLO agents do not nuke their customers code/documents/databases by sandboxing everything in the workspace — the git checkout the agent is working on, plus whatever's needed to run commands against it (compilers, package managers, etc.).


> figuring out if the company can afford this level of productivity at scale

This is the thing that boggles my mind. They spent their budget. They have 4 months of data. What do they have to show for it?

I'm not a hater; I'm not a luddite. I have a $200 Max plan and I use it.

But are you saying that Uber made this tool available, urged everybody to use it, and is confused about what happens when it worked? It's one thing if they decide AI isn't productive enough to be worth the cost.

Are they out of ideas on what to build next, or something?


The personal max and teams plan actually are an amazing bargain compared to the API PAYG cost you get with Enterprise. I guess they really need their Enterprise features though, otherwise they could just tell users to expense a $200 max sub. Enterprises gonna Enterprise.


Entreprise gets you the written agreement that the data you send to Claude will never be used for model training


It also gives the plan admins the ability to surveil in automated fashion what the company employees are prompting.


If I explicitly turn this off in Claude’s settings isn’t it the same thing?


Can they get 5000 people to do the same correctly on every reinstall and enforce it ?.

Small individual tasks become complex at scale , and that is why these enterprise contracts sell


You can set these policies at an org level in ChatGPT business and Claude team


- Claude Team plan is limited to 150 now[1]. You can mix between Premium (equivalent of Max 5x) and standard (Pro like)

- ChatGPT business is not user count limited (AFAIK) however business plan only supports the equivalent of Plus there is no Pro equivalent.

For larger organizations it will still come down to usage billing . The ChatGPT plan is quite limiting for most users and the only option is usage/credit billing for more consumption. Claude has the more generous usage in Team but limited to 150 users.

[1] it was only 75 till late Feb.


> What do they have to show for it?

My guess is nothing you can see right now, since it likely takes a lot longer for any substantial external-facing changes to roll out broadly. Internally I'm sure several features have moved faster. I've noticed this at Salesforce where it certainly seems like things that would have taken a few weeks take a few days now. This doesn't translate directly to more money, just more potential to make money.


What I don't understand is there are really good controls for spend, why on earth didn't they put caps on?

Or ask engineers to justify the spend?

Why should we spend that many tokens, what will that get us in return?

If this was AWS we'd all be pointing and going "Ahhhh you twats, didn't you look at your monthly spend?"


> I'm not a hater; I'm not a luddite. I have a $200 Max plan and I use it.

I'm glad to see we've reached the point of AI discourse at which anything that might be construed as criticism must be prefixed by "I'm also part of the cult, I'm not a non-believer, but" to avoid being dismissed as a heretic.


Since AI has become a partisan political football it makes sense


> Are they out of ideas on what to build next, or something?

Well, what is there for Uber to build next? They have their ride hailing platform. It works. They have adapted it for other kinds of delivery (food, groceries, "anything that fits in a car") What else is there in the "someone driving a car" space for them?


Car Fleets, professional freight logistics.

There is a lot of things to do in some driving a vehicle space . The other obvious business (that they exited) is self driving of course .


It's an honest surprise that this isn't spun as "internal AI efficiency gains." They want the efficiency, of course there's AI component, but they're not pre-claiming victory. Neat.

It's worth remembering that there's an _actual_ underlying economic problem here. Interest rates are up. AI spending is expensive. A dollar invested in a company needs to do _more_ than it did 5 years ago, relative to sitting in treasury bills. And Meta isn't delivering on that right now.

But IMHO: that's no excuse. This is admitting defeat, deciding to push the share price higher while they give up. Meta has the user data, the AI ambitions, the distribution, and the brand.

They could do anything, and the world is re-inventing itself. They're ... laying off people, maximizing profits, and giving up.

Cowards.


Layoffs are a very normal thing for businesses to do.

There is nothing "cowardly" about it.

Would you rather them never hire them in the first place?


> Layoffs are a very normal thing for businesses to do.

Didn't used to be, except in extreme circumstances. Was seen as a really bad sign.

To the extent there's "science" on this, it's a lot less clear than you might think that a policy of reaching eagerly for the layoff-button is long-term beneficial to companies, i.e. there's a good chance it's a cultural fad, you do it because "that's what's expected" and perhaps investors get skittish if you don't, for the circular reason that... that's what's expected.


People generally complain about the interview process being bloated while also not giving a good signal - is it then not better to hire people for a while, see if they perform and then letting them go again? Though perhaps in Meta's case they hire a lot while also having cumbersome interviews, I don't know. I just feel like there are perhaps some benefit in being quick to hire and fire.


What people dislike is the boom-bust cycle inherent to all levels of a market economy. During some years, these companies suck people up like a vacuum -- that can be bad if you're on the inside and all of a sudden the culture goes out the window, or if you're expected to onboard 3-4 people at the same time, or you end up with a reorg every quarter. Then, on the other end of the spectrum, companies shut down (non-backfill) hiring entirely and layoff huge percentages of the company, with no guarantee that you'll be safe just because you're doing a good job.

Human lives do not work like this. If you're getting married, if you have an unexpected hospital expense, if you want to buy a house -- these are not things that "market cycles" will plan around, but you have to.

Being quick to hire or fire is not the problem. Massive overhiring and massive layoffs are.


I don’t think the previous poster is saying all layoffs are “cowardly”, but pointing out that these ones are.

I think they have a point. Facebook is making money. Tech is in a very dynamic phase, right now. This is a moment of huge opportunity for them, and one that won’t necessarily be as large in the future.

To be contracting right now, rather than making a play, seems like a lack of leadership.


Not saying you are wrong, but you could argue they made their big move with the Metaverse. Then again with those crazy AI contracts to ML people.

Maybe Meta missed on those big plays and now there’s too much pressure to make another.

I don’t know if I believe that, but worth considering


Pressure from whom? Mark Zuckerberg personally controls the majority of voting shares. He can do whatever he wants.


yeah, these big layoffs don't add up to me right now.

if you're making money and you feel that these are good employees, why not take them off the core products and ship them to some other ambituous R&D proejct?

making core products leaner is probably a good, but surely there's some other big moonshot you'd like to take?


I'd say that a 10% culling of their workforce when they should be going all in on is not "very normal".

I don't think that those 10% of their workforce were keeping them back, to the contrary, now a big part of the remaining 90% will start wondering (if they hadn't already done so) when they'll be next, that is instead of focusing their minds on this AI-race thing.


Agreed. What happens when every company lays off 10, 20, 40% of their staff? AI Agents don't pay taxes and dont participate in a meaningful amount of the consumer economy.


AI isn't contributing to the layoffs though


It absolutely is. The funding for it, not the product itself.


> Would you rather them never hire them in the first place?

If it's not sustainable? Yes. They shouldn't have hired them in the first place then. Such a major round of firing (the second one in only a few months) shows a completely failing leadership.

I'm glad in Europe companies are much more conservative with hiring and firing. Because it's much harder to let employees go and there's strings attached.

Don't forget when you fire an employee you're giving them a lot of stress about their livelihood, you're externalising a lot to society. Internalise the profits, externalise the problems. Typical.

I'm so glad I don't live in the US and that things don't work like that here.


There's also a reason why there are no innovative companies in Europe. If you make it hard to fire someone you make it hard to hire someone.

Companies won't spin up risky projects if they can't spin them down. This is why Europe continues to fall behind the US and China.

Accepting the mediocrity is abdicating the leadership of the world to China. If you like that, good for you. But I doubt the low-growth, low-innovation world of Europe will make the next iPhone, AI, or chip.

Oh, and Europe can only do this stuff because of the USA military, by the way.


You heard it here first, Zuck and his peers are brave generals in the battle against the Y...Chinese Peril and we are all...cannon fodder, I guess.


Is Meta innovative?

They make products, sure, but output isn't the same as innovation.


You have brain poisoning from reading too much slop online.

>But I doubt the low-growth, low-innovation world of Europe will make the next iPhone, AI, or chip. >chip

Do you realize that the cutting edge in chip technology is a Dutch company


That obtained the cutting edge technology by buying an American company that had been founded to productize technology developed at an American defense laboratory based on a Japanese researcher’s work


You are forgetting 20 years and billions of dollars developing, in collaboration with research institutes like IMEC and funding from chipmakers like Intel, Samsung, and TSMC.

But it doesn’t fit your ideological narrative of how innovation functions so…


I am not the person you originally replied to. I have no ideological motivation here. I am merely pointing out ASML did not invent EUV, nor did they fund its initial development or the first decade or so of its productization. ASML employs plenty of scientists and engineers who did important work getting EUV to market, but your characterization implied that ASML single-handedly introduced a step-function increase in semiconductor fabrication technology from their labs in the Netherlands, and that is a misleading impression to give. It’s belied by the fact that ASML can’t even choose their own customers without approval from the U.S. government.


I believe that it's a bit more complicated than that especially if we look at the contributions of IMEC.

But irregardless I can hand you the point that you are making and then say that yours is a very tight standard that would not pass most of what passes for innovation in Silicon Valley.

The point I'm trying to make for the initial poster is that they are confusing "technological innovation" for money making. And yes you don't have a money printing machine in the EU, but you have A LOT of technological innovation that eventually goes to market through SV.


Where is the Silicon Valley of Europe?

I'll wait for your answer.


I think it’s to their credit that they don’t have one and instead got cern. A bunch of shitty crud apps made by mediocre rent seekers that got rich on tax avoidance, gov money + research, and low interest rates vs actual ground breaking research that benefits humanity. Silicon Valley is probably the worst thing that happened to humanity between the 2008 crash and Covid. People have been figuring it out but not after they have already given these scammers permission to inspect their wallets.


Wow. Can't tell if this is a parody or just very, very uninformed. Either way, good day. Hope you understand, one day, how technology has helped millions of people in many different ways.


100% agree with this. I think where it really went wrong is around 2005 with Google, Facebook etc. Adtech, engagement-driven social media.


Please use more mature language on this forum and engage in an adult way. People can disagree without being brain poisoned, kid.


No idea what Dutch company you’re alluding to. FWIW my guess for cutting edge chip tech is TSMC.


ASML


It's a 30-something day old account. An obvious troll. Just don't engage.


I'm tired of this American exceptionalism. Success is not only about money. It's about making a positive impact on the world for everyone. This is where the big tech companies deeply fail.

America poisons the world with pollution (eg pulling out of Paris Agreement), misinformation, promoting discord as 'engagement', unnecessary military engagements and screwing up the rest of the world.

And really, abdicating 'leadership' was already done by America by voting for Trump.


Billions and billions of people willingly use these products. Are you saying they use them because they are bad for them and they don't like them and don't value them?


No they use them because they are addictive, intentionally. But they reinforce negative emotional content because it is more engaging. This is what causes all the polarisation in society, what gave us Brexit and Trump. See for example the book Careless People, it describes how Facebook set up a huge consulting operation with the Trump campaign to get him into office.

But they are bad for society, it's not for no reason that a lot of countries are trying to ban them for the younger generations now, similar to smoking.


Exiting low performers is one thing, but using layoffs as tool to put pressure on your workforce to extract more labor and keep them busy is a toxic culture.


Toxic = green brokerage accounts for those in charge


It would also be green for everyone else's brokerage account.




A few% pump is only a salary replacement for the poly-millionaires. Not for "everyone else."


its not “normal” when companies have 10s of Billion in net profit per quarter

Axing low/negative ROI product lines, sure. But recently these cuts have been across-the-board and in product lines that are net profitable and have strong technical product roadmaps. Moreover they are firing longer tenured (expensive) engineers

I understand they’re managing a transition to a capital intensive strategy but the whole era reeks of stock price focused financial engineering and these large companies flexing oligopoly power in the face of their customers and the labor that builds their technology.


> Would you rather them never hire them in the first place?

It does seem like a lot of people would prefer this, they way they react to every layoff announcement.


It would be better because it would create a more diverse work space where multiple employers complete for employees, instead of one company playing musical chairs with people


Having fewer jobs available would result in more employers? I'm not sure how.


A few companies get almost all investments. They start a lot of projects fast and close the ones that don’t work

If companies stuck to fewer projects, money would be invested in other companies focusing on specific products, you get a lot of companies and not the market concentration you got today (which is responsible according to few economists to a lot of the us labor market dysfunctions it is currently experiencing)


Reducing your workforce always means you either made a strategic mistake, your bottom line is hurting, your growth is stagnating or you hired McKinsey (lol) not a good sign for company health and always bad for morale.


Literally not true. Some bets just don't work. If a company tries to enter some new market and fails, they may use a layoff.


The strategic mistake is that they don’t have any other good ideas to deploy these folks toward. A company of this size and financial condition in technology with exceptional leadership should not be out of good ideas.


I mean, no company ever has solved that problem soooo


Well Apple seems to be able to largely avoid these staffing whiplash problems…

I mainly call them problems because hugely scaling your org up and down on a whim is extremely inefficient when your recruiting and onboarding costs are high. Surely it’s more wise to repurpose the people you already have unless you have no time horizon on appropriate new areas of R&D.


Sounds like a strategic mistake.


"Some bets didn't work so let's destroy lives and cause needless suicides. It wasn't my fault, I was only following orders." - Random Meta VP of Customer Misery.


Because hiring people and paying them a salary is somehow hurting those people?


No but purposely forcing economic hardship on people when you're one of the most profitable entities on earth will always be a shitty thing. I'm sorry but treating workers like replaceable cogs is disgusting behavior and I'm not shocked that big tech routinely turns to anti-worker devices to enforce control.


Nobody is forcing anything to happen. People chose to work there. They get paid a HUGE amount of money. Now their projects ended or whatever.

What world do you live in? Suicide? Crazy talk.


I'm not sure the 3 version of your account is going to fare better than the last[0] if you don't find a better way to contribute to the community.

[0]: https://news.ycombinator.com/user?id=matchbok


I like how the account is just nothing but attacking workers while supporting the rich, classic VC mindset.


Attacking? Nowhere did I attack anybody. Explaining what a layoff is not "attacking". If you are offended by market economics I suggest you get thicker skin.


Solid contribution!


Man if you don't understand how unemployment leads to suicide you are truly a privilege individual. I honestly envy you now.


That does tend to be the more experienced management decision among firms who survived through the dot-com bubble.


found the ceo


With that kind of mindset… man, so sorry for you


Care to explain? Rather than these jugemental one-offs?


You are normalising layoffs in companies that are not losing money. If you are a regular employee, this kind of behaviour affects you, but hereyou are saying “it’s alright folks, it’s just business “. Sure thing these kind of layoffs are not illegal, but there must be something else in life than raw corporate behaviour when it comes to work, don’t you think?

The other scenario is that Meta doesn’t layoff people. The big fishes will make less money, but won’t affect their lives in the minimum. What about that? That’s not illegal either, but ofc, “that’s not how businesses work!”. So brainwashed. We are the frogs, they are boiling us and you don’t care


Layoffs mean a company doesn't have productive, profitable work for a set of people. The broader profitability of the entire company is entirely irrelevant. Should employee x subsidize employee y? That's nonsense.

Should a company keep someone on payroll and have them do nothing until profit reaches 0?


First of all if a company is profitable and has a number of employees and has no idea how to use them that’s a failure of leadership. The board should look for an executive team that knows how to use what it has.

Secondarily layoffs don’t happen the way you say: they are across the board and when you are talking of 10% of a company there is no real way of targeting the inefficient people. More than anything is fiscal engineering: you need x amount, you fire people and then you rehire 75% offering less equity and at lower levels imposing more work on the remaining employees


It's a failure, sure. But also a reality of every single company, ever. It's the nature of business.

And yeah, this approach to layoffs is sound. Been there, done that.


> The board should look for an executive team that knows how to use what it has.

I was thinking the exact same thing. This makes them look pathetic.

Meta is very selective in their hiring process. If they can't figure out how to use these incredibly talented and driven people, then that's a failure of leadership. How do they not have an enormous backlog of promising and interesting ideas to pursue?

They've got the cash, they've got the people, they just don't have any imagination or ambition. Better management would see the current situation is an opportunity, not a problem.


> profitable work for a set of people

I think this is essential to the disagreement in this little part of the discussion.

Ending a product line and laying off the people who worked on that product line aligns more to your "profitable work for a set of people" phrasing. But a great deal of tech sector layoffs happen as a blanket action, not targeted at specific products, teams, or roles. Business units are directed to find X% to cut. When the business is making money, these blanket actions can feel pretty unfair to the affected employees. The decision to lay off any specific individual could be completely disconnected from the value that individual provides to the business.


Should employee X subsidize employee Y? Yes! Ideally, companies should structure themselves in a way where that's not even a question; it would be weird to say my coworkers are "subsidizing" me when they keep working while I'm out sick or taking a vacation. You can't keep a money-losing org running forever, but your job should not be dependent on whether your utility right this second crosses some threshold.


> Layoffs mean a company doesn't have productive, profitable work for a set of people.

That's only one of many things layoffs can mean. In this case, Meta seems to be laying people off so that it can make a bigger bet on its AI programs (which I assume are deeply unprofitable right now) at the expense of other lines of business.


Sadly a lot of people see profit as the only incentive.


> Would you rather them never hire them in the first place?

Isn't the obvious answer yes for everyone that sells their labor?

If I gave you the choice between being an employee in an economy where it is more difficult to land a job, but you could be sure that job would last, or an economy where it is easier to find a job, but it was completely insecure, I think most would choose the former. No? Worring about finding work while looking, or worrying about it all the time? Seems obvious.


This is a very depressing and mediocre outlook on innovation and growth.

Based on your logic we should make it impossible to fire anybody. That surely will solve our problems, right?

I want a dynamic, innovative economy where anyone can find a job if they work hard. Not because the law says they can't be fired. How depressing.


I guess the issue with the first one would be actually getting the job. If jobs were that valuable, I'd expect other factors not necessarily related to job performance to be reasons in getting a job, especially knowing (or being related to) the right person.


No, of course not. How silly. As an employee who's been laid off a couple times I greatly prefer an economy where it's easy to find a job.


If it's easy to find a job why would I care if I'm laid off? Just get another job.


I'm guessing a lot of these large companies will have massive layoffs followed by slightly less massive re-hiring in 6 to 18 months.


Correction, the layoffs will be followed by massive re-hiring overseas in 6 to 18 months.

The domestic jobs aren't coming back.


why do we feel that way? it's becoming more and more likely that developments in AI lead to a K graph in experience / value - senior / self sufficient workers will be significantly more valuable than ever.

unless you mean that the quality of domestic workers is declining, which i'd agree in most things (tho for some things like software i think still has a chance)


I don't think the quality of US workers has to decline. The quality of workers in lower CoL places like India simply has to increase, and it has. Both of the companies I've worked for have opened India campuses in the past few years.


I hire for such companies and the quality of US workers vs foreign workers who move here on visas is much different. To be fair, foreign workers who move here on visas tend to be the rich and highly educated of their own country and US workers are more distributed across SES. They also have more education on paper bc they usually need a masters or more to be eligible to work here


The compensation of software tech (especially Silicon Valley) has also gotten much higher over the past number of years in the US compared to disciplines requiring the same level of education/experience both is the US and even Western Europe. I expect this will equalize with outsized tech salaries becoming a thing of the past except for a few individuals with proven track records.


I mean, the same can be said for consulting salaries, HFT salaries, hedge fund salaries, etc., which similar to software engineering only require a bachelor's and have a similarly grueling interview process.

Why would this equalize? As long as software companies make huge profits and have growth capability which the top ones clearly do, what change would make this happen?


Some software companies are making huge profits today. Many software jobs are at companies making returns comparable to other engineering job profits. There's also a supply side. If the market is flooded with a lot of people in it mostly for the money, salaries will supposedly shrink.


we've seen that most of the people who are only in it for the money don't actually bring much value to the company. a lot of middling software engineers are actually a liability. unlike operational work, engineering needs to have a higher bar than just a beating heart and hands


Hot take: their quality is possibly a reason these people were unable to leave their country in the first place.


Too simplistic of a hot take. People have families and other reasons _not_ to emigrate. I also know people who moved to big tech companies in the states, worked there for a number of years and then went back home to “emerging countries” to be closer to their roots.


>it's becoming more and more likely that developments in AI lead to a K graph in experience / value - senior / self sufficient workers will be significantly more valuable than ever.

I don't buy this at all, this narrative feels like pure cope to me. The skill ceiling for working with AI tooling is not that high (far lower than when everyone had to write all their code by hand, unquestionably). To me it seems far more likely that software engineering will become commoditized.

I'm sure everyone posting about the supposed K graph believes that they're on the valuable side of it, naturally.


American workers got uppity. Forgot their place. Started protesting company decisions and wouldn't return to office. Hiring may eventually come back but not any time soon. Workers need to be chastised first.


Offshoring has been a common practice for decades, it works great for some functions and not great for others. Why would it suddenly have a massive uptick in 2027?


Meta has done several rounds of such layoffs since the post COVID interest rate hikes and they do not have a larger employee presence abroad since then.

They also, unlike a lot of their cohorts in FAANG, don't have a significant engineering presence in India and it hasn't rapidly grown since COVID either.


I’m curious why this meme is so sticky. In the early 2000s people were also panicking that all the software jobs were going to India and never coming back. It was so pervasive it made the cover of Wired magazine, but it never happened. Why is this time different?


The reason it never happened wasn't that MANY jobs went off-shore (they did) but that the pace of this paled in comparison to number of new jobs that were opening up on-shore. Now that we are seeing demand stall on-shore this is going to hit the front more-so than before. Many layoff news later come with "oh by the way, we also hired x,xxx people off-shore. I think has generally been overblown but I think it is a thing if someone actually wanted to run "America First" campaign and actually mean it, to outlaw or make off-shore development cost-prohibitive. I work on a project in a company that employs now about 1k people and over 40% of that workforce is off-shore. Just about every colleague I have (DC metro area) that works at another joint is in the same spot (or much worse, like CGI etc which doesn't even have developers on-shore anymore...)


Maybe it did happen, but the expansion of broadband internet, and then mobile broadband internet, caused an enormous demand for additional and different types of programmers that was unable to be satiated by people outside of the US.


It "never happened" only in aggregate, which is sometimes irrelevant and always hard to see for an individual employee who's worried about their individual career. IBM had 150,000 US employees in 2000 and 50,000 today.


>Why is this time different?

The humiliation of all of the disastrous failures has been lost to history and PMC are once again bullish about their cost cutting genius.


Remote coordination tools are no longer utter dogshit.


Sure, but there's no getting around how terrible it is to communicate and coordinate between PST and IST. One of the divisions I currently work with operates in a model where the "drivers" are all in the US and there's a large IST-based team that "executes". It's ... not great, and nobody on either side of the equation likes it. And all the people involved are very smart! But it really does matter, and we're seeing a lot of things move far slower than initially thought.


Why are people so focused on India when it comes to outsourcing?

US dev salaries are so much higher than the rest of the world that basically you could hire anywhere in Europe and still save most of the cost per person.

You could go to LATAM if you want the same timezone.

On the corollary, salaries of capable Indian developers have certainly caught up to most Western countries, so that you wont be saving much per person.


AI: actually an indian

Seen in foreign workers remote driving ai cars, foreign workers training ai robots, etc etc


Not buying it personally, I think this is the start of a slow unwinding.

AI won't replace everybody overnight, but it'll make 10% layoffs year after year a real possibility.

Either people are simply made redundant because bots in the hand of a bot wrangler can do much of their work, or people are relatively less efficient than their peers because they refuse to adapt to a world where AI is a force multiplier.


Not going to argue about what will or will not happen (predictions are hard, especially about the future), but you absolutely don't need AI to explain layoffs at Meta. On one hand they have a failed investment in Metaverse and an underwhelming attempt to participate in AI race. On the other hand they have a stable advertising business that doesn't need much innovation, but can always benefit from some cost cutting


I think this is broadly correct too.

They obviously biffed it by hiring for a bad moonshot when the pandemic money printers were turned on, and now they have plenty of belt tightening to do.


Also doesn't help that nobody can say how many people it needed to develop and maintain software even before AI. Elon declared the emperor had no clothes.


He really didn’t tho. X was constantly breaking and falling apart in his hands, so he repackaged it in xAI where he got a bunch of money to hire a bunch of engineers to develop features and keep it running. It’s still not profitable. But people have no critical thinking skills so they haven’t noticed this


I'd argue Twitter not breaking down after layoffs is good for the industry. It means you can roughly see investment in software as capex - once it's built, it's built.

You still need engineers to innovate though, but industry has no idea what innovation still makes sense except, maybe, AI. That's why everyone is investing in it, there are just not many other places to invest.


Did he really? X is constantly more buggy than Twitter ever was.

Right now they have a bug where post appears duplicated as a reply to itself (you can tell it's a bug because liking one automatically likes the other).


The obvious problem is that you can't run a consumer economy without consumers. No one cares about warehouse robots if no one has the income to buy what's in the warehouses.

For "no one" substitute "more and more of the working population."

I suspect oligarchs believe they can automate their way out of this. The little people will be surplus to requirements, and measures will be taken to eliminate most of us in due course.

But the manufacture of everything is both global and industrial. You need to run things at a certain scale.

Even if we had AGI tomorrow there's still a huge gap between where we are today and a hypothetical low-population global post-AGI robot economy.

And if burn through that straight into ASI no one knows - or likely can even imagine - what that would look like.


but why rehire at all? if AI is even half as competent as they say it is, then they don't need all those employees. Afterall, some of the latest models are passing the GDPW benchmark with flying colors. wouldn't it make sense to just keep laying off more and more and replacing it all with AI?

I think there's a big disconnect between how competent the AI crowd says it is vs reality.


It depends what your company does. In my case we are double our output and probably will be triple by summer. We are building new adjacent products and more complex features. Smoking our competition. So they better keep up or we will eat them. We let go of one person in the fall who just couldn't work this new way. Our head count is going to stay the same or go up by one more hire in the next few months. We are a dev/qa team of five people now, do billing systems...


Do people in the US enjoy that kind of bullshit? I'm not saying we have to go back to the days when people worked for a company all their life. But this constant chaos, fear and looking at job offers can't be good for morale.


> But this constant chaos, fear and looking at job offers can't be good for morale.

Definitely makes it harder to make long term plans/commitments. It was tolerable at least when the market was decent, ie, if you were reasonably good at what you did you could be confident about landing a new role before your severance ran out (typically within a couple months-ish). If this current state of the tech market is the new normal, where it takes many months of searching to land something, that alone will likely cause many to reconsider this field, I think.


> They're ... laying off people, maximizing profits, and giving up. Cowards.

To play devil’s advocate, what they’re doing is not remotely cowardly, it is the entire point of their existence

They have a lever they can pull that will increase profits and the stock price. Why the hell else does a company like Meta even exist? It sure as hell isn’t to provide jobs to meat bags, and anyone that thinks it is needs a very quick lesson about the real world.


They are maximizing profits this quarter at the expense of profits every future quarter.

That's not at all the point of a company's existence. That's what a few companies do, for a short time, if they think they have no place to go but down.

That said, IMO they are right...


> They are maximizing profits this quarter at the expense of profits every future quarter

Oh sure, but the MBAs running stuff don’t care about that. Their bonuses are tied to the now, so the system has optimized for that.


This makes a good point. A lot of people think that big tech has a duty to provide jobs to smart, ambitious people.

They assume that we live in some kind of socialist system. They feel like it's a kind of deal; they accept all the regulations, monopolies bureaucratic bullshit and, in return, the corporate monopolies pay them to keep quiet and stay out of politics.

I understand the sentiment but what's horrible about this mindset is that these people think it's OK to support corrupt political power to enrich themselves at the expense of everyone who doesn't work for a big corporate monopoly. They think that all the smart people work for big tech and everyone else is trash... And they set the criteria for entry into the big tech monopoly club (I.e. screenings and interviews). But the irony is that they're trash! Their pseudo-socialist view of the word is crooked.

The reason I support UBI is because I don't see a meaningful difference between ambitious people and random people. Every generation from boomers onwards are spoiled brats. Mostly monetizing and gatekeeping the ingenuity and labor of past generations by playing dumb social games. The whole system doesn't make sense. As meritocracy declines, the rewards increase and false narratives fill the gaps... They'll have you believe that the person who painted Facebook HQ's walls contributed more to society than the guy who actually invented the paint...


It isn't good optics at the moment, or good politics, for a company to loudly proclaim "we're firing people because of AI taking their jobs".

That doesn't mean that's what happened, it only means that whether or not its true, most companies aren't going to say it. The few that have said anything of the sort have suffered some backlash, and they aren't even as prominent as Meta or Microsoft (which also just announced plans to reduce by ~7% through buybacks, the first in their > 50 years) And this is on top of their decline to ~210,000 employees after 2025 firing of 15,000.


It's probably not fun for executives to admit "we overhired and invested in the wrong things" either.


Didn't Square do that a couple weeks ago?


this seems a little hyperbolic without knowing details. they probably already cut around 5% every year for performance anyway (their performance reviews probably just came out). i could pretty easily see the rest of the reduction being unprofitable businesses like VR that they don't want to invest in anymore, it might not be due to AI at all


Given facebook/Zuckerberg’s history it’s tough to give them the benefit of the doubt. From day one it’s been ruthless, harmful ambitions and business practices. It is a bad company that does bad things.

They also burn capital at insane rates on projects nobody wants then fire everybody involved (see: the metaverse, the very reason they rebranded to that dumb name)


I can pretty much agree with everything you said in the first line

but for the second, I guess I don't consider that terrible? they make risky bets, pay people tons and tons of money to try them, then if it doesn't work out they shut down the projects and let the people go? that feels like every startup except the employees actually get compensated. if that's driving the extra layoffs, it's hard to feel too bad for people who have probably been paid millions already


have any of their risky bets paid off though? most of their main products have been acquisitions.


who cares? I'm saying the people that take the jobs for the incredibly risky bets (and everyone knows what is risky) understand the tradeoff--if the bet doesn't work their job is at risk. In the meantime they get paid millions of dollars. That seems like a fair situation to me


I think we should care about poor stewardship at the top of a major company.


You make fair points there. I think what bothers me is that they can be so irresponsible with money/their projects, but still somehow manage to make very high margins, and yet they continue to just lay off thousands at a time like this repeatedly. There doesn’t seem to be any logic to it other than typical “number go up” nonsense.

The fact is Facebook had serious red flags going up that the AI boom has papered over (for now?) as well. They don’t make a lot of sense to me.

I don’t know how to tie this all together to be honest. It’s a lot of feelings/emotional response. But frankly it just feels cruel how they treat their employees and our society, so it colors my perception of everything they do.


meta has laid off 34,800 people in just the large scale rounds we know about in the past 5 years.

they're growing at high teens % a year and have record profits and a centi-billionaire has complete control. whats going on there is gross, even compared to the finance world of yearly culling of the bottom few % its gross.

There are a few US companies that crossed beyond the carelessness of us work culture to flat out hostile and metas one of them.


Literally, what else can they possibly do that hasn't been done? there's just limited opportunity.


I agree. A lot of people have an unspoken assumption that there are unlimited amounts of positive EV investments for any given company to make. This also underpins the extremely common idea that dividends and buybacks are always happening at a direct cost to growth and R&D.


Meta has Facebook and Instagram, and Facebook has been slowing down for a while. Everything else is neutral, a net loss, or not very significant.


> It's an honest surprise that this isn't spun as "internal AI efficiency gains."

Meta is working on "personal AI that will empower you". Saying they are firing people because of AI would be a bad marketing move.


Facebook is of course a company that had ONE idea, which wasn't even original - trick people to use the service and then use their data in inappropriate ways. I believe their original business plan was "People just submitted it. I don't know why. They 'trust me'. Dumb fucks."

They scaled that idea, made a lot of money doing it because of course, bought up a bunch of companies who themselves had original and ethical ideas. But they were never allowed to shine brighter or step out of the shadow that is Facebook, who still believes their customers are "dumb fucks". That never changed and Facebook's current customers, employees, shareholders, and targets of acquisitions need to remember that and never kid themselves about who Facebook is.


When is it ok to lay people off?


Laying off 10% of your workforce at a company this size means someone high up has been making some pretty significant mistakes.

So the answer is, when an executive is held accountable for disrupting this many people's lives. When they claw back bonuses they have probably received for hitting or setting those previous hiring targets.


Laying off 10% of your workforce at a company this size means someone high up has been making some pretty significant mistakes.

Why must a mistake have been made, as opposed to just changes in the market? Doesn't this presuppose that people are entitled to keep their job as long as they want to, and if the company no longer needs them, it's a violation of that right?

And even if it's because the leaders of the company misjudged something, I'm unclear how that means that employees who were laid off have had some great injustice visited upon them.

I got laid off from Block a little over a year ago, and I wasn't salty about it at all. They paid me millions of dollars over the years I was there, they gave me great severance, and I don't view myself as entitled to be able to sell my labor to them, just as I don't view them as being entitled to buy my labor. I wouldn't have felt bad ending my employment if it was best for me, why should they feel bad for doing the same?


> Why must a mistake have been made, as opposed to just changes in the market?

If you're high up at a company like Meta, you likely have a compensation package worth millions a year.

The question is what are they being paid for if not to be "better" at steering the ship than others? They always tell us they are brilliant leaders who bring more value to the company than others could or would.

So if they're just following the market like everyone else, and having to react with large reversals, then to me, it starts to poke some pretty large holes in this idea that they are somehow the best of the best. It starts to look like their only real skills are self-promotion and career advancement. Not because they're better at operating the company, but because they're better at office politics.

This is nothing new of course, this is the way most organizational structures have worked since the dawn of time. The people with power are given deference and privilege commensurate with being elite, but really they're just average at doing their actual job and kind of guessing their way through it. I'm not saying Meta is special or uniquely culpable for this mistake here. I'm saying it's a sad fact of life and maybe, just maybe, if we all start saying out loud this truth, that this is something we could change as a society.


class war is the only answer ever given


BIG FAX


Imagine a world where people could just be happy with returns on investments. Even treasury bills.

Can't we all just be happy?


If the richest people in the world are chronically unhappy then that indicates that excess wealth does not bring happiness.


It's more that the psychologically broken people who are also somewhat lucky and intelligent and hard-working end up being those "richest people" - they almost all have some kind of impostor/self-esteem issue. Pretty sure there are a lot of anonymous people with $25M net worth who are happily out rock climbing, traveling, etc.


It must be true what Schopenhauer said: "Wealth is like sea water; the more we drink, the thirstier we become."


If you make 900,000 but your rent and healthcare are 850000, how rich are you?


As a pratical lens on this advice: people are excellent at giving feedback on their problems. They are terrible at identifying how to fix it.

"It felt too long" was right. The solution was not to make the story shorter. The solution was to look at the parts that felt long, and believe that feedback.

If you're building something, and your users tell you it's complicated or it's slow or it's not useful, they're right! The fix may or may not be to make it simpler, faster, or more useful. Maybe it needs to be organized better, or to create deliberate moments of action, or to be used at a different time. The problems are real, but the obvious solutions are not always right.


I’ve heard exactly the same advice re: focus groups. A focus group can give excellent feedback but terrible advice. Probably applies to comment sections in the modern day too.

So if they didn’t like your movie the movie probably is bad. But don’t listen to them about what they would change about the movie. They don’t know anything about the creative process.


A phrase I heard from a tv writer on a podcast was "note behind the note".

The gist of the conversation was about TV execs giving all sorts of bonkers notes all the time that are usually terrible. This writer tried to think about what might have triggered the exec to make a note. Maybe the characters are not engrossing enough, or the plot is too complex, or the dialogue isn't snappy enough. If the exec had been engrossed in the story they wouldn't have made a note. This writer rarely implemented any note from an exec, but did make all sorts of changes in and around noted sections.


This reminds me of the infamous Sid Sheinberg memo to Steven Speilberg and Bob Zemeckis on changing the title of “Back to the Future”.

https://imgur.com/gallery/producers-memo-to-speilberg-during...

“Behind the note”, it’s about emphasizing the goofy fun of the film, rather than the genre elements, and in that it’s right on.


I would never have gone to see a spaceman from Pluto.


Based on how “Pluto Nash” performed: that’s a deeply and widely held sentiment.


The strength of a focus group is (or should be, anyway) that it's representative. It makes sense that their overall reception of a work is a more accurate estimate of its eventual popularity than the maker's.

However, the maker has tried many things, and among them will be things which are obviously bad (to anyone) if you actually try it.

Story time: in 2008, I went to the big board game fair in Essen and got to try the then-new game Dominion. I think most people who did, knew that this game was going to be hugely popular and influential, which it was. Donald X. Vaccarino is a really, really good game designer. And sure enough, it spawned the genre of deck building games, games where you build a deck as you play (as opposed to collectible card games, which are an important ancestor). But the first few attempts to adopt improve on the formula were pretty lousy.

What's interesting is that Donald X. posted dev diaries, writing at length about what he had tried and rejected. And although I'm pretty sure he did not follow the Dominion-likes closely (the dev diaries may even have been written before many of them), the things he'd tried and rejected were exactly what the Dominion-likes tried to add as their twist. Multiple currencies, like Thunderstone had, he'd tried rejected because it was too high variance. "Pick one of the cards on offer" like Ascension had, he'd also tried first, and found that the game was deeper and more fun if everyone had access to buy the same cards. (The "Pick one of three" mechanic would turn out to work much better in solo/computer games, however, as Slay the Spire's success is proof of!)


This is true for casual users, but if you're getting feedback from enthusiasts or even experts, their solutions are often -- not always, but often -- quite good.


Yes. People who live and breathe your product should absolutely be listened to. Especially when they don't have the super-user tools you do for support (or unconsciously rounding off sharp edges).


But they may not be a very representative user. Especially for things like games, they may be far removed from what originally got them into it.

Fan-modded games are often great fun if you're seriously into a game. But they're rarely better if you've never played the game.


It's true. There's probably not a clean rule for when to listen and when not to.

I would propose as a heuristic that for the early stage, when your product relies on true-fans, you at least consider the content of a complaint more and don't just treat it as a signal that something somewhere is wrong. Bringing it back to the OP, I'm sure this is what Card did himself.


> "It felt too long" was right. The solution was not to make the story shorter. The solution was to look at the parts that felt long, and believe that feedback.

smells like LLM


> Smells like LLM

Smells like not adding value to the discussion.


LLM writing is not adding value to the discussion.

The slightly inflated rhetoric whose tells are false contrast and unnecessary parallelism let me know that a human did not spend time writing that comment. Why should humans spend time reading it?


I read the same thing and didn’t get that vibe. The parent comment added to the discussion regardless of origin.


lol sorry, I just spend to much time on LinkedIn, I think. I promise this was 100% human-written.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: