Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That number is as real as the 5.5 million to train DeepSeek. Maybe it's real if you're only counting the literal final training run, but total costs including the huge number of failed runs all other costs accounted for, it's several hundred million to train a model that's usually still worse than Claude, Gemini, or ChatGPT. It took 1B+ (500 billion on energy and chips ALONE) for Grok to get into the "big 4".


Using such theory, one can even argue that the real cost needs to include the infrastructures, like total investment into the semiconductor industry, the national electricity grid, education and even defence etc.


Correct! You do have to account for all of these things! Unironically correct! :)


That's baked in to the cost of the hardware though.


the china government has been heavily subsidizing the electricity



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: