Haha, I like to joke that we were on track for the singularity in 2024, but it s...

ai-christianson · 2025-10-24T17:01:45 1761325305

There's massive hardware and energy infra built out going on. None of that is specialized to run only transformers at this point, so wouldn't that create a huge incentive to find newer and better architectures to get the most out of all this hardware and energy infra?

Mehvix · 2025-10-24T17:07:34 1761325654

>None of that is specialized to run only transformers at this point

isn't this what [etched](https://www.etched.com/) is doing?

imtringued · 2025-10-24T17:18:33 1761326313

Only being able to run transformers is a silly concept, because attention consists of two matrix multiplications, which are the standard operation in feed forward and convolutional layers. Basically, you get transformers for free.

kadushka · 2025-10-24T18:28:50 1761330530

devil is in the details

Davidzheng · 2025-10-24T19:54:12 1761335652

how do you know we're not at recursive self-improvement but the rate is just slower than human-mediated improvement?