More

Marazan · 2025-11-25T16:10:07 1764087007

It is all those things.

The Bitter Lesson is with enough VC subsidised compute those things are useful.

Marazan · 2025-11-23T12:23:22 1763900602

It is deeply offensive to the serious players of the game to suggest Mornington Crescent is "made up". Yes, to neophytes it can seem random and unstructured but it is preposterous to suggest game with such a lineage is fictional.

antonvs · 2025-11-23T15:05:37 1763910337

Do people still play it now that all the major lines stop at Mornington Crescent? Kids just won’t understand how difficult it was back in the day.

Marazan · 2025-11-25T15:15:16 1764083716

The Mornington Crescent Players Association (MCPA - often lovingly renlffered to as The Scottish Father) unanimously voted through the Flodden amendments last year. The Mornington Crescent Rules Committee (not to be confused with the Rules Committee of Mornington Crescent) will be voting on the topic on December 25th. Whereupon it will be passed to the International Board for ratification.

The only controversial point is it will be applied retroactively over the last decade, changing the results of no less than 3 world championship matches

Marazan · 2025-11-21T20:46:29 1763757989

It is beautifully simple and a classic building bloc of Lunetta music synths

Marazan · 2025-11-20T22:17:27 1763677047

Nissan were selling thousands of Leafs before the Model S every rolled off the production line.

Marazan · 2025-11-11T16:48:09 1762879689

I was secretly hoping they would write solutions in Inform 7.

Marazan · 2025-11-09T12:59:20 1762693160

> There are decisions you didn't realize you needed to make, until you get there.

Is the key insight and biggest stumbling block for me at the moment.

At the moment (encourage by my company) I'm experimenting with as hands off as possible Agent usage for coding. And it is _unbelievably_ frustrating to see the Agent get 99% of the code right in the first pass only to misunderstand why a test is now failing and then completely mangle both it's own code and the existing tests as it tries to "fix" the "problem". And if I'd just given it a better spec to start with it probably wouldn't have started producing garbage.

But I didn't know that before working with the code! So to develop a good spec I either have to have the agent stopping all the time so I can intervene or dive into the code myself to begin with and at that point I may as well write the code anyway as writing the code is not the slow bit.

trjordan · 2025-11-09T13:18:51 1762694331

For sure. One of our first posts was called "You Have To Decide" -- https://tern.sh/blog/you-have-to-decide/

And my process now (and what we're baking into the product) is:

- Make a prompt

- Run it in a loop over N files. Full agentic toolkit, but don't be wasteful (no "full typecheck, run the test suite" on every file).

- Have an agent check the output. Look for repeated exploration, look for failures. Those imply confusion.

- Iterate the prompt to remove the confusion.

First pass on the current project (a Vue 3 migration) went from 45 min of agentic time on 5 files to 10 min on 50 files, and the latter passed tests/typecheck/my own scrolling through it.

Marazan · 2025-11-04T23:37:57 1762299477

It was not a CPU hog - this is a myth that needs to die The flash runtime was pretty modest.

Now, the code people wrote was CPU hogs, because lots of non coders were writing code and they would do anything to make it work. The Flash runtime was not causing the Punch the Monkey and to peg your CPU, it was because the punch the monkey ad was fucking awful code.

All those Flash programmer went on to write the first wave of HTML5 stuff which, shock horror, where vastly CPU inefficient.

Marazan · 2025-11-03T16:16:52 1762186612

At this point it is getting beyond parody

Marazan · 2025-11-02T10:33:45 1762079625

Because if you don't understand how a tool works you can't use the tool to it's full potential.

Imagine if you were using single layer perceptrons without understanding seperability and going "just a few more tweaks and it will approximate XOR!"

famouswaffles · 2025-11-03T01:38:44 1762133924

If you want a good idea of how well LLMs will work for your use case then use them. Use them in different ways, for different things.

Knowledge of backprop no matter how precise, and any convoluted 'theories' will not make you utilize LLMs any better. You'll be worse off if anything.

Al-Khwarizmi · 2025-11-03T07:11:34 1762153894

Yeah, that's what I'm trying to explain (maybe unsuccessfully). I do know backprop, I studied and used it back in the early 00s when it was very much not cool. But I don't think that knowledge is especially useful to use LLMs.

We don't even have a complete explanation of how we go from backprop to the emerging abilities we use and love, so who cares (for that purpose) how backprop works? It's not like we're actually using it to explain anything.

As I say in another comment, I often give talks to laypeople about LLMs and the mental model I present is something like supercharged Markov chain + massive training data + continuous vocabulary space + instruction tuning/RLHF. I think that provides the right abstraction level to reason about what LLMs can do and what their limitations are. It's irrelevant how the supercharged Markov chain works, in fact it's plausible that in the future one could replace backprop with some other learning algorithm and LLMs could still work in essentially the same way.

In the line of your first paragraph, probably many teens who had a lot of time in their hands when Bing Chat was released, and some critical spirit to not get misled by the VS, have better intuition about what an LLM can do than many ML experts.

tarsinge · 2025-11-02T11:46:25 1762083985

I disagree in the case of LLMs, because they really are an accidental side effect of another tool. Not understanding the inner workings will make users attribute false properties to them. Once you understand how they work (how they generate plausible text), you get a far deeper grasp on their capabilities and how to tweak and prompt them.

And in fact this is true of any tool, you don’t have to know exactly how to build them but any craftsman has a good understanding how the tool works internally. LLMs are not a screw or a pen, they are more akin to an engine, you have to know their subtleties if you build a car. And even screws have to be understood structurally in advanced usage. Not understanding the tool is maybe true only for hobbyists.

adi_kurian · 2025-11-03T01:26:38 1762133198

Could you provide an example of an advanced prompt technique or approach that one would be much more likely to employ if they had knowledge of X internal working?

kubb · 2025-11-02T10:53:19 1762080799

You hit the nail on the head, in my opinion.

There are things that you just can’t expect from current LLMs that people routinely expect from them.

They start out projects with those expectations. And that’s fine. But they don’t always learn from the outcomes of those projects.

Al-Khwarizmi · 2025-11-02T11:08:55 1762081735

I don't think that's a good analogy, becuase if you're trying to train a single layer perceptron to approximate XOR you're not the end user.

vajrabum · 2025-11-02T17:18:02 1762103882

None of this is about an end user in the sense of the user of an LLM. This is aimed at the prospective user of a training framework which implements backpropagation at a high level of abstraction. As such it draws attention to training problems which arise inside the black box in order to motivate learning what is inside that box. There aren't any ML engineers who shouldn't know all about single layer perceptrons I think, and that makes for a nice analogy to real life issues in using SGD and backpropagation for ML training.

Al-Khwarizmi · 2025-11-03T07:04:53 1762153493

The post I was replying to was about "colleagues, who are extremely invested in capabilities of LLMs" and then mentions how they are uninterested in how they work and just interested in what they can do and societal implications.

It sounds to me very much like end users, not people who are training LLMs.

Marazan · 2025-11-02T11:47:19 1762084039

The analogy is if you don't understand the limitations of the tool you may try and make it do something it is bad at and never understand why it will never do the thing you want despite looking like it potentially coild

Marazan · 2025-10-25T15:33:10 1761406390

You just need 2 components with bidirectional binding to enter a world of hurt.

As you say many people do not understand how important, vital and bizarrely _non-obvious_ uni directional data flow was.