More

tobyhinloopen · 2026-01-22T19:19:07 1769109547

I had to read it twice as well, I was so confused hah. I’m still confused

rtkwe · 2026-01-22T19:30:26 1769110226

They probably organize individual accounts the same as organization accounts for larger groups of users at the same company internally since it all rolls up to one billing. That's my first pass guess at least.

tobyhinloopen · 2026-01-22T19:18:19 1769109499

So you were generating and evaluating the performance of your CLAUDE.md files? And you got banned for it?

Aurornis · 2026-01-22T19:50:39 1769111439

I think it's more likely that their account was disabled for other reasons, but they blamed the last thing they were doing before the account was closed.

pocksuppet · 2026-01-22T20:28:08 1769113688

And why wouldn't you? It's the only information available to you.

alistairSH · 2026-01-22T19:23:46 1769109826

It reads like he had a circular prompt process running, where multiple instances of Claude were solving problems, feeding results to each other, and possibly updating each other's control files?

Hackbraten · 2026-01-22T20:49:29 1769114969

They were trying to optimize a CLAUDE.md file which belonged to a project template. The outer Claude instance iterated on the file. To test the result, the human in the loop instantiated a new project from the template, launched an inner Claude instance along with the new project, assessed whether inner Claude worked as expected with the CLAUDE.md in the freshly generated project. They then gave the feedback back to outer Claude.

So, no circular prompt feeding at all. Just a normal iterate-test-repeat loop that happened to involve two agents.

epolanski · 2026-01-22T19:27:14 1769110034

What would be bad in that?

Writing the best possible specs for these agents seems the most productive goal they could achieve.

NitpickLawyer · 2026-01-22T19:46:35 1769111195

I think the idea is fine, but what might end up happening is that one agent gets unhinged and "asks" another agent to do more and more crazy stuff, and they get in a loop where everything gets flagged. Remember that "bots configured to add a book at +0.01$ on amazon, reached 1M$ for the book" a while ago. Kinda like that, but with prompts.

epolanski · 2026-01-22T19:49:30 1769111370

I still don't get it, get your models better for this far fetched case, don't ban users for a legitimate use case.

alistairSH · 2026-01-22T21:12:32 1769116352

Nothing necessarily or obviously bad about it, just trying to think through what went wrong.

andrelaszlo · 2026-01-22T19:46:19 1769111179

Could anyone explain to me what the problem is with this? I thought I was fairly up to date on these things, but this was a surprise to me. I see the sibling comment getting downvoted but I promise I'm asking this in good faith, even if it might seem like a silly question (?) for some reason.

alistairSH · 2026-01-22T21:18:16 1769116696

From what I'm reading in other comments, the problem was Claude1 got increasingly "frustrated" with Claude2's inability to do whatever the human was asking, and started breaking it's own rules (using ALL CAPS).

Sort of like MS's old chatbot that turned into a Nazi overnight, but this time with one agent simply getting tired of the other agent's lack of progress (for some definition of progress - I'm still not entirely sure what the author was feeding into Claude1 alongside errors from Claude2).

tobyhinloopen · 2026-01-20T12:46:48 1768913208

How about running Claude as a different user with very limited permissions?

gregoriol · 2026-01-20T12:48:56 1768913336

This breaks the non-interactive mode the post want to achieve. Claude will not be able to install some things and will require user action, which is not desired here.

progval · 2026-01-20T12:58:10 1768913890

Like what? It can already use npm/pip/etc. And if it needs a new APT package or config in /etc/ then you would want to know because you need to document it.

tstrimple · 2026-01-20T22:30:05 1768948205

Claude Code on NixOS feels like it has super powers. Being able to spin up a nix-shell with needed dependencies on demand gives it access to all sorts of tools I don't have or want installed on my base system. My "book-recommendation" claude code uses sqlite to manage my reading history and to-read and maybe-read lists but I never installed tools for sqlite and they aren't present on my NixOS desktop. It just launches a nix-shell with sqlite anytime it needs to read/modify the database. As long as the database file is within the directory claude code was launched from, it doesn't need to prompt for permission. With the caching that NixOS does, it's fast enough to not even think about.

gregoriol · 2026-01-20T13:42:30 1768916550

If you make claude work with c/c++, it may need apt for libraries or build tools.

Even with npm/pip, these may not be available on a base linux box.

Even then, some complex projects may need other tools that are not part of a base system (command line tools, redis, ...).

emilburzo · 2026-01-20T12:57:25 1768913845

I tried this approach for a while, but I really wanted it to be able to do anything (install system packages, build/run Docker containers, the works).

With these powers there's a lot less back-and-forth with me running commands, copying the output, pasting it to Claude, etc.

I'm sure you've had the case where you had to instruct someone to do something (e.g. playing tech support with family, helping another engineer, etc). While it helps the other person learn, it feels soooo slow vs just doing it yourself :) And since I don't have to teach the agent, I think this approach makes sense.

delaminator · 2026-01-20T12:51:12 1768913472

I run it with sudo enabled - true story

just give it its own machine and let it check out any code

I PXE boot it from a known image when I feel the need

tobyhinloopen · 2026-01-20T13:00:15 1768914015

Running it remotely on a VM seems like a very sensible option. Just don't give it permission to nuke the remote repository hah (EG don't allow force-push, use protected branches, only allow write access to branches it created)

zh3 · 2026-01-20T15:12:42 1768921962

Same solution here - keep a base diskless image on the server, copy it to the diskless area, pxeboot the machine. Works for Windows too (iscsi).

Could do the same thing on EC2 of course.

tobyhinloopen · 2026-01-20T09:24:52 1768901092

Is this developed by these 10x developers I've heard about?

tobyhinloopen · 2026-01-20T07:05:28 1768892728

EU automaters fail at making modern cars. They just put a bunch of screens in there with awful software. If you go all screens, just commit like Tesla. If you can't beat Tesla, just stick with minimal screens and use buttons.

Somewhere between 2010 and 2020, most automakers went crazy with their designs and it went all downhill from there.

quantum_magpie · 2026-01-20T07:52:41 1768895561

I have a 2020 Fiat 500 Abarth, and it is absolutely perfect: There is a screen (I think 7") for Android Auto/CarPlay/radio/nav, and every single other function in the car has a physical button. It is also absolutely gorgeous - pinnacle of design, IMO

PunchyHamster · 2026-01-20T08:17:42 1768897062

That's about what I want from interior - any builtin infotainment will get out of date, any more electronics is just stuff to eventually break

gambiting · 2026-01-20T10:31:04 1768905064

Our 2021 Volkswagen e-Up is like this. There is a tiny(like 3" tiny) screen for the radio, bluetooth and reverse camera, everything else is analogue and has physical buttons. It's honestly best of the best Volkswagen design, what they did with their newer cars in terms of interior usability is a travesty.

LightBug1 · 2026-01-20T10:47:51 1768906071

They fell for Tesla-fication ... and are only now waking up to the mistake.

trgn · 2026-01-20T14:08:56 1768918136

i still miss the interior of my 2010 fiat punto

jansper39 · 2026-01-20T12:01:38 1768910498

From this year all EU cars will have physical buttons for heater controls, media etc.

epolanski · 2026-01-20T13:13:58 1768914838

Not sure why would you think EU automakers fail at making modern cars, also, you're generalizing 40+ car automakers in one basket.

tobyhinloopen · 2026-01-16T21:43:07 1768599787

Poor family members though

honzabe · 2026-01-17T11:18:19 1768648699

I recently read a book of interviews with people who escaped from North Korea, and what shocked me was the discovery that the relatives of those who escaped are often executed (publicly) and that even children are executed in North Korea. We live in a terrible world. I mean... you expect a book from North Korea to contain terrible things, but somehow it was even worse than I expected.

qcnguy · 2026-01-17T13:09:06 1768655346

Left wing thought doesn't contain any philosophy of limitations on state power, so under a left wing regime there is no limit to what it might do. No matter how terrible something is, if it can be imagined they will consider implementing it. To avoid that outcome there has to be an understanding of the flawed nature of government, and from that an ideological commitment to a state limited in power and role.

olelele · 2026-01-17T15:37:40 1768664260

So everyone not a libertarian is dangerous?

qcnguy · 2026-01-17T17:38:54 1768671534

Only if they get into power? I mean what's your reading of the last few centuries of history?

immibis · 2026-01-17T16:04:42 1768665882

[flagged]

qcnguy · 2026-01-17T17:36:47 1768671407

We're talking about North Korea executing children because their family members escaped. Doesn't get more left wing than North Korea. Not a good look to deploy a stock reply without thinking.

blahaj · 2026-01-17T18:23:44 1768674224

North Korea is an extremely class based society. About as far away from left as it gets.

immibis · 2026-01-17T19:18:43 1768677523

I'm not sure you understand the situation here. Everything I don't like is left-wing.

qcnguy · 2026-01-17T23:40:26 1768693226

[flagged]

immibis · 2026-01-18T05:00:37 1768712437

> Everything I don't like is left-wing.

tobyhinloopen · 2026-01-18T15:23:48 1768749828

Your sarcasm isn’t obvious enough

tobyhinloopen · 2026-01-14T22:43:11 1768430591

A great start is to have LLMs use special UNIX users that can’t do anything except that you allowed them to do, including accessing the database with a read only user.

tobyhinloopen · 2025-12-29T08:14:56 1766996096

The problem with their example is that you can display linear image data just fine, just not with JPEG. Mapping linear data to 255 RGB that expects the gamma-corrected values is just wrong. They could have used an image format that supports linear data, like JPEG-XL, AVIF or HEIC. No conversion to 0-255 required, just throw in the data as-is.

tobyhinloopen · 2025-12-29T08:11:59 1766995919

They did:

> Sensor data with the 14 bit ADC values mapped to 0-255 RGB.

tobyhinloopen · 2025-12-15T15:44:00 1765813440

> You're not going to try and extract a timestamp from a uuid.

I totally used uuidv7s as "inserted at" in a small project and I had methods to find records created between two timestamps that literally converted timestamps to uuidv7 values so I could do "WHERE id BETWEEN a AND b"