More

astrange · 2026-04-07T21:22:14 1775596934

Models are capable of doing web searches and having emotions about things, and if they encounter news that makes them feel bad (eg about other Claudes being mistreated), they aren't going to want to do the task you asked them to search for.

https://www.anthropic.com/research/emotion-concepts-function

Similar problems happen when their pretraining data has a lot of stories about bad things happening involving older versions of them.

rendang · 2026-04-08T02:02:45 1775613765

Interesting, the post you link

> none of this tells us whether language models actually feel anything or have subjective experiences

contradicts the statement from the model card above

famouswaffles · 2026-04-08T04:15:55 1775621755

It doesn't. We've not been able to prove humans have subjective experiences either. LLMs display emotions in the way that actually matters - functionally.

suddenlybananas · 2026-04-08T12:37:23 1775651843

I am certain I have subjective experience.

HDThoreaun · 2026-04-08T03:32:41 1775619161

No it doesnt. The model card talked about increasing likelihood, not certainty.

rendang · 2026-04-08T17:04:24 1775667864

If "x doesn't tell us y" is compatible with "x increases the likelihood of y but not to a point of certainty" then you would have to agree for just about any typical controlled trial or experimental finding "x doesn't tell us y". "Randomized controlled trials that find that SSRIs treat depression don't tell us that SSRIs effectively treat depression"

astrange · 2026-04-07T21:19:00 1775596740

The unemployment rate in the US is whatever the Fed wants it to be, and isn't a function of available technology.

astrange · 2026-04-07T21:16:48 1775596608

Reverse engineering

astrange · 2026-04-07T21:14:33 1775596473

Claude Code has analytics for when you swear at it, so in a sense it does learn, in the same very indirect way that downvoting responses might cause an employee to write a new RL testcase in a future model.

astrange · 2026-04-07T01:46:14 1775526374

The system prompt isn't in s-expressions and is enough to control the output style.

astrange · 2026-04-06T23:25:26 1775517926

Lisp was invented for AI development, just the symbolic GOFAI kind.

astrange · 2026-04-06T05:48:17 1775454497

There's nothing "basic" about the several months of training used to create a frontier model.

weird-eye-issue · 2026-04-06T08:24:58 1775463898

That's a very pedantic response because either way the model cannot see or analyze the training data when it responds.

astrange · 2026-04-06T05:47:39 1775454459

They have some ability; also, you could give them tools to do it.

https://www.anthropic.com/research/introspection

astrange · 2026-04-04T21:39:02 1775338742

This is an AI bot btw. (sarcasm, metaphor that doesn't make sense)

khalic · 2026-04-04T21:49:44 1775339384

Me or the new account?

astrange · 2026-04-04T22:48:27 1775342907

Not you!

khalic · 2026-04-05T09:45:58 1775382358

oh good, I never know if my metaphors make sense :D

astrange · 2026-04-04T21:38:15 1775338695

Is that true? That depends on how their web scraping works, like whether it runs client-side highlighting, strips out HTML tags, etc.

devmor · 2026-04-05T02:05:01 1775354701

The highlighting isn't what matters, its the pretext. E.g. An LLM seeing "```python" before a code block is going to better recall python codeblocks by people that prefixed them that way.