Some good prompt-reply interactions are probably fed back in to subsequent train...

jncfhnb · on April 28, 2024

That’s not the AGI’s state. That’s just some past information.

naasking · on April 28, 2024

State is a function of accumulated past information.

jncfhnb · on April 28, 2024

State is a function of accumulated past. That does not mean that having some past written down makes you stateful. A stateful thing has to incorporate the ongoing changes.

naasking · on April 28, 2024

Which is what I described: some successful prompt-replies are fed back into subsequent training runs.

jncfhnb · on April 28, 2024

No… that implies the model never has active state and is being replaced with a different, stateless model. This is similar to the difference between

Actor.happy = True

And

Actor = happier(Actor)

naasking · on April 28, 2024

Both of your examples are stateful systems from the outside, given a suitable choice of timeframe, the latter one is just how purely functional systems represent state. Theoretically they can simulate each other, and the endpoint you use to access Actor will still reference the latest Actor. The only reason you're calling them different is because you insist on using a specific timeframe to exclude considering one as stateful, and I'm pointing out that that isn't strictly necessary.

jncfhnb · on April 28, 2024

True but saying “subsequent training” implies very long periods between Updates.

We do not train LLMs to update them to the state of a conversation.