That's someone who is confident enough to have an evidently successful enough career to be able to access Mythos in its currently-limited rollout and yet not take themselves terribly seriously online.
Realistically their opinion deserves to hold more weight than the median HN comment.
I would prefer a pseudo-anonymous account if possible. Obviously if this is a marketing stunt the very not anonymous feedback is called into question immediately.
That said: I already was aware of Mozilla's account and despite what you are thinking, it essentially confirms everything.
> The biggest differentiating factor was the use of an agent harness, a piece of code that wraps around an LLM to guide it through a series of specific tasks. For such a harness to be useful, it requires significant resources to customize it to the project-specific semantics, tooling, and processes it will be used for.
Yep. Sounds exactly right. So the question is do we really need Mythos for this or can almost any reasonably close to frontier AI model accomplish similar results with a sufficiently advanced harness?
Jury's out but my vote is "probably most of the way". After all, alongside all of the splashy zero days dropped by eager AI companies, Greg Kroah-Hartman has been posting many useful, if minor patches to the Linux kernel produced by nothing more than a single 128 GiB Framework Desktop. So apparently, even small models can be very useful if you can find a way to get the noise out.
Mythos could still be very useful and effective and still be mostly a marketing ploy, and that's because until very recently investment in trying to make LLMs work for security auditing has been underserved. Without more substantial information, it's difficult to tell how much better at security research Mythos is vs say, Opus or DeepSeek 4 coupled with a good agent harness would be.
And in that sense, it's the same sort of crap as the GPT-2 and GPT-3 releases. A lot of hooplah about how dangerous it is to humanity. Then it turns out it's only dangerous enough that it needs to be gated behind an additional monthly subscription.
The most intelligent person involved in the highest level projects at my current company introduces themselves as an out of work circus clown.
There is an incredible amount of competency signaled by someone who was given access to this model but doesn’t treat their online presence like a professional resume.
They won’t until the winds change, and people start talking about the tradeoffs of Claude Code vs any of the other thousand good quality agent harnesses out there that recognize AGENTS.md
Opencode is good enough for most workflows IME, even if it doesn’t have the kitchen sink of features as cc
rootless docker's networking (slirp4netns) is still terribly buggy and in edge cases often locks up using 100% CPU until you discover that your laptop is a lapwarmer and kill it
reply