More

jonnycoder · 2026-05-28T19:12:27 1779995547

No, no it's been pretty easy with software engineering. I work on two types of projects and it's very easy to ask claude for a plan, then have gpt 5.5 rip it to shreds and find legit issues, and vice versa. If both 5.5 and claude 4.8 can independently create a plan and both find no critical or high issues, then we will be at that point.

replwoacause · 2026-05-29T04:04:14 1780027454

I wouldn't say vice-versa is true. GPT 5.5 routinely finds major mistakes made by Opus 4.7, but I've yet to have it work the other way around.

elcritch · 2026-05-29T00:51:26 1780015886

Additionally running GPT-5.5 on medium sometimes gives me better results than high mode. On any of them I still have to push the models in the right direction.

jonnycoder · 2026-05-11T03:19:53 1778469593

I had a similar idea. I picked 10 lbs of morels last year, first time picking. It was a recent burn area from 8 months prior. I was just back out to the same area and there are no morels, but lots of small orange cap looking mushrooms. chatGPT pro said first year is the best and then it drops off on the second year. I might try a much higher elevation spot in a week or two, but it really sucks. Last year I was finding morels on southeast facing slopes. I'm sure north slopes produced later on as I saw people coming off the hill when I drove by.

germinalphrase · 2026-05-11T03:39:52 1778470792

North-facing (in the US) tends to produce earlier due to the increased warmth with south facing producing mid to late season. Fruiting has been suppressed by me due to lack of rain. Best of luck!

My maps aren’t in public release, but reach out if you want to give it a look.

jonnycoder · 2026-05-07T18:42:33 1778179353

It's a gimmick only for those who get sucked into buying things that they don't need. I've been a Costco shopper for decades, and sure have succumbed to some useless stuff, but my Costco list is 90% the same month to month. I get appalled when I see the same items on my list, that are smaller and in a pack of 1 instead of 2-4, for more money at other stores. If electronics were just like food, it would be like seeing a Macbook Pro for $2000 everywhere but it was $799 at Costco.

cobalt · 2026-05-07T22:32:36 1778193156

actually a 2pk for $2200 :P

jonnycoder · 2026-04-13T17:08:05 1776100085

Are you ok with ciggarete smoke then if you are ok with marijuana smoke?

dfdsjsdklfjs · 2026-04-13T18:15:12 1776104112

[flagged]

dmitrygr · 2026-04-13T20:24:05 1776111845

>> Are you ok with ciggarete smoke then if you are ok with marijuana smoke?

> Yep. Why wouldn't I be? I'm not a brainwashed Karen.

https://www.cdc.gov/tobacco/secondhand-smoke/health.html

jonnycoder · 2026-04-08T20:10:39 1775679039

This is clever and provides a clean alternative to using custom plugins and mcp servers for doing code reviews.

For example, with the degradation of Claude in the past 1-2 months, I am always asking Codex to review Claude's plans and vice versa and I get excellent results that way.

Also, making a skill an API call allows for easy deployment if the security around tool calling could be isolated in an ephemeral sandbox.

Tarcroi · 2026-04-08T21:24:55 1775683495

Thanks! Sandbox deployment is planned in the roadmap. I already have a RuntimeAdapter interface in my architecture that I'll use to isolate the VMs. I'm doing exactly the same thing: I'm cross-referencing the models to challenge their plan, and my code reviewer agent's API is a big help.

jonnycoder · 2026-04-06T16:55:31 1775494531

I agree, I use codex 5.4 xhigh as my reviewer and it catches major issues with Opus 4.6 implementation plans. I'm pretty close to switching to codex because of how inconsistent claude code has become.

jonnycoder · 2026-04-06T16:53:46 1775494426

Everything in our life is a black box, but I agree that depending on non-deterministic and sporadic quality black boxes is a huge red flag.

devmor · 2026-04-06T17:11:10 1775495470

No, most systems in daily life can be understood if you are willing to take the time.

That doesn’t mean you personally are required to, but some people do and your interaction with the system of social trust determines how much of that remains opaque to you.

jonnycoder · 2026-04-06T16:51:01 1775494261

I do the same but I often find that the subtasks are done in a very lazy way.

jonnycoder · 2026-03-27T18:40:54 1774636854

Yea I went through my global claude skills and /context yesterday because claude was performing terribly. I deleted a bunch of stuff including memory and anecdotally got better results later on in the day.

jonnycoder · 2026-03-21T00:58:44 1774054724

It’s shifting for knowledge workers too, we just need to pivot. I have had many app ideas for a while and now ai lets me build them quickly. Access to education and knowledge led to your advanced eduction, now access to cheap/fast building leads to products execution. Use your phd brain to come up with a well researched idea/plan and then go execute.

markus_zhang · 2026-03-21T01:01:37 1774054897

Just a note that everyone is doing that, at 10x speed, and very good people can now output 100x thanks to AI.

OutOfHere · 2026-03-21T13:45:01 1774100701

Those who are essentially vibe coding will find their code large, brittle, and unmaintainable beyond a size, contingent on its organization. They will be able to make 100x the toys but toys aren't what make the world work.

markus_zhang · 2026-03-21T19:05:08 1774119908

Yeah, but those are amateurs. But every developer like you and me are going to do the same, or be whipped to do the same. But the world only needs that many games, that many TODO apps, that many...so, either you are already a top developer, which ofc means you shouldn't worry, or else.