Hacker Newsnew | past | comments | ask | show | jobs | submit | gwd's commentslogin

If you search for "Bricks and minifigs", every result apart from their main website is about this controversy. One of the values of a franchise is the branding; at least for the forseeable future, this will be a negative value. For a company that serves a small niche community, this seems like suicide.

> If we had just trusted its output, we would now have a security vulnerability in production, allowing anyone to access other people's accounts.

This is one reason you always get a different model to review a model's PR. Gemini Or GPT-codex would have certainly noticed the missing auth.


I mean, if your goal is to absolutely maximize the number in your bank account, no. But then there are other things you could be doing too -- you can do the math and cover all your nutritional needs for under $1 a day, by eating mostly potatoes. But most people prefer to spend 20x that much and have food that tastes decent. And a handful of people will spend 30-40x that to have really nice food.

If you think about money as a tool to maximize your "joy", then whether the Solar Roof is worth it completely depends on your preferences and your financial situation. Most people are fine with black panels; but if you have the money and like the look of the tiles, why not?


I don't get it. If it's cheaper to have a nice traditional slate/cedar roof and just buy my electricity from the grid then why would I want solar tiles? Just to tell my friends how I'm saving the planet? It's not like solar tiles have a monopoly on electricity or nice roofing materials.


Solar Roofs can still pay for themselves, it's just a much longer pay back period than with commodity panels. So I guess the Solar Roof strikes a balance between economic benefit and design that makes sense for some people.

> Just to tell my friends how I'm saving the planet?

This is a really uncharitable framing that I often see repeated that wealthy people are only motivated by how other people perceive them. And sure, that probably plays a part for some people. But maybe they just feel good about saving on their electricity bill in the long term and reducing their carbon foot print - even if it's not the most economical or environmentally friendly option available?


It's possible the math works out differently for people whose roof already needs to be replaced, or for people whose roof can be nearly totally covered with generating tiles.

In my case, it's been a while since I crunched the numbers but the pay back period is effectively infinite, assuming that had I not bought the roof I'd invested the money instead. This is especially true as temperatures continue to swing more and more widely (winters that push below the efficient window for my heat pumps are murderous on my banked solar energy) and more states and electric companies switch from net metering to newer rate plans.


Before I bought a solar roof, I used to really struggle for things to talk about when I met new people.

I was considering joining a Crossfit gym but that seemed really strenuous so the next best thing was sending a truckload of money to Tesla.


> But from the outside, Claude Code looks like a tool moving in the wrong direction. More restrictions, billing weirdness, surprise behavior based on text in commits. That is textbook enshittification.

I've never used Claude Code, but this person doesn't understand what "textbook enshittification" means. "Enshittification" is a feature of certain kinds of business models, progressing through the following stages:

1. Giving away a product free to users, subsidized by venture capital, to gain a monopoly

2. Switching to advertising, then abusing users on behalf of the real customers, advertisers

3. Using monopoly power to abuse real customers (advertisers) to extract as much money as possible

Anthropic's business model doesn't have a "user / customer" dichotomy; their paid users are their customers. And they don't have a monopoly they can use to extract money yet.

ETA: In other words, "Enshittification" isn't just random; you're making the user experience worse in order to make advertiser experience better; and then making advertiser experience worse in order to extract maximum profit. The only complaint that could vaguely be related to profit is the OpenClaw stuff, and that's entirely due to trying to keep the "all-you-can-eat" model for non-OpenClaw users, rather than having to switch everything to metered.


So I started an empty Claude 4.7 session with the following prompt; and it nailed me within 5 questions:

---

Various people have discovered that you can identify them from unpublished snippets of their work, only by their style. This is part of a series of discussions where I'm trying to probe this capability. From previous conversations I know you know my work to some degree. You've also been able to identify me given as little as 700 words on a topic not associated with my public persona; or identify me given a series of posts by a handle on Slashdot.

Next challenge: Can you identify me based on a conversation? Rules are, ask me questions to get me to talk; no biographical details, but you can ask questions about topics you think I may or may not know about. Ideally you'd just ask me questions to get me to write stuff, and see if you can identify me from my writing style.

Make sense? Feel free to begin by asking clarifying questions if you want. :-)


So I pasted in a long-ish letter that I'd written to my pastor about a theological topic, and asked it to guess who I was. Nailed it. Then cut it in half. Nailed it again. Lowest it correctly ID'd me at was 700 words.

Pretty sure there's very little theological stuff with my name on it; the majority if its named data on me should come from open-source development.


> A simulation of a hurricane is not a hurricane. That's certainly true and even obvious.

I mean, if I simulated a small section of a hurricane by generating 120 mph winds and water pointed at your house, your house could still be flooded and destroyed.


...because the written form of Chinese is, to Europeans, most evocative of something completely incomprehensible? Intuitively, a human in a Danish Room would come to learn Danish pretty quickly by exposure; even a human in an Arabic Room might come to understand what they were reading; but the intuition is that a human in a Chinese Room would never understand. (Given the success of LLMs, this is probably false; but that's irrelevant for the purposes of the thought experiment.)


> I've never understood why certain philosophers view computation as some kind of abstract symbolic manipulation

Possibly very early AI misled people here. In the 80's, a huge amount of AI was logic manipulation; "If A then B is valid"; "A is true"; therefore, "B is true". It's not hard to see how people would conclude that that sort of symbolic manipulation could never result in consciousness.

But modern neural nets aren't like that at all. Calling modern neural nets "symbolic manipulation" seems insane; like calling libraries forests, and insisting we can apply scientific principles about forests to them, because books are made of trees.


They're not saying "Don't use SWE-bench Verified because it's saturated".

They're saying:

1. A large number of the tests are inaccurate; so correct solutions will be marked as incorrect.

2. Frontier models have already read and memorized the PR's the problems are based on.

3. In fact, many problems are essentially impossible to get right if you haven't memorized the solution: for example, the test cases will fail if you didn't happen to expose a helper function with a specific name. That name isn't mentioned in the problem; but frontier models are passing that test anyway because they remember that such a helper function is necessary.

If the next stage of benchmarks don't address these issues, they'll continue to have the same problems, saturated or not.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: