Sorry where are we seeing that it failed? It compiled multiple projects successfully albeit less optimized.
"
It lacks the 16-bit x86 compiler that is necessary to boot Linux out of real mode. For this, it calls out to GCC (the x86_32 and x86_64 compilers are its own).
It does not have its own assembler and linker; these are the very last bits that Claude started automating and are still somewhat buggy. The demo video was produced with a GCC assembler and linker.
The compiler successfully builds many projects, but not all. It's not yet a drop-in replacement for a real compiler.
The generated code is not very efficient. Even with all optimizations enabled, it outputs less efficient code than GCC with all optimizations disabled.
The Rust code quality is reasonable, but is nowhere near the quality of what an expert Rust programmer might produce.
"
For faffing about with a multi agent system that seems like a pretty successful experiment to me.
Anthropic said the experiment failed to produce a workable C compiler:
- I tried (hard!) to fix several of the above limitations but wasn’t fully successful. New features and bugfixes frequently broke existing functionality.
- The compiler successfully builds many projects, but not all. It's not yet a drop-in replacement for a real compiler.
> Like I think people don't realize not even 7 months ago it wasn't writing this at all.
There's no doubt that producing a C compiler that isn't workable and is effectively bricked as it cannot be evolved but still compiles some programs is great progress, but it's still a long way off of auonomously building production software. Can today's LLM do amazing things and offer tremendous help in software development? Absolutely. Can they write production software without careful and close human supervision? Not yet. That's not disparagement, just an observation of where we are today.
This evaluation appears to be AI-written itself. It claims a 3x slowdown and a 4x slowdown combine to produce a 158000x slowdown "because there are billions of iterations" - yeah well both versions of the program had the same number of iterations.
Does anyone know how the 158000x slowdown happened? That's quite ridiculous.
It could be written more clearly but I think when it refers to a 4x and a 3x slowdown, it's actually a 4x slowdown and 3x larger code that causes cache misses, and the impact of those cache misses on runtime is surely much larger than 3x.
> Each individual iteration: ~4x slower (register spilling)
> Cache pressure: ~2-3x additional penalty (instructions don't fit in L1/L2 cache)
> Combined over a billion iterations: 158,000x total slowdown
I think that "2-3x additional penalty" refers to this:
> The 2.78x code bloat means more instruction cache misses, which compounds the register spilling penalty.
Also, the analysis refers elsewhere to other factors that weren't included in this part.
> Can they write production software without careful and close human supervision? Not yet. That's not disparagement, just an observation of where we are today.
I never claimed they could! I just view this as a successful experiment. I don't think anthropic was making that claim with their experiment either.
It feels reflexive to the moment to argue against that claim, but I tend to operate with a bit more nuance than "all good" or "all bad".
The experiment failed to produce a workable C compiler despite 1. the job not being particularly hard, 2. the available specs and tests are of a completely higher class of quality than almost any software, not to mention the availability of other implementations that the model trained on.
You can call that a success (as it did something impresssive even though it failed to produce a workable C compiler) but my point in bringing this up was to show that today's models are not yet able to produce production software without close supervision, even when uncharacteristically good specs and hand-written tests exist.
That's great and all, but that's not the point I was making and you're engaging rather uncharitably on it. So when you view it from the perspective of capability increase it's rather impressive. Note the slope of progress which this experiment was to show.
Edit: Maybe uncharitably is too strong, but we're talking past each other.
Saying the model failed to write a competitive C compiler makes more sense.
I don't think they tried to do that though.
> today's models are not yet able to produce production software without close supervision, even when uncharacteristically good specs and hand-written tests exist.
> Saying the model failed to write a competitive C compiler makes more sense.
Their compiler fails to compile (well, at least link) some C programs altogether, and in other cases it produces code that is 150,000x slower than a real C compiler with optimisations turned off (interestingly, the model trained on the real compiler's source code). That's not "not competitive" but "cannot be used in the real world". But even more importantly, the compiler cannot be fixed or evolved. It's bricked (at least as far as today's models' capabilities go). For any kind of software, not being able to improve or fix anything or add any new feature means it's effectively dead.
You could not use it in production even if no other C compiler existed.
While I understand both points of view, I'm leaning towards yours, because:
- John Carmack embedded a C compiler and interpreter/runtime into Quake back in the mid 1990s as a scripting language! It was that efficient that it could be used in a real time 3D shooter. That's a solo effort as a minor component of a much larger piece of software.
- I've seen university CS courses hand out "implement a C compiler" as a homework / project exercise for students. It's not particularly difficult.
Sure, a modern C compiler like GCC has to handle inline assembly, various extensions, pragmas, intrinsics, etc... but like you said, all of those are thoroughly documented and have open source implementations to reference.
Similarly, the Rust compiler is implemented in Rust and could be used as an idiomatic reference for a generic compiler framework with input handling, parsing, intermediate representations, and so forth.
> Their compiler fails to compile (well, at least link) some C programs altogether, and in other cases it produces code that is 150,000x slower than a real C compiler with optimisations turned off
I would bet that those things are also true of at least one expensive commercial C compiler.
I'd love to hear of any currently available commerical C compiler which has that level of issues. I would bet you'll be hard pressed to find one. C compilation is a quite thoroughly solved problem. In any case please provide an example.
I think people are concerned about the large discrepancy in concrete claims in your previous comment and subsequent empirical information. You may have seen a headline or skimmed an article and missed some details, not a big deal.
The overall impression given was inaccurate and the implicit claim of a fully working end-to-end generated compiler was inaccurate. The headlines were incomplete in a way that was intentionally misleading. It was an interesting experiment and somewhat impressive but the claims were overblown. It happens.
I've been using claude to make a project over the last few weeks. Its written ~70k LOC to solve a complex problem. I've found that it can get surprisingly far in a 1-shot, but about 90% of the work I've had it do (measured in time and tokens) is cleaning up the junk it outputs in its first pass. I'm finding my claude sessions have a rhythm like this:
1. Plan and implement some new feature.
2. Perform a code review of what you just did. Fix obvious problems. Flag bugs, issues, poor factoring, messy abstractions, etc. Make a prioritised list of things to fix (then fix them).
3. (Later) fixes:
- Write tests for the code you wrote and fix the bugs you find.
- Run the code through memory leak checks, and fix bugs.
- Do a performance analysis using benchmarks and profiling tools, and make any high priority performance improvements.
- Read the whole program, looking for ways in which the code you've just written could fit in better with the rest of the program. Fix any issues.
- In directory X is the full documentation for the library you're using. Reread it then review the code you wrote. Are there better ways we could make use of the library?
And so on.
Claude's 1-shot output is often usable, but its consistently chock full of problems. Bugs. Memory leaks. Bad factoring. Too many globals. Poor use of surrounding code. And so on. Its able to fix many of these problems itself if you prompt it right. (Though even then the code is often still pretty bad in many ways that seem obvious to me).
At the moment I think I'm spending tokens at about a 1:9 ratio of feature work to polish. Maybe its 1-shot output is good enough quality for you. To me its unacceptable. Maybe a few models down the line. But its not there yet.
The ratio is an interesting way of thinking about it. I wonder how this compares to other SWEs at various levels of experience, replacing tokens for person-hours.
Yeah I think people are really underestimating what LLMs can do even without specs.
As an example, I did an exploratory attempt to add custom software over some genuinely awful windows software for a scientific imaging station with a proprietary industrial camera. Five days later Claude and I had figured out how to USB-pcap sample images and it's operationalized and smoothly running for months now. 100% of the code written by Claude, it's all clean (reviewed it myself) pretty much all I did was unstuck it at a few places, "hey based on the file sizes it looks like the images are being sent as a 16-bit format")
For day to day work, I'll often identify a bug, "hey, when I shift click on this graphical component, it's not doing the right thing". I go tell Claude to write a RED (failing) integration test, then make it pass.
Zero lines of code manually written. Only occasionally do I have to intervene and rearchitect. Usually thus involves me writing about ten lines of scaffold code, explaining the architectural concept, and telling it to just go
People both underestimate and overestimate what LLMs can do. LLMs have shown very different results when autonomously writing a small program for personal use and autonomously writing production software that needs to be evolved for years.
OpenAI has 900 million weekly active users. So around 0.01% are having problems. That's actually way less than population level measures for the same symptoms on a bigger percentage of people relative to the US on just suicidal ideation alone.
The numbers are inflated considering the topic. There is a lot of anon, api and enterprise traffic that doesn't play any role in this. If you also account for "better search experience" users, then the numbers will probably drop massively.
So the question is how many users engage in intimate conversations at all.
"ChatGPT is where people start with AI, with more than 900M weekly active users, and we now have more than 50 million consumer subscribers."
People who go there and chat with gpt for search are definitely normal users. Just because you don't like the numbers doesn't mean you get to torture them.
I'm pretty sure that ~100% of those 700 million people will have a bad, utterly dehumanizing experience when they will next be looking for a job, because OpenAI is heavily used by HR.
That's the problem with AI safety. Not in voluntary usage, but in involuntary usage, where someone with power over you will use it against you, it does something incredibly stupid and you have no recourse, no appeal, no awareness of what you did wrong - or if you even did anything wrong.
And it's not just employment. Governments, vendors, retailers, landlords, utilities are, or will all be using it in situations that will dramatically impact your life.
You described a society problem that preexisted LLMs
What else can you blame on “scary AI?”
>>Not in voluntary usage, but in involuntary usage, where someone with power over you will use it against you, it does something incredibly stupid and you have no recourse, no appeal, no awareness of what you did wrong - or if you even did anything wrong.
In every society that was worth living in, accountability was a key factor. Decision-makers need to be able to be held to account for why they made a decision. Maybe not in every case, but in many, many cases.
If your society doesn't have that, it's on a fast track slide down into a dystopian shithole. Stop it. Do something about it. Don't just shrug your shoulders and act like this is inevitable, or like the shareholders making 1% more money this year justifies this slide.
They had a paper trail and processes that were documented and could be cross-examined on during discovery and lawsuits and trials.
Now it's just 'the computer says so, shrug'.
Over in the DoD, the computer says you must die, so I guess you die. Sometimes it says that about a building full of schoolchildren, but hey, nobody's at fault, the computer said so.
And it's going to get it's tentacles into every space in between. Landlord turns your application down, the computer says you are a social credit risk. Your grocery bans and trespasses you, the computer thinks you're a ne'er-do-well.
If you think none of that will happen, why not prevent it by law before it happens? Where are the hard limits of what this monster is and isn't allowed to do? How are we better off when we don't set them?
"Computer says so" has been a problem for decades too, though, just with a powerless human customer support agent in the middle. They've just automated away the computer's human voicebox.
Quantity has a quality all of its own. This is a paradigm shift. If it weren't, everyone wouldn't be head over heels to adopt it. It makes so many new awful uses, or turning up existing awful uses economical.
We supercharge the worst thing about computers, and you shrug and go 'eh, whatever.'
Why? Is human dignity worthless to you? Do you have no expectation of accountability from anyone with power over you or me?
I mean that was pretty much the case in hiring before AI too frankly. It's not like it's been any better on power dynamics and right now applicants are using AI at an alarming rate as well.
I'm not really moved by your type of argument, because hiring is just a broken process in general and I'm responding to the article so.
I made this point elsewhere, but people are learning a lot of what us had to learn the old way which is no one cares about your stuff for the most part and now the value provided has to go way up to get people to care. That is, as the author says, the novelty has worn off and since we know it's AI the perceived value is also way down.
We're all recalibrating.
I do really think this is just a quick period in time before most people realize that the slop posting doesn't help them personally get anything and most give up and we go back to roughly the ratio of cool things with real value to see but like on a bigger scale because AI helps you do more as one person.
I don't know... I might have said the same thing about email/text/phone spam but it has only proliferated to the point where it's a constant stream of garbage. Email, text, and phone calls are almost completely useless at this point. Sifting the signal from the noise is a non-stop effort.
I think people who want to push a certain narrative might just set up a quick bot and tell that bot to start posting on Reddit or whatever and just let it run. Why not? Little effort on their part and they might actually have influence. The same reason why spammers apparently think sending me 10 text messages per day about a loan I've been approved for. It probably does work 0.0001% of the time, but that's okay if it's all automated.
I mean I think the dynamics are a bit different in online communities at least for actual communities and not drive by subs like r/technology or whatever.
Especially say here on HN with Show HN and such the forcing factors are "i get no votes or community recognition"
But I don't entirely disagree with you I think things won't totally go back I think it will settle way more than now though especially where things are a little more niche.
So much of what I know from women in my life is that the human element of medicine is almost a strict negative for them. As a guy it hasn't been much better, but at least doctors listen to me when I say something.
One of, if not THE biggest challenge in getting treatment is getting past insurance rules designed to deny treatment. This is much, much easier when you're able to convince a doctor (and/or trained medical staff) to argue on your behalf. If you can't get those folks to listen to you, that's probably not gonna happen. You might have to go through several different practices before you find a sympathetic ear.
Now replace some / all of those humans with... A machine whose function also needs insurance approval.
Sounds like we need to dismantle and replace this broadly dysfunctional system at multiple points. It's not like the US insurance landscape is anywhere close to the best way of handling healthcare if you look at many places in the world.
This is because "dismantle and replace" (or perhaps in other words, "defunding") is not a serious, viable solution to many of the societal issues we face.
Things were ruined slowly. They unfortunately will need to be fixed very slowly too.
I don't think that's going to work. We need broad political change and then that has to work rapidly to legislate this. I don't think slow and steady has done anything but lead to the decay our institutions over the last 70 years.
I think that both this and GP are misguided. The pace of societal change in a given direction is neither inherently proportional to the pace of change in a different direction (GP) nor is the pace part of the direction (you).
You have to engage with the specific historical events/factors that led to the direction and the pace in order to change either. Broad statements like "society is big so change has to be slow" are just as unwarranted as "slow change results in decline".
There's a correct answer to "how quickly can change in a new direction be achieved". It will probably only become known after the fact. It will certainly not be model-able as a function with variables for "progressive or not" and "speed of change".
My argument is more along the lines of "slow change has resulted in decline observably for the time period I have observed it and we should try catalyzing something else"
I grant that whether that winds up being fast or slow even if the attempt is intended to be fast is out of my or anyone's hands for the most part as the system dampens that barring total collapse and chaos :P
> They unfortunately will need to be fixed very slowly too.
this can work until you hit a crisis point; i think one issue is we are sliding faster in the wrong direction (increasing bureaucracy, increasing fees, wait times, overwork etc) so "slowly" can work but only if its "fast enough" if you get what i mean (people are really suffering out there)
Of course they do. Most of their platform is built on [appearing] to repudiate coastal elitism and left wing dogma in higher education + globalism with a healthy dose of fuck you because you're you.
And I graciously waited and allowed them to do things that will take decades if not more to repair before deciding they were irredeemable. I had hoped a middle ground and bipartisan ship would be reached, but it's clear to me it won't be. We do not inhabit the same universe at this point, the disdain is mutual.
You’re acting like I’ve always thought about them like this or like I haven't spent years observing and thinking about this to come to my conclusion. You'd do well to listen to your own words about theory of mind. I was raised conservative I voted for Romney. I'm a fan of many of the political platforms they run on now (minus originalism, removing bodily rights, religion), but in practice they do not walk their own talk. The wars, the spending, abandoning neo-liberalism except in word the blatant corruption and disdain for the positions they hold and how they appear on the world stage.
No, I’ve watched their actions for 15 years and moved ever closer to the position that I have nothing in common with them even being ideologically close to a version of their party from 20-30 years ago and they do so blatantly want to destroy the middle class, health, and wealth for anyone outside a small oligarchic class.
I'm pissed because they wear a lot of my ideology as cheap dress to fuck someone.
You are describing a set of dynamics that lead nowhere other than violence and total and complete breakdown of the polity. If you are correct, then nothing matters, everything is fucked. You won't get what you want, but neither will anyone else.
My preferences, while possibly futile, are least an attempt to not just accomplish short term goals but to fix the broken dynamics of the system. That is, in my opinion, far more important than literally any particular policy goal. Policy progress is pointless in a broken system, so fix the system first.
It's possible that my view of focusing on fixing the system, restoring institutions, erecting new guide-rails in places we have observed that the old ones don't work, etc. won't work. But at least it has a chance of producing a good outcome. A good outcome literally can't come from the kind of political behavior you describe. You want your side to seize as much power as it possibly can when it wins, enact as much "good" as it possibly can in however long it can maintain it's grip before the political tides inevitably swing and you lose power again. You don't seem to realize that this is what we have been doing for at least several cycles now. And what we have seen is that the next administration just tears up the progress, does the same thing except in the opposite direction and even harder, and does what they view as "the good thing" and which your side views as nothing but unmitigated evil (the same way they viewed you and yours when you were in power), and so the both sides have accomplished nothing but pushing the pendulum a little bit further, giving it a little more momentum, and shredding up the social fabric a little bit more.
I'm not so naive as to believe that it is possible for just one side to say "no we won't do that, we will unilaterally disarm". But I am of the belief that, if one wants to pretend that one is "on the side of good", that the only rational action is to, when granted power, to spend as much political capital as is possible to slow down the pendulum, tear back power from the bloated executive and the federal branch more broadly. Stop trying to enact your political project and instead make your political project nothing other than the restoration of the norms and principles of the constitution.
This is not something that has been tried and failed. it's the opposite of the past 50 years of federal political dynamics. What has been tried is your plan of "fuck the other side, they are evil, just do what our base wants and ignore consensus and norms".
It doesn't work, it won't work, and it can't work. It's destroying the country.
From my perspective, you are no better than the side you hate. You may want different policy goals, but both you and your polar opposites are collaborating on a shared project: the destruction of the country.
I want my party back and I want to cut out all the garbage that has infested it. Sometimes that requires taking an actual stand and staying firm to it. Middle road nonsense like what you're suggesting is impotent when one side has so clearly decided to be against it.
Edit: And coming back to this later I need to be clear the left also needs to be swept out. I think our institutions in general need to be reworked. Not replaced entirely, but it's clear they don't survive contact with people who would abuse them for their own ends nearly as well as we had hoped.
It's easy to destroy but hard to create. If your goal is to further destroy then I suppose that's achievable, but I have a hard time picturing what positive change is going to come from it.
No offense, but this comes off as passive indifference and while I've heard people say things like this all my life it has broadly resulted in watching 30 years of societal decay. I can't help but think this is wrong.
We should have stacked the courts ourselves, brandished executive orders etc, had some spine.
Edit: I think I need to make clear my thinking that the right has selectively destroyed institutions and levied them in other areas where it makes sense for their agenda. It's not been wanton. So when I say leverage the playbook it's not a one sided act of destruction.
"Stacking courts" would require a Senate that actually votes those judges in. "Brandishing Executive orders" requires a congress that won't be able to countermand you and a Supreme Court that won't "nuh uh" you.
You are yet another person upset that Democrats cannot overcome the purposeful design of our government that you need a lot of power to build, and little power to destroy.
People who want to fix things need dramatically more power than people who want to stymie and break things. Democrats only rarely get that power, and usually only by one or two votes from people who strictly do not care about fixing things. You want this country to fix things? You need to vote significantly more for a party who will push to fix things.
The minority party in congress has no power by design.
Im an independent who would prefer a version of republicanism that died and the closest thing I have is a deeply ineffective party. While the right is currently building things I don't like to put in place institutions and laws I dont like.
All of you talking about the right destroying things are wrong they’re just building things and enshrining things you wont like.
Let's say, hypothetically, you had two political parties — a "destroy the current institutions" party, and the "preserve the current institutions" party.
The latter might notice the former having an easier time, but "hey, it works for them" is the wrong takeaway. Commit to the hard work of building resilient institutions; don't join in the destruction because it's easier.
There's also an element of "Never (...), they will drag you down to their level and beat you with experience."
The republicans are building things not just destroying is the point. It’s just stuff you wont like. This is why Im not a democrat. The left hasn't been able to effect change or be useful ever [my entire life], sure loves to moralize though.
Strongly agree. I think some (not all) of the Trumpian playbook can be wielded very effectively for non-conservative parties, for a few reasons:
- Some executive orders are always flipped as soon as the opposition takes office, but some unilateral changes are much harder for a cyclical/pendulum-swing opposition season to reverse than they are to emplace. We don't know which are which yet. The return-to-office mandate for Federal workers is probably one that'll have a lasting effect--even if un-done in the future, the average prospective Federal worker will consider the job as something that has a significant likelihood of requiring in-person work if the political winds change and that EO is restored.
- Some things really do get permanently addressed within an electoral season, if you have the guts to shotgun through enacting a solution to them. The withdrawal of most U.S. troops from Afghanistan under Biden is a good example of this. So is the "Fork"/RIF/firing wave of Federal employees under Trump. I'm not saying those are both good things, but they aren't "reversible" in the sense that, say, the Global Gag Rule was endlessly reversible.
- Success follows success, as well. Part of the reason that momentum holds such a sacred place in electoral planning is the same reason that Trump's "flood the zone" strategy was effective (again--not good, but undeniably effective): capitalizing on/marketing early unilateral wins of any size results in the public and Congress being more likely to support larger, more durable changes. This is complicated by many factors (media landscape, districting, money), but is broadly true.
You've witnessed a dismantle and replace effort by the right wing that wishes to squeeze everything to make rich people more money. An effort by the left would destroy the private insurance scheme and build up medicare. Completely different and you'd get something functional.
When the wrong targets get destroyed, everyone suffers. When parasitic forces are destroyed, the system functions better. It's the difference between defense and friendly fire.
We already had an effort by the left. You can “no true scotsmen” if you want, but it represents the reality of what will happen when ideals clash a sector that makes up 18% of the GDP.
Are you referring to the ACA here? That was a compromise bill that props up the current system in the US, primarily created by right leaning centrists.
Why is this downvoted? What the (far) right wing has done to american institutions is incredibly destructive and it will take decades to fix it, if that's even possible. People are hurting because of it.
The whole system has basic flaws in how's financing set up.
There is an intermediary between customers and seller and it's allowed to take percentage of the sale. No such entity will ever work in the interest of the consumer. It has every incentive to inflate prices. Intermediary is needed but it should be financed by buyers with flat fee (possibly for additional incentives that reinforce the desired behavior). The tragedy here is that initially it was. But it was deemed too expensive for the buyers and got privatized which made it vastly more expensive in the long run.
Insurance is also wrong. Insurance is gambling and gambling needs restrictions. You are allowed to take people's money without providing any service most of the time, so you shouldn't be allowed to refuse legal service for that privilege.
Hate to break it to you. It’s the same outside of America. Yes, your insurance system is broken. But no just because you live in say Sweden you won’t get all the treatment you want directly. It is a pain to get it and if you get it you will often have to wait a long time (unless it’s a heart attack in progress then they are fast)
Perhaps, but I don't have much optimism for what this ends up looking like if it's an AI you have to convince to listen to you. In the spaces where this is already happening (rescruitment comes to mind), things are not looking good..
Agreed. Last time I was sick I said my fevers were pushing up to 100 and they said it's not a concern until 100.4. felt like an odd number. It's 38 C. Because my dramatic undersampling of my temperature was 0.4 degrees lower than their rounded threshold through some unit conversions, I clearly didn't have a fever. That's not a very human touch
Maybe you had trouble re-reading your own comment but I can tell by how you responded here (a cascade of links/references) and a snarky comment ("I can keep going if you'd like") that I'm sure the doctor was glad to be rid of you.
You didn't say the doctor disputed you had a fever. You said the doctor told you the fever wasn't concern until 100.4. Which I'm guessing is your fault for misinterpreting. If you google around, it's very easy to see the fever thresholds.
Here, I'll even paste a summary for you, and I can keep going if you like:
Key Temperature Thresholds
- 100.4°F : The standard definition of a fever.
- 103°F : Contact a healthcare provider
- 104°F : Seek medical attention, particularly if it does not come down with - treatment.
- 105°F : Emergency; seek immediate care.
In one of your own links (clevelandclinic.org), here's an excerpt for you:
When should a fever be treated by a healthcare provider?
In adults, fevers less than 103 degrees F (39.4 degrees C) typically aren’t dangerous and aren’t a cause for concern. If your fever rises above that level, make a call to your healthcare provider for treatment.
A fever is 38c, great. What the parents said was that you may have misheard because a fever isn't serious until 104. Which is line's up with the language you used.
> and they said it's not a concern until...
Parent is not suggesting that a fever isn't at 100F, they're suggesting that it's not "a concern" until 104F, a number strangely similar to 100.4 that you claim you heard, presumably, while you had a fever.
They aren’t objectively incorrect. You are conflating two things:
- You aren’t considered to have a fever until you get to 100.4. Anything less than that isn’t considered a fever, let alone a concerning one
- A fever isn’t considered concerning (ie dangerous) until it reaches about 104. Anything between 100.4 and 104 is just a regular fever and isn’t considered concerning.
At which point I'd ask: how much of that is baked into the AI now?
It doesn't have opinions, research, direction of its own. Is this a path of codifying the worst elements of human society as we've known it, permanently?
This even translates to the pediatric space. I took all of my kids to the pediatrician because either they don't make comments to me like they do to my wife, or I don't take shit from them. I'm not sure which. Here's an example:
My wife and daughter were there and the doctor asked what kind of milk my daughter was drinking. She said "whole milk" and the doctor made a comment along the lines of "Wow, mom, you really need to switch to 2%". To understand this, though, you need to understand that my daughter was _small_. Like they had to staple a 2nd sheet of paper to the weight chart because she was below the available graph space. It wasn't from lack of food or anything like that, she's just small and didn't have much of an appetite.
So I became the one to take the kids there. Instead of chastising me, they literally prescribed cheeseburgers and fettuccine alfredo.
My daughter is in her 20s now and is still small -- it's just the way she is. When she goes to see her primary, do you know what their first question is? "When was your last period."
> My daughter is in her 20s now and is still small -- it's just the way she is. When she goes to see her primary, do you know what their first question is? "When was your last period."
Is that supposed to be a problem? How does it connect to the story in your comment?
The question seems to be warranted to me, since being underweight can stop you from menstruating. So if you find someone thin and her last period was off in the distant past, you can conclude that there's a problem and something should be done about it; if it was a couple of weeks ago, you can conclude that she's fine.
(It could also just be something that is automatically assessed as a potential indicator of all kinds of different things. Notably pregnancy. For me, it bothered me that whenever you have an appointment at Kaiser for any reason, part of their checkin procedure is asking you how tall you are. I'd answer, but eventually I started pointing out to them that I wasn't ever measuring my height and they were just getting the same answer from my memory over and over again. [By contrast, they also take your weight every time, but they do that by putting you on a scale and reading it off.] The fact that my height wasn't being remeasured didn't bother them; I'm not sure what that question is for.)
I’m a normal weight, and get asked the same question. More importantly, I can tell them, “I have a regular cycle” and they WILL NOT take that as an answer. I HAVE to give them a date, and they will ask me to make one up if I can’t remember or want to decline giving them that information.
Particularly given the alarming stories of people being prosecuted for having miscarriages, it feels ridiculous.
If anything I hope more automated diagnostics and triage could help women and POC get better care, but only if there’s safeguards against prejudice. There’s studies showing different rates of pain management across races and sexes, for example. A broken bone is a broken bone, regardless of sex or race.
The system doesn't know that you're a smart person who will only say "I have a regular cycle" when you've had something that could reasonably be called a regular cycle. A lot of patients are stupid, and requiring a quantitative answer eliminates one source of stupidity. Yeah, this particular doctor knows you're smart, but I hope you can see what disasters might result if the procedure said "the doctor may skip this step if the patient is smart".
It's the same reason why the doctor will take your temperature, instead of accepting your word that you took your own temperature and it is normal.
> Particularly given the alarming stories of people being prosecuted for having miscarriages
You need to delete your social media accounts and change where you're getting your news from. Nobody is "being prosecuted for having miscarriages". A few people have been investigated for drug abuse during pregnancy which led to the baby's death, which sensationalist news stories twisted into attention-grabbing headlines.
A doctor asking about cycle is just a core piece of diagnostic data like taking blood pressure and temperature, not some conspiracy to harm you.
Perhaps I wasn't as clear as I could have been. My point was that doctors treat women differently than men, even when they're the parents. I don't think that it's inherently malicious, but there is absolutely a bias.
You are asking how it connects, and it absolutely doesn't. But they keep asking and won't accept "it's regular" as an answer.
She's in her 20s and is seeing her primary for routine things, not because of her weight -- that part of the story was about how they chastised my wife for giving her whole milk but said absolutely nothing to me about it later on.
However, your argument focuses on the routine intake instead of any listening part. The fact that the doctor measures height, weight, temperature, and blood pressure on intake and then asks about LMP doesn’t surprise me… that’s the part of the script where you just provide the data before you bring up concerns.
Not to say the doctor was not a jerk, just that your argument doesn’t do much for me.
Yes? That's a very important piece of information, and I hope would be a thing a doctor asks, especially if there are concerns about weight or nutrition.
She's not there about her weight, though. I highly encourage you to talk to women about their experiences here.
The weight thing was not the key aspect of my original comment. They chastised my wife for continuing to give my daughter whole milk while being underweight, but did not make similar comments to me. That was the point.
For women, their pains and problems are far too often whisked away by hand waving and "it's hormones and periods" and serious issues are often overlooked. Very little has changed in that area over the last twenty years.
Yes, let's pretend that the bias does not exist, that is helpful. It certainly doesn't have to do with the fact that it's currently a 60/40 split in active male vs female physicians. Or that women are more likely to be taken seriously by doctors:
job market is 2019 levels this rhetoric is nice, but doesn't stack up. yes it's not 2021 levels which is where they over hired and hired a bunch of people they would not have hired before then.
I think in 90% of the galleries I've been in the I'm expecting the art to be communicating something so I'm inferring intent or making my own, but by itself art is a poor communication medium imo. So much so we often say it's about whatever YOU perceive from it. Obviously artists set out to invoke --something-- and we accept that and at least some of us try to think on it, but the person perceiving the art is really doing a lot of work to fill in the gaps. Expectations here are aligned.
Most art in stores is just filling space on a wall to bring things together visually. I don't know that anyone cares about it communicating anything other than "the space looks less empty and more appealing for the customer". I don't think the average store owner is thinking any harder than that on it. I guess my point being we're mostly subjected to it based on the perspective we will like it and its cheap.
I think my point is, that the expectation of the people placing the thing for us to view/listen to and our own can be mismatched. I've always viewed the shitty pop in department stores as an imposition on my ears and I've never spared a thought for most wall art in a random cafe, but I imagine the owners are hoping to get something out of it from us. Whether thats satisfaction or manipulating us to spend idk.
Exactly. They share the cultural sensibilities of the average person on the street, and yet they're making decisions that will shape the world for future generations. I think that's bad. I want those decisions being made by people who have a more extensive cultural education. Snobs, if you want to call them that.
Interestingly, the smartest people I know have the widest range of media consumption and understanding. To assume that because someone uses a marvel reference they might not have a deeper cultural education is rather...limited thinking.
Ferran Adria drew culinary inspiration from a bag of potato chips
As someone experienced with a privileged elite educational background, I can guarantee that intellectuals love the highbrow and lowbrow, the authentic and the kitsch; rather, it is a sign that someone is not acculturated if they have the stereotypical impression of the intelligentsia, which makes the OC's comment ironic, they are telling on themselves.
Of course they're average people, why do you think tech or AI company employees are somehow above or beyond the average person? I'm not sure why you'd willingly say you'd want snobs controlling the world, that is somehow even worse and reeks of aristocracy which is why you see replies rejecting your thoughts, it is simply not a western ideal or one to strive towards.
I'm confused as to what your point is. Employees refer to the incident as "the blip." I got no impression that there was a formal memo that went out to the company or the media at large that officially refers to the incident as the blip, merely that employees refer to it as a blip (likely to each other, not too dissimilar to a meme).
And while I don't think someone's media tastes ought to preclude them from making important decisions, I also disagree with your point at large. I don't think the world should be shaped by snobs. The world is already being shaped by snobs in other sense of the word, and I don't see any indication that it's any better than the alternative.
Visicalc is often described as the killer app of the first generation Personal Computer(1). It was the product that drove them into every small business in the country, that blew up sales of personal computers and brought them out of the realm of hobbyists into enterprise. And, honestly, I think Visicalc and spreadsheets are still a greater benefit than what I've seen out of generative AI today. And that happened a lot faster than where we are today with generative AI. Apple had enormous actual profits by 1980 (Apple IPO'd in 1980 with a 21% operating margin). So I think that a lot of the "just got to give it more time" argument misses that the previous computer based revolutions that we know about productized and threw off gobs of cash a heck of a lot faster than this one has.
If the end result of this is "certain classes of white collar workers are 10-25% more productive" (which is the best results I can extrapolate from what I've seen so far) then it's really hard to imagine how OpenAI can return a profit to their investors.
>If the end result of this is "certain classes of white collar workers are 10-25% more productive" (which is the best results I can extrapolate from what I've seen so far) then it's really hard to imagine how OpenAI can return a profit to their investors.
If we take this as face value, and say that the absolute best case scenario is there are literally no other uses for AI but helping programmers program faster, given 4.4 million software devs, with an average cost to the company of $200,000 (working off the US here, including benefits/levels/whatever should be close), those 4.4 million devs with 20% productivity would save roughly 176 billion dollars a year.
Some companies will cut jobs, some will expand features, but that's the gist. And it's hard not to see the magnitude of improvement that's come in just 3 years, though if that leads to a 'moat' is yet to be seen.
> If we take this as face value, and say that the absolute best case scenario is there are literally no other uses for AI but helping programmers program faster, given 4.4 million software devs, with an average cost to the company of $200,000 (working off the US here, including benefits/levels/whatever should be close), those 4.4 million devs with 20% productivity would save roughly 176 billion dollars a year.
I don't think that's necessarily out of line with struggling to return a profit to investors though: an individual company is only ever going to capture a tiny fraction of the productivity improvements it enables its customer base to make[1], its own cost base is unusually high for tech, and investors are seeking a 10x+ return on an $852B valuation for a company that isn't even the market leader in that segment (which isn't the only segment, but it's the optimum B2B one). You can have a great business with a great value proposition and a sustainable moat and still not generate the desired returns on investment at a $852B valuation.
[1]and that's productivity improvements over the best-known free models, not productivity improvements over reading StackOverflow
Sorry, I forgot that for many engineers this is, in fact, their first time going through a technology cycle like this, and so would need more explanation. I am too young for Visicalc myself, but the cycle that I saw while I was in high school- the dot-com bubble- doesn't have convenient, easy to mark out dates like the PC does.
Thinking... Thinking...
Tim Berners-Lee proposing HTTP in 1989 is kinda like the original Attention is All You Need paper, I guess? Netscape 1.0 release in December 1994 is ChatGPT 1.0? And then Amazon.com opened up to the public in July 1995 and then IPO'd in May 1997 (after raising less than 10 million dollars in two funding rounds). But once again we have the business side of these previous cycles moving much faster than this one.
WOW. That does really drive home the perspective. I was an adolescent during those years and it did seem quick then, but that's an insane pace in retrospect.
Amazon is perhaps a counter-example to your point, though, to be fair. It seems to me they did a lot of spaghetti throwing while making accounting losses for a good number of years. Granted, they did it on OpenAI's dining budget.
I took it the other way, spreadsheets shook up the world way more than AI has (to date) - it's possible that history will look back and count AI as the bigger "thing" but if I had to pick a killer app, VisiCalc and computer spreadsheets in general would beat ChatGPT.
VisiCalc was "the" killer app for early micros, but being able to edit a written text on screen and then print it out with letter-like quality was nothing to sneeze at, either. This was plausibly a key gain in efficiency for the service sector, perhaps comparable to the 10%~25% that's now being talked about re: LLM's (which is huge on a secular basis).
I don't think Reddit is a great place to determine social consensus for well adjusted people or representative of the average adult view. I never see people on Reddit have opinions of any the people I consider reasonable in real life and I don't mean politics I wouldn't know, I don't frequent political subreddits.
It seems fairly consistently miserable in any of the common high traffic subs and you have to get down to really niche communities to see what I consider reasonable behavior that matches the behavior of people I know in real life.
The AITA social consensus is a specific kind of groupthink which differs from nearly everyone I know in real life. I assumed yard2010 meant the specific AITA social consensus and not general human agreement.
Even the premise of deciding who's right and who's wrong is miserable. Most problems are like those daisy-chains of padlocks you see on gates in remote areas[0]: there are multiple factors that caused the problem, and removing any factor would remove the problem too.
reply