Scott Shambaugh mulls about an AI alignment issue following his run-in with a bot last week
TechTakes
Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.
This is not debate club. Unless it’s amusing debate.
For actually-good tech, you want our NotAwfulTech community
i've collided with an article* https://harshanu.space/en/tech/ccc-vs-gcc/
you might be wondering why it doesn't highlight that it fails to compile linux kernel, or why it states that using pieces of gcc where vibecc fails is "fair", or why it neglects to say that failing linker means it's not useful in any way, or why just relying on "no errors" isn't enough when it's already known that vibecc will happily eat invalid c. it's explained by:
Disclaimer
Part of this work was assisted by AI. The Python scripts used to generate benchmark results and graphs were written with AI assistance. The benchmark design, test execution, analysis and writing were done by a human with AI helping where needed.
even with all this slant, by their own vibecoded benchmark, vibecc is still complete dogshit with sqlite compiled with it being slower up to 150000x times in some cases
This is why CCC being able to compile real C code at all is noteworthy. But it also explains why the output quality is far from what GCC produces. Building a compiler that parses C correctly is one thing. Building one that produces fast and efficient machine code is a completely different challenge.
Every single one of these failures is waved away because supposedly it's impressive that the AI can do this at all. Do they not realize the obvious problem with this argument? The AI has been trained on all the source code that Anthropic could get their grubby hands on! This includes GCC and clang and everything remotely resembling a C compiler! If I took every C compiler in existence, shoved them in a blender, and spent $20k on electricity blending them until the resulting slurry passed my test cases, should I be surprised or impressed that I got a shitty C compiler? If an actual person wrote this code, they would be justifiably mocked (or they're a student trying to learn by doing, and LLMs do not learn by doing). But AI gets a free pass because it's impressive that the slop can come in larger quantities now, I guess. These Models Will Improve. These Issues Will Get Fixed.
spent $20k on electricity blending them
They would probably be even more impressed that you only spent $20k
AI bros do new experiments in making themselves even stupider. Going from 'explain what you did but dumb it down for me and my degraded attention span' into 'just make a simplified cartoon out of it'.
Proud of not understanding what is going on. None of these people could hack the Gibson.
E: If they all hate programming so much, perhaps a change of job is in question, sure might not pay as much, but it might make them happier.
E: If they all hate programming so much, perhaps a change of job is in question, sure might not pay as much, but it might make them happier.
Surely at least a few of them have worked up enough seed capital to try their hand at used-car dealerships. I can attest that the juicier markets just outside the Bay Area are fairly saturated, but maybe they could push into lesser-served locales like Lost Hills or Weaverville.
my current favorite trick for reducing "cognitive debt" (h/t @simonw ) is to ask the LLM to write two versions of the plan:
- The version for it (highly technical and detailed)
- The version for me (an entertaining essay designed to build my intuition)
I don't know about them, but I would be offended if I was planning something with a collaborator, and they decide to give me a dumbed down, entertaining, children's storybook version of their plan while keeping all the technical details to themselves.
Also, this is absolutely not what "cognitive debt" means. I've heard technical debt refers to bad design decisions in software where one does something cheap and easy now but has to constantly deal with the maintenance headaches afterwards. But the very concept of working through technical details? That's what we call "thinking". These people want to avoid the burden of thinking.
Eh, one might say that going by the broad strokes version while letting the expert do their thing is basically what management is all about, especially if they ignore the part where he wants his version to be light and entertaining.
This isn't about managing subordinates though, this is about devising ways to be complacent about not double checking what the LLM generates in your name.
@Soyweiser @BlueMonday1984 I like* how the structure of the boat changes from moment to moment. I like* how the radio dishes just beam from some random place between the transmitter and the dish. I like* that the original person who was waiting for a live stream doesn't get it (because it goes to a different group of people) and is just eating popcorn watching the mess unfold. I like* how the "audience" have their backs to the "live stream" screen and are excited to be looking away from it.
I think I understand it. Think of an alcoholic that's trying every sort of miracle hangover "cure" instead of drinking less.
in today's news about magical prompts that super totes give you superpowers:
We introduced SKILLSBENCH, the first benchmark to systematically evaluate Agent Skills as first-class artifacts. Across 84 tasks, 7 agent-model configurations, and 7,308 trajectories under three conditions (no Skills, curated Skills, self-generated Skills), our evaluation yields four key findings: (1) curated Skills provide substantial but variable benefit (+16.2 percentage points average, with high variance across domains and configurations); (2) self-generated Skills provide negligible or negative benefit (–1.3pp average), demonstrating that effective Skills require human-curated domain expertise
I am jack's surprised face
...and given I have other yaks, I shall not step on my "software and tools don't have to suck" soapbox right now
This reminds me of when Steve Jobs would introduce every new Mac release by talking about how fast it could render in Photoshop. I wonder how he would do in our brave new era of completely ass-pulling your own bespoke benchmark frameworks.
Somebody on bsky talking about various Ben Goertzel Epstein file emails.
Goertzel discusses with Epstein future scenario for "an AGI economy":
"for the AGI parasite to overcome to regular-human-economy host (so it can grow to be more than a parasite and eventually gain its own superhuman autonomy) it first needs to suck off the resources of the host"
The Rationalists!

who up continvoucly morging they branches
Timn
yes, I certainly do know how to handle software development over Timn
it is actually kinda incredible that this shit has invented a way to be terrible that we can't actually easily riff off by what's expressable in unicode. an unholy clusterfuck of what would otherwise be be joked about as keming (but isn't because it's straight-up an artefact of the process used to encode visual data from source data, badly), a mindless automaton outputting garbage, and then also the shitty model
and people keep telling me this shit is good
and people keep telling me this shit is good
I mean, this one is really good, I got like half an hour of jokes with my friend off it
okay I can't argue with that outcome
wait, was this brain-rotting cognitive hazard posted at the linked page on microsoft dot com documentation? if so they have already removed it
edit: archive caught it
I checked yesterday and it was there, can confirm

what I'm thinking about is for how many years now they have been promising that just one more datacenter will fix the "hallucinations", yet this mess is indistinguishable from nonsense output from three years ago. I see "AI" is going well
Two thoughts:
That this is not just some random AI generated graphic, but from official Microsoft tutorial is unpleasantly unsurprising.
I think the tinm (timn ?) axis goes the wrong way.
It's morgin' time
*morgin' timn
morgin' all muh featues
Microsoft is really putting the "git" in GitHub thanks to copilot.
Mighty Morgin' Power-Sloppers
You have to wonder about that Tim traveler; Merlin?
A little exchange on the EA forums I thought was notable: https://forum.effectivealtruism.org/posts/EDBQPT65XJsgszwmL/long-term-risks-from-ideological-fanaticism?commentId=b5pZi5JjoMixQtRgh
tldr; a super long essay lumping together Nazism, Communism and religious fundamentalism (I didn't read it, just the comments). The comment I linked notes how liberal democracies have also killed a huge number of people (in the commenter's home country, in the name of purging communism):
The United States presented liberal democracy as a universal emancipatory framework while materially supporting anti-communist purges in my country during what is often called the “Jakarta Method". Between 500,000 and 1 million people were killed in 1965–66, with encouragement and intelligence support from Western powers. Variations of this model were later replicated in parts of Latin America.
The OP's response is to try to explain how that wasn't real "liberal democracy" and to try to reframe the discussion. Another commenter is even more direct, they complain half the sources listed are Marxist.
A bit bold to unqualifiedly recommend a list of thinkers of which ~half were Marxists, on the topic of ideological fanaticism causing great harms.
I think it's a bit bold of this commenter to ignore the empirical facts cited in how many people 'liberal democracies' had killed and to exclude sources simply for challenging your ideology.
Just another reminder of how the EA movement is full of right wing thinking and how most of it hasn't considered even the most basic of leftist thought.
@scruiser @BlueMonday1984 funny how they mock left wingers for «that wasn't real communism» and then come up with the same excuses for liberal democracies and capitalism whenever one points out all the shit that came out of that. It's really ALWAYS projection with them, isn't it?
Claudio Nastruzzi of The Reg chimes in on the inherent shittiness of AI writing, coining the term "semantic ablation" to describe its capacity to destroy whatever unique voice a text has.
https://softcurrency.substack.com/p/the-dangerous-economics-of-walk-away
- Anthropic (Medium Risk) Until mid-February of 2026, Anthropic appeared to be happy, talent-retaining. When an AI Safety Leader publicly resigns with a dramatic letter stating “the world is in peril,” the facade of stability cracks. Anthropic is a delayed fuse, just earlier on the vesting curve than OpenAI. The equity is massive ($300B+ valuation) but largely illiquid. As soon as a liquidity event occurs, the safety researchers will have the capital to fund their own, even safer labs.
WTF is "even safer" ??? how bout we like just don't create the torment nexus.
Wonder if the 50% attrition prediction comes to pass though...
"even safer" in this case means some combination of two things:
-
The new organization is more ideologically aligned with the transhumanist doom cult that apparently managed to eat the brains of the people with money to burn.
-
The new organization, largely as a result of this, is capable of sinking an unending amount of capital into buying compute time and Nvidia chips but due to their commitments to safety is even less inclined to actually deliver anything.
new interview with Dario Amodei dropped https://youtu.be/n1E9IZfvGMA basically exponential curve real soon, nice skepticism from both the interviewer and the comment section
On a related note, I really gotta stop browsing r/singularity man, some of the AI hype in there is just painful. though it is funny to see people with "AGI 2024/2025/2026" flairs
EDIT: this is also the same podcast where Dario said we could have AGI in 2-3 years back in 2023. So lol
Last megathread I posted about some LWer writing about "demographic collapse" in Japan (indistinguishable from anyone with any ambition leaving for greener pastures). I've read the comments to see if there's any pushback, and found this absolute doozy
yawn, i diagnose that LWer with weeb. this is something happening across entire industrialized world, causes being high performance mechanization of agriculture, old people being stubborn in regards to moving, lack of specialized work in countryside and couple of other factors. germany has patched their hospice staff shortage (not sure how effectively) with migrants, but japanese are way too racist for that. same thing happens in moldova, but you never hear sob stories about retired moldovans because they're broke and nobody cares, while moldovan govt can't really do much about it (because broke) to degree that it has not just economic and demographic, but even strategic effects. whole lotta drs strangelove in there
What in tarnation is happening at adafruit?
https://blog.adafruit.com/2026/02/14/heres-our-first-gemini-deep-think-llm-assisted-hardware-design/
@o7___o7 @techtakes It was Arduino who emitted the llm generated circuit but I was tired and conflated two companies with similar names (whose products I don't use) before going to bed. Now they're trying to throw a flame war. Pay no attention.
You handled that whole unhinged situation like a champ.
Seeing once respectable folks get trapped in this bullshit feels like watching my college pal smoke his first clove cigarette.
It seems like, for certain kind of geek, code LLMs sit somewhere between "The One Ring but rubbish" and "mechanical meth." The users certainly feel powerful and productive, and god help anyone who hints otherwise.
You can always blame an LLM for confusing the 2 companies ;)
Limor Fried and I had a class together at MIT in 2001. This has no bearing on the present circumstances and offers me no real insight (anything I could say about our extremely limited interactions would amount to confirmation bias). It's just the odd little factoid that comes to mind whenever adafruit Does Something Online.