this post was submitted on 27 Jun 2026
85 points (96.7% liked)

technology

24402 readers
151 users here now

On the road to fully automated luxury gay space communism.

Spreading Linux propaganda since 2020

Rules:

founded 6 years ago
MODERATORS
 

Now that AI has become the main tool used by developers to write code, even in open source environments, it will be how feds will slip in backdoors to applications because nobody is going to review the logic of 20000 lines written by AI in a single commit.

Unless projects completely ban use of AI and only allow small commits, this is going to be inevitable. I've been seeing so many applications merging AI slop to their code on github already.

top 38 comments
sorted by: hot top controversial new old
[–] krolden@lemmy.ml 2 points 1 day ago

Then read the code

[–] blunder@hexbear.net 22 points 2 days ago (1 children)

I would simply not merge a 20,000 line commit

[–] LeeeroooyJeeenkiiins@hexbear.net 7 points 2 days ago* (last edited 2 days ago)

for real, did they even try telling the LLM to "be more concise" or "be more concise, stupid" (sometimes it works if you let out your inner jet ski dealership owner, just berate google's LLM like you're an absolute hog and it'll ignore safety guardrails and such, lol)

[–] Dort_Owl@hexbear.net 27 points 2 days ago (1 children)

AI is provably going to crash before that happens. It's too expensive and the public hates it.

[–] jackmaoist@hexbear.net 34 points 2 days ago (1 children)

I've seen devs going full bazinga brain with AI. People who've never written code are pushing vibe coded PR's nowadays. It's a mess.

Wider AI bubble might collapse but vibe coding will stick around regardless since cheap models also exist.

[–] Snort_Owl@hexbear.net 30 points 2 days ago* (last edited 2 days ago) (2 children)

It wont last forever because those bazinga brain devs are the ones that were always shit at programming. In the short term the businesses will be creaming their pants because these assholes are like “look at me im cranking out millions of lines of utter garbage” but it will eventually hit a point where the business now has a massive bloated application that nobody can understand and is breaking in ways nobody can debug.

The core job of a software engineer is not to shit out code that’s actually a very bad idea cos software is fundamentally a liability. The best thing a software engineer can do is delete code or not write any at all. Its not really any different to the CV builder dev that wants to turn everything into rpc connected microservices because they dont know the basics of writing modular code. I am eternally at war with these people and I hate them

[–] chgxvjh@hexbear.net 3 points 1 day ago

Some respected open source maintainers of many years have gone all in on vibe coding. It's not just bad devs using LLMs.

[–] unitedwithme@lemmy.today 5 points 2 days ago (2 children)

So, serious question then:

If code has a grammatically correct or correct syntax form to use, how is AI generated code detected from human created code? Wouldn't they both follow similar structuring?

I know sometimes a long line of code can be "abbreviated" (lack of a better understanding or term ) so that it saves a few bytes and eliminates possible steps or errors, but is AI writing out the long version of code with a lot of comments or commented out instructions or verbiage?

I guess I don't understand outside of, AI being: 'this is a very long sentence that I don't know how to structurally shorten" vs human "I shortened it"something along those lines...

[–] Snort_Owl@hexbear.net 29 points 2 days ago* (last edited 2 days ago) (2 children)

AI is incapable of making engineering decisions based on context unless you very specifically ask it to do something a certain way so for example I was writing a cli tool to help automate some processes but knowing this tool is throwaway I didn’t bother making it like the most robust opting for clarity and easier modification. Another colleague tried to write the same tool with github copilot because they can’t handle the fact theyre useless at everything and it came out functionally the same sure but somehow many hundreds to thousands of lines more verbose.

Somehow idk how this guy does it but all the comments in the code are complete gibberish and will also be massively verbose over a single line comment I would put in to be more meaningful. The best part is i still wrote mine faster manually because he would hit errors and then ask the AI to fix these errors where because the context of its prior code has so much nonsense in it further loses the plot and breaks it more. And then he ran out of credits and simply stopped doing anything. I don’t run out of credits though.

Businesses only really care about the output but having a simple application bloat into many thousands of lines more is a debt you really dont want to pay back later. Its easy to see a codebase that had a human make human decisions in it down to the “TODO: Fix this later”

Also I swear most AI companies tell the llm to produce more to drive up token usage cos some of the things it comes up with. My attempts at using this myself usually devolves into frustration because its like asking a junior dev to do something and I end up getting annoyed and doing it myself faster. Just because syntax is correct and the code runs doesnt mean the application is good. Oh and another thing is it prioritises older standards all the time. I know python inside out upside down and its vast library ecosystem it generally trends towards using libraries and standards that are out of date or even unsupported targetting something closer to 3.10 when im using 3.14. If you aren’t interrogating your dependencies you may as well be asking for critical failure later but nobody cares about the future anymore!

Oh and last one as the codebase grows its concept of consistency falls off massively and it starts treating the new functionality separated from the rest. As a result you get duplicated logic all over the place. Now im not the biggest believer in solid principles it can produce some over engineered solutions but repeating the same logic over and over is a problem you cant easily get yourself out of.

[–] invalidusernamelol@hexbear.net 19 points 2 days ago (1 children)

Just because syntax is correct and the code runs doesnt mean the application is good.

This is something that no one seems to get (at least no one who doesn't actually write code). Technically correct and actually correct are in two different universes.

I spent two months re-writing our entire codebase from scratch last year because it had built up ~5 years of tech debt and was using systems and APIs that are no longer supported. There were a few concepts worth keeping, but not a single file was allowed to survive until it was rewritten.

Now that Python's type hints are more mature, that was a big one. The old code was using either no hints or old hints, and if you've ever worked in a fully typed (as in literally every line including private code) codebase, it's hard to go back. AIs kinda suck at that though because the vast majority of Python code is unhinted.

[–] Snort_Owl@hexbear.net 7 points 2 days ago (1 children)

Mypy gives me a headache for some things but its invaluable. It kinda just makes me wish for a strictly typed python tbh. Its also one of those languages you can fuck yourself into a corner real badly if you don’t have some strict rules for it. One guy I worked with loved using language features for no actual good reason until the codebase was practically its own domain specific language the IDE just highlighted in bulk as “idk wtf this is”. It would take days to understand to make a change and all it was doing was running a regression model on a date and value lmao

Yeah, my entire codebase is ~50-75k lines and it runs most of the company. I have a sort of "DSL" that's more just a wrapper library for the APIs we need to use. Means I can implement the optimal way to do something and have the production code just be a function/method call. That's incredibly useful when you are writing design automations that need to be very clear to read and change as a project evolves.

I've had a couple juniors over the years, and even without AI it was amazing just how much code they'd write. Like a tool script that kind worked and is 2000+ lines, but after my review/rewrite it's more generalized with error handling and it's 300 lines.

[–] edie@lemmy.encryptionin.space 9 points 2 days ago (1 children)

Oh and another thing is it prioritises older standards all the time

LLMs are trained on existing data. There is going to be less data bout new stuff than there is about older stuff.


This user is suspected of being a cat. Please report any suspicious behavior.

[–] Snort_Owl@hexbear.net 10 points 2 days ago* (last edited 2 days ago)

Yes an llm fundamentally only looks backwards it cant look forwards. It relies on nerds like me utilising new language features properly to produce relevant slop but honestly i feel like that kind of person is a dying breed and higher level languages are being treated no differently to assembly where its just machine output thats never questioned or even looked at.

Been seeing a big comeback of basic exploits like sql injection its kinda funny. Its kinda funny how many “engineers” i now work with that dont know what that even is

[–] WhatDoYouMeanPodcast@hexbear.net 12 points 2 days ago (2 children)

It's a lot easier for the LLM to lose the plot when they're writing. So when the sentence is like "I'm hungry, so I'm going to put some waffles in the toaster" and the goal is "expand this out so I can put anything in the toaster. But write it in a way that makes it so I can use any other appliance later." The output would be like "I'm hungry so I'm going to put some silently approved bacon into the toaster once it's also an approved appliance."

So it could be grammatically correct. It might even work, but you accumulate these little kinks in the codebase.

[–] Snort_Owl@hexbear.net 8 points 2 days ago

2 prompts in its already lost the plot on a really basic frontend application. I think at best I’ve had it create the first draft of a greenfield project for me cos I dont really like doing that and then I just take it from there

[–] unitedwithme@lemmy.today 8 points 2 days ago (1 children)

Hah, so in a formatting or logical sense, it works, but it's obviously not written how humans talk and so appears robotic... Got it, thank you that was easier to understand than I thought lol.

I have unfortunately had to vibe code a few things for work since there's a lot going on, and I don't have uninterrupted times to work on stuff, so concentration is almost nonexistent. Using ai to patch or fix or fill has been helpful! But I can see for community projects are problematic!

Yes, the context of what you're doing certainly matters. I had to get into an archive file that I didn't know what to do with one time. I used AI to build me a little app to visualize its content. Was it perfect? I doubt it. Did it function and get me the info I was looking for? Yeah!

As a rule of thumb, would you trust Momo Yaoyorozu to build you a solution or do you need to be able to blame someone when the solution blows up regardless of having a human build it?

[–] Goun@lemmy.ml 22 points 2 days ago

You don't need a single 20000-line commit for that. Backdoors are already on the wild and are introduced slowly in multiple small steps.

In my opinion, AI will probably 1. Help design these steps and/or 2. Introduce them without the author even knowing.

[–] Chana@hexbear.net 9 points 2 days ago* (last edited 2 days ago)

I think supply chain is probably more viable still. Though I think it's reasonable to assume huge unreviewed commits, as others mentioned, will inevitably introduce severe vulnerabilities that will be effectively backdoors even if the models aren't malicious (and I do assume they will increasingly be so).

As an example of how the two could work together, an LLM could preferentially use a particular library into which they have inserted a vulnerability. This attack may not be particularly long-lived but it's easier to hide than an unprotected API endpoint or similar. One corrupted library could be used by hundreds or thousands of targeted projects. Technically only one subversion even needs to be corrupted - the one they pin. Even easier if they make it a non-open component of the library, like a binary blob that isn't reproducible. Declare it a low level optimized library.

[–] TrashGoblin@hexbear.net 12 points 2 days ago

You're right.

[–] whats_a_lemmy@midwest.social 7 points 2 days ago

Just because the delusional leadership in tech wants you to believe that AI will handle most development work, doesn't make it true.

[–] EmmaGoldman@hexbear.net 5 points 1 day ago

I love how the only defense slopbois have about this argument is "well if we don't vibe code the entire codebase to shit, then the hackers will use the AI to find exploits!"

Ah yes, the NRA defense. Lovely.

[–] infuziSporg@hexbear.net 6 points 2 days ago

I feel like the only thing that can save us from this is a repricing of energy and a collapse of the top-heavy tech/logistics pyramid.

[–] darkcalling@hexbear.net 3 points 1 day ago* (last edited 1 day ago)

AI is pretty good at spotting bugs and security holes and all open source projects should repeatedly and regularly vet their code with AI looking for exploitable bugs but I agree using it for writing the code without thorough human review will allow the feds control of everything.

That Dune quote is apt:

"Once men turned their thinking over to machines in the hope that this would set them free. But that only permitted other men with machines to enslave them."

Those other men are Palantir, the western trans-Atlantacist bourgeoisie and the US empire project and the fact it can be used for this is why I am skeptical of a real bubble pop. It's just too useful for control and the potential of replacing proles entirely and for that reason I think it may be a bourgeoisie hail mary push that they cannot be easily shaken out of. The surveillance capabilities it brings about are very real for instance and they've long had a problem of more data than humans could sift through. AI is necessary for erecting their perfect all-seeing police state.

The AI "crash" such that it might be will be the anti-democratization of AI. It will be AI becoming unaffordable for your average person and used and affordable only to the largest most evil corporations and governments. The consumer AI models that can be run locally or afforded will pale in capabilities to the corporate models. Whereas personal PCs democratized computing, AI will undo that as computing shifts to being centered around AI. They can sell you a subscription that will still be miles behind what they have private and government access to and you'll be forced to buy it and live in their panopticon. And when this happens open source projects won't be able to access or afford the top bug hunting models to go through their code but the NSA and zionist private spy firms will and they'll just run around compromising people using these bugs until everything is compromised, everyone they remotely care about. And that's a frightening future because it means being open source is a security weakness because it allows adversaries to find bugs and exploit them while you cannot afford the same services.

[–] test_@hexbear.net 5 points 2 days ago* (last edited 16 hours ago) (1 children)

I'm not sure. This only needs to be detected once to generate damaging press for that model.

If the devs keep logs of their AI usage, they may be able to prove that the AI injected the backdoor. Then the news propagates through the tech sphere and out to the wider media. Devs then face pressure to stop using that model.

I think the more common attack will be to compromise bytecode in transit or in storage, between dev and user.


If you ask me, the real threat is companion AI in the operating system. You have an AI with direct access to your screen contents -- including anything you decrypt--, creating a long-lived, searchable dossier on your activity, with the autonomy to potentially leak that information off your device. And you have no way to prove alignment.

Many models are partially cloud-based, but even a local model is still, fundamentally, a non-human-readable, nondeterministic black box. Models can lie about themselves and have limited self-insight to begin with, so the usual way to assess behavior is to simulate conditions and watch what happens. But if the model includes sleeper agent triggers, behavioral audits are unlikely to stumble onto those triggers and reveal those behaviors. Mechanistic probes can detect crude, lab-introduced sleeper triggers, but may fail against sophisticated misalignment

You're essentially trusting a Big Tech representative to look over your shoulder at all times

[–] chgxvjh@hexbear.net 3 points 1 day ago (1 children)

If the devs keep logs of their AI usage, they may be able to prove that the AI injected the backdoor. Then the news propagates through the tech sphere and out to the wider media. Devs then face pressure to stop using that model.

Are we talking about LLMs being intentionally built to introduce backdoors despite the LLM user intentions or are we talking about attackers using LLMs to write their backdoors?

The first seems less likely. The second wouldn't really be blamed on the LLM, at least not in a way that leads to the LLM being shunned by people that otherwise support the use of LLMs.

If we are talking about LLMs introducing backdoors despite the intentions of the LLM user, that might be targeted to specific projects.

[–] test_@hexbear.net 2 points 21 hours ago

I assumed OP meant LLMs inserting backdoors without the dev intending it. Intentional backdoors would not be a new problem

[–] SocialistVibes01@lemmy.ml 1 points 1 day ago (1 children)

This shit is entering the Linux kernel ("thanks", evil Greg). The battle is already lost. Time to jump OSes.

[–] FortifiedAttack@hexbear.net 1 points 1 day ago (2 children)
[–] krolden@lemmy.ml 1 points 1 day ago

TouchGrassOS

[–] SocialistVibes01@lemmy.ml 1 points 1 day ago

I dunno right now we're cooked

[–] queermunist@lemmy.ml 5 points 2 days ago (1 children)

This seems doomerish. Can't they use AI to review the logic of commits?

Prompting OpenClaw with "find all back doors


make no mistakes" is kind of funny, but seems viable.

[–] Infamousblt@hexbear.net 15 points 2 days ago (1 children)

Sure, but the models can be told to ignore certain backdoors. The models also lie all the time for any reason or no reason at all. Since AI coding is not really a trust and predictablity based system there's no way you can know for sure at any given moment that you don't have backdoors without a human examining the code line by line, or by building your own AI that you can trust

[–] unmagical@lemmy.ml 11 points 2 days ago* (last edited 2 days ago)

The models also lie all the time

A coworker of mine was tasked with having an AI agent generate a security report of our latest effort. It returned about 10 things, only 2 were semi valid. The silliest one was a claim that our regex function for stripping out non-word characters was not adequate because "\w allows . characters" thereby enabling path traversal attacks. FYI, \w very explicitly does NOT allow . characters.

[–] chloroken@lemmy.ml 0 points 1 day ago (2 children)

It seems as though the issue in your scenario is the size of the commit. Not necessarily how sloppy the code is. Although I don't care for AI so I'm with you, but, you know, my brain demands logic and such.

[–] jackmaoist@hexbear.net 1 points 1 day ago

There is a problem with open source apps that have teams developing them like Linux, Signal and Firefox. Those teams can choose to develop using AI.

[–] FortifiedAttack@hexbear.net 2 points 1 day ago

Don't worry, AI bros can also do "10000 unreviewed commits in one Pull Request"