this post was submitted on 16 Jun 2026

216 points (95.0% liked)

Web Development

5690 readers

432 users here now

Welcome to the web development community! This is a place to post, discuss, get help about, etc. anything related to web development

What is web development?

Web development is the process of creating websites or web applications

Rules/Guidelines

Follow the programming.dev site rules
Keep content related to web development
If what you're posting relates to one of the related communities, crosspost it into there to help them grow
If youre posting an article older than two years put the year it was made in brackets after the title

Related Communities

Wormhole

!cool_github_projects@programming.dev

Some webdev blogs

Not sure what to post in here? Want some web development related things to read?

Heres a couple blogs that have web development related content

https://frontendfoc.us/ - [RSS]
https://wesbos.com/blog
https://davidwalsh.name/ - [RSS]
https://www.nngroup.com/articles/
https://sia.codes/posts/ - [RSS]
https://www.smashingmagazine.com/ - [RSS]
https://www.bennadel.com/ - [RSS]
https://web.dev/ - [RSS]

Credits

Icon base by Delapouite under CC BY 3.0 with modifications to add a gradient

founded 3 years ago

MODERATORS

snowe@programming.dev

erlingur@programming.dev

Ategon@programming.dev

Vacant@programming.dev

216

“I’m calling it now, the adoption of AI agents into software development will be one of the most costly mistakes in the field’s history. Agents cannot program…” (geohot.github.io)

submitted 2 days ago by cat_fishing@feddit.online to c/webdev@programming.dev

72 comments fedilink hide all child comments

I’m calling it now, the adoption of AI agents into software development will be one of the most costly mistakes in the field’s history. Agents cannot program, and it’s taking longer and longer to realize that they can’t. They are a highly sophisticated statistical model designed to mimic the distribution of programming. The output is broken, but in a way that’s getting harder and harder to detect. Which is exactly what you’d expect from an increasingly accurate statistical model.

top 50 comments

sorted by: hot top controversial new old

[–] BehindTheBarrier@programming.dev 3 points 4 hours ago* (last edited 4 hours ago) (1 children)

But they do work, maybe not as a full replacement but my god the amount of boilerplate I can avoid in creating unit tests from scratch. Extracting and finding information in the code base is also useful, not everything is an easy text search of tracing a few code paths. It's an incredible tool for these kinds of work.

If it becomes harder to tell the difference then it also means it's closer to matching reality. And todays AI can do very impressive "reasoning", managing to debug complex issues I have had.

The most important part is that you as developer is fully responsible and can stand behind what they do and deliver using AI agents.

[–] matdave@lemmy.ml 1 points 4 hours ago

Right? The bottle has opened. It has taken so much mundane work out of programming. Also, I feel like a human is just as likely to create great looking code changes with a possible flaw. You just have to review the code. Whether it's a person or a bot, "lgtm" can only be used sparingly.

[–] agamemnonymous@sh.itjust.works 3 points 7 hours ago (1 children)

Unpopular prediction: AI agents are going to get better at coding. Not great, but halfway decent at cranking out basic features. Once everything levels out in like 3-5 years, AI agents will be a cherished part of the toolbox most software developers. It will be useful for skimming code, it will be useful for tedious parts of tasks that are just a degree off from boilerplate.

People are definitely gonna try to use it for things more complicated than that, and that'll be a mistake, and it will be costly, but the far side of it could be pretty cool actually. Admittedly I have an optimistic disposition.

[–] 0t79JeIfK01RHyzo@lemmy.ml 1 points 6 hours ago* (last edited 6 hours ago) (2 children)

~~I have yet to be impressed.~~ I’m not very convinced*. I asked for format type mappings between Pipewire, WebGpu, and Vulcan and both ChatGPT and Gemini failed very badly only providing the most common type mappings. This should be a wildly easy task, something any programmer or even beginner programmers can complete. It’s just very boring, mundane, buffer shifting like work. It almost feels like pencil pushing.

Why couldn’t they do it?

[–] BehindTheBarrier@programming.dev 1 points 4 hours ago (1 children)

Are you using real agents or the free chats on the web, because the latter ones are really dumb. Even when you ask them to search the web for basis you don't get much success.

[–] 0t79JeIfK01RHyzo@lemmy.ml 1 points 4 hours ago* (last edited 4 hours ago) (1 children)

I haven’t paid

edit: Well I did for grok, but not on purpose. Grok still failed too.

[–] BehindTheBarrier@programming.dev 1 points 4 hours ago* (last edited 4 hours ago)

The bigger paid models or potentially the local ones if given access to search the web can probably get you the right answers. The big ones have pretty much memorized half the internet, but can still be wrong so pushing them to verify their answers.

But the harder part is trusting what they say regardless. I can't just take an answer for truth, and unless I can verify the statement (fact checking myself, looking up the source, running/testing the code, etc) then it gets harder to do anything with AI. This is the thing I hate about AI in general, people just take whatever they say at face value. Lawyers with fake citations, random people asking chatgpt about random facts and such. Its a tool that people put too much faith in to do thinking for them.

[–] agamemnonymous@sh.itjust.works 3 points 6 hours ago* (last edited 6 hours ago)

Hence the "are going to get better" and "3-5 years".

[–] antrosapien@lemmy.ml 1 points 9 hours ago

If my job mandates me to use ai agents, idgaf, I'll use it. But my every oss contributions will be clanker-free

[–] Quetzalcutlass@lemmy.world 18 points 22 hours ago (1 children)

geohot

Now there's a name I haven't heard in a long time. George Hotz was the guy who first jailbroke iOS and the PlayStation 3 and made the towelroot exploit for early versions of Android, before legal threats drove him out of the scene.

[–] 0t79JeIfK01RHyzo@lemmy.ml 1 points 6 hours ago

And the self driving car company? Didn’t he also recently found another company for servers or something? I remember seeing him get like 1st place (or top 10) for one of the days on Advent of Code like a year or two ago. Thats very difficult, like Olympic swimmer of programmer type of thing.

[–] GreenKnight23@lemmy.world 7 points 1 day ago

remember, when you interview for a job and they ask you, "do you have any questions?", you ask;

has AI ever been used to develop your product?
what percentage of your product has been written by agenetic AI?
is the use of AI tracked as a performance indicator?

[–] Aceticon@lemmy.dbzer0.com 8 points 1 day ago

It's going to be a wonderful time to be a Freelance Senior Developer and above in a few years.

[–] Avicenna@programming.dev 19 points 1 day ago* (last edited 1 day ago) (1 children)

They are not the automated from 0 to 100 coders that some people claim them to be. But they are quite capable, definitely much more capable than what anyone could have imagined ten years ago. Given well defined problems they can excel at even relatively complex tasks. I pointed Claude at a latex file of a somewhat complicated nonparametric statistical estimate calculation to look for any mistakes and it was actually able to find some. I then pointed it at a code that replicates the calculations and it was also able to correctly identify some issues with the code. I think this is the way one should use LLMs, not let it loose on coding tasks. In the former way you won't even be able to burn through your first tier account quota where as in the latter the LLM will likely end up getting in weird loops burning tokens like there is no tomorrow. Also this method of sane usage of LLMs is much more suitable for open local LLMs. I don't think there is any doubt anymore that LLMs can be very useful tools, not just for doing stuff but learning it too. People should move past the stage of invalid criticisms like "they are just stochastic parrots" and move to more serious matters like environmental impact, greedy fucking CEOs pretending LLMs are replacements for humans, degredation of skills, getting lazy at checking AI code, ethics of capitalizing on collective human knowledge and the unsustainable AI bubble that tech companies are pushing for.

[–] davidagain@lemmy.world 22 points 1 day ago (9 children)

invalid criticisms like “they are just stochastic parrots”

That's not a criticism per se, it's a description of how they work.

load more comments (9 replies)

[–] NigelFrobisher@aussie.zone 16 points 1 day ago (5 children)

This is very obvious unless you are in tech leadership, in which case your job is now to push this at all costs and suppress dissenting voices.

load more comments (5 replies)

[–] ICastFist@programming.dev 57 points 2 days ago (2 children)

This alarm's being rung for over a year now, so "calling it now" means finally reading the writing on the wall

[–] FiniteBanjo@programming.dev 24 points 1 day ago

Let it be known that the first person to call it was actually Sam Altman when OpenAI's paper on AI Scaling Laws in 2020 subtly showed that the diminishing returns will stop showing improvement with infinite power, compute time, and data before 94% accuracy is reached.

load more comments (1 replies)

[–] Stefan_S_from_H@piefed.zip 13 points 1 day ago (1 children)

You know the feeling that you want to rewrite a project? But you know that most rewrites are a bad idea.

Be it your own, old code. Or code you inherited.

There is a small chance that the world realizes that they went in the wrong direction and nothing can get fixed. That will be the time of rewrites.

No, I don't expect this to be very likely. The agent code will remain, and human programmers get yelled at for not fixing it fast enough.

[–] GreenKnight23@lemmy.world 2 points 1 day ago

1000002112

[–] megopie@beehaw.org 5 points 1 day ago

part of the issue as well is that when they get something completely broken, people just re roll the output until they get something that’s broken in ways they don’t notice. Or re roll parts of it, or tell the system to judge if the output is broken and re roll the parts that it judges are broken automatically. Or increase the size of the context window to get it closer to that upper limit of accuracy.

All this together can get a more functional output with less effort, and as people find these tricks it gives them the illusion of an upward trend in capability, like this is all solvable issues that will improve as time goes on. Big problem with that though, theses tricks and methods explode the compute cost rapidly. That’s all fine and dandy when everyone is getting their compute costs for these tools subsidized by these model providers, but eventually they will need to charge the real cost of running this. The compute providers that host the model providers are also running at a loss, trying to help grow the market segment and maximize their market share. And then places that have the datacenters in them are giving tax breaks and discount utilities to attract new construction.

Everyone except the people making the chips is selling at a loss, and as people pile on usage to make up for the fundamental limitations of these systems, the demand balloons, validating to the providers at all levels that this is a growing market they should invest more in to.

But eventually… they need to make money. The bill comes due on all the debt and investment. What happens to the people who have fully embraced these to run their businesses? Or to all the people who have built their skill set around using these systems? It’s a crisis, a series of crisis, each time a debt wall gets hit by someone in the supply chain. A half decade of technical debt that just got really expensive to deal with, and not enough experienced people to handle it, since all the grey beared retired and not enough new people got brought in to replace them because the entry level work was automated.

[–] obviouspornalt@fedinsfw.app 14 points 1 day ago (18 children)

if it's broken in a way that can't be detected, is it actually broken?

all software is broken in some way. if the rate of bugs generated by llm and the severity of those bugs drops below the rate you would expect from a human programming team, then llm is offering something competitive.

[–] FiniteBanjo@programming.dev 10 points 1 day ago

It will eventually be detected, but it passes tests before hitting production, that is the problem.

load more comments (17 replies)

[–] FiniteBanjo@programming.dev 10 points 1 day ago (1 children)

It's so nice to see some people speaking reason. If only any of those people ran multibillion dollar companies.

load more comments (1 replies)

[–] Skullgrid@lemmy.world 11 points 1 day ago

No one paying the bills cares

load more comments