this post was submitted on 15 Jun 2026
52 points (91.9% liked)

chat

8624 readers
218 users here now

Chat is a text only community for casual conversation, please keep shitposting to the absolute minimum. This is intended to be a separate space from c/chapotraphouse or the daily megathread. Chat does this by being a long-form community where topics will remain from day to day unlike the megathread, and it is distinct from c/chapotraphouse in that we ask you to engage in this community in a genuine way. Please keep shitposting, bits, and irony to a minimum.

As with all communities posts need to abide by the code of conduct, additionally moderators will remove any posts or comments deemed to be inappropriate.

Thank you and happy chatting!

founded 5 years ago
MODERATORS
 

i was under the impression that were explicitly against genAI but i found that some people are actually VERY pro genai and quite dismissive of its risks. this happened on lemmy but i saw people getting downvoted for saying that genAI in its current state is harmful to the environment. i saw the old bullshit of how ”i use AI for art because i cant draw”, it was crazy. i called someone a Promptitute and someone unironically told me not to say that, just so much. so can everybody pls affirm you dont support that crap.

another user straight up admitted that they use AI as a friend and to gn to... i dont care how much we have in common ideologically, i cant support you gning to chatgpt and thinking its your friend...

you are viewing a single comment's thread
view the rest of the comments
[–] AltMaarri@hexbear.net 18 points 6 days ago* (last edited 6 days ago) (2 children)

We don't think it has remained static, we realize the "AI companies" have been implementing bandaid after bandaid to try and compensate for fundamental lacks in the technology itself, and that as a consequence results are better now (for like 1000x the cost but who's counting - well we all will, soon, but you get the point).

But this is still a credible text generator, ultimately close to a larger, N-dimensional markov chain. It still "hallucinates". It's still shit. It's still almost entirely useless for coding (you know, if you care about long or even medium-term maintainability or security), never mind the other use cases they're trying to sell it for. It does not work.

And if it did (it doesn't !), the aforementioned bandaids would have made the cost impossible to justify anyway.

[–] space_comrade@hexbear.net 4 points 6 days ago* (last edited 6 days ago) (2 children)

we realize the "AI companies" have been implementing bandaid after bandaid to try and compensate for fundamental lacks in the technology itself

You could argue in the same way that car safety technologies are "bandaid after bandaid to try and compensate for the fact that cars are fundamentally huge hunks of metal going really fast".

It's still almost entirely useless for coding (you know, if you care about long or even medium-term maintainability or security)

That's just not true at all, you still need to do quite a bit of handholding and double checking it gets stuff right but it definitely can get medium-high complexity stuff right and overall it is a productivity boost.

[–] AltMaarri@hexbear.net 10 points 6 days ago* (last edited 6 days ago)

You could argue in the same way that car safety technologies are "bandaid after bandaid to try and compensate for the fact that cars are fundamentally huge hunks of metal going really fast".

The parallel doesn't really work (there is nothing fundamentally incompatible between the purpose of a car and the basic design idea); but to try and answer, this would be a viable answer if the huge "AI labs" admitted the tech doesn't work and were collectively working on entirely new architectures (likely ones that include actual symbolic representation of the world, not just statistical links between words).

But they're not, they're using the same fundamental approach and doing ridiculous shit around that approach - ultimately a genuine, and interesting, NLP breakthrough - to try and sell it as what it's not and will never be.

you still need to do quite a bit of handholding and double checking it gets stuff right but it definitely can get medium-high complexity stuff right and overall it is a productivity boost.

See, that's one part I disagree about. First as a direct productivity boost; I realize you probably think it's making you more productive; but that's something you have to demonstrate, at scale. And the few genuine studies I've seen suggest the effect, if it exists, is marginal at best; at least two I remember suggest it decreases productivity (and at least one, possibly both, can't remember, said self-reporting from devs was that it increased it despite the effect).

More importantly, though: very little of the code you produce this way will be maintainable. Very little will also be secure, too (both "direct" vulns - they're not actually good at finding them, FYI - and more fundamental logic vulnerabilities - they'll never find these at all outside of trivial cases). If you want a concrete example of that, just look at the leaked Claude harness code: it's genuinely pathetic. They're vibe coding it all and it shows.

Also, I notice you say "you still need to..."; do you think this will eventually get fixed ? it's been nine years. There's a point where "it'll get better" is starting to become more of a meme than an answer. It was getting better, then a lot less, then lately almost not at all, despite greater and greater cost increases.

Shit, lately I've even started to wonder if they're even training new models anymore; I've been wondering if they're just not shipping the same with a different harness / different bullshit surrounding it. It's not like they can improve on the models themselves anyway (not without at least one, likely several, genuine fundamental breakthrough) - they're all out of human-produced data by now.

[–] Formerlyfarman@hexbear.net 3 points 6 days ago

Yes it is, just yesterday I was having trouble with a script, so I googled it, the Google ai told me to use a command that doesn't exist. It just made it all up. Completely worthless.

[–] LeeeroooyJeeenkiiins@hexbear.net 3 points 6 days ago (1 children)

for like 1000x the cost but who's counting

China isn't counting the same numbers i tell you that

It's still almost entirely useless for coding

It's incredibly useful for letting humans interact with a computer system in natural language and maybe you shouldn't take from that "hey we should have it do highly sensitive stuff where the slightest error could have great consequences"

Here's a use for it. You nerds hate Reddit, right? Well hey, it does a great job of summarizing reddit threads, since that's where half of its fucking training comes from. No more ever having to go back to reddit to browse dozens of inane responses hoping to find the one comment with relevant information. Congratulations, you have a machine that made reddit obsolete

[–] AltMaarri@hexbear.net 16 points 6 days ago* (last edited 6 days ago) (1 children)

China isn't counting the same numbers i tell you that

That's true; the very quality of results you can obtain with some recent models locally suggests the possible optimizations are huge. But it's also diminishing returns: the moat between state of the art models and one I run on a P40 locally is really small these days; the amounts they need to sink to get even slightly better results at this stage are more and more. They're text generators; yes, if you run ten of them in parallel, vote; have them cross-check their results, introduce harnesses at every steps, etc. (all examples of actual bandaids I evoked above) you can improve results. But all of this is trying to make a tech that fundamentally doesn't answer the problem answer it nevertheless. And all of it costs a lot.

And I dearly hope China is not spending too much effort on backing these domestic initiatives, because again: outside of a few limited use cases (easily identifiable, I've listed a few in another comment in this very thread: those where having no relation to the truth in a portion of the generated text is acceptable), the tech doesn't work.

It's incredibly useful for letting humans interact with a computer system in natural language and maybe you shouldn't take from that "hey we should have it do highly sensitive stuff where the slightest error could have great consequences"

There are use cases where having the computer completely invent actions or do shit randomly isn't that bad, I guess; games come to mind. But ultimately and more generally I disagree, it's shit for that too. Try one as a daily runner, just for laughs. Or just try an entire shell session where instead of typing the commands, you complete a description of what you want to do through one and then do it. It goes bad very fast, let me tell you.

And I get it: what you describe would be awesome - a SF dream. I like tech and I wish all of this would work; and like many initially I genuinely wondered as well if scaling/attention was all you needed, and had some measure of hope; quickly dashed, though.

it does a great job of summarizing reddit threads

Again, no it does not. You have no guarantee it won't pull shit out of its virtual ass; and just as crucially (and even more likely), no guarantee it won't ignore significant parts of the source material. You'll get a result that seem like it summarizes the thread, with no confidence level, and no guarantee of its reliability.

[–] LeeeroooyJeeenkiiins@hexbear.net 0 points 6 days ago (1 children)

Again, no it does not. You have no guarantee it won't pull shit out of its virtual ass; and just as crucially (and even more likely), no guarantee it won't ignore significant parts of the source material. You'll get a result that seem like it summarizes the thread, with no confidence level, and no guarantee of its reliability.

I've seen what it does and how often and the various ways it tends to fuck up, at least Google's search result llm because I'm not going out of my way to use this shit outside of trying to look up shit i was going to look up anyway (mentioning this because as a result i don't know if the model they're using is better but more expensive), and i really don't consider it a different experience than the last 6-10 years of having to look up a reddit thread to parse comments for information. I still have to figure out who knows what and how much they know and figure out how much I trust the information.

With the llm it's the same shit to me, I have to figure out if it's pulling from a source that knows anything, if the information is actually even present in the source it thinks it's from, if it's miscontextualizing the information, and if it's just trying to glaze me or frame its responses in relation to my prompt

The end result is i google shit and tend to find the information i'm looking for faster and without having to be exposed to 100s of tips m'fedora redditisms directly. In this way you could consider the llms use to be like those boxes you use to view an eclipse without hurting your eyes

[–] AltMaarri@hexbear.net 5 points 6 days ago* (last edited 6 days ago) (1 children)

Fair enough, if it works for you and you're aware of the downsides (which you seem to be, contrary to most users); two things I haven't even mentioned though are the power/environmental costs (obviously) but also the potential cognitive impact.

You say it doesn't change much compared to when you were parsing the reddit comments yourself; doesn't it ? how sure are you "wasting" time parsing these comments wasn't exercising an important muscle mentally for you (getting the jist of a text rapidly - excluding braindead content quickly, etc.) ?

Here the example is pretty ridiculous - I doubt your mental faculties depend much on parsing reddit comments - but you get the idea. It's very early to tell but several papers now suggest the negative cognitive impact is very real and potentially very fast.

[–] LeeeroooyJeeenkiiins@hexbear.net 4 points 6 days ago* (last edited 6 days ago)

how sure are you "wasting" time parsing these comments wasn't exercising an important muscle mentally for you (getting the jist of a text rapidly - excluding braindead content quickly, etc.) ?

Idk, pretty sure, since i feel the "is the lying machine lying to me or is it accurate" appraisal is about the same exercise and there's also the fact that an absolutely disgustingly large portion of reddit comments are themselves being made by llms now (as every single anti bot measure reddit ever uses only impacts real people, apparently)

Edit: Thank you for the concern though btw but if you ever do notice me exhibiting cognitive decline it is almost definitely gonna be a combination of the alcoholism, drug use, contraindicated drug use, and general medical decline