this post was submitted on 17 Jun 2026
204 points (100.0% liked)

Fuck AI

7353 readers
1142 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

AI, in this case, refers to LLMs, GPT technology, and anything listed as "AI" meant to increase market valuations.

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] ZDL@lazysoci.al 3 points 10 hours ago (1 children)

It is by no means based on the aggregate of all human knowledge. It is based on the aggregate of all human knowledge that techbrodudes could easily rip off on the Internet.

There are enormous swaths of material that is not incorporated into them. There are likely entire LANGUAGES that are under-represented if not flatly absent from the training data. Approximately 50% to 70% or even beyond (depending on the specific analyses involved) of the training material pulled into LLMs, according to the Allen Institute, is in English. About 17% of the planet speaks English. "All human knowledge" indeed. There are approximately 7000 living languages on the planet. The best of the LLMs barely cover 50 of them to any degree of linguistic or cultural competence. (I know ChatGPT claims coverage of 80+ languages. I've also seen its unfortunate attempts at the outlying ones...)

And then there is a whole lot of knowledge and information in print form which is not yet incorporated. As a trivial example of this, the very important book in tea production and consumption circles, 中国茶经 (not to be confused with the ancient classic 茶经), is not available in any electronic form anywhere. Its encyclopedic coverage of the fractally complicated Chinese tea sphere is not in any LLM anywhere. Books of this calibre number in the thousands, possibly hundreds of thousands, and are not in any LLM anywhere. This means that if you query an LLM about tea, you're going to get the amalgamated opinions of dumbasses on Reddit instead of authoritative sources like 中国茶经.

(And I'm not even going to start going down the epistemic rabbit warren of non-textual knowledge. Go ahead and ask your LLMbecile what it feels like when the clay is too wet on the wheel, or how to read a hostile room before negotiations. It will generate text ... but what is the source of the physicality and instinct? It has none. It regurgitates what some dumbass on Reddit said.)

[–] TheFrogThatFlies@lemmy.world 1 points 7 hours ago* (last edited 7 hours ago)

You're right, but I was hoping people wouldn't take my comment literally :) It's not ALL human knowledge, obviously. But if it was a tool from humans to humans, instead of from companies to make money of, we could add more and more of our global knowledge to it and have more to win from the tool.

I also am fully aware that this tool is not applicable to EVERY situation, and everyone should also be aware of this.