152

Today's Large Language Models are Essentially BS Machines (quandyfactory.com)

submitted 1 year ago by Veraticus@lib.lgbt to c/technology@beehaw.org

134 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] scrubbles@poptalk.scrubbles.tech 79 points 1 year ago

And everyone in tech who has worked on ML before collectively says "yeah that's what we've been trying to tell you". Don't get me wrong, LLMs are a huge leap, but god did it show how greedy corporations are, just immediately jumping to "how quick can we lay people off?". The tech is not to that spec. Yet. It will get there, but goddamn do we need to be demanding some regulations now

[-] Dark_Arc@social.packetloss.gg 39 points 1 year ago

The tech is not to that spec. Yet.

I'm not sure it will. At least, not this tech, not this approach to the problem. From my understanding there's fundamentally no comprehension; it's not bugged, broken, or incomplete, it's just not there... it's missing from the design.

[-] communist@beehaw.org 17 points 1 year ago

We don't know that for sure yet, we saw a lot of emergent intelligent properties appear as we scaled up, and we're nowhere near done scaling LLM's, I'm not saying it will be solved, just that we don't know one way or the other yet.

[-] BotCheese@beehaw.org 7 points 1 year ago

And we're nowhere near dome scalimg LLM's

I think we might be, I remember hearing openAI was training on so much literary data that they didn't and couldn't find enough for testing the model. Though I may be misrememberimg.

[-] newde@feddit.nl 5 points 1 year ago

No that's definitely the case. However, Microsoft is now working making LLM's more dependent on several high quality sources. For example: encyclopedias will be more important sources than random reddit posts.

[-] HobbitFoot@thelemmy.club 2 points 1 year ago

Microsoft is also using LinkedIn to help as well, getting users to correct articles generated by AI.

[-] Zaktor@sopuli.xyz 2 points 1 year ago

Cunningham's Law may be very helpful in this respect.

"the best way to get the right answer on the internet is not to ask a question; it's to post the wrong answer."

[-] lloram239@feddit.de 4 points 1 year ago

There are still plenty of videos to watch and games to play. We might be running short on books, but there are many other sources of information that aren't accessible to LLMs at the moment.

Also just because the training set contained most of the books, doesn't mean the model itself was large enough to learn from all of them. The more detailed your questions get, the bigger the change it will get them wrong, even if that knowledge should have been in the training set. For example ChatGPT as walkthrough for games is pretty terrible, even so there should be more than enough walkthroughs in the training set to learn from, same for summarizing movies, it will do the most popular ones, but quickly fall apart with anything a little lesser known.

There is of course also the possibility that using the LLM as knowledge store by itself is a bad idea. Humans use books for that, not their brain. So an LLM that is very good at looking things up in a library could answer a lot more without the enormous models size and training cost.

Basically, there are still a ton of unexplored areas, even if we have collected all the digital books.

load more comments (45 replies)

load more comments (65 replies)

this post was submitted on 12 Sep 2023

152 points (100.0% liked)

Technology

37677 readers

509 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:

This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago

MODERATORS

alyaza@beehaw.org

TheRtRevKaiser@beehaw.org

gyrfalcon@beehaw.org

rs5th@beehaw.org

coldredlight@beehaw.org

Los@beehaw.org

SemioticStandard@beehaw.org

TheRtRevKaiser@kbin.social

remington@beehaw.org