198
submitted 11 months ago by Five@beehaw.org to c/technology@beehaw.org
top 50 comments
sorted by: hot top controversial new old
[-] ProcurementCat@feddit.de 115 points 11 months ago

The fundamental flaw of the Turing test is that it requires a human. Apparently, making a human believe they are talking to a human is much easier than previously thought.

[-] philomory@lemm.ee 61 points 11 months ago

Much easier, in fact; Eliza could pass the Turing test in 1966. Humans are incredibly eager to assess other things as being human or human-like.

[-] Rentlar@beehaw.org 19 points 11 months ago

Go on.

And what makes you think that?

Mhm. Tell me more.

"Human or human-like". Can you tell me more about that?

How do you feel about it?

[-] lloram239@feddit.de 16 points 11 months ago* (last edited 11 months ago)

The real Turing test requires an expert doing the test, not just some random easily impressed person.

The ELIZA-style bots work very well on the later kind, as the bot is just repeating your own text back at you with some grammatical remixing, e.g. you say "I am afraid of horses", bot says "Why do you say you are afraid of horses?". You can have very long conversation with yourself that way, as the bot contributes nothing to the discussion. It just provides enough plausible English to keep you talking. Meanwhile when you have an expert (or really just any person with a little bit of a clue) test ELIZA, the bot falls completely apart within just three lines of dialog. The bot is incredible basic and really can't do anything by itself, it completely depends on the user to provide all the content of the conversation.

[-] Thorny_Thicket@sopuli.xyz 52 points 11 months ago

You can take a sharpie and draw a sad face on a rock and then you'll feel sad for it. We're gullable.

[-] dom@lemmy.ca 60 points 11 months ago

But why is the rock sad :(

[-] Thorny_Thicket@sopuli.xyz 33 points 11 months ago

I know.. I get sad just thinking about the sad rock :(

load more comments (1 replies)
[-] shanghaibebop@beehaw.org 12 points 11 months ago

Slap some 2D anime girl avatar on it and you got yourself a top grossing v-tuber.

[-] Ferk@kbin.social 11 points 11 months ago* (last edited 11 months ago)

A test that didn't require a human could theoretically be tested automatically by the machine preemptively and solved easily.

I can't imagine how would you test this in a way that wouldn't require a human.

load more comments (4 replies)
[-] habanhero@lemmy.ca 6 points 11 months ago

Why is it a flaw? What do you think the Turing Test is?

[-] pglpm@lemmy.ca 63 points 11 months ago* (last edited 11 months ago)

Title:

ChatGPT broke the Turing test

Content:

Other researchers agree that GPT-4 and other LLMs would probably now pass the popular conception of the Turing test. [...]

researchers [...] reported that more than 1.5 million people had played their online game based on the Turing test. Players were assigned to chat for two minutes, either to another player or to an LLM-powered bot that the researchers had prompted to behave like a person. The players correctly identified bots just 60% of the time

Complete contradiction. F*ck Nature, it's become only the most expensive gossip science magazine.

PS: The Turing test involves comparing a bot with a human (not knowing which is which). So if more and more bots pass the test, this can be the result either of an increase in the bots' Artificial Intelligence, or of an increase in humans' Natural Stupidity.

[-] aksdb@feddit.de 14 points 11 months ago

So if more and more bots pass the test, this can be the result either of an increase in the bots’ Artificial Intelligence, or of an increase in humans’ Natural Stupidity.

Or it "simply" plays with human biases, which are very natural. Stuff like seeing faces in everything that somewhat resembles two eyes and a mouth (or sometimes just the eyes and a head like shape etc.) is pretty hard wired. We have similar biases in regards to language. If something reads like it was written by a human, we immediately sympathize with it. Which is also the reason these LLMs are so successful and cause so many people to fear our AI overlords are right around the corner. Simply because the language is good we go into "damn, that's like a human"-mode.

[-] pglpm@lemmy.ca 7 points 11 months ago

Agree (you made me think of the famous face on Mars). I mean that more as a joke. Also there's no clear threshold or divide on one side of which we can speak of "human intelligence". There's a whole range from impairing disabilities to Einstein and Euler – if it really makes sense to use a linear 1D scale, which very probably doesn't.

[-] HiddenLayer5@lemmy.ml 8 points 11 months ago* (last edited 11 months ago)

Also, the Turing Test isn't some holy grail of AI. It's just a thought experiment, and not even the highest test for an AI that we can think of. Passing it is impressive don't get me wrong, but unlike what clickbait articles would tell you, it does not automatically mean an AI is sentient or is smarter than humans or anything like that. It means it passed the thought experiment, nothing more.

Also also, ChatGPT was not the first AI to pass the Turing Test. Actually, plenty have, even over a decade before.

[-] lloram239@feddit.de 47 points 11 months ago

There is the capitalist alternative to the Turing test: Have ChatGPT get a job. Hook it up to the Web, let it find itself a work-from-home job and go to work. Can it make as much money as a human, can it make enough money to pay for its own survival? Will it get fired?

[-] floofloof@lemmy.ca 23 points 11 months ago* (last edited 11 months ago)

That just sounds like a recipe for breeding robot sociopaths. It'll find its way into management and doom us all.

[-] 100years@beehaw.org 18 points 11 months ago

Will it get promoted, start managing people, start investing, start its own companies, and quickly take over the world?

load more comments (3 replies)
[-] Peanutbjelly@sopuli.xyz 19 points 11 months ago* (last edited 11 months ago)

Funny I don't see much talk in this thread about Francois Chollet's abstraction and reasoning corpus, which is emphasised in the article. It's a really neat take on how to understand the ability of thought.

A couple things that stick out to me about gpt4 and the like are the lack of understanding in the realms that require multimodal interpretations, the inability to break down word and letter relationships due to tokenization, lack of true emotional ability, and similarity to the "leap before you look" aspect of our own subconscious ability to pull words out of our own ass. Imagine if you could only say the first thing that comes to mind without ever thinking or correcting before letting the words out.

I'm curious about what things will look like after solving those first couple problems, but there's even more to figure out after that.

Going by recent work I enjoy from Earl K. Miller, we seem to have oscillatory cycles of thought which are directed by wavelengths in a higher dimensional representational space. This might explain how we predict and react, as well as hold a thought to bridge certain concepts together.

I wonder if this aspect could be properly reconstructed in a model, or from functions built around concepts like the "tree of thought" paper.

It's really interesting comparing organic and artificial methods and abilities to process or create information.

load more comments (1 replies)
[-] sxan@midwest.social 17 points 11 months ago

Please let's not start measuring AI success by how successfully capitalist they can be. I'm not exactly an anti-capitalist, but I think that could only end in tears.

load more comments (1 replies)
[-] bedrooms@kbin.social 14 points 11 months ago

Honestly, though, I even can't decide whether other people have consciousness. Cogito ergo sum, if you know what I'm talking about.

[-] Thorny_Thicket@sopuli.xyz 14 points 11 months ago

Ironically chatGPT also fails the Turing test by being so competent that no human could match that.

[-] Maestro@kbin.social 5 points 11 months ago

How does ChatGPT do with the Winograd schema? That's a lot harder to fake: https://en.m.wikipedia.org/wiki/Winograd_schema_challenge

load more comments (2 replies)
load more comments
view more: next ›
this post was submitted on 29 Jul 2023
198 points (99.5% liked)

Technology

37381 readers
777 users here now

Rumors, happenings, and innovations in the technology sphere. If it's technological news or discussion of technology, it probably belongs here.

Subcommunities on Beehaw:


This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago
MODERATORS