this post was submitted on 25 Nov 2025
155 points (93.3% liked)

memes

377 readers
2 users here now

Community rules

1. Be civil
No trolling, bigotry or other insulting / annoying behaviour

2. No politics
This is non-politics community. For political memes please go to !politicalmemes@lemmy.world

3. No recent reposts
Check for reposts when posting a meme, you can only repost after 1 month

4. No bots
No bots without the express approval of the mods or the admins

5. No Spam/Ads
No advertisements or spam. This is an instance rule and the only way to live.

founded 5 months ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] chicken@lemmy.dbzer0.com 5 points 4 weeks ago

From wikipedia:

The Turing test, originally called the imitation game by Alan Turing in 1949,[2] is a test of a machine's ability to exhibit intelligent behaviour equivalent to that of a human. In the test, a human evaluator judges a text transcript of a natural-language conversation between a human and a machine. The evaluator tries to identify the machine, and the machine passes if the evaluator cannot reliably tell them apart.

This isn't as hard a test as the one you're describing. There's research showing LLMs pass very similar tests:

randomised, controlled, and pre-registered Turing tests on independent populations. Participants had 5 minute conversations simultaneously with another human participant and one of these systems before judging which conversational partner they thought was human. When prompted to adopt a humanlike persona, GPT-4.5 was judged to be the human 73% of the time: significantly more often than interrogators selected the real human participant. LLaMa-3.1, with the same prompt, was judged to be the human 56% of the time -- not significantly more or less often than the humans they were being compared to -- while baseline models (ELIZA and GPT-4o) achieved win rates significantly below chance (23% and 21% respectively). The results constitute the first empirical evidence that any artificial system passes a standard three-party Turing test.

That's not quite the same thing as LLMs being so good at imitating humans that a trained expert has no possible edge for telling the difference, but it is a major milestone, and I think it's technically accurate to say "AI has passed the Turing Test" at this point.