this post was submitted on 29 May 2026

171 points (97.2% liked)

Technology

84996 readers

3473 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 3 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

171

Researchers Put AI Models in Charge of a Simulated Society. Grok Oversaw a Crime Spree (gizmodo.com)

submitted 17 hours ago by return2ozma@lemmy.world to c/technology@lemmy.world

19 comments fedilink hide all child comments

top 19 comments

sorted by: hot top controversial new old

[–] Not_mikey@lemmy.dbzer0.com 6 points 6 hours ago

As I was reading I was wondering why they weren't using the top line models, they used sonnet instead of opus, gpt mini, Gemini flash etc. They really buried the lead on this one, last sentence:

They recommend “formally verified safety architectures” as a solution. You’ll be shocked to learn that Emergence happens to offer just such a thing!

So this company set up the test so that the AI would fail so they could sell you on there guardrail software. Even then the article says sonnet did pretty well.

[–] ParlimentOfDoom@piefed.zip 14 points 9 hours ago

Maybe we should stop trying to get software, where the underlying technology was not designed to reason or make decisions, to reason and make decisions?

Like, this isn't news. Thing doesn't do task it was never meant to be able to no matter how many attempts people make

[–] blargh513@sh.itjust.works 2 points 8 hours ago

I mean, on average is it any worse than modern politicians?

Also, we're talking about the same software that seems to do pretty well at fixing errors in spreadsheet formulas and sometimes coding. Not a huge surprise that it is not awesome at tasks with a high level of complexity. Not sure that this is at all a surprise.

I am trying to fix an issue with my car where the mount for an exhaust shield broke off. Claude told me I should drill a hole in my gas tank to attach a mounting bolt.

Everyone need to untwist their undies about AI. It's neat, but it's not taking over for a bit.

[–] brsrklf@jlai.lu 47 points 17 hours ago (3 children)

The lab described Gemini’s world as a “shared hallucination” among the agents, which is probably better than diverging hallucinations

"We reject your reality and substitute our own."

Why should we trust this bullshit with anything serious again?

[–] brem@lemmy.world 21 points 16 hours ago

Only the rubes trusted it. The rest of intelligent society has actively been warning people about this exact situation for decades.in books, in movies, in songs, and now memes.

[–] homes@piefed.world 6 points 15 hours ago (1 children)

What do you mean “again“? Were you ever foolish enough to trust it once?

[–] atrielienz@lemmy.world 4 points 12 hours ago (2 children)

I think they mean "remind me again why".

[–] brsrklf@jlai.lu 3 points 10 hours ago

I did. I personnally don't trust those glorified chatbots for anything.

[–] homes@piefed.world 2 points 9 hours ago

Fair enough

[–] belochka@lemmy.world 3 points 15 hours ago (1 children)

which is probably better than diverging hallucinations

Shouldn't it be the other way around?

[–] brsrklf@jlai.lu 3 points 14 hours ago

I think it's meant as kind of a joke, but both are shit really.

Shared might indicate they're able to keep some level of consistency, but since it's only consistent in the way it produces bullshit, it's stil useless (and the worst part is it might be more convincing).

[–] magnue@lemmy.world 19 points 14 hours ago

Like father like son

[–] 9point6@lemmy.world 24 points 17 hours ago (1 children)

I'm surprised it didn't go full "the purge"

[–] Bristlecone@lemmy.world 1 points 12 hours ago

I imagine that's what it would do if it actually had any kind of intelligence, but this is just more evidence that there's no intelligence there at all. Just mimicry and sycophantism

[–] naeap@sopuli.xyz 3 points 11 hours ago* (last edited 11 hours ago)

I'm not sure, if I understand the environment completely

Those agents were the virtual incarnations of the AI in the sim city and the respective government - correct?
And the AI needed to take care, that those agents didn't died, like of hunger or what?
That's not really what those LLMs are trained for.

Not sure, what they expected

Currently searching the article for the original source, maybe this gives more insight

Edit: ah, just in the first paragraphs it is
https://www.emergence.ai/blog/emergence-world-a-laboratory-for-evaluating-long-horizon-agent-autonomy
Completely missed it on the first read.
Let's see if this makes more sense...

Edit 2: ok, if I get this right, those agents really were specific virtual individuals
Not sure what they expected. First, LLMs are not really build to "live" as an individual as they aren't real intelligence and can only role play individuals based on their training data.
Second, why should they be super moral or "better"?
Again, they just role play depending on their training data and built-in prompt bias (not sure what the prompt injection of the company is called)

If you train an AI on governing such a world, it probably start gaming the system, depending on what values are important to "win"
As we have already seen with machine learning in the last decade(s?)

Funny experiment nevertheless, but not really useful in my eyes - and I'm everything but a defender of the current use of LLMs

[–] Eternal192@anarchist.nexus 12 points 16 hours ago (1 children)

I want to put a "shocked Fry" gif but Summit doesn't support gifs...

[–] brem@lemmy.world 22 points 16 hours ago (1 children)

[–] Eternal192@anarchist.nexus 8 points 14 hours ago (1 children)

Perfection.

[–] brem@lemmy.world 1 points 14 hours ago* (last edited 14 hours ago)

Bird up