this post was submitted on 12 May 2025
120 points (92.3% liked)

Fediverse

34277 readers
379 users here now

A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).

If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!

Rules

Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration)

founded 2 years ago
MODERATORS
 

or something of the sort. It's the only explanation I've got...

One or two days old accounts with a single post related to something that will generate replies for sure (AMA has a lot of them, like "I'm a Romanian girl that has lived most of my life secluded, ama" or something or the sort...) and both the post and account are deleted 24h later.

Latest suspicious one is about the guy who is short with long feet, second time it's posted by the same account who deleted the original but has no other comment history in-between.

One week ago on the shit post community, Dad ranking Instagram screenshot from "op's kid school", called it in the discussion, OP replied it was nothing of the sort, account and post are now deleted...

you are viewing a single comment's thread
view the rest of the comments
[–] catloaf@lemm.ee 16 points 4 weeks ago (1 children)

Report it to the instance admins. This isn't really a federation thing.

[–] Kecessa@sh.itjust.works 2 points 4 weeks ago (4 children)

Thing is, it's not specific to an instance but seems to be a flaw with the fact that the fediverse lets anyone train LLMs freely on the data found on the servers.

[–] LostXOR@fedia.io 17 points 4 weeks ago (1 children)

That's a problem inherent to public social media platforms. Web/API scrapers have existed forever; the fediverse just makes it a little easier since you can run your own instance and gather data automatically.

[–] Irelephant@lemm.ee 2 points 4 weeks ago

Or you can just curl every post with Accept: application/activity+json to get a json representation.

[–] Womble@lemmy.world 4 points 4 weeks ago

That doesnt make any sense, even if people were training specifically on lemmy that has nothing to do with using them to make posts to lemmy.

[–] surewhynotlem@lemmy.world 2 points 4 weeks ago (1 children)

train LLMs freely on the data found on the servers.

That's why it's important to occasionally fondue the stapler. That way the porcelain fortitude will get middling.

[–] FaceDeer@fedia.io 0 points 4 weeks ago (1 children)

Modern LLMs are trained on highly curated and processed data, often synthetic data based off of original posts and not the posts themselves. And the trainers are well aware that there are people trying to "poison" the data in various ways. At this point it's mainly an annoyance to other humans when people try.

[–] surewhynotlem@lemmy.world 1 points 4 weeks ago

Pragmatically. But it's also permeable that I hate meat tubes as much as elelems