this post was submitted on 26 Jun 2026
143 points (91.8% liked)
Technology
85775 readers
3729 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 3 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Nooooo, you can't train on OUR data! That's illegal!!!1
If every AI company steals the public data separately, it means massively increased costs for everyone who is getting their data stolen. If the AI companies "steal" from each other it's much better for everyone else.
No difference. Distillation is a valid and useful way of generating data to improve or make new models. It's still just example data to be trained on. Anthropic is doing the same with their own models, and inadvertently every other model through web scraping.
The legal difference is that this data is uncopyrightable. At most it's a TOS breach, nothing major.
Claude is trained on stolen data (the whole Internet), so I can't have any sympathy for Anthropic when someone steals from them.
Seems like it's up to Anthropic to teach it's AI model not to pimp itself out.
Ai is already dog shit, there is a level of concern if we start getting ai incest and the absolute fucking retards shoving this shit everywhere goes from using unethical ai to unethical incest ai.
There's no way this isn't just going to make everything worse.
I have zero sympathy for anthropic but can we not make a shit situation worse and just be ok with that cause the first dude is Hitler and the second dude is mega Hitler.
Ok the flip side if this some how creates a more efficient and power conservative model that doesn't fuck over consumers and the environment as hard.
PIRATE MORE ALIBABA YOU DA CHAMP
I welcome it getting worse. The worse it gets the faster it will collapse.
Why the fuck would they do that if Anthropic is being kind enough to just give them that data (regardless of how it makes them be butthurt)?