this post was submitted on 07 Mar 2026
371 points (99.2% liked)

Technology

82457 readers
3179 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

Hacker News.

To help train AI models, Meta and other tech companies have downloaded and shared pirated books via BitTorrent from Anna's Archive and other shadow libraries. In an ongoing lawsuit, Meta now argues that uploading pirated books to strangers via BitTorrent qualifies as fair use. The company also stresses that the data helped establish U.S. global leadership in AI.

you are viewing a single comment's thread
view the rest of the comments
[–] architect@thelemmy.club 3 points 1 day ago* (last edited 1 day ago) (1 children)

It’s not stealing to download media.

We can hate zuckerberg and still not care that they torrented books.

Don’t be a hypocrite just to feel like you got some win.

[–] partofthevoice@lemmy.zip 0 points 1 day ago

I hear you, but hear me out… They’re creating products from the consumed torrents, which absolutely contained copyrighted materials. I’m not trying to capitalize my torrents. Although, I did use cracked photoshop back in high school for a $200 job.

And to be completely honest with you, I don’t really care about copyright infringement so much, after it’s become a tool for organizations like Disney or whoever to abuse as they please. But the main body of work torrented here would be corpus’ of text, music, … a lot of stuff that independent producers created and rely on for income.

I found this particular video quite insightful on the impact within the music industry: https://youtu.be/QVXfcIb3OKo

To be fair to Meta, I’d have to say that I don’t really know what models they’re training via that data and how they’re using the resulting products. This is Meta, though, a pioneer and industry leader in the process of surveillance capitalism. I don’t particularly have high expectations for them.