this post was submitted on 06 Jan 2026
24 points (87.5% liked)

Asklemmy

52437 readers
969 users here now

A loosely moderated place to ask open-ended questions

Search asklemmy ๐Ÿ”

If your post meets the following criteria, it's welcome here!

  1. Open-ended question
  2. Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
  3. Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
  4. Not ad nauseam inducing: please make sure it is a question that would be new to most members
  5. An actual topic of discussion

Looking for support?

Looking for a community?

~Icon~ ~by~ ~@Double_A@discuss.tchncs.de~

founded 6 years ago
MODERATORS
 

E.g. ollama/hugging face

you are viewing a single comment's thread
view the rest of the comments
[โ€“] digdilem@lemmy.ml 3 points 3 weeks ago (1 children)

You can't trust an inherantly untrustworthy industry.

The problem is that to make a good AI, you need a lot of input and we know from leaks and reports that many/most of the major players deliberately ignored copyright to train their models. If it was reachable, they used it. Are using it. Will use it. Like Johnny 5, there's no limit to the data they want, or that their handlers want to feed them with. They're the Cookie Monster at a biscuit factory.

So when the question of trust comes up, you'd have to be pretty forgiving to overlook that they're built on foundations of theft, and pretty naive to assume these companies have suddenly grown ethics and won't use your data and input to train with, even when you're using commercial systems that promise they won't.

Even in the event that there is an ethical provider that does their utmost to ensure your data doesn't migrate (these do exist, at least in intention), this is an incredibly fast moving, ultra-competitive market where huge amounts of data are shifted around constantly and guardrails being notoriously hard to accurately define, let alone enforce. It's inevitable stuff will leak.

[โ€“] ClassIsOver@hexbear.net 2 points 2 weeks ago

I'm just picturing Cookie Monster at a biscuit factory screaming "WHAT FUCK IS THIS? THIS COOKIE HAVE NO SUGAR!"