Technology
This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.
Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.
Rules:
1: All Lemmy rules apply
2: Do not post low effort posts
3: NEVER post naziped*gore stuff
4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.
5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)
6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist
7: crypto related posts, unless essential, are disallowed
view the rest of the comments
Yes a long time ago and they don't, but the AI models and training data are two different things.
Also as they are open source, there's nothing stopping anyone from running these AI models locally with your own training data.
There is no such thing like "running model with your training data". To change model's behavior you need to fine-tune it, which means: to continue training it on your own data set. For this to happen you need to have your own dataset, computing power and knowledge how to do it because you may as well make your model performing worse. It is not an easy task.
It's not easy, but it's done all the time. New models, new LoRAs, and in some cases, the training data doesn't even need to be very large for a specific task.
You don't need the entire training dataset that the model was built from.