this post was submitted on 05 Dec 2025
228 points (99.1% liked)
Game Development
5298 readers
2 users here now
Welcome to the game development community! This is a place to talk about and post anything related to the field of game development.
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Name one example
No. I'll name three.
Pleias, an LLM family of models that train on the common corpus, compliant with EU copyright and fair use law. They filtered a public domain dataset for racism and other bias's, and released the results.
common canvas is a (suite) of text-to-image models trained on a data they know is well sourced.
Apertus, public ai is a chat-gpt style bot made in collaboration with the swiss government, with a commitment to using only training data that complies with swiss fair use. They've chosen a model design that let's them remove training data which is improperly labeled, or becomes no longer accessible (ie, by changing robots.txt).
Not to mention the hundreds of models academics in ML have trained using things like open diffusion and public datasets (see also these hobbyists).
They don't have advertising budgets (generally). But you see a steady stream of open models on arXiv.