this post was submitted on 16 May 2026
221 points (97.8% liked)
Programming
26951 readers
746 users here now
Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!
Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.
Hope you enjoy the instance!
Rules
Rules
- Follow the programming.dev instance rules
- Keep content related to programming in some way
- If you're posting long videos try to add in some form of tldr for those who don't want to watch videos
Wormhole
Follow the wormhole through a path of communities !webdev@programming.dev
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I still can't get over how the only fine tuning you can do for an LLM is yell at it with markdown files. We should be able to retrain local models so they can develop an actual experience without prefilling the context.
How many extra tokens get burned with all this pre filled context I wonder.
It isn't.
Great news, you can do exactly that.
Not GPT5.1 though lol
Yeah. It's proprietary. And you can't modify the Windows 11 source code, either.
But Microsoft can modify the Windows 11 source code. Or at least they used to be able to, before AI.
OpenAI should be able to re-train its poorly trained model. But of course it can't, that would take months, maybe years of datacenter time.
Now OpenAI since can't even re-train their own models, they resort to chastising it in its own system prompt.
This is the problem. If you're trying to imply this is normal and expected, it shouldn't be. It needs not to be. We cannot accept this as the normal way of doing things going forward. It is awful, and painfully stupid.
Not with that attitude!
Windows 11 isn't running in the cloud yet though. Unless it checks to make sure it hasn't been tampered with too much you should just be able to modify some of its binaries (the source code obviously isn't available). With the cloud based llms that is not possible.
If you have a model on your computer you can retrain it, which is like changing a binary just far less precise. The option of having a source code equivalent just isn't there beyond having the same dataset and seeds for the training program.
So I'd say it is worse than your average run of the mill proprietary software.
You can. Just not frontier models. Check out unsloth
lol how do you think LLMs are trained in the first place?
I think he (or she) is talking about the user of the LLM, not the creator.
but you can, as long as it's open weight. Fine tuning and training are pretty much the same process
That still falls into the category "creator" to me, if you need to rebuild. I was making the distinction to an end user, comparable to applications that you download and use and configure. Instead of rebuilding the source code with your modifications.
Do I misunderstand here something? Or is this a communication issue caused by different interpretations?
If you define "user" to be a set that excludes anyone capable of modifying the weights, then by definition, no user can modify the weights.
Any criticism about users being unable to modify weights becomes vacuous, so it's not an interpretation that makes sense.