this post was submitted on 10 Aug 2025
97 points (99.0% liked)

AI - Artificial intelligence

286 readers
18 users here now

AI related news and articles.

Rules:

founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] brucethemoose@lemmy.world 3 points 10 months ago* (last edited 10 months ago)

I feel like diffusion LLMs would get this better.

After “position 5,” an autoregressive LLM has one chance, one pass, to get the next token right instead of another bullet point. And if it randomly picks another bullet point because the temperature is at 1 or whatever, the whole answer is hosed.

Not that OpenAI would ever do that. They just want to deep fry autoregressive transformers more and more instead of, you know, trying something actually interesting.