this post was submitted on 25 May 2026
1245 points (98.7% liked)
Fuck AI
7191 readers
1896 users here now
"We did it, Patrick! We made a technological breakthrough!"
A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.
AI, in this case, refers to LLMs, GPT technology, and anything listed as "AI" meant to increase market valuations.
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
It's much more likely to be like internet fiber. Some was needed/used.
datacenters will always leverage scale, and AI is only economic at 16+ concurrent users. delivers 3x the tokens/s of a single user. Current rental rates for H200s are below their runcosts. Capacity is already too high in US. Innovations for smaller, faster, cheaper models are providing significant value for less hardware. Gemini flash 3.5 is very small and fast, at much lower cost as top 2 US labs. Deepseek v4 has massive cost reductions that will filter down to rest on industry, especially for context compression which is what allows more users on a single GPU cluster. Qwen 3.6 does bring size down enough to run 3-4 month old state of the art models on consumer hardware, but again multi user service at (pro instead of industrial) 96gb ram.
MTP and Turboquant are other technologies that increase tps delivery at less ram. Software stacks making better use of GPUs is eating token demand growth by itself even as exaggerated capacity comes online at slower pace than hardware investment values justified.