This is really what blows my mind the most. With all this talk about how much power LLMs and diffusion models use companies are still constantly cramming it into places where it's just running all the time passively doing things no one asked for.
Overall power use by these things would probably be cut down by an order of magnitude by just limiting it to directed, intentional use only.
The suggestion that this is a "radical approach" might actually be the most insane part of what is already a fairly insane article.