this post was submitted on 03 Jun 2026
819 points (99.6% liked)
People Twitter
10037 readers
342 users here now
People tweeting stuff. We allow tweets from anyone.
RULES:
- Mark NSFW content.
- No doxxing people.
- Must be a pic of the tweet or similar. No direct links to the tweet.
- No bullying or international politcs
- Be excellent to each other.
- Provide an archived link to the tweet (or similar) being shown if it's a major figure or a politician. Archive.is the best way.
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I do wonder about that though. The Big AI operating costs include being able to service a certain number of customers within a certain amount of time. So if they need to service 10,000 requests per minute and fulfill them within 2-4 seconds, that's a big datacenter.
Now if a company does a few dozen requests a minute and on average needs double-digit response times... the costs to implement could be much different. The thing is finding a model that will do that and provide accurate (enough) output versus how much it Claude's pricing is built around speed+volume versus accuracy.
A lot of cost is on training it as well. Which you need if you want to "build your own claude". If you run only the inferences with an open model then ya it's directly correlated to how fast you want the responses to come in.