Technology

85208 readers

3941 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 3 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

555

Anthropic/OpenAI may be spending more than $1000 for every $100 you pay them (ea.rna.nl)

submitted 1 day ago by Trilogy3452@lemmy.world to c/technology@lemmy.world

153 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] Wildmimic@anarchist.nexus 13 points 22 hours ago* (last edited 22 hours ago) (1 children)

Both Uber and Spotify (and AWS too) had economics of scale going for them - the more users they have, the more the infrastructure could be leveraged. This does NOT work for LLMs. More users means using more compute, more advanced tasks (like coding) uses exponential amounts of compute. A single user running a complex task can make 8 Blackwell GPUs run full tilt, and you don't even have any guarantee that the output will be useable.

There are a few narrow areas where LLMs might be successful, like scanning for security vulnerabilities or searching large amounts of documents. The massive amount of money invested will never be recouped with these usage scenarios.

[–] Imperious_melange@lemmy.world 1 points 11 hours ago* (last edited 11 hours ago)

I don't think anyone is assuming it will stay at its current efficiency and there will be zero improvements. A lot of the everyday AI use cases will likely be pushed to someone's personal device aka your phone. In the same way a lot of Uber and Spotify is handled by your personal device today. What we've seen for years now is the development of these gargantuan models that are then condensed down into much smaller models with 90%+ of the same effectiveness. Simultaneously we will see and are seeing devices sold with better NPU's for edge compute for AI the same we've seen the push for more edge compute to manage other services such as Uber and Spotify.

Across this thread and others there's like this implicit assumption AI will never progress beyond where it is right now in spite of the evidence of its almost exponential growth. It's really interesting.