this post was submitted on 14 Jun 2026
804 points (99.5% liked)

People Twitter

10090 readers
1283 users here now

People tweeting stuff. We allow tweets from anyone.

RULES:

  1. Mark NSFW content.
  2. No doxxing people.
  3. Must be a pic of the tweet or similar. No direct links to the tweet.
  4. No bullying or international politcs
  5. Be excellent to each other.
  6. Provide an archived link to the tweet (or similar) being shown if it's a major figure or a politician. Archive.is the best way.

founded 3 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] errer@lemmy.world 12 points 4 days ago (3 children)

I mean the poster above you is wrong, they use math tools internally now when you ask math questions. Very obvious in Gemini. Yes the raw LLM trying to autocomplete the answer to a math problem is gonna be wrong but that’s not the way they are used to solve problems like that anymore.

[–] sbv@sh.itjust.works 7 points 4 days ago

The LLM has to choose to use the calculating tools. Gemini tried to do this one solo:

4 + 2 + 2 + 2 + 1+ 2 + 0 = 15

Tbf, it did four of these calculations, and 75% were correct.

[–] baines@lemmy.cafe 5 points 4 days ago

no way i’d want to drive on a bridge built on their supposed math

[–] wonderingwanderer@sopuli.xyz 1 points 4 days ago

That makes sense. I clearly don't keep up on the frontier models...