this post was submitted on 19 Feb 2026
163 points (96.6% liked)

Technology

81532 readers
5552 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] tal@lemmy.today 3 points 11 hours ago (2 children)

Wooldridge sees positives in the kind of AI depicted in the early years of Star Trek. In one 1968 episode, The Day of the Dove, Mr Spock quizzes the Enterprise’s computer only to be told in a distinctly non-human voice that it has insufficient data to answer. “That’s not what we get. We get an overconfident AI that says: yes, here’s the answer,” he said. “Maybe we need AIs to talk to us in the voice of the Star Trek computer. You would never believe it was a human being.”

Hmm. That's probably a pretty straightforward modification for existing LLMs, at least at the token level.

You can obtain token probabilities, so you can give some estimate out-of-band confidence in a response, down to the token level. Don't really need to change anything for that, just expose some data.

And you could make the AI aware of its own neural net's confidence level, feed the confidence back into the neural net for subsequent tokens, see if you can get it to take that information into account.

https://en.wikipedia.org/wiki/Recurrent_neural_network

In artificial neural networks, recurrent neural networks (RNNs) are designed for processing sequential data, such as text, speech, and time series,[1] where the order of elements is important. Unlike feedforward neural networks, which process inputs independently, RNNs utilize recurrent connections, where the output of a neuron at one time step is fed back as input to the network at the next time step. This enables RNNs to capture temporal dependencies and patterns within sequences.

[–] ThirdConsul@lemmy.zip 3 points 2 hours ago

You can obtain token probabilities, so you can give some estimate out-of-band confidence in a response, down to the token level.

That means literally nothing. You can get wrong answer with 100% token confidence, and correct one with 0.000001% confidence.

[–] Aceticon@lemmy.dbzer0.com 1 points 2 hours ago

The problem is that LLMs don't generate "an answer" as a whole, they just generate tokens (generally word-sized, but not always) for the next text element given the context of all the text elements (the whole conversation) so far and the confidence level is per-token.

Further, the confidence level is not about logical correctness, it's about "how likely is this token to appear in this context".

So even if you try using token confidence you still end up stuck due to the underlying problem that the LLMs architecture is that of a "realistic text generator" and hence that confidence level is all about "what text comes next" and not at all about the logical elements conveyed via text such as questions and answers.