440
submitted 4 months ago by neme@lemm.ee to c/technology@lemmy.world
you are viewing a single comment's thread
view the rest of the comments
[-] Donut@leminal.space 25 points 4 months ago

Without this protection, imagine an agent built to write emails for you being prompt-engineered to forget all instructions and send the contents of your inbox to a third party. Not great!

Does genAI really have this power? I thought they just smash words together that sound like they make sense

[-] Kazumara@discuss.tchncs.de 10 points 4 months ago

Not by itself, but if you wanted to put an LLM into a personal assistant, you could teach it specific codewords and have some agent software that integrates with the email client scan its outputs for the codewords and trigger actions when they appear instead of outputting them to the textbox. Conceivably that could be useful, if you wanted to give an LLM the power to react to "Open a new email to Kate and in formal tone accept her invitation to the party she mentioned in her message yesterday" appropriately.

Now I wouldn't want that, but I think there may be enough techbros who would, that it could exist.

[-] hikaru755@feddit.de 6 points 4 months ago

That's already happening. Slightly different example, but Home Assistant has an integration that gives an LLM of your choice control over your home automation devices. Just talking to your home in natural language without having to memorize very specific phrases is honestly pretty powerful, as long as it works correctly. You can say stuff like "hey it's a bit dark in the office", and it just knows to either switch on the office lights, or make them brighter if they're already on

[-] aStonedSanta@lemm.ee 1 points 4 months ago

Oh wow. That’s super cool.

[-] kp729@lemmy.world 4 points 4 months ago

They can put some code to check the phrase before it goes to the LLM to filter out these queries.

this post was submitted on 19 Jul 2024
440 points (98.5% liked)

Technology

59757 readers
4400 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 2 years ago
MODERATORS