this post was submitted on 02 Jun 2025
677 points (98.8% liked)

Programmer Humor

23915 readers
3234 users here now

Welcome to Programmer Humor!

This is a place where you can post jokes, memes, humor, etc. related to programming!

For sharing awful code theres also Programming Horror.

Rules

founded 2 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] Schadrach 1 points 4 days ago

just curious, what kind of guardrails have you tried going against? i recently used the above to get a long and detailed list of instructions for cooking meth (not really interested in this, just to hone the technique)

Essentially the same kind of thing, just as a test. Older models you can usually just ask to roleplay such a character, later models you can cheat a bit and write up some JSON configuration as a prompt, because that apparently skips right past some of the input filtering. Look up the so-called "Dr. House" attack for an example of it. It's basically the typical roleplaying style attack wrapped in JSON.