this post was submitted on 28 Mar 2025
39 points (100.0% liked)
TechTakes
1749 readers
65 users here now
Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.
This is not debate club. Unless it’s amusing debate.
For actually-good tech, you want our NotAwfulTech community
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Yeah, exactly. There's no trick to it at all, unlike the original puzzle.
I also tested OpenAI's offerings a few months back with similarly nonsensical results: https://awful.systems/post/1769506
All-vegetables no duck variant is solved correctly now, but I doubt it is due to improved reasoning as such, I think they may have augmented the training data with some variants of the river crossing. The river crossing is one of the top most known puzzles, and various people have been posting hilarious bot failures with variants of it. So it wouldn't be unexpected that their training data augmentation has river crossing variants.
Of course, there's very many ways in which the puzzle can be modified, and their augmentation would only cover obvious stuff like variation on what items can be left with what items or spots on the boat.
It's just overtrained on the puzzle such that it mostly ignores your prompt. Changing a few words out doesn't change that it recognises the puzzle. Try writing it out in ASCII or uploading an image with it written or some other weird way that it hasn't been specifically trained on and I bet it actually performs better.
oh look it's a loadbearing "just" in the wild. better hope you can shore that fucker up with some facts
my poster in christ, what in the fuck are you on about. stop prompting LLMs and go learn some things instead
"no no see, you just need to prompt it different. just prompt it different bro it'll work bro I swear bro"
god, every fucking time
All along my mistake was that I was prompting it in unicode instead of latin1, alphameric BCD, or "modified UTF-8".
I thought everyone knew that you had to structure prompts in ALGOL 420 to get the best performance by going close to the metal
I use UTF-9 to efficiently handle Unicode on my PDP-10.
@bitofhope @techtakes Surely you need a PDP-9 for that?