this post was submitted on 28 Mar 2025
39 points (100.0% liked)

TechTakes

1742 readers
94 users here now

Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

founded 2 years ago
MODERATORS
 

So I signed up for a free month of their crap because I wanted to test if it solves novel variants of the river crossing puzzle.

Like this one:

You have a duck, a carrot, and a potato. You want to transport them across the river using a boat that can take yourself and up to 2 other items. If the duck is left unsupervised, it will run away.

Unsurprisingly, it does not:

https://g.co/gemini/share/a79dc80c5c6c

https://g.co/gemini/share/59b024d0908b

The only 2 new things seem to be that old variants are no longer novel, and that it is no longer limited to producing incorrect solutions - now it can also incorrectly claim that the solution is impossible.

I think chain of thought / reasoning is a fundamentally dishonest technology. At the end of the day, just like older LLMs it requires that someone solved a similar problem (either online or perhaps in a problem solution pair they generated if they do that to augment the training data).

But it outputs quasi reasoning to pretend that it is actually solving the problem live.

all 34 comments
sorted by: hot top controversial new old
[–] quediuspayu@lemmy.dbzer0.com 20 points 3 days ago

Take the potato and the carrot, show the carrot to the duck and make it follow you, dude can swim.

One trip, solved.

[–] blakestacey@awful.systems 12 points 3 days ago
[–] BlueMonday1984@awful.systems 11 points 3 days ago (2 children)

I'm kinda tired, but this puzzle's shoved itself into my brain. The obvious solution I can see is, roughly speaking:

  1. Take the duck and carrot across

  2. Take the duck back

  3. Take the duck and potato across

[–] shnizmuffin@lemmy.inbutts.lol 11 points 3 days ago (1 children)

My two solutions:

  1. Eat the carrot. Take the duck and potato across.
  2. It's a row boat. Take the carrot and potato, supervise the duck as it swims behind you.

I'm not doing three river crossings, you can't make me.

[–] skillissuer@discuss.tchncs.de 11 points 3 days ago* (last edited 3 days ago)

another solution:

take duck, carrot and potato at once. if boat is fine if you put duck and carrot in but will sink if you put in duck, carrot and potato then you're already on horrifyingly narrow engineering margins and probably shouldn't use it in the first place

in the worst case you can put duck on a leash if it'll run away otherwise

[–] diz@awful.systems 9 points 3 days ago* (last edited 3 days ago) (1 children)

Yeah, exactly. There's no trick to it at all, unlike the original puzzle.

I also tested OpenAI's offerings a few months back with similarly nonsensical results: https://awful.systems/post/1769506

All-vegetables no duck variant is solved correctly now, but I doubt it is due to improved reasoning as such, I think they may have augmented the training data with some variants of the river crossing. The river crossing is one of the top most known puzzles, and various people have been posting hilarious bot failures with variants of it. So it wouldn't be unexpected that their training data augmentation has river crossing variants.

Of course, there's very many ways in which the puzzle can be modified, and their augmentation would only cover obvious stuff like variation on what items can be left with what items or spots on the boat.