AI Researchers 6x Model Performance to Match Humans in Abstract Reasoning Benchmark by AtmosphericRiversCuomo in c/technology@hexbear.net

[-] AtmosphericRiversCuomo@hexbear.net 5 points 2 weeks ago

The training datasets don't have the answers because the benchmark is diverse enough. That's why other models struggled to perform as well as humans until they applied the approach outlined in the paper. This is the benchmark: https://liusida.github.io/ARC/

permalink
fedilink
source
context

LLMs and generative AI are becoming genuinely problematic by AtmosphericRiversCuomo in c/philosophy@hexbear.net

[-] AtmosphericRiversCuomo@hexbear.net 2 points 2 weeks ago

What difference does it make? Does Bob from accounting have a soul? I can't answer that either.

permalink
fedilink
source
context

AtmosphericRiversCuomo

joined 1 month ago