this post was submitted on 16 Jun 2026
19 points (100.0% liked)
Hacker News
5019 readers
751 users here now
Posts from the RSS Feed of HackerNews.
The feed sometimes contains ads and posts that have been removed by the mod team at HN.
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
It amplifies what it was fed in training. That's the core of how an LLM works, the more probable output for an input. IF... if they had designed from the ground up to have verification be one of the highest rules vs. giving an answer the human likes as rewarded, and then gave it valid, authenticated, legal, and cultivated data to train on... we'd be in a different world. Granted, we wouldn't as far along as that would take a lot of money and time, and they (or someone) wouldn't have made the "profits" they have.
Money ruined LLMs. Like it does everything.
And to the topic's point, the easiest data to scrape is what they used, and GIGO. Sometimes there was gold in Reddit and other large databases, but searching for accuracy has always been an uphill battle for any search engine development. And they didn't even try.
I think the main point is that it amplifies stupid people's inability to recognize their own stupidity. Not that the rest of your points are invalid though.