230
AI models fed AI-generated data quickly spew nonsense
(www.nature.com)
A community to post scientific articles, news, and civil discussion.
rule #1: be kind
<--- rules currently under construction, see current pinned post.
2024-11-11
As long as you verify the output to be correct before feeding it back is probably not bad.
That’s correct, and the paper supports this. But people don’t want to believe it’s true so they keep propagating this myth.
Training on AI outputs is fine as long as you filter the outputs to only things you want to see.
How do you verify novel content generated by AI? How do you verify content harvested from the Internet to "be correct"?
Same way you verified the input to begin with. Human labor
The issue is that A.I. always does a certain amount of mistakes when outputting something. It may even be the tiniest, most insignificant mistake. But if it internalizes it, it'll make another mistake including the one it internalized. So on and so forth.
Also this is more with scraping in mind. So like, the A.I. goes on the internet, scrapes other A.I. images because there's a lot of them now, and becomes worse.