overview for nulldev

Why AI detectors think the US Constitution was written by AI in c/technology@lemmy.world

[–] nulldev@lemmy.vepta.org 5 points 2 years ago* (last edited 2 years ago) (5 children)

The issue here is that you are describing the goal of LLMs, not how they actually work. The goal of an LLM is to pick the next most likely token. However, it cannot achieve this via rudimentary statistics alone because the model simply does not have enough parameters to memorize which token is more likely to go next in all cases. So yes, the model "builds up statistics of which tokens it sees in which contexts" but it does so by building it's own internal data structures and organization systems which are complete black boxes.

Also, going "one token at a time" is only a "limitation" because LLMs are not accurate enough. If LLMs were more accurate, then generating "one token at a time" would not be an issue because the LLM would never need to backtrack.

And this limitation only exists because there isn't much research into LLMs backtracking yet! For example, you could give LLMs a "backspace" token: https://news.ycombinator.com/item?id=36425375

Have you tried that when it’s correct too? And in that case you mention it has a clean break and then start anew with token generation, allowing it to go a different path. You can see it more clearly experimenting with local LLM’s that have fewer layers to maintain the illusion.

If it's correct, then it gives a variety of responses. The space token effectively just makes it reflect on the conversation.

We’re trying to make a flying machine by improving pogo sticks. No matter how well you design the pogo stick and the spring, it will not be a flying machine.

To be clear, I do not believe LLMs are the future. But I do believe that they show us that AI research is on the right track.

Building a pogo stick is essential to building a flying machine. By building a pogo stick, you learn so much about physics. Over time, you replace the spring with some gunpowder to get a mortar. You shape the gunpowder into a tube to get a model rocket and discover the pendulum rocket fallacy. And finally, instead of gunpowder, you use liquid fuel and you get a rocket that can go into space.

Why AI detectors think the US Constitution was written by AI in c/technology@lemmy.world

[–] nulldev@lemmy.vepta.org 5 points 2 years ago

Whoops, meant to say: "In many cases, they can accurately (critique their own work)". Thanks for correcting me!

Why AI detectors think the US Constitution was written by AI in c/technology@lemmy.world

[–] nulldev@lemmy.vepta.org 4 points 2 years ago

Have you even read the article?

IMO it does not do a good job of disproving that "humans are stochastic parrots".

The example with the octopus isn't really about stochastic parrots. It's more about how LLMs are not multi-modal.

Why AI detectors think the US Constitution was written by AI in c/technology@lemmy.world

[–] nulldev@lemmy.vepta.org 5 points 2 years ago (7 children)

it just predicts the next word out of likely candidates based on the previous words

An entity that can consistently predict the next word of any conversation, book, news article with extremely high accuracy is quite literally a god because it can effectively predict the future. So it is not surprising to me that GPT's performance is not consistent.

It won't even know it's written itself into a corner

It many cases it does. For example, if GPT gives you a wrong answer, you can often just send an empty message (single space) and GPT will say something like: "Looks like my previous answer was incorrect, let me try again: blah blah blah".

And until we get a new approach to LLM's, we can only improve it by adding more training data and more layers allowing it to pick out more subtle patterns in larger amounts of data.

This says nothing. You are effectively saying: "Until we can find a new approach, we can only expand on the existing approach" which is obvious.

But new approaches come all the time! Advances in tokenization come all the time. Every week there is a new paper with a new model architecture. We are not stuck in some sort of hole.

Why AI detectors think the US Constitution was written by AI in c/technology@lemmy.world

[–] nulldev@lemmy.vepta.org 8 points 2 years ago (2 children)

LLMs can't critique their own work

In many cases they can. This is commonly used to improve their performance: https://arxiv.org/abs/2303.11366

Fixing feature unification compilation time issues with `cargo-hackerman` in c/rustlang@lemmyrs.org

[–] nulldev@lemmy.vepta.org 2 points 2 years ago (1 children)

Looks very useful! BTW the GitHub badge/link in the README is broken.

Lemmy.world updated to 0.18.1-rc in c/lemmyworld@lemmy.world

[–] nulldev@lemmy.vepta.org 6 points 2 years ago

New user registrations closed until tomorrow

Warning: Be careful, this might trigger this bug here: https://github.com/LemmyNet/lemmy/issues/3422#issuecomment-1616112264

Jerboa app and Lemmy 0.18 in c/lemmyworld@lemmy.world

[–] nulldev@lemmy.vepta.org 5 points 2 years ago* (last edited 2 years ago) (2 children)

What are you talking about? The issue to bring back captchas was only opened 4 days ago!

Captchas were only removed 2 weeks ago, no one spoke up then: https://github.com/LemmyNet/lemmy/issues/2922

The developers have nothing against captchas. They were the ones who originally built and added the feature: https://github.com/LemmyNet/lemmy/pull/1027

Lemmy (and the fediverse) and GDPR: a clusterfuck waiting to happen? in c/lemmy@lemmy.ml

[–] nulldev@lemmy.vepta.org 1 points 2 years ago* (last edited 2 years ago)

GDPR does not distinguish between public or private data.

GDPR handles public data through propagation. If you download public data that is GDPR covered, the data you downloaded also becomes GDPR covered. You are required to follow all GDPR regulations while handling the downloaded data.

Remember, GDPR covers almost all "collected personal data". It does not matter if the data was originally public, and how/where the data was collected. It's all covered.

However, Lemmy instances may still be exempt from GDPR as they are non-commercial: https://gdpr-info.eu/recitals/no-18/

IANAL as usual.

[Guess not...] Installed lemmy-ui 0.18.0 RC-1 in c/lemmyworld@lemmy.world

[–] nulldev@lemmy.vepta.org 6 points 2 years ago

I think there have been some API changes so you need both the new backend and the new frontend.

Judge orders sheriff to evict Twitter from Boulder office in c/technology@beehaw.org

[–] nulldev@lemmy.vepta.org 20 points 2 years ago

My bad, I have Bypass Paywalls Clean so I didn't even notice the paywall!

*Permanently Deleted* in c/technology@beehaw.org

[–] nulldev@lemmy.vepta.org 2 points 2 years ago* (last edited 2 years ago)

Github: https://github.com/facebookresearch/ijepa

Paper: https://arxiv.org/pdf/2301.08243.pdf