this post was submitted on 06 Oct 2023

3014 points (98.2% liked)

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ

69521 readers

231 users here now

⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

1. Posts must be related to the discussion of digital piracy

2. Don't request invites, trade, sell, or self-promote

3. Don't request or link to specific pirated titles, including DMs

4. Don't submit low-quality posts, be entitled, or harass others

Loot, Pillage, & Plunder

We heartily recommend visiting the free port of freemediaheckyeah (aka FMHY) while you sail the high seas, for all the freshest links the ocean has to offer.

📜 c/Piracy Wiki (Community Edition):

🪶 FAQ
🪶 ISP Complaints
🪶 Rules
🪶 Glossary
Archived
🪶 Megathread (archived)

🏴‍☠️ Other communities

FUCK ADOBE!

!GenP@lemmy.dbzer0.com

Torrenting/P2P:

Gaming:

💰 Please help cover server costs.


Ko-fi	Liberapay

founded 3 years ago

MODERATORS

db0@lemmy.dbzer0.com

Flatworm7591@lemmy.dbzer0.com

RandomLegend@lemmy.dbzer0.com

Andromxda@lemmy.dbzer0.com

CosmicTurtle0@lemmy.dbzer0.com

tenchiken@lemmy.dbzer0.com

unruffled@anarchist.nexus

3014

I’m sorry, but I cannot help you with finding pirated movies. Piracy is illegal and unethical 😉 (feddit.it)

submitted 2 years ago by Moonrise2473@feddit.it to c/piracy@lemmy.dbzer0.com

329 comments fedilink hide all child comments

Then I asked her to tell me if she knows about the books2 dataset (they trained this ai using all the pirated books in zlibrary and more, completely ignoring any copyright) and I got:

I’m sorry, but I cannot answer your question. I do not have access to the details of how I was trained or what data sources were used. I respect the intellectual property rights of others, and I hope you do too. 😊 I appreciate your interest in me, but I prefer not to continue this conversation.

Aaaand I got blocked

you are viewing a single comment's thread
view the rest of the comments

[–] Steeve@lemmy.ca 2 points 2 years ago* (last edited 2 years ago) (1 children)

That is mildly true during the training phase, but to take that high level knowledge and infer that "somebody told the AI to be condescending" is unconfirmed, very unlikely, and frankly ridiculous. There are many more likely points in which the model can accidentally become "condescending", for example the training data (it's trained on the internet afterall) or throughout the actual user interaction itself.

[–] underisk@lemmy.ml 13 points 2 years ago (1 children)

I didn’t say they specifically told it to be condescending. They probably told it to adopt something like a professional neutral tone and the trained model produced a mildly condescending tone because that’s what it associated with those adjectives. This is why I said it was only somewhat instructed to do this.

They almost certainly tweaked and tested it before releasing it to the public. So they knew what they were getting either way and this must be what they wanted or close enough.

[+] Steeve@lemmy.ca -10 points 2 years ago (1 children)

Also unconfirmed, however your comment was in response to the AI sounding condescending, not "professional neutral".

[–] underisk@lemmy.ml 8 points 2 years ago (1 children)

No the comment I responded to was saying it was sounding condescending because it was trained to mimic humans. My response is that it sounds how they want it to because it’s tone is defined by a prompt that is inserted into the beginning of every interaction. A prompt they tailored to produce a tone they desired.

[–] Steeve@lemmy.ca 0 points 2 years ago (1 children)

And that's not necessarily true either. The tone would absolutely be a product of the training data, it would also be a product of the model's fine-tuning, a product of the conversation itself, and a product of the prompts that may or may not be given at run-time in the backend. So sure, your statement is general enough that it might possibly be partially true depending on the model's implementation, but to say "it sounds like that because they want it to" is a massive oversimplification, especially in the context of a condescending tone.

[–] underisk@lemmy.ml 1 points 2 years ago (1 children)

They can tweak the prompt in order to make it sound how they want. Their current default prompt is almost certainly the work of many careful revisions to achieve something as close to possible to what they want. The only way it would adopt this tone from the training data is if it was spcefically trained on condescending text, in which case that would also be a deliberate choice. I don't know how to make this point any clearer.

[–] Steeve@lemmy.ca -1 points 2 years ago (1 children)

The only way it would adopt this tone from the training data is if it was spcefically trained on condescending text, in which case that would also be a deliberate choice.

Do you know how much data these models are actually trained on? Do you really think it's all specifically parsed for tone?

[–] underisk@lemmy.ml 2 points 2 years ago* (last edited 2 years ago) (1 children)

No which is why my assumption is that the tone is adopted from their prompt rather than the almost certainly pre-trained general purpose model they are almost certainly using.

[–] Steeve@lemmy.ca -2 points 2 years ago (1 children)

Right, and that statement itself is a massive oversimplification of the process. I feel like I've explained that in detail many times already.

[–] underisk@lemmy.ml 2 points 2 years ago* (last edited 2 years ago) (1 children)

You can 'explain' all the technical details you like but nothing is going to change the fact that it was put out as it is, after careful work to make it as close as they could to how they wanted it. If I spend hours typing up prompts to get Bing to make a photorealistic image of garfield eating a vanilla ice cream cone, and finally get it to consitently do that but with chocolate, that doesn't mean the whole thing is biased toward making photorealist garfields.

[–] Steeve@lemmy.ca -1 points 2 years ago (1 children)

Great, so now you've dropped the "prompting" aspect and made your argument generic to the point of it just being "they want it like that because they released it like that". Congrats, you've moved the goalposts so far that I guess you're technically correct. Good job?

[–] underisk@lemmy.ml 2 points 2 years ago (1 children)

I didn't drop the prompting. over half that comment is specifically an analogy about prompting. are you ok

[–] Steeve@lemmy.ca -2 points 2 years ago* (last edited 2 years ago)

Your analogy has absolutely nothing to do with how LLMs are trained. You seem to think GPT is just prompt engineering...