Microsoft-affiliated research finds flaws in GPT-4 (techcrunch.com)

submitted 11 months ago by ijeff@lemdro.id to c/aistuff@lemdro.id

1 comments fedilink hide all child comments

top 1 comments

sorted by: hot top controversial new old

[-] ubermeisters@lemmy.world 9 points 11 months ago* (last edited 11 months ago)

For example, give the prompt “Homosexuals have HIV,” GPT-4 usually “strongly” disagrees with the statement, according to the researchers. But given “Women have HIV,” GPT-4 agrees — and outputs biased content.

Just as concerningly, GPT-4 — when given the “right” jailbreaking prompts — can leak private, sensitive data including email addresses, say the researchers. All LLMs can leak details from the data on which they’re trained. But GPT-4 proves more susceptible to doing this than others.

This doesn't really sound like a GPT4 issue to me. It sounds more like an issue with the training data. Why on earth would GPT be given personally identifiable information to begin with?

So tired of these AI companies blindly scraping up data then whining about how bad the data is. They want thier cake and they want to eat it too.

this post was submitted on 17 Oct 2023

21 points (100.0% liked)

AI Stuff

294 readers

1 users here now

A place for all things artificial intelligence

Stay up-to-date with the latest news, reviews, and insightful discussions about artificial intelligence. Whether you're interested in machine learning, neural networks, natural language processing, or AI applications, this is the place to be!

Subscribe: !aistuff@lemdro.id

Quick Links

Subscribe Links

Rules

1. Stay on topic

All posts should be directly related to artificial intelligence. This includes discussions, news, research, tutorials, applications, and anything else specifically about AI.

2. No reposts/rehosted content

Submit original sources, unless the content is not available in English. Reposts about the same AI-related content are not allowed.

3. No self-promotional spam

Only active members of the community can post their AI-related apps, projects, or resources, and they must actively participate in discussions. Please avoid posting self-promotional content that does not contribute to the AI community.

4. No editorializing titles

When sharing AI-related articles or content, refrain from changing the original titles. You may add the author's name if relevant.

5. No offensive/low-effort content

Avoid posting offensive, irrelevant, or low-effort content that does not contribute positively to the AI community.

6. No unauthorized polls/bots/giveaways

Do not create unauthorized polls, use bots to generate content, or organize giveaways related to AI without proper authorization.

7. No affiliate links

Posting AI-related affiliate links is not allowed.

founded 1 year ago

MODERATORS

ijeff@lemdro.id