The problem with AI alignment is that humans aren't aligned (lemy.lol)

submitted 1 year ago* (last edited 1 year ago) by preasket@lemy.lol to c/showerthoughts@lemmy.world

23 comments fedilink hide all child comments

I'm sure there are some AI peeps here. Neural networks scale with size because the number of combinations of parameter values that work for a given task scales exponentially (or, even better, factorially if that's a word???) with the network size. How can such a network be properly aligned when even humans, the most advanced natural neural nets, are not aligned? What can we realistically hope for?

Here's what I mean by alignment:

Ability to specify a loss function that humanity wants
Some strict or statistical guarantees on the deviation from that loss function as well as potentially unaccounted side effects

you are viewing a single comment's thread
view the rest of the comments

[-] fubo@lemmy.world 18 points 1 year ago* (last edited 1 year ago)

Some of the human-alignment projects look like "religions" and some look like "economies" and some look like "just talking to each other and trying to be halfway decent folks and not flipping out or some shit".

Heck, arguably the United Nations is a human-alignment project for x-risk mitigation.

[-] DeVaolleysAdVocate@lemmy.world 1 points 1 year ago

We'd like to bring all those and their existing versions together with the A-Better-World Consensus-Engine idea.

Tell me more about some of these other projects though please.

load more comments (3 replies)

this post was submitted on 14 Jul 2023

69 points (96.0% liked)

Showerthoughts

28866 readers

872 users here now

A "Showerthought" is a simple term used to describe the thoughts that pop into your head while you're doing everyday things like taking a shower, driving, or just daydreaming. The best ones are thoughts that many people can relate to and they find something funny or interesting in regular stuff.

Rules

All posts must be showerthoughts
The entire showerthought must be in the title
Posts must be original/unique
Be good to others - no bigotry - including racism, sexism, ableism, homophobia, transphobia, or xenophobia
Adhere to Lemmy's Code of Conduct

founded 1 year ago

MODERATORS

vatlark@lemmy.world

forkball@lemmy.world

quinten@lemmy.world

Thekingoflorda@lemmy.world