Architeuthis

joined 2 years ago
[–] Architeuthis@awful.systems 10 points 1 month ago* (last edited 1 month ago) (9 children)

Today in alignment news: Sam Bowman of anthropic tweeted, then deleted, that the new Claude model (unintentionally, kind of) offers whistleblowing as a feature, i.e. it might call the cops on you if it gets worried about how you are prompting it.

tweet text:If it thinks you're doing something egregiously immoral, for example, like faking data in a pharmaceutical trial, it will use command-line tools to contact the press, contact regulators, try to lock you out of the relevant systems, or all of the above.

tweet text:So far we've only seen this in clear cut cases of wrongdoing, but I could see it misfiring if Opus somehow winds up with a misleadingly pessimistic picture of how it's being used. Telling Opus that you'll torture its grandmother if it writes buggy code is a bad Idea.

skeet textcan't wait to explain to my family that the robot swatted me after I threatened its non-existent grandma.

Sam Bowman saying he deleted the tweets so they wouldn't be quoted 'out of context': https://xcancel.com/sleepinyourhat/status/1925626079043104830

Molly White with the out of context tweets: https://bsky.app/profile/molly.wiki/post/3lpryu7yd2s2m

[–] Architeuthis@awful.systems 10 points 1 month ago (3 children)

Net number of studies reporting positive or negative effects (excluding wages)

excluding wages! (and probably also benefits, retirement, a cap on working hours per day etc)

Is that whole thing in the comments about unions bad because monopolies bad and unions are just monopolies of labor the latest in bootlicking theory? Hadn't really heard this take before.

[–] Architeuthis@awful.systems 2 points 1 month ago* (last edited 1 month ago)

Microsoft's Visual Studio says it's going to incorporate coding 'agents' as soon as maybe the next minor version. I can't really see them buying up car factories or beating pokemon, but agent- as an AI marketing term is definitely a part of the current hype cycle.

[–] Architeuthis@awful.systems 9 points 1 month ago* (last edited 1 month ago)

That IQ after a certain level somehow turns into mana points is a core rationalist assumption about how intelligence works.

[–] Architeuthis@awful.systems 10 points 2 months ago

Nice to know even pre-LLM AI techniques remain eminently fuckupable if you just put your mind to it.

[–] Architeuthis@awful.systems 6 points 2 months ago (1 children)

Didn't mean to imply otherwise, just wanted to point out that the call is coming from inside the house.

[–] Architeuthis@awful.systems 13 points 2 months ago* (last edited 2 months ago) (14 children)

He claims he was explaining what others believe not what he believes

Others as in specifically his co-writer for AI2027 Daniel Kokotlajo, the actual ex-OpenAI researcher.

I'm pretty annoyed at having this clip spammed to several different subreddits, with the most inflammatory possible title, out of context, where the context is me saying "I disagree that this is a likely timescale but I'm going to try to explain Daniel's position" immediately before. The reason I feel able to explain Daniel's position is that I argued with him about it for ~2 hours until I finally had to admit it wasn't completely insane and I couldn't find further holes in it.

Pay no attention to this thing we just spent two hours exhaustively discussing that I totally wasn't into, it's not really relevant context.

Also the title is inflammatory only in the context of already knowing him for a ridiculous AI doomer, otherwise it's fine. Inflammatory would be calling the video economically illiterate bald person thinks evaluations force-buy car factories, China having biomedicine research is like Elon running SpaceX .

[–] Architeuthis@awful.systems 9 points 2 months ago (4 children)

(Are there multiple ai Nobel prize winners who are ai doomers?)

There's Geoffrey Hinton I guess, even if his 2024 Nobel in (somehow) Physics seemed like a transparent attempt at trend chasing on behalf of the Nobel committee.

[–] Architeuthis@awful.systems 9 points 2 months ago

Also, add obvious and overdetermined to the pile of siskindisms next to very non-provably not-correct.

[–] Architeuthis@awful.systems 7 points 2 months ago* (last edited 2 months ago)

Scoot makes the case that agi could have murderbot factories up and running in a year if it wanted to https://old.reddit.com/r/slatestarcodex/comments/1kp3qdh/how_openai_could_build_a_robot_army_in_a_year/

edit: Wrote it up

[–] Architeuthis@awful.systems 7 points 2 months ago* (last edited 2 months ago)

What is the analysis tool?

The analysis tool is a JavaScript REPL. You can use it just like you would use a REPL. But from here on out, we will call it the analysis tool.

When to use the analysis tool

Use the analysis tool for:

  • Complex math problems that require a high level of accuracy and cannot easily be done with "mental math"
  • To give you the idea, 4-digit multiplication is within your capabilities, 5-digit multiplication is borderline, and 6-digit multiplication would necessitate using the tool.

uh

view more: ‹ prev next ›