If I mathed right that'd be one waymo every 350 feet of road on average. Is that a lot? It sounds like it might be a lot. Especially since self-driving cars greatest weakness appears to be driving in the vicinity of other self-driving cars.
I woke up and immediately read about something called "Defense Llama". The horrors are never ceasing: https://theintercept.com/2024/11/24/defense-llama-meta-military/
Scale AI advertised their chatbot as being able to:
apply the power of generative AI to their unique use cases, such as planning military or intelligence operations and understanding adversary vulnerabilities
However their marketing material, as is tradition, include an example of terrible advice. Which is not great given it's about blowing up a building "while minimizing collateral damage".
Scale AI's response to the news pointing this out -- complaining that everyone took their murderbot marketing material seriously:
The claim that a response from a hypothetical website example represents what actually comes from a deployed, fine-tuned LLM that is trained on relevant materials for an end user is ridiculous.
"Yeah I thought about going into civil engineering but the department of hustling really spoke to me y'know?"
Oh hey looks like another Chat-GPT assisted legal filing, this time in an expert declaration about the dangers of generative AI: https://www.sfgate.com/tech/article/stanford-professor-lying-and-technology-19937258.php
The two missing papers are titled, according to Hancock, “Deepfakes and the Illusion of Authenticity: Cognitive Processes Behind Misinformation Acceptance” and “The Influence of Deepfake Videos on Political Attitudes and Behavior.” The expert declaration’s bibliography includes links to these papers, but they currently lead to an error screen.
Irony can be pretty ironic sometimes.
Here are the results of these three models against Stockfish—a standard chess AI—on level 1, with a maximum of 0.01 seconds to make each move
I'm not a Chess person or familiar with Stockfish so take this with a grain of salt, but I found a few interesting things perusing the code / docs which I think makes useful context.
Skill Level
I assume "level" refers to Stockfish's Skill Level option.
If I mathed right, Stockfish roughly estimates Skill Level 1 to be around 1445 ELO (source). However it says "This Elo rating has been calibrated at a time control of 60s+0.6s" so it may be significantly lower here.
Skill Level affects the search depth (appears to use depth of 1 at Skill Level 1). It also enables MultiPV 4 to compute the four best principle variations and randomly pick from them (more randomly at lower skill levels).
Move Time & Hardware
This is all independent of move time. This author used a move time of 10 milliseconds (for stockfish, no mention on how much time the LLMs got). ... or at least they did if they accounted for the "Move Overhead" option defaulting to 10 milliseconds. If they left that at it's default then 10ms - 10ms = 0ms so 🤷♀️.
There is also no information about the hardware or number of threads they ran this one, which I feel is important information.
Evaluation Function
After the game was over, I calculated the score after each turn in “centipawns” where a pawn is worth 100 points, and ±1500 indicates a win or loss.
Stockfish's FAQ mentions that they have gone beyond centipawns for evaluating positions, because it's strong enough that material advantage is much less relevant than it used to be. I assume it doesn't really matter at level 1 with ~0 seconds to produce moves though.
Still since the author has Stockfish handy anyway, it'd be interesting to use it in it's not handicapped form to evaluate who won.
When the reporter entered the confessional, AI Jesus warned, “Do not disclose personal information under any circumstances. Use this service at your own risk.
Do not worry my child, for everything you say in this hallowed chamber is between you, AI Jesus, and the army of contractors OpenAI hires to evaluate the quality of their LLM output.
The guy running a hostile workplace while hanging out with Logan Paul, selling junk food to children, and putting on reality shows so hostile to the contestants that they get compared to torture is... into cryptocurrency?! I'm shocked! Shocked!
Goodness kids need some better role models because sometimes it seems 90% of people on the social networks are morally bankrupt.
Microsoft’s excuse is that many of these attacks require an insider.
Sure we made phishing way easier, more dangerous, and more subtle; but it was the user's fault for trusting our Don't Trust Anything I Say O-Matic workplace productivity suite!
Edit: and really from the demos it looks like a user wouldn't have to do anything at all besides write "summarize my emails" once. No need to click on anything for confidential info to be exfiltrated if the chatbot can already download arbitrary URLs based on the prompt injection!
The one catch is that because responses from the blockchain can take variable amounts of time, it’s best to request and receive from blockchains using asynchronous methods.
"You may be used to writing websites that actually load in fractions of a second, and so rely on obsolete web2 technologies like synchronous fetches. But don't worry! With modern techniques like async / await your loading spinner will animate flawlessly while the blockchain spends 20 minutes burning down a forest in the background."
You can practically taste the frustration in the "prompt engineering" here. Just one more edge case bro, one more edge case and then the prompt will be perfect!
TL;DR: "I'm no longer a white nationalist because a lot of white people are cucked. Now my new identity is with sufficiently pilled white people. This is much more pragmatic. Also I would totally have a cool black friend if any actually existed which means I'm totally not racist."
Also: did anyone else feel a chill reading about a literal nazi talking about how society accepts him now? America used to despise nazis and now they feel like they can walk around openly. It's terrifying.
Ah yes Africa, the small country on the northern coast of Africa.