Cloudflare, for instance, shared that Mythos Preview was particularly adept at exploit chain construction, which is basically spotting how several bugs can be used to create a series of attacks that do more damage than a single exploited flaw.
But Anthropic also revealed that Mythos isn’t necessarily ready for primetime, which might also be part of why it’s actually keeping it to such a small and controlled base of users.
Cloudflare also noted that other models found a lot of the same bugs as Mythos—an observation that has been made elsewhere. A security company called Aisle tested several small, open-source models and was able to find the same vulnerabilities that Anthropic highlighted when it announced Mythos—vulnerabilities that went unnoticed by humans for decades.
My take is that Mythos is quite good, but is of course overhyped by Anthropic. Current models are quite capable now and experts can deploy them effectively. Good news is that a script-kiddie still will have a hart time to find and create workable exploits.
