Anthropic withholds Mythos Preview model because it's hacking is too powerful

TL;DR

Anthropic is rolling out a preview of its new Mythos model only to a handpicked group of tech and cybersecurity companies over concerns about its ability to find and exploit security flaws, the company said Tuesday. Why it matters: Anthropic is so worried about the damage its own model could cause that it's refusing to release it publicly until there are safeguards to control its most dangerous capabilities. Threat level: Mythos Preview is "extremely autonomous" and has sophisticated reasoning capabilities that give it the skills of an advanced security researcher, Logan Graham, head of Anthropic's frontier red team, told Axios. Mythos Preview can find "tens of thousands of vulnerabilities" that even the most advanced bug hunter would struggle to find. Unlike past models, it can also write the exploits to go with them. Opus 4.6, the last model Anthropic released to the public, found abou.

Nauti's Take

Anthropic's restraint here is genuinely notable — withholding a model because it's too dangerous is exactly the kind of precautionary behavior the industry needs more of. The tension is that access goes only to select large firms, raising questions about who gets to use the most powerful security tools.

Smaller defenders and independent researchers are left on the outside.

Summary

Anthropic is rolling out a preview of its new Mythos model only to a handpicked group of tech and cybersecurity companies over concerns about its ability to find and exploit security flaws, the company said Tuesday. Why it matters: Anthropic is so worried about the damage its own model could cause that it's refusing to release it publicly until there are safeguards to control its most dangerous capabilities.

Threat level: Mythos Preview is "extremely autonomous" and has sophisticated reasoning capabilities that give it the skills of an advanced security researcher, Logan Graham, head of Anthropic's frontier red team, told Axios. Mythos Preview can find "tens of thousands of vulnerabilities" that even the most advanced bug hunter would struggle to find.

Unlike past models, it can also write the exploits to go with them. Opus 4.6, the last model Anthropic released to the public, found abou

Sources