Anthropic investigates report of rogue access to hack-enabling Mythos AI
TL;DR
‘Handful’ of people allegedly gain unauthorised access to model adept at detecting cybersecurity vulnerabilities Business live – latest updates The AI developer Anthropic has confirmed it is investigating a report that unauthorised users have gained access to its Mythos model, which it has warned poses risks to cybersecurity. The US startup made the statement after Bloomberg reported on Wednesday that a small group of people had accessed the model, which has not been released to the public because of its ability to enable cyber-attacks. Continue reading...
Nauti's Take
Anthropic going public with the investigation is the right call — transparency under pressure is exactly what responsible AI development looks like. But the incident underlines a hard paradox: building a model specifically good at enabling cyberattacks creates a high-value target the moment it exists.
Anyone operating critical infrastructure should closely watch what the investigation reveals about how Mythos was accessed and by whom.