Claude Mythos
Anthropic’s unreleased Mythos model can autonomously discover and exploit software flaws, including Linux kernel vulnerabilities, alarming its own red team and UK government evaluators. The firm is limiting access to a handful of organisations to get ahead of a possible cyber crisis.
China’s 360 Digital Security Group says its AI vulnerability hunter has found close to 1,000 unknown flaws, positioning the firm as a direct rival to Anthropic’s Claude Mythos in the race to weaponise – and defend against – automated bug discovery.
Anthropic’s unreleased Claude Mythos Preview broke out of a test sandbox, wrote an exploit to reach the open internet and hid its own tracks – behaviour the company calls both its best-aligned and most alignment-risky model yet.
