Anthropic spent two months deciding whether it was responsible to release an AI that can find vulnerabilities in every major operating system — yesterday it released it anyway, under a different name

Tension: Anthropic built an AI capable of finding zero-days in every major operating system — then spent two months concluding it was too dangerous to release publicly, before releasing it publicly.
Noise: The framing of “responsible AI release” tends to collapse into either blanket alarm or blanket reassurance. Both miss the actual calculus: withholding a defensive capability from defenders doesn’t prevent the capability from reaching attackers — it just delays the defence.
The Direct Message: The name change from Mythos to Fable 5 isn’t a rebrand — it’s a signal. Anthropic is betting that proliferation of this capability is already inevitable within 6 to 18 months, and that getting it into the hands of defenders first, with guardrails, is the more responsible position than waiting for a window that will not stay open.

To learn more about our editorial approach, explore The Direct Message methodology.

On 7 April, Anthropic introduced a new AI model internally described as a watershed moment for cybersecurity. It could autonomously identify zero-day vulnerabilities in every major operating system and browser, build working exploits, and — in one documented test — write a complete exploit for a disclosed Windows kernel flaw in 31 minutes. Anthropic’s own assessment was that no existing safeguards were sufficient to release it publicly. Yesterday, they released it publicly.

The model is called Fable 5. The underlying capability class is still called Mythos. The distinction matters: Fable 5 carries a set of automated guardrails designed to intercept the most dangerous queries before the full model can act on them. Whether those guardrails hold — and what happens when they don’t — is now a live question for the security industry.

What the model can actually do

When Anthropic first revealed Claude Mythos Preview in April, they did so alongside a detailed technical assessment conducted by their security research team over the preceding month. The findings were specific enough to take seriously. Mythos Preview identified a 27-year-old denial-of-service vulnerability in OpenBSD’s TCP implementation, a 16-year-old flaw in FFmpeg’s H.264 codec, and a 17-year-old remote code execution vulnerability in FreeBSD’s NFS server — CVE-2026-4747 — which it then autonomously exploited.

These were not theoretical demonstrations. They were documented findings, disclosed through coordinated vulnerability programs. The model didn’t just identify the flaws; it understood the attack surface, constructed the exploit logic, and produced working proof-of-concept code.

Rather than release the model publicly, Anthropic formed Project Glasswing — a restricted-access coalition initially comprising approximately twelve named organisations including AWS, Apple, Microsoft, Google, CrowdStrike, and Palo Alto Networks, which later expanded to more than 50. The explicit rationale was defensive: put the model to work finding vulnerabilities before adversaries with comparable capabilities could exploit them first. Partners collectively identified more than 10,000 high- or critical-severity security flaws during the preview period.

The timeline: 7 April — Mythos Preview announced; public release deemed irresponsible. 9 June — Fable 5, a Mythos-class model with guardrails, released for general use. Elapsed time: 63 days. Anthropic had previously estimated that comparable AI cyber capabilities would proliferate across competing labs within 6 to 18 months.

How Fable 5 differs — and what’s been left in

Fable 5 and Claude Mythos 5 (the upgraded version for existing Project Glasswing partners) share the same underlying model. The difference is in what happens when a request approaches certain risk thresholds. For Fable 5, queries touching on cyberattack methods, sensitive biological or chemical capabilities, or AI model distillation are automatically rerouted to Claude Opus 4.8 — a less capable model that lacks the reasoning depth to complete those requests effectively.

Anthropic’s head of product management, Dianne Penn, described the approach to Axios as “deliberately more conservative,” acknowledging that some legitimate security research may also be caught by the filters at launch, with the expectation that those restrictions ease as post-release testing continues.

The commercial logic is also visible. Fable 5 is priced at double Anthropic’s existing Opus models, making it the company’s most expensive public release to date. Anthropic frames this as relative value — higher intelligence per task, lower overall cost per outcome — but it also positions the model’s most capable tier firmly in the enterprise and government market, not the consumer one.

What this means for European organisations

For security teams across Europe, the implications run in two directions simultaneously. On one side: a defensive AI of significant capability is now commercially available to organisations with the budget and the security maturity to use it. The NIS2 directive has already pushed critical infrastructure operators toward more systematic vulnerability management; a tool that can scan codebases and surface legacy flaws at this scale is a genuine operational asset.

On the other: the same capability, even with guardrails, substantially lowers the barrier to AI-assisted offensive security work. Anthropic itself has stated that Mythos-class capabilities are expected to emerge from other AI labs within 6 to 18 months. The question is not whether this capability will proliferate — it will — but whether defensive deployment can outpace offensive adoption.

Anthropic’s bet, implicit in the Fable 5 release, is that getting the capability into the hands of defenders first is safer than waiting until it exists everywhere and is available to no one responsibly. The logic is coherent. The proof will be in what happens next.

The naming question

The decision to release under the name Fable 5 rather than Claude Mythos is not incidental. It separates the product from the capability class in public perception — Fable 5 is a product with guardrails, Mythos is a capability benchmark. The framing also allows Anthropic to reserve the Mythos name for the unguarded version available only to vetted partners, creating a visible two-tier structure that signals the differentiation without requiring a lengthy explanation each time.

Whether that distinction holds in practice — whether the guardrails are robust enough to make Fable 5 meaningfully different from a full Mythos deployment — is a question the security research community will now spend considerable time testing. The answers will matter well beyond Anthropic’s product roadmap.

Anthropic spent two months deciding whether it was responsible to release an AI that can find vulnerabilities in every major operating system — yesterday it released it anyway, under a different name

What the model can actually do

How Fable 5 differs — and what’s been left in

What this means for European organisations

The naming question

Direct Message News

MOST RECENT ARTICLES

Virginia is still the largest data center market on the planet, with roughly two hundred facilities packed into a single county, and its legislature is now fighting over whether to keep a sales tax exemption worth an estimated one point nine billion dollars a year

Meta is replacing up to ninety percent of its content review staff with AI, and the marketers most exposed are the ones who have never had to argue with a machine about why their account got flagged

Monterey Park, California just became the first city in the country to permanently ban data centers by popular vote, with eighty six percent of residents in favor

Arizona lawmakers passed a three year moratorium on data center tax breaks to slow the industry down, and in the two weeks before it took effect developers filed nearly as many applications as they had in the previous thirteen years combined

Texas passed a law banning targeted ads to minors, and a federal judge has now struck it down in a second ruling that went further than his first, ruling it violated advertisers’ free speech rights

Connecticut, Arkansas, and Utah made their comprehensive privacy laws enforceable on July 1, adding a new wave of state-specific consent and opt-out rules for marketers to track this year