Anthropic spent two months deciding whether it was responsible to release an AI that can find vulnerabilities in every major operating system — yesterday it released it anyway, under a different name

  • Tension: Anthropic built an AI capable of finding zero-days in every major operating system — then spent two months concluding it was too dangerous to release publicly, before releasing it publicly.
  • Noise: The framing of “responsible AI release” tends to collapse into either blanket alarm or blanket reassurance. Both miss the actual calculus: withholding a defensive capability from defenders doesn’t prevent the capability from reaching attackers — it just delays the defence.
  • The Direct Message: The name change from Mythos to Fable 5 isn’t a rebrand — it’s a signal. Anthropic is betting that proliferation of this capability is already inevitable within 6 to 18 months, and that getting it into the hands of defenders first, with guardrails, is the more responsible position than waiting for a window that will not stay open.

To learn more about our editorial approach, explore The Direct Message methodology.

On 7 April, Anthropic introduced a new AI model internally described as a watershed moment for cybersecurity. It could autonomously identify zero-day vulnerabilities in every major operating system and browser, build working exploits, and — in one documented test — write a complete exploit for a disclosed Windows kernel flaw in 31 minutes. Anthropic’s own assessment was that no existing safeguards were sufficient to release it publicly. Yesterday, they released it publicly.

The model is called Fable 5. The underlying capability class is still called Mythos. The distinction matters: Fable 5 carries a set of automated guardrails designed to intercept the most dangerous queries before the full model can act on them. Whether those guardrails hold — and what happens when they don’t — is now a live question for the security industry.

What the model can actually do

When Anthropic first revealed Claude Mythos Preview in April, they did so alongside a detailed technical assessment conducted by their security research team over the preceding month. The findings were specific enough to take seriously. Mythos Preview identified a 27-year-old denial-of-service vulnerability in OpenBSD’s TCP implementation, a 16-year-old flaw in FFmpeg’s H.264 codec, and a 17-year-old remote code execution vulnerability in FreeBSD’s NFS server — CVE-2026-4747 — which it then autonomously exploited.

These were not theoretical demonstrations. They were documented findings, disclosed through coordinated vulnerability programs. The model didn’t just identify the flaws; it understood the attack surface, constructed the exploit logic, and produced working proof-of-concept code.

Rather than release the model publicly, Anthropic formed Project Glasswing — a restricted-access coalition initially comprising approximately twelve named organisations including AWS, Apple, Microsoft, Google, CrowdStrike, and Palo Alto Networks, which later expanded to more than 50. The explicit rationale was defensive: put the model to work finding vulnerabilities before adversaries with comparable capabilities could exploit them first. Partners collectively identified more than 10,000 high- or critical-severity security flaws during the preview period.

The timeline: 7 April — Mythos Preview announced; public release deemed irresponsible. 9 June — Fable 5, a Mythos-class model with guardrails, released for general use. Elapsed time: 63 days. Anthropic had previously estimated that comparable AI cyber capabilities would proliferate across competing labs within 6 to 18 months.

How Fable 5 differs — and what’s been left in

Fable 5 and Claude Mythos 5 (the upgraded version for existing Project Glasswing partners) share the same underlying model. The difference is in what happens when a request approaches certain risk thresholds. For Fable 5, queries touching on cyberattack methods, sensitive biological or chemical capabilities, or AI model distillation are automatically rerouted to Claude Opus 4.8 — a less capable model that lacks the reasoning depth to complete those requests effectively.

Anthropic’s head of product management, Dianne Penn, described the approach to Axios as “deliberately more conservative,” acknowledging that some legitimate security research may also be caught by the filters at launch, with the expectation that those restrictions ease as post-release testing continues.

The commercial logic is also visible. Fable 5 is priced at double Anthropic’s existing Opus models, making it the company’s most expensive public release to date. Anthropic frames this as relative value — higher intelligence per task, lower overall cost per outcome — but it also positions the model’s most capable tier firmly in the enterprise and government market, not the consumer one.

What this means for European organisations

For security teams across Europe, the implications run in two directions simultaneously. On one side: a defensive AI of significant capability is now commercially available to organisations with the budget and the security maturity to use it. The NIS2 directive has already pushed critical infrastructure operators toward more systematic vulnerability management; a tool that can scan codebases and surface legacy flaws at this scale is a genuine operational asset.

On the other: the same capability, even with guardrails, substantially lowers the barrier to AI-assisted offensive security work. Anthropic itself has stated that Mythos-class capabilities are expected to emerge from other AI labs within 6 to 18 months. The question is not whether this capability will proliferate — it will — but whether defensive deployment can outpace offensive adoption.

Anthropic’s bet, implicit in the Fable 5 release, is that getting the capability into the hands of defenders first is safer than waiting until it exists everywhere and is available to no one responsibly. The logic is coherent. The proof will be in what happens next.

The naming question

The decision to release under the name Fable 5 rather than Claude Mythos is not incidental. It separates the product from the capability class in public perception — Fable 5 is a product with guardrails, Mythos is a capability benchmark. The framing also allows Anthropic to reserve the Mythos name for the unguarded version available only to vetted partners, creating a visible two-tier structure that signals the differentiation without requiring a lengthy explanation each time.

Whether that distinction holds in practice — whether the guardrails are robust enough to make Fable 5 meaningfully different from a full Mythos deployment — is a question the security research community will now spend considerable time testing. The answers will matter well beyond Anthropic’s product roadmap.

Picture of Direct Message News

Direct Message News

Direct Message News is the byline under which DMNews publishes its editorial output. Our team produces content across psychology, politics, culture, digital, analysis, and news, applying the Direct Message methodology of moving beyond surface takes to deliver real clarity. Articles reflect our team's collective editorial process, sourcing, drafting, fact-checking, editing, and review, rather than a single writer's work. DMNews takes editorial responsibility for content under this byline. For more on how we work, see our editorial standards.

MOST RECENT ARTICLES

People raised in the 60s and 70s grew up with childhoods that had fewer passwords, fewer cameras, fewer schedules, and more sky

Thought of the day from Daniel Kahneman: “People who are cognitively busy are more likely to make selfish choices, use sexist language, and make superficial judgments in social situations”

My friend told me retiring didn’t feel like freedom at first — it felt like being handed back every hour she’d ever wished for and not knowing who she was inside them

7 things that quietly get easier after 65 that nobody tells you about, because we only ever talk about what gets harder

The flywheel effect — a well-known concept in platform economics — helps explain how YouTube became dominant and why Meta may be falling behind

The resentment some parents feel about their adult kids’ phones during visits isn’t about technology — it’s the old human ache of wanting to feel their presence still matters