Anthropic Unlocks Its Restricted Cyber-Weapon AI
We need to talk about what just happened, because the tech landscape shifted underneath our feet this week and barely anyone outside of cybersecurity circles has realized it. For the last month, a ghost has been living in the machine at some of the world’s biggest infrastructure and software companies. It wasn’t a hacker, it was an artificial intelligence model so profoundly capable of offensive cyber warfare that its creators locked it in a digital vault the second they finished training it.
Now, Anthropic is taking its first steps toward a broader release. The company confirmed it is preparing to expand access to Mythos-class models to more customers in the coming weeks, though a full general public rollout has not yet been officially announced. If you’ve been keeping up with the Claude ecosystem, you know the naming structure. Haiku is the lightweight speedster, Sonnet is the reliable mid-tier worker, and Opus is the heavy-lifting flagship. Mythos doesn’t fit the pattern. It is an entirely separate tier of intelligence, sitting firmly above Claude Opus. When Anthropic dropped the first hint of the Claude Mythos Preview System Card, they made it clear that this model had reached a threshold where it could comfortably outclass almost any human engineer at finding and exploiting software vulnerabilities.
What followed was a high-stakes corporate and national security thriller. Anthropic withheld the model from the public, quietly initiating a restricted, elite defense program called Project Glasswing. For weeks, a tiny circle of tech titans used Mythos in the dark. The numbers they just came back with are staggering, and they explain exactly why the incoming public launch has the security industry sweating through its suit jackets.
The Secret Trial That Found 10,000 Bugs

When Anthropic realized what they had on their hands back in April, they didn’t just hide the model away, they weaponized it for defense. Under Project Glasswing, Anthropic provided significant compute resources to an exclusive roster of launch partners, including Amazon Web Services, Google, Mozilla, and the Linux Foundation, along with dozens of other critical infrastructure organizations. The objective outlined in the official Project Glasswing announcement was straightforward. Find the flaws in the world’s digital foundation before bad actors could build a model capable of doing the same.
The results of that one-month trial are nothing short of a paradigm shift. Collectively, the partners used Mythos Preview to uncover over 10,000 high- or critical-severity vulnerabilities across systemically vital software.
To give you an idea of the sheer scale of this automated pentesting, consider a few individual report cards. Some partners reported discovering hundreds to thousands of bugs across their internal systems, with a significant portion flagged as severe threats. Most impressively, the engineering team noted that the AI’s false-positive rate was actually better than that of veteran human testers. Meanwhile, Mozilla deployed the model against the Firefox browser codebase. Mythos reportedly identified and helped developers fix 271 vulnerabilities in Firefox 150. A diagnostic output ten times more effective than what they achieved using previous models.
Furthermore, Anthropic used Mythos to audit over 1,000 open-source infrastructure projects underpinning the modern internet, flagging 6,202 severe vulnerabilities. Independent security firms double-checked a massive sample of these findings, validating an eye-watering 90.6% of them as true positives. This wasn’t just a tool finding typos in CSS. Mythos autonomously discovered a 27-year-old remote crash vulnerability in OpenBSD, an operating system widely legendary for being an un-hackable fortress. It found a 16-year-old flaw hidden deep inside FFmpeg, a video processing library used by almost every app on your phone, in a line of code that traditional automated fuzzing tools had tested five million times without ever triggering the bug.
The Zero-Day Factory and the Triage Trap

The technical superpower that sets Mythos apart from a model like the newly minted Claude Opus 4.8 on AWS Bedrock is its advanced, multi-step autonomous reasoning. It doesn’t just read code, it actively executes a mental loop of hypothesis, testing, and exploitation. It can discover a zero-day exploit (a completely unknown software vulnerability), write a script to leverage it, and then instantly flip sides to generate a corresponding patch.
But this brings us to a terrifyingly practical bottleneck. AI can now find software flaws exponentially faster than human developers can fix them.
In cybersecurity, the process of sorting, verifying, and prioritizing threats is called triaging. Right now, according to reports on the Project Glasswing trial telemetry, it takes an average of two weeks for a human development team to verify a serious Mythos bug and successfully deploy a patch. Open-source maintainers (many of whom work as volunteers) are already crying foul. They are drowning in floodwaters of highly technical, AI-generated bug reports.
If it takes a human team fourteen days to clean up a vulnerability that an AI can find in fourteen seconds, the math stops working in our favor. We have built an automated zero-day factory, but our assembly line for patches is still entirely hand-crafted by tired humans fueled by espresso.
Rolling Out the Ghost in the Code

This glaring imbalance is exactly why Anthropic hesitated to let Mythos see the light of day. A tool that can flawlessly match an exploit to a patch is inherently dual-use. In the hands of a network defender, it’s the ultimate shield. In the hands of a state-sponsored hacking collective or a ransomware cartel, it is an automated master key to global infrastructure.
So, why release it now? The reality is that frontier AI labs are trapped in a fierce competitive sprint. With companies rolling out massive subscription upgrades and rival models breathing down their necks, keeping a crown-jewel model locked away indefinitely is a commercial impossibility. Leaked code fragments recently exposed by Testing Catalog reportedly revealed references to claude-mythos-1-preview integrated into Anthropic’s product interfaces, suggesting a launch is being actively prepared.
Anthropic’s public stance is one of calculated optimism. They argue that while attackers might get a short-term boost, defenders will ultimately win the long game by using Mythos-class intelligence to secure software pipelines before code ever ships to production. To smooth the transition, they are launching it around the same time as Claude Opus 4.8, which introduces massive quality-of-life updates to agentic workflows, long-context code navigation, and adaptive thinking gates.
But don’t expect Mythos to replace your daily chatbot anytime soon. Beyond the intense safety alignment checks Anthropic is scrambling to finalize, there is the brutal reality of the data-center balance sheet. Mythos requires an immense amount of reasoning effort. What I mean is that it spends vastly more compute time thinking internally before it spits out an answer. The economic cost to run inference on a model that routinely solves end-to-end cyber simulations is astronomically high. When it rolls out via the Claude API and specialized security platforms, it will likely be priced as a premium enterprise utility, far out of reach for casual tinkering.
The New Baseline for Trust

We are entering an era where software can no longer be evaluated by human eyes alone. If a piece of infrastructure hasn’t been hardened by a frontier-class AI model, it should be assumed vulnerable by default.
Anthropic’s transition from locking Mythos in a bunker to preparing its public debut proves that the defensive containment of high-consequence AI tools is a temporary strategy at best. The capability exists, therefore, the world has to adapt to it. As Mythos-class access expands to a broader market over the coming weeks, the tech industry will face its ultimate test. Can we automate our defenses fast enough to survive our own innovations?
The window to patch the world is officially open. Let’s hope the human engineers can keep up with the typing speed of the AI.
Thanks for reading everyone! Visit my site to learn more about me and explore what I’m building at Learn With Hatty. I hope everyone has a great day and as I always say, stay curious and keep learning.
Original article on PublishOX
