Mozilla's Firefox team used an early preview of Anthropic's Claude Mythos to identify 271 vulnerabilities patched in Firefox 150, contributing to a massive April surge of 423 total bug fixes versus just 31 a year earlier, marking AI's shift from coding assistant to production security triage layer.
The operational change at Mozilla is the real story. The Firefox security team deployed Claude Mythos Preview as part of Project Glasswing, Anthropic's restricted-access program for critical partners. The model scanned the browser's source code, surfacing 271 vulnerabilities that landed in the Firefox 150 release on April 22, 2026. This followed a prior collaboration where Claude Opus 4.6 found 22 security bugs for Firefox 148. Mozilla CTO Bobby Holley described the Mythos results as producing vertigo before landing on the conclusion that defenders now have a decisive edge. The fixes spanned memory safety flaws, sandbox escapes, a 15-year-old HTML legend element parser error, and a 20-year-old XSLT bug. Human engineers triaged the AI findings, validated them, and coordinated patches across more than 100 contributors.
April's fix volume exploded to 423 total bugs, up from 31 the prior year. More than 40 CVEs were addressed in Firefox 150 alone, though only three are publicly credited to Claude in the advisory: CVE-2026-6746, CVE-2026-6757, and CVE-2026-6758. The scale reflects AI's throughput advantage. Traditional bug hunting relies on elite human researchers scanning specific code paths. Mythos evaluates the entire codebase systematically, tracing execution paths, identifying memory corruption risks, and flagging conditions that could lead to remote code execution. Mozilla improved their prompting techniques and integration workflow to harness the model effectively. The result is not just more findings, but findings across long-dormant code sections that no single team would audit manually.
For SF readers, this episode moves AI security tooling from conference demos to the maintenance cycle of a major open-source project. Firefox processes billions of web requests daily. Vulnerabilities in its JavaScript engine or sandbox create real attack surfaces for malware, phishing, and zero-days. Startups can now target code audit platforms that integrate frontier models for continuous vulnerability discovery. Enterprise vulnerability management follows. Companies running large codebases face the same scale problem as browsers. AI triage layers could automate initial scans, prioritization, and even patch suggestions. The economics favor incumbents with model access, but open-source alternatives or fine-tuned smaller models create entry points for new players.
AI fundamentally changes open-source security maintenance economics. Human-led audits cost thousands per day for top researchers. Mythos operates at API scale, potentially costing pennies per scan. Mozilla's blog notes the model excels at discovery but struggles with exploit generation. In tests, Claude required disabling Firefox's sandbox to produce working exploits, a condition irrelevant to real attacks. The value lies in surfacing latent issues before attackers find them. Open-source projects benefit most. Firefox relies on volunteer contributors. AI-generated bug lists create actionable work items that scale participation. The 423 fixes in April show how one model can bootstrap a contributor army. Cost drops, volume rises, patching cadence accelerates. Traditional bounties like HackerOne become supplements to automated discovery.
Governance risks emerge alongside the gains. False positives burden triage teams. Mythos produced vertigo-level volume, meaning engineers spent weeks validating findings. Not every flagged issue was exploitable, but the signal overwhelmed the noise initially. Disclosure becomes tricky. Anthropic restricts Mythos access to prevent offensive use. Mozilla credits the model but only publicly ties three CVEs, likely to avoid tipping attackers. Dependence on closed systems raises questions. Firefox is open-source, but Mythos is proprietary. What happens if Anthropic changes terms, raises prices, or prioritizes commercial clients. Open alternatives like Code Llama or fine-tuned OSS models lag in capability. Startups building on top must navigate API costs, data leakage risks, and model hallucination mitigations.
The browser precedent applies broadly. Security teams at startups treat AI as a junior researcher that never sleeps. It finds use-after-free bugs in JavaScript engines, sandbox bypasses, and parser errors that have lingered for decades. Human oversight remains essential for validation and patching. The Firefox case proves the workflow viable at production scale. Founders should build tools that chain model discovery to human review, automated testing, and continuous integration. The opportunity lies in making AI bug hunting reliable enough for daily use across codebases, not just one-off audits. Mozilla's results suggest that point is closer than most expect.
Also read: Arm's 5% slide reveals AI chip supply constraints even as smartphone weakness bites • AWS Bedrock AgentCore Payments makes stablecoin micropayments the default for autonomous agents • Block's 40 percent AI-driven layoffs make Dorsey's automation thesis a Wall Street reality