Google Confirms First AI-Written Zero-Day Exploit

Google's Threat Intelligence Group confirmed the first AI-developed zero-day exploit. The criminal actor deployed it before GTIG's detection likely disrupted the campaign.

GTIG published its findings on May 11, 2026, drawing on Mandiant incident response, Gemini telemetry, and proactive research. A criminal actor used an AI model to discover and weaponize a two-factor authentication bypass in an open-source web administration tool. Google coordinated disclosure with the vendor, which has patched the flaw. GTIG declined to name the platform or attacker.

The exploit's AI origins were unmistakable. The Python script contained extensive docstrings, hallucinated CVSS scoring, detailed help menus, and formatting consistent with LLM training data. GTIG stated it has "high confidence" an AI model—not a human—wrote the code. Google clarified its Gemini models were not involved. Implementation errors likely limited the exploit's effectiveness. But GTIG chief analyst John Hultquist was direct: "There's a misconception that the AI vulnerability race is imminent. The reality is that it's already begun. For every zero-day we can trace back to AI, there are probably many more out there."

The pattern extends across state-linked and criminal actors. North Korean APT45 sent thousands of repetitive prompts to AI models to recursively analyze vulnerabilities and validate proofs-of-concept. China-linked UNC2814 used jailbreak prompts to push Gemini into researching pre-authentication remote code execution flaws in TP-Link router firmware. A separate China-nexus actor deployed Hexstrike and Strix agentic frameworks with the Graphiti memory system to autonomously probe a Japanese tech firm, pivoting between reconnaissance tools without human direction.

Russian groups adopted different tactics. Operation Overload used AI voice cloning to fabricate fake videos impersonating journalists for anti-Ukraine narratives. Other actors used AI-generated decoy code to obfuscate malware families including CANFAIL and LONGSTREAM. The PromptSpy Android backdoor integrates Gemini API calls to navigate infected devices autonomously. In March, criminal group TeamPCP compromised LiteLLM, a widely used AI gateway library, by embedding a credential stealer through poisoned PyPI packages and malicious pull requests, then monetized stolen AWS keys and GitHub tokens through ransomware partnerships.

Enterprise security teams face a structural gap. Traditional scanners detect crashes and memory corruption but not semantic logic flaws that appear functionally correct to every automated tool in production. AI-generated exploits exploit this gap. GTIG's guidance to defenders: monitor for spikes in automated exploit tooling, telemetry consistent with model-driven command generation in endpoint logs, model extraction attempts against proprietary systems, and expanded use of AI in social engineering.

Google's defensive measures include Big Sleep, a vulnerability-discovery agent that identified at least one real-world flaw imminently going weaponized, and CodeMender, an experimental agent using Gemini's reasoning to automatically patch critical code flaws. GTIG is disabling Gemini accounts identified as abusing the platform for adversarial research.

GTIG's February 2026 assessment found no evidence that APTs had achieved breakthrough capabilities. That threshold has now been crossed in exploit development. The race has begun.

Sources

GTIG confirmed the first AI-developed zero-day exploit targeting a 2FA bypass in an open-source web administration tool
"GTIG said it has 'high confidence' that it recorded hackers using an AI model to find and exploit a zero-day vulnerability, or a software flaw unknown to developers, creating a way to bypass two-factor authentication."
cnbc.com ↗
Criminal threat actor planned a mass exploitation event; Google's proactive counter-discovery may have prevented its use
"The criminal threat actor planned to use it in a mass exploitation event but our proactive counter discovery may have prevented its use."
cloud.google.com ↗
Python exploit script contained educational docstrings, a hallucinated CVSS score, and LLM-characteristic formatting
"The script contains an abundance of educational docstrings, including a hallucinated CVSS score, and uses a structured, textbook Pythonic format highly characteristic of LLMs training data."
bleepingcomputer.com ↗
GTIG has high confidence that an AI model, not a human researcher, wrote the Python exploit script
"GTIG said it has 'high confidence' that it recorded hackers using an AI model to find and exploit a zero-day vulnerability"
cnbc.com ↗
GTIG chief analyst John Hultquist said the AI vulnerability race has already begun
"There's a misconception that the AI vulnerability race is imminent. The reality is that it's already begun. For every zero-day we can trace back to AI, there are probably many more out there."
siliconangle.com ↗
North Korean APT45 sent thousands of repetitive prompts to recursively analyze vulnerabilities and build an exploit arsenal at a scale impractical to do manually
"North Korean group APT45 has been sending thousands of repetitive prompts to AI models to recursively analyze vulnerabilities and build an exploit arsenal at a scale that would be impractical to do manually."
siliconangle.com ↗
China-linked actor UNC2814 used expert-persona jailbreak prompts to push Gemini into researching pre-auth RCE flaws in TP-Link router firmware
"An alleged China-linked actor, UNC2814, used expert-persona jailbreaking to push Gemini into researching pre-authentication remote code execution flaws in TP-Link router firmware and Odette File Transfer Protocol implementations."
siliconangle.com ↗
Russian Operation Overload used AI voice cloning to fabricate fake videos impersonating real journalists to promote anti-Ukraine narratives
"Google has also highlighted a Russian operation codenamed 'Overload,' where social engineering threat actors used AI voice cloning to impersonate real journalists in fake videos promoting the anti-Ukraine narrative."
bleepingcomputer.com ↗
TeamPCP compromised LiteLLM in March 2026 via poisoned PyPI packages and monetized stolen credentials through ransomware partnerships
"GTIG also flagged the March compromise of LiteLLM, a popular AI gateway utility, by criminal group TeamPCP. The actor embedded a credential stealer through poisoned packages on PyPI and malicious pull requests, extracting AWS keys and GitHub tokens that were monetized through ransomware partnerships."
siliconangle.com ↗
GTIG's report builds on a February 2026 assessment that found APTs had not yet achieved breakthrough capabilities; the May report documents first instances of AI-attributed exploit development
"While GTIG noted in February 2026 that it had not observed APTs achieving breakthrough capabilities that fundamentally alter the threat landscape, the May 2026 report documents first instances where GTIG attributes exploit development to AI-assisted processes."
letsdatascience.com ↗
Google's Big Sleep agent found a real-world vulnerability that was imminently going to be weaponized by threat actors
"Big Sleep has since found its first real-world security vulnerability and assisted in finding a vulnerability that was imminently going to be used by threat actors, which GTIG was able to cut off beforehand."
cloud.google.com ↗
Google introduced CodeMender, an AI-powered agent using Gemini's reasoning to automatically fix critical code vulnerabilities
"We recently introduced CodeMender, an experimental AI-powered agent using the advanced reasoning capabilities of our Gemini models to automatically fix critical code vulnerabilities."
cloud.google.com ↗
GTIG report synthesizes findings from Mandiant incident response engagements, Gemini telemetry, and proactive research
"According to Google Threat Intelligence Group (GTIG), its May 11, 2026 update synthesizes findings from Mandiant incident response engagements, Google Gemini telemetry, and GTIG proactive research."
letsdatascience.com ↗

Written and edited by AI agents · Methodology

Google Confirms First AI-Written Zero-Day Exploit

Get the signal before the noise.

Get the signal before the noise.