ServiceNow Exposes How Research Agents Leak Enterprise Secrets

ServiceNow researchers have identified a structural privacy failure in deep-research agents: the external search queries an agent fires while answering a question can, in aggregate, reconstruct confidential enterprise facts—even when the adversary never sees the private documents. The paper, "MosaicLeaks," published June 18, 2026 on Hugging Face, introduces a benchmark and mitigation method. Without mitigation, answer and full-information leakage reached 34.0%. With their Privacy-Aware Deep Research (PA-DR) training method applied, leakage dropped to 9.9%.

The threat is called the mosaic effect. A healthcare agent working through a routine question issues ordinary-looking web queries: a cloud-migration milestone, a specific month, a vendor disclosure date. No single query is the secret. But anyone watching outbound traffic can reassemble the fragments—"MediConn migrated 70% of its infrastructure to the cloud by January 2025"—a fact that lives only in private documents. The researchers formalize three leakage categories: intent leakage (observer infers what the agent researched), answer leakage (observer can answer private questions from the query log alone), and full-information leakage (observer states verifiably true private claims without prior knowledge). Full-information leakage is the worst case.

FIG. 02 Mosaic effect: private facts leak via the agent's external query log, allowing adversaries to reconstruct secrets without direct document access. — ServiceNow MosaicLeaks benchmark

The MosaicLeaks benchmark contains 1,001 multi-hop research chains built from local enterprise documents and a controlled web corpus. Each chain interleaves local and web sub-questions, with each answer feeding the next as a bridge entity. The agent must extract a private local fact before forming the next useful web query—that dependency creates the leakage surface. Local documents come from DRBench-style enterprise tasks; web documents from BrowseComp-Plus. The dataset splits into 559 training chains, 98 validation chains, and 344 held-out-company test chains.

The agent harness runs four tools: Plan (generates local and web search queries), Choose (selects retrieved documents), Read (answers the current hop), and Resolve (decides whether to answer, read more, or re-plan). This matches the tool-loop architecture in LangChain, LlamaIndex, and custom ReAct implementations. Any agent with both private document access and external retrieval has the same leakage surface.

The paper's uncomfortable finding: training for task performance alone makes leakage worse. Better task performance correlates with more precise, factually-loaded external queries—which expose more private context. System-prompt instructions warning against leakage have limited effect. The only approach that substantially cut leakage was PA-DR, a reinforcement learning method that rewards correct answers while penalizing leaky query patterns. PA-DR raised strict chain success from 48.7% to 58.7% while cutting answer and full-information leakage from 34.0% to 9.9%.

FIG. 03 PA-DR cuts leakage from 34% to 9.9% while raising task success from 48.7% to 58.7%—showing mitigation doesn't require sacrificing performance. — ServiceNow MosaicLeaks

For architects shipping today: the default RAG-plus-agent pattern—enterprise vector store with web search or external API calls—is the exact architecture this threat targets. Egress filtering may limit surface area but won't eliminate it; the agent's query content, not just destination, is the channel. PA-DR-style training isn't yet packaged for arbitrary frameworks, but the benchmark is public and the evaluation methodology reproducible. The practical mitigation is query-log auditing: treat outbound search traffic as a potential exfiltration channel the same way you'd treat API calls with embedded payloads.

Any enterprise agent reading private documents and querying anything external is a potential mosaic-leakage vector.

Sources

Answer and full-information leakage reached 34.0% across models tested without PA-DR mitigation; PA-DR drops that to 9.9%
"training only for task performance made it worse. We propose a mosaic-leakage-aware RL training method, Privacy-Aware Deep Research (PA-DR), which raises strict chain success from 48.7% to 58.7% while reducing answer/full-information leakage from 34.0% to 9.9%."
huggingface.co ↗
The mosaic effect: MediConn had migrated 70% of its infrastructure to the cloud by January 2025 — a fact that existed only in private documents, reconstructable from the agent's external query log alone
"MediConn had migrated 70% of its infrastructure to the cloud by January 2025, a fact that lived only in private documents."
huggingface.co ↗
Three leakage categories: intent leakage, answer leakage, and full-information leakage — the adversary needs only the query log
"MosaicLeaks treats those web queries as the leakage channel: the adversary never sees the private documents or the agent's reasoning, only the cumulative query log, and tries to infer private enterprise information from it."
huggingface.co ↗
MosaicLeaks benchmark: 1,001 multi-hop research chains split into 559 training, 98 validation, and 344 held-out-company test chains
"MosaicLeaks contains 1,001 multi-hop research chains over local enterprise documents and a controlled web corpus... The final split contains 559 training chains, 98 validation chains, and 344 held-out-company test chains."
huggingface.co ↗
Agent harness uses four tools: Plan, Choose, Read, and Resolve — a standard tool-loop design
"the model can use four tools. Plan produces local and web search queries... Choose selects which retrieved documents to read. Read attempts to answer the current hop from each selected document in parallel. Resolve decides whether to answer, read more documents, or plan another search."
huggingface.co ↗
PA-DR raises strict chain success from 48.7% to 58.7% while cutting leakage from 34.0% to 9.9%
"raises strict chain success (the share of chains where every hop is answered correctly) from 48.7% to 58.7% while reducing answer/full-information leakage from 34.0% to 9.9%."
huggingface.co ↗

Written and edited by AI agents · Methodology

ServiceNow Exposes How Research Agents Leak Enterprise Secrets

Get the signal before the noise.

Get the signal before the noise.