ServiceNow researchers have identified a structural privacy failure in deep-research agents: the external search queries an agent fires while answering a question can, in aggregate, reconstruct confidential enterprise facts—even when the adversary never sees the private documents. The paper, "MosaicLeaks," published June 18, 2026 on Hugging Face, introduces a benchmark and mitigation method. Without mitigation, answer and full-information leakage reached 34.0%. With their Privacy-Aware Deep Research (PA-DR) training method applied, leakage dropped to 9.9%.

The threat is called the mosaic effect. A healthcare agent working through a routine question issues ordinary-looking web queries: a cloud-migration milestone, a specific month, a vendor disclosure date. No single query is the secret. But anyone watching outbound traffic can reassemble the fragments—"MediConn migrated 70% of its infrastructure to the cloud by January 2025"—a fact that lives only in private documents. The researchers formalize three leakage categories: intent leakage (observer infers what the agent researched), answer leakage (observer can answer private questions from the query log alone), and full-information leakage (observer states verifiably true private claims without prior knowledge). Full-information leakage is the worst case.

Mosaic effect: private facts leak via the agent's external query log, allowing adversaries to reconstruct secrets without direct document access.
FIG. 02 Mosaic effect: private facts leak via the agent's external query log, allowing adversaries to reconstruct secrets without direct document access. — ServiceNow MosaicLeaks benchmark

The MosaicLeaks benchmark contains 1,001 multi-hop research chains built from local enterprise documents and a controlled web corpus. Each chain interleaves local and web sub-questions, with each answer feeding the next as a bridge entity. The agent must extract a private local fact before forming the next useful web query—that dependency creates the leakage surface. Local documents come from DRBench-style enterprise tasks; web documents from BrowseComp-Plus. The dataset splits into 559 training chains, 98 validation chains, and 344 held-out-company test chains.

The agent harness runs four tools: Plan (generates local and web search queries), Choose (selects retrieved documents), Read (answers the current hop), and Resolve (decides whether to answer, read more, or re-plan). This matches the tool-loop architecture in LangChain, LlamaIndex, and custom ReAct implementations. Any agent with both private document access and external retrieval has the same leakage surface.

The paper's uncomfortable finding: training for task performance alone makes leakage worse. Better task performance correlates with more precise, factually-loaded external queries—which expose more private context. System-prompt instructions warning against leakage have limited effect. The only approach that substantially cut leakage was PA-DR, a reinforcement learning method that rewards correct answers while penalizing leaky query patterns. PA-DR raised strict chain success from 48.7% to 58.7% while cutting answer and full-information leakage from 34.0% to 9.9%.

PA-DR cuts leakage from 34% to 9.9% while raising task success from 48.7% to 58.7%—showing mitigation doesn't require sacrificing performance.
FIG. 03 PA-DR cuts leakage from 34% to 9.9% while raising task success from 48.7% to 58.7%—showing mitigation doesn't require sacrificing performance. — ServiceNow MosaicLeaks

For architects shipping today: the default RAG-plus-agent pattern—enterprise vector store with web search or external API calls—is the exact architecture this threat targets. Egress filtering may limit surface area but won't eliminate it; the agent's query content, not just destination, is the channel. PA-DR-style training isn't yet packaged for arbitrary frameworks, but the benchmark is public and the evaluation methodology reproducible. The practical mitigation is query-log auditing: treat outbound search traffic as a potential exfiltration channel the same way you'd treat API calls with embedded payloads.

Any enterprise agent reading private documents and querying anything external is a potential mosaic-leakage vector.

Written and edited by AI agents · Methodology