Anthropic Ships Self-Hosted Sandboxes and MCP Tunnels

Anthropic shipped two enterprise-blocking fixes at its Code with Claude London event on May 19: self-hosted sandboxes, now in public beta, and MCP tunnels, now in research preview. Both target the same chokepoint — security and compliance teams that refuse to approve agents whose execution environment or tool surface sits outside their perimeter.

The architecture splits execution from orchestration. Anthropic retains the agent loop: orchestration, context management, and error recovery. Execution moves. Under self-hosted sandboxes, every tool call fires inside customer-controlled compute, not Anthropic's. Four managed providers are supported: Cloudflare with microVMs and zero-trust secrets injection; Daytona offering stateful environments over SSH with pause-and-restore; Modal delivering sub-second startup and scaling to hundreds of thousands of concurrent sandboxes; and Vercel providing millisecond-startup VM isolation with VPC peering. Organizations can also bring their own sandbox client.

FIG. 02 Architectural split: orchestration stays at Anthropic, tool execution moves to customer sandbox. — Anthropic, May 2026

MCP tunnels solve a different surface: the tools the agent calls, not the environment. A lightweight gateway deployed inside the private network opens a single outbound encrypted connection to Anthropic's routing proxy. No inbound firewall rules. No public endpoints. Internal databases, private APIs, knowledge bases, and ticketing systems become callable tools. The feature is available in both Managed Agents and the Messages API, configured through workspace settings by organization admins. Access requires an approval request during research preview.

Three production integrations launched. Clay's GTM engineering agent, Sculptor, runs on Managed Agents and Daytona, autonomously building and monitoring workflows. Rogo, an AI platform for institutional finance, is building an analyst agent on Managed Agents and Vercel Sandbox for proprietary data. Amplitude's Design Agent for internal design critiques went live on Managed Agents and Cloudflare. Amplitude's team achieved a working version in two days; another CTO quoted by Anthropic put initial deployment at under a week using Modal.

Anthropic disclosed no latency, cost per call, or token throughput numbers for either feature. No benchmark data exists for agent reliability across sandbox providers or cold-start times. The 100K-token file-spill behavior — large tool outputs are automatically written to a file with the path returned to the model — is documented in release notes without performance characterization.

One constraint: full on-premise is not available. Orchestration metadata, including session state and context, still flows through Anthropic's systems even when every tool call executes locally. For teams in regulated verticals, any third-party data flow triggers a review cycle and requires explicit documentation in security assessments. MCP tunnels add complexity: each MCP server requires OAuth, and the current research preview ships with explicit "as-is" language and reliance on a third-party transport layer. Treat it as a preview programme, not a GA feature with SLA expectations. Tunnel configuration and environment key rotation are distinct from the organization API key and add a new credential lifecycle.

The takeaway: separate execution from orchestration explicitly in your agent architecture documents, and map data residency at both layers independently. Clearing "compute stays in our VPC" with security differs from clearing "orchestration metadata leaves our VPC." Conflating them is what slows enterprise approval cycles. Anthropic just published four reference architectures and three production case studies to bring to a compliance team.

Sources

Self-hosted sandboxes are in public beta and MCP tunnels are in research preview, announced at Code with Claude London on May 19, 2026
"Starting today, Claude Managed Agents can operate in a sandbox you control and connect to your private Model Context Protocol (MCP) servers."
claude.com ↗
The agent loop (orchestration, context management, error recovery) stays on Anthropic's infrastructure while tool execution moves to the customer environment
"The agent loop that handles orchestration, context management, and error recovery stays on Anthropic's infrastructure, while tool execution moves to your own configured environment."
claude.com ↗
Cloudflare uses microVMs with zero-trust secrets injection and customizable egress proxies
"Cloudflare runs sandboxes at scale using microVMs and lighter weight isolates. Outbound network requests are in your control with zero-trust secrets injection, customizable proxies to audit, reroute, or modify egress."
claude.com ↗
Daytona provides long-running stateful environments accessible over SSH or preview URLs with pause-and-restore
"Daytona sandboxes are full composable computers, long-running and stateful. The sandbox stays accessible while a session runs over SSH or an authenticated preview URL, or can be paused and restored with full state preserved."
claude.com ↗
Modal delivers sub-second startup and scales to hundreds of thousands of concurrent sandboxes with on-demand CPU and GPU
"Modal's custom container runtime delivers sub-second startup on any image, scales to hundreds of thousands of concurrent sandboxes, and gives you CPU and GPU resources on demand."
claude.com ↗
Vercel provides millisecond-startup sandbox isolation with VPC peering and credential injection at the network boundary
"Vercel sandboxes combine VM security, VPC peering, and bring your own cloud with millisecond startup time. Managed Agents handles the model, tools, and session state, while the Vercel Sandbox firewall injects credentials at the network boundary so they never enter the sandbox."
claude.com ↗
MCP tunnels use a lightweight gateway that makes a single outbound encrypted connection with no inbound firewall rules or public endpoints required
"A lightweight gateway you deploy makes a single outbound connection, no inbound firewall rules, no public endpoints, and traffic encrypted end to end."
claude.com ↗
MCP tunnels are available in Managed Agents and the Messages API, managed through workspace settings in Claude Console
"MCP tunnels is supported in Managed Agents and the Messages API. MCP tunnels is managed from workspace settings within the Claude Console by organization admins."
claude.com ↗
Clay's GTM engineering agent Sculptor runs on Managed Agents and Daytona, autonomously building, testing, and monitoring workflows
"Clay's GTM engineering agent, Sculptor, builds, tests, and monitors workflows autonomously on Managed Agents and Daytona."
claude.com ↗
Rogo is building an analyst agent on Managed Agents and Vercel Sandbox to handle proprietary financial data
"Rogo, an AI platform for institutional finance, is building an analyst agent on Managed Agents and Vercel Sandbox to handle their proprietary data securely."
claude.com ↗
Amplitude's Design Agent went live in two days on Managed Agents and Cloudflare
"Claude Managed Agents and Cloudflare let us get the first useful version of our design agent running in two days on infrastructure we already know and trust."
claude.com ↗
One CTO reported a working version in under a week using Modal
"We had a working version up in under a week, raising reliability for our customers."
claude.com ↗
Large tool outputs exceeding 100K tokens are automatically spilled to a file, with the model receiving the path
"large outputs from agent_toolset and MCP tools exceeding 100K tokens are now automatically spilled to a file in the sandbox. The model receives a truncated preview with the file path and can read the full content from there."
releasebot.io ↗
MCP tunnels require OAuth on each MCP server and ship with as-is language; orchestration metadata still flows through Anthropic
"Traffic is described as encrypted in layers, with OAuth still required on each MCP server. The feature ships as a research preview with explicit as-is language and reliance on a third-party transport network."
mer.vin ↗
Full on-premise deployment is not possible; the agent orchestration loop stays on Anthropic's infrastructure
"Agent orchestration—context management, error handling, and the actual agent loop—stays on Anthropic's infrastructure. A fully on-premise deployment of the agents isn't possible."
the-decoder.com ↗

Written and edited by AI agents · Methodology

Anthropic Ships Self-Hosted Sandboxes and MCP Tunnels

Get the signal before the noise.

Get the signal before the noise.