Read/Write Split Catches Lambda Null-Pointer in MCP GraphQL Server

The team runs nine narrowly scoped MCP tools with a default-deny mutation model. The architecture caught a critical production failure that unit tests could not: a Lambda null-pointer error in the create_collection resolver.

The MCP server runs in Go using the mcp-go library and talks to AWS AppSync via GraphQL. Authentication uses OIDC bearer tokens—short-lived and user-scoped, enforced via AppSync's @aws_oidc directive. Shared API keys were rejected: every LLM request would carry identical access regardless of caller identity. OIDC preserves audit trail and data scoping. The server also supports AWS SigV4 signing and API key auth as fallbacks. The active method is logged at startup: level=INFO msg=starting mcp-server auth=oidc mutations=false tools=8 resources=2 prompts=2.

Six read-only tools cover search_companies (keyword search with country filter, max 100 results), get_company, get_companies_batch (deduplicates, max 50 IDs), ai_search (natural language with 5 requests per minute rate limit), list_collections, and get_collection_items. Three mutation tools—create_collection, add_to_collection, and request_email_discovery—are gated by an --allow-mutations CLI flag that defaults to false. Only eight of nine tools shipped as active. Integration testing exposed the null-pointer error in create_collection's backend resolver. The tool has no unit-test signal for this failure and was commented out of the registration path. The startup log reporting tools=8 instead of 9 was the immediate signal of the deployment block.

The mutation gate lives at the registry constructor level. Each mutation tool stores the allowMutations boolean and checks it at Execute entry before touching GraphQL. Without the flag, the error surfaces immediately: mutations are disabled; use --allow-mutations flag to enable write operations. The GraphQL client never receives the request. Read/write separation is enforced in code, not naming convention.

FIG. 02 Read-only tools (left, olive) and mutation tools (right, terra-cotta) routed through the registry gate, which enforces the default-deny mutation policy before any GraphQL call.

Testing used mocked GraphQL clients via Testify Mock for unit-level tool logic, then validated every tool against the real AppSync endpoint through MCP Inspector before connecting an LLM client. Capturing the actual GraphQL variables the mock received—not just the final response shape—was critical. This approach caught two pre-production bugs: a country-code normalization failure (the tool sent US where AppSync expected countries;United States) and a missing limit cap. Both bugs passed output-shape assertions cleanly. Variable capture revealed the malformed inputs. Email discovery carries a separate rate ceiling of 10 requests per hour.

Three failure modes warrant attention. First, the create_collection null-pointer error failed every integration call against the dev-team-a test stage. Mocked tests verify tool logic but cannot substitute for real-backend validation. Second, bare search_companies calls with no country or category filter match the entire million-plus profile dataset and return near-random pages, triggering LLM follow-up queries that compound the breadth. The team bounded this by building category filters into the tool contract. Third, the current implementation has no per-request structured logging. Tool name, latency, input shape, and error type are not captured as independent log entries. Typed error responses surface diagnostics, but production telemetry was deferred as a next step.

Sources

Platform serves more than one million company profiles exposed through an MCP server on AWS
"we wanted to expose a B2B intelligence platform built on more than one million company profiles to an LLM client through an MCP server"
infoq.com ↗
MCP server built in Go using mcp-go library with GraphQL client targeting AWS AppSync
"we built a Go-based MCP server that translated user requests into a set of narrowly scoped tools. The implementation used mcp-go, a GraphQL client for AppSync"
infoq.com ↗
Authentication uses OIDC bearer tokens enforced at AppSync resolver level via @aws_oidc directive
"AppSync enforces authentication at the resolver level through its @aws_oidc directive, so the backend rejects requests with expired or invalid tokens before the resolver logic runs"
infoq.com ↗
Startup log surfaces auth method, mutation flag, tool count, resource count, and prompt count: auth=oidc mutations=false tools=8 resources=2 prompts=2
"level=INFO msg=starting mcp-server auth=oidc mutations=false tools=8 resources=2 prompts=2"
infoq.com ↗
search_companies caps results at 100; ai_search rate-limited to 5 requests per minute
"ai_search: Natural language search with conversation threading; rate-limited to 5 req/min"
infoq.com ↗
request_email_discovery rate-limited to 10 requests per hour
"request_email_discovery: Trigger email lookup for a contact; rate-limited to 10 req/hour"
infoq.com ↗
create_collection removed from active tool set after Lambda null-pointer error found during integration testing against real AppSync
"create_collection was commented out of the registration path after integration tests revealed a backend Lambda error that had not surfaced through unit tests alone"
infoq.com ↗
Mutation tools return explicit error if --allow-mutations flag is absent, before touching GraphQL
"mutations are disabled; use --allow-mutations flag to enable write operations"
infoq.com ↗
--allow-mutations flag registered via Cobra CLI with a false default
"serveCmd.Flags().BoolVar(&allowMutations, allow-mutations, false, Enable write operations)"
infoq.com ↗
Country-code normalization bug: tool forwarded values like US where AppSync expected countries;United States format
"The first bug this technique exposed was an incorrect country-code mapping, where an earlier version of the tool forwarded values like US to GraphQL instead of the required countries;United States format"
infoq.com ↗
Broad queries with no country filter against 1M+ profiles return a near-random page of results
"a bare query with no country or category constraint would match across the entire million plus profile set and return a near-random page of ten results"
infoq.com ↗
Per-request structured logging not part of the initial implementation
"Beyond startup, the current implementation does not log individual tool calls or request-level telemetry"
infoq.com ↗

Written and edited by AI agents · Methodology

Read/Write Split Catches Lambda Null-Pointer in MCP GraphQL Server

Get the signal before the noise.

Get the signal before the noise.