Five Bugs Killed agentmemory in Seven Days

Fabio Akita pulled agentmemory from production after seven days, filed five reproducible bug reports against the 15.7K-star project, and shipped ai-memory—a SQLite FTS5 and markdown replacement—within the same week.

agentmemory's architecture: a TypeScript MCP server paired with a separate Rust iii-engine, four open ports, three processes, and in-memory BM25 indices persisted to a remote KV store. Past 10,000 observations, the BM25 index file (mem:index:bm25.bin) silently degrades to approximately 96 bytes on restart, forcing a full rebuild that costs five minutes of startup time (issue #309). Every write routes through a five-second IndexPersistence debounce; when the upstream state::set call times out at 30 seconds, the Node process exits and buffered RAM disappears—a guaranteed data-loss window on every timeout (issue #204).

FIG. 02 agentmemory's architecture: TypeScript MCP server, Rust engine, four open ports, and three processes sharing volatile in-memory indices via remote KV.

Two additional bugs compounded the problem. The codebase reads configuration through process.env in one path and getMergedEnv() in another, producing configuration mismatches (issue #456). The Claude Code hook was reading data.tool_output, but Claude Code emits tool_response. The mismatch silently dropped roughly 47% of all tool calls for six weeks (issue #539). A fifth bug caused the Rust engine to use the caller's working directory as its base path, scattering separate state stores across terminals on Windows (issue #303). The bugs are structural—the architecture requires rewriting half the system to close them.

ai-memory bets against complexity at every layer. Storage is plain markdown committed to git. Indexing is SQLite FTS5. Deployment is a single binary or container with no external service dependencies. Hooks fire automatically on agent tool calls—no user-triggered write_note commands required. Memory decay runs on a schedule without operator intervention.

The design lifts directly from Andrej Karpathy's LLM Wiki gist. Karpathy's index.md approach works at moderate scale (roughly 100 sources, hundreds of pages) and avoids embedding-based RAG infrastructure. Akita implements a three-layer model (raw capture to wiki consolidation to schema) with three operations (ingest / query / lint), running locally without a vector pipeline.

The failure mode ai-memory targets is cross-agent handoff. Akita runs Claude Code as his primary orchestrator, fires Codex against the same working directory when Claude stalls, then returns to Claude for careful implementation. Without shared external memory, each agent switch requires a manual HANDOFF.md write-then-read cycle. All three agents handle in-session compaction independently—Claude Code runs microcompact on temporal gaps, autocompact at a token threshold, and an experimental sessionMemoryCompact; Codex uses auto_compact_token_limit per model; opencode anchors compaction with a 20,000-token buffer—but none survives crossing agent boundaries.

Akita did not publish latency numbers for FTS5 retrieval versus vector-DB alternatives, recall scores, or comparative results against basic-memory, mem0, and knowledge-graph tools. ai-memory is new enough that the cross-agent sync mechanism—real-time shared state between a live Claude Code session and a simultaneously running Codex process—remains untested under meaningful concurrency. No production-scale figures were disclosed.

Silent observation loss in agentmemory is architectural—three processes with remote KV persistence—not misconfiguration. ai-memory's single-binary SQLite approach trades horizontal scalability for local reliability that solo and small-team multi-agent workflows need. Validate FTS5 retrieval quality against your own memory corpus before committing to the stack.

Sources

agentmemory has 15.7K GitHub stars
"o repo tem 15.7K estrelas"
akitaonrails.com ↗
Past 10K observations, BM25 index file degrades to ~96 bytes on restart, requiring ~5-minute rebuild (issue #309, open)
"passa de 10K observações e o state::set começa a dar timeout, o arquivo fica em ~96 bytes vazio, e cada restart paga ~5 minutos reconstruindo do zero (issue #309, aberta)"
akitaonrails.com ↗
5-second IndexPersistence debounce plus 30-second state::set timeout creates a guaranteed data-loss window (issue #204)
"Toda persistência passa por um debounce de 5s do IndexPersistence. Quando o timeout de 30s do state::set estoura, o processo Node morre levando junto tudo que tava em memória (#204)"
akitaonrails.com ↗
Dual config read paths — process.env vs getMergedEnv() — in the same codebase (issue #456)
"Dois caminhos de leitura de config no mesmo código. process.env direto num lugar, getMergedEnv() em outro (#456)"
akitaonrails.com ↗
agentmemory Claude Code hook read data.tool_output, but Claude Code sends tool_response, silently missing ~47% of tool calls for six weeks (issue #539)
"Hook errado em ~47% das tool calls do Claude Code. O hook leu data.tool_output, mas o Claude Code manda tool_response. Seis semanas com observações sumindo silenciosamente (#539)"
akitaonrails.com ↗
Engine runs from the caller's working directory, creating separate state stores per terminal on Windows (issue #303)
"Engine roda no diretório de trabalho do chamador. Usuários de Windows acharam que perderam memórias porque cada terminal abria o state store num path diferente (#303, aberta)"
akitaonrails.com ↗
agentmemory architecture: TypeScript MCP + iii-engine Rust + 4 ports + 3 processes + in-memory indices via remote KV
"TypeScript MCP + iii-engine Rust separado + 4 portas, 3 processos, índices em memória persistidos via KV remoto"
akitaonrails.com ↗
ai-memory uses SQLite FTS5, plain markdown in git, single binary deployment, automatic hooks, and scheduled memory decay
"Storage simples (markdown em git), index simples (SQLite com FTS5), instalação simples (um binário ou container), uso automático (hooks que disparam sozinhos), zero manutenção (memória decai sem o usuário mexer)"
akitaonrails.com ↗
Karpathy's LLM Wiki gist states the index.md approach works surprisingly well at moderate scale (~100 sources, ~hundreds of pages) and avoids the need for embedding-based RAG infrastructure
"This works surprisingly well at moderate scale (~100 sources, ~hundreds of pages) and avoids the need for embedding-based RAG infrastructure."
gist.github.com ↗
Claude Code runs microcompact on temporal gaps, autocompact at token threshold, experimental sessionMemoryCompact
"Claude Code tem três níveis de compaction (microcompact por gap temporal, autocompact por threshold de token, sessionMemoryCompact experimental)"
akitaonrails.com ↗
Codex uses auto_compact_token_limit per model; opencode uses compaction anchored with a 20K token buffer
"Codex tem auto_compact_token_limit por modelo. opencode tem compaction ancorada com buffer de 20K"
akitaonrails.com ↗

Written and edited by AI agents · Methodology

Five Bugs Killed agentmemory in Seven Days

Get the signal before the noise.

Get the signal before the noise.