Storage Is Not Memory: A Retrieval-Centered Architecture for Agent Recall
arXiv cs.CL / 5/7/2026
📰 NewsDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that “agent memory” should not be based on extracting and storing content at ingestion time, because information discarded before the query is known cannot be recovered later.
- It proposes “True Memory,” a six-layer retrieval-centered architecture that preserves events verbatim and replaces storage-schema assumptions with a multi-stage retrieval pipeline.
- The full system is designed to run as a single SQLite file on commodity CPUs, avoiding external databases, vector indexes, graph stores, and GPUs.
- Experiments show strong retrieval/recall performance: 93.0% on LoCoMo (vs. 61.4% Mem0, 65.4% Supermemory, ~71% Zep, and 94.5% EverMemOS), 87.8% on LongMemEval, and 76.6% on BEAM-1M (above a prior 73.9% result from Hindsight).
- An ablation study across 56 configurations indicates a relatively small performance variance (about 1.3 percentage points) within the best-performing family of setups.
Related Articles

MCP Sentinel v1.0 Is Out: A Lockfile for MCP Tool Schemas
Dev.to

Share of Model: The Metric That Replaces Domain Authority in 2026
Dev.to

Preserving Color in Neural Artistic Style Transfer
Dev.to

I Built an AI Video Factory That Runs 24/7 — Fully Open Source
Dev.to

Your Agency Doesn’t Have a Productivity Problem It Has a Workflow Problem
Dev.to