Memanto: Typed Semantic Memory with Information-Theoretic Retrieval for Long-Horizon Agents

arXiv cs.AI / 4/27/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Key Points

  • The paper argues that memory is a key bottleneck for deploying long-horizon, multi-session autonomous agents, where existing approaches often rely on costly hybrid semantic graph pipelines.
  • It introduces Memanto, a universal typed semantic memory layer that uses 13 predefined memory categories, automated conflict resolution, and temporal versioning to improve agent memory fidelity without complex knowledge graph maintenance.
  • Memanto is powered by Moorcheh’s Information-Theoretic Search engine, which uses a no-indexing semantic database to enable deterministic retrieval with sub-90 ms latency and to avoid ingestion delays.
  • Benchmarking on LongMemEval and LoCoMo shows state-of-the-art performance (89.8% and 87.1%), outperforming tested hybrid graph and vector systems while using only a single retrieval query.
  • The work includes a progressive ablation study and discusses implications for scalable deployment of agentic memory systems by reducing operational complexity.

Abstract

The transition from stateless language model inference to persistent, multi session autonomous agents has revealed memory to be a primary architectural bottleneck in the deployment of production grade agentic systems. Existing methodologies largely depend on hybrid semantic graph architectures, which impose substantial computational overhead during both ingestion and retrieval. These systems typically require large language model mediated entity extraction, explicit graph schema maintenance, and multi query retrieval pipelines. This paper introduces Memanto, a universal memory layer for agentic artificial intelligence that challenges the prevailing assumption that knowledge graph complexity is necessary to achieve high fidelity agent memory. Memanto integrates a typed semantic memory schema comprising thirteen predefined memory categories, an automated conflict resolution mechanism, and temporal versioning. These components are enabled by Moorcheh's Information Theoretic Search engine, a no indexing semantic database that provides deterministic retrieval within sub ninety millisecond latency while eliminating ingestion delay. Through systematic benchmarking on the LongMemEval and LoCoMo evaluation suites, Memanto achieves state of the art accuracy scores of 89.8 percent and 87.1 percent respectively. These results surpass all evaluated hybrid graph and vector based systems while requiring only a single retrieval query, incurring no ingestion cost, and maintaining substantially lower operational complexity. A five stage progressive ablation study is presented to quantify the contribution of each architectural component, followed by a discussion of the implications for scalable deployment of agentic memory systems.