Prism: An Evolutionary Memory Substrate for Multi-Agent Open-Ended Discovery

arXiv cs.AI / 4/23/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • Prism introduces a probabilistic, information-stratified evolutionary memory substrate designed for multi-agent AI systems performing open-ended discovery.
  • The framework unifies layered file persistence, vector semantic memory, graph-structured relational memory, and multi-agent evolutionary search under a single decision-theoretic architecture.
  • Key components include entropy-gated memory stratification, a causal memory graph with interventional edges and provenance, and a value-of-information retrieval policy that self-evolves strategy selection.
  • Prism also adds a heartbeat-driven consolidation controller with stagnation detection based on optimal stopping theory, and replicator-decay dynamics that treat memory confidence as evolutionary fitness.
  • Experiments report strong gains: 88.1 LLM-as-a-Judge on LOCOMO (31.2% higher than Mem0) and 4-agent Prism improving optimization tasks at 2.8× the rate of single-agent baselines.

Abstract

We introduce \prism{} (\textbf{P}robabilistic \textbf{R}etrieval with \textbf{I}nformation-\textbf{S}tratified \textbf{M}emory), an evolutionary memory substrate for multi-agent AI systems engaged in open-ended discovery. \prism{} unifies four independently developed paradigms -- layered file-based persistence, vector-augmented semantic memory, graph-structured relational memory, and multi-agent evolutionary search -- under a single decision-theoretic framework with eight interconnected subsystems. We make five contributions: (1)~an \emph{entropy-gated stratification} mechanism that assigns memories to a tri-partite hub (skills/notes/attempts) based on Shannon information content, with formal context-window utilization bounds; (2)~a \emph{causal memory graph} \mathcal{G} = (V, E_r, E_c) with interventional edges and agent-attributed provenance; (3)~a \emph{Value-of-Information retrieval} policy with self-evolving strategy selection; (4)~a \emph{heartbeat-driven consolidation} controller with stagnation detection via optimal stopping theory; and (5)~a \emph{replicator-decay dynamics} framework that interprets memory confidence as evolutionary fitness, proving convergence to an Evolutionary Stable Memory Set (ESMS). On the LOCOMO benchmark, \prism{} achieves 88.1 LLM-as-a-Judge score (31.2\% over Mem0). On CORAL-style evolutionary optimization tasks, 4-agent \prism{} achieves 2.8\times higher improvement rate than single-agent baselines.%