GSEM: Graph-based Self-Evolving Memory for Experience Augmented Clinical Reasoning

arXiv cs.AI / 2026/3/24

📰 ニュースSignals & Early TrendsIdeas & Deep AnalysisModels & Research

要点

  • The paper introduces GSEM (Graph-based Self-Evolving Memory), a clinical memory framework that improves how clinical decision experiences are stored and retrieved for experience-augmented clinical reasoning agents.
  • Unlike prior memory methods that treat experiences as independent records, GSEM organizes experiences into a dual-layer graph to capture both intra-experience decision structure and inter-experience relational dependencies.
  • GSEM supports applicability-aware retrieval and uses online feedback to calibrate node quality and edge weights, aiming to reduce noisy retrieval and improve reliable reuse.
  • Experiments on MedR-Bench and MedAgentsBench using two LLM backbones (DeepSeek-V3.2 and Qwen3.5-35B) report the best average accuracy versus baselines, achieving 70.90% and 69.24% respectively.
  • The authors provide public code at the linked GitHub repository to enable replication and further development of the proposed approach.

Abstract

Clinical decision-making agents can benefit from reusing prior decision experience. However, many memory-augmented methods store experiences as independent records without explicit relational structure, which may introduce noisy retrieval, unreliable reuse, and in some cases even hurt performance compared to direct LLM inference. We propose GSEM (Graph-based Self-Evolving Memory), a clinical memory framework that organizes clinical experiences into a dual-layer memory graph, capturing both the decision structure within each experience and the relational dependencies across experiences, and supporting applicability-aware retrieval and online feedback-driven calibration of node quality and edge weights. Across MedR-Bench and MedAgentsBench with two LLM backbones, GSEM achieves the highest average accuracy among all baselines, reaching 70.90\% and 69.24\% with DeepSeek-V3.2 and Qwen3.5-35B, respectively. Code is available at https://github.com/xhan1022/gsem.