APEX-EM: Non-Parametric Online Learning for Autonomous Agents via Structured Procedural-Episodic Experience Replay
arXiv cs.AI / 4/1/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces APEX-EM, a non-parametric online learning framework for LLM-based autonomous agents that reuses prior procedural plans via structured procedural-episodic experience replay without updating model weights.
- APEX-EM defines a structured experience representation capturing planning steps, artifacts, iteration history with error analysis, and quality scores, and uses a PRGII workflow with task verifiers to generate multi-dimensional reward signals.
- It also proposes a dual-outcome experience memory that performs hybrid retrieval using semantic search, structural signature matching, and plan-DAG traversal to enable transfer across tasks with little/no lexical overlap but similar operational structure.
- Experiments on BigCodeBench, KGQAGen-10k, and Humanity’s Last Exam show large accuracy/SR gains from memory, including 89.6% vs 41.3% on KGQAGen-10k and 83.3% SR vs 53.9% on BigCodeBench, with ablations indicating feedback usefulness depends on task type.
- The approach treats successful executions as positive in-context examples and failures as negative examples annotated with structured error information to improve iterative planning and reuse over time.
Related Articles

Black Hat Asia
AI Business

Mistral raises $830M, 9fin hits unicorn status, and new Tech.eu Summit speakers unveiled
Tech.eu
ChatGPT costs $20/month. I built an alternative for $2.99.
Dev.to

OpenAI shifts to usage-based pricing for Codex in ChatGPT business plans
THE DECODER
Why I built an AI assistant that doesn't know who you are
Dev.to