Hybrid Self-evolving Structured Memory for GUI Agents
arXiv cs.AI / 3/12/2026
📰 NewsModels & Research
Key Points
- HyMEM is a graph-based memory that couples discrete high-level symbolic nodes with continuous trajectory embeddings to enable structured, multi-hop retrieval in GUI agents.
- It supports self-evolution through node update operations and on-the-fly working-memory refreshing during inference, inspired by human memory organization.
- Extensive experiments show HyMEM consistently improves open-source GUI agents, enabling 7B/8B backbones to match or surpass strong closed-source models, notably boosting Qwen2.5-VL-7B by +22.5% and outperforming Gemini2.5-Pro-Vision and GPT-4o.
- The work points to broad implications for GUI automation tasks with long-horizon workflows and diverse interfaces by providing a memory-augmented approach.
Related Articles

PearlOS. We gave swarm intelligence a local desktop environment and code control to self-evolve. Has been pretty incredible to see so far. Open source and free if you want your own.
Reddit r/LocalLLaMA
QwenDean-4B | fine-tuned SLM for UIGen; our first attempt, looking for feedback!
Reddit r/LocalLLaMA
acestep.cpp: portable C++17 implementation of ACE-Step 1.5 music generation using GGML. Runs on CPU, CUDA, ROCm, Metal, Vulkan
Reddit r/LocalLLaMA

**Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding**
Hugging Face Blog

Newest GPU server in the lab! 72gb ampere vram!
Reddit r/LocalLLaMA