GraphWalker: Agentic Knowledge Graph Question Answering via Synthetic Trajectory Curriculum

arXiv cs.CL / 3/31/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces GraphWalker, an agentic knowledge-graph question answering framework aimed at improving exploration and reasoning generalization despite limited training data.
  • It uses automated trajectory synthesis via constrained random walks to train an exploration prior, then applies stage-wise fine-tuning on expert trajectories to teach reflection and error recovery.
  • Experimental results show GraphWalker achieves state-of-the-art performance on CWQ and WebQSP, aided by a higher performance ceiling for a subsequent lightweight reinforcement learning stage.
  • Additional evaluations on GrailQA and a newly constructed GraphWalkerBench indicate improved generalization to out-of-distribution reasoning paths.

Abstract

Agentic knowledge graph question answering (KGQA) requires an agent to iteratively interact with knowledge graphs (KGs), posing challenges in both training data scarcity and reasoning generalization. Specifically, existing approaches often restrict agent exploration: prompting-based methods lack autonomous navigation training, while current training pipelines usually confine reasoning to predefined trajectories. To this end, this paper proposes \textit{GraphWalker}, a novel agentic KGQA framework that addresses these challenges through \textit{Automated Trajectory Synthesis} and \textit{Stage-wise Fine-tuning}. GraphWalker adopts a two-stage SFT training paradigm: First, the agent is trained on structurally diverse trajectories synthesized from constrained random-walk paths, establishing a broad exploration prior over the KG; Second, the agent is further fine-tuned on a small set of expert trajectories to develop reflection and error recovery capabilities. Extensive experiments demonstrate that our stage-wise SFT paradigm unlocks a higher performance ceiling for a lightweight reinforcement learning (RL) stage, enabling GraphWalker to achieve state-of-the-art performance on CWQ and WebQSP. Additional results on GrailQA and our constructed GraphWalkerBench confirm that GraphWalker enhances generalization to out-of-distribution reasoning paths. The code is publicly available at https://github.com/XuShuwenn/GraphWalker