RouterKGQA: Specialized--General Model Routing for Constraint-Aware Knowledge Graph Question Answering

arXiv cs.CL / 3/23/2026

📰 NewsModels & Research

Key Points

  • RouterKGQA introduces a specialized--general model routing framework where a specialized model produces reasoning paths for knowledge graph question answering and a general model steps in only when needed for KG-guided repair, reducing cost while maintaining grounding.
  • The specialized component incorporates constraint-aware answer filtering to cut redundant outputs and ensure answers respect knowledge graph constraints.
  • Experimental results show RouterKGQA outperforms prior best methods by 3.57 points in F1 and 0.49 points in Hits@1 across benchmarks, while using about 1.15 average LLM calls per question.
  • The paper also optimizes the general agent workflow to further lower inference cost and provides open-source code and models.

Abstract

Knowledge graph question answering (KGQA) is a promising approach for mitigating LLM hallucination by grounding reasoning in structured and verifiable knowledge graphs. Existing approaches fall into two paradigms: retrieval-based methods utilize small specialized models, which are efficient but often produce unreachable paths and miss implicit constraints, while agent-based methods utilize large general models, which achieve stronger structural grounding at substantially higher cost. We propose RouterKGQA, a framework for specialized--general model collaboration, in which a specialized model generates reasoning paths and a general model performs KG-guided repair only when needed, improving performance at minimal cost. We further equip the specialized with constraint-aware answer filtering, which reduces redundant answers. In addition, we design a more efficient general agent workflow, further lowering inference cost. Experimental results show that RouterKGQA outperforms the previous best by 3.57 points in F1 and 0.49 points in Hits@1 on average across benchmarks, while requiring only 1.15 average LLM calls per question. Codes and models are available at https://github.com/Oldcircle/RouterKGQA.