Which bird does not have wings: Negative-constrained KGQA with Schema-guided Semantic Matching and Self-directed Refinement

arXiv cs.CL / 4/17/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Key Points

  • The paper argues that large language models often lack faithfulness in Knowledge Graph Question Answering (KGQA), especially when handling negative constraints and negation.
  • It introduces a new KGQA task, NEgative-conSTrained (NEST) KGQA, and a dataset (NestKGQA) where each question includes at least one negative constraint.
  • The authors design PyLF, a Python-formatted logical form language aimed at making negation both clear and readable.
  • They propose CUCKOO, a framework that uses schema-guided semantic matching and only triggers self-directed refinement when execution produces an empty result, improving robustness while controlling cost.
  • Experiments show CUCKOO improves performance over baselines on both standard KGQA and the new NEST-KGQA benchmarks in few-shot settings.

Abstract

Large language models still struggle with faithfulness and hallucinations despite their remarkable reasoning abilities. In Knowledge Graph Question Answering (KGQA), semantic parsing-based approaches address the limitations by understanding constraints in a user's question and converting them into a logical form to execute on a knowledge graph. However, existing KGQA benchmarks and methods are biased toward positive and calculation constraints. Negative constraints are neglected, although they frequently appear in real-world questions. In this paper, we introduce a new task, NEgative-conSTrained (NEST) KGQA, where each question contains at least one negative constraint, and a corresponding dataset, NestKGQA. We also design PyLF, a Python-formatted logical form, since existing logical forms are hardly suitable to express negation clearly while maintaining readability. Furthermore, NEST questions naturally contain multiple constraints. To mitigate their semantic complexity, we present a novel framework named CUCKOO, specialized to multiple-constrained questions and ensuring semantic executability. CUCKOO first generates a constraint-aware logical form draft and performs schema-guided semantic matching. It then selectively applies self-directed refinement only when executing improper logical forms yields an empty result, reducing cost while improving robustness. Experimental results demonstrate that CUCKOO consistently outperforms baselines on both conventional KGQA and NEST-KGQA benchmarks under few-shot settings.