SEARCH-R: Structured Entity-Aware Retrieval with Chain-of-Reasoning Navigator for Multi-hop Question Answering

arXiv cs.CL / 4/28/2026

📰 NewsModels & Research

Key Points

  • The paper introduces SEARCH-R, a framework for multi-hop question answering that targets two common failure points: uncontrolled reasoning-path generation and low-utility retrieval.
  • SEARCH-R trains an end-to-end chain-of-reasoning path navigator, fine-tuning Llama 3.1 8B to improve sub-question decomposition for complex queries.
  • It proposes a dependency-tree-based retrieval method that quantitatively estimates a document’s informational contribution rather than relying mainly on similarity or matching scores.
  • Experiments on three challenging multi-hop datasets show that the proposed approach improves answer effectiveness compared with prior prompt-based and retrieval-combination methods.
  • The authors provide code and datasets publicly via the linked GitHub repository for replication and further research.

Abstract

Multi-hop Question Answering (MHQA) aims to answer questions that require multi-step reasoning. It presents two key challenges: generating correct reasoning paths in response to the complex user queries, and accurately retrieving essential knowledge in the face of potential limitations in large language models (LLMs). Existing approaches primarily rely on prompt-based methods to generate reasoning paths, which are further combined with traditional sparse or dense retrieval to produce the final answer. However, the generation of reasoning paths commonly lacks effective control over the generative process, thus leading the reasoning astray. Meanwhile, the retrieval methods over-rely on knowledge matching or similarity scores rather than evaluating the practical utility of the information, resulting in retrieving homogeneous or non-useful information. Therefore, we propose a Structured Entity-Aware Retrieval with Chain-of-Reasoning Navigator framework named SEARCH-R. Specifically, SEARCH-R trains an end-to-end reasoning path navigator, which is able to provide a powerful sub-question decomposer by fine-tuning the Llama3.1-8B model. Moreover, a novel dependency tree-based retrieval is designed to evaluate the informational contribution of the document quantitatively. Extensive experiments on three challenging multi-hop datasets validate the effectiveness of the proposed framework. The code and dataset are available at: https://github.com/Applied-Machine-Learning-Lab/ACL2026_SEARCH-R.