FT-RAG: A Fine-grained Retrieval-Augmented Generation Framework for Complex Table Reasoning

arXiv cs.CL / 5/5/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Key Points

  • The paper introduces FT-RAG, a fine-grained retrieval-augmented generation framework designed to improve LLM performance on complex table reasoning where conventional RAG struggles.
  • FT-RAG decomposes tables into entry-level semantic units, builds a structured graph, uses structural neighbor expansion for graph retrieval, and applies multi-modal fusion to consolidate retrieved context.
  • To tackle dataset scarcity, the authors release Multi-Table-RAG-Lib, a benchmark with 9,870 high-complexity QA pairs requiring multi-table integration and text-table information fusion.
  • Experiments report strong gains over top baselines, including improved table-level and cell-level Hit Rates (23.5% and 59.2%) and a 62.2% increase in exact value accuracy recall, demonstrating better factual grounding in both pure tables and mixed table-text scenarios.

Abstract

Retrieval-Augmented Generation (RAG) enhances Large Language Models (LLMs) by grounding responses in external knowledge during inference. However, conventiona RAG systems under-perform on structured tabular data, largely due to coarse retrieval granularity and insufficient table semantic comprehension. To address these limitations, we introduce FT-RAG, a fine-grained framework that employs knowledge association by decomposing tables into entry-level semantic units to construct a structured graph. FT-RAG employs a structural neighbor expansion mechanism to find semantically connected entities during graph retrieval, followed by multi-modal fusion to consolidate the context of table retrieval results. Further, to address the scarcity of specialized datasets in this domain, we introduce Multi-Table-RAG-Lib, a benchmark comprising 9870 QA pairs with high complexity and difficulty, curated to demand multi-table integration and text-table information fusion for reasoning. FT-RAG surpasses top-performing baselines across all metrics, achieving a 23.5\% and 59.2\% improvement in table-level and cell-level Hit Rates, respectively. Generation performance also sees a remarkable 62.2\% increase in exact value accuracy recall. These metrics verify the framework's effectiveness in factual grounding across both pure tabular and heterogeneous table-text contexts. Therefore, our method establishes a new state-of-the-art performance for complex reasoning over mixed-modality documents.

FT-RAG: A Fine-grained Retrieval-Augmented Generation Framework for Complex Table Reasoning | AI Navigate