Bridge-RAG: An Abstract Bridge Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter

arXiv cs.AI / 3/31/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes Bridge-RAG, a new retrieval-augmented generation (RAG) framework aimed at improving both retrieval accuracy and computational efficiency for LLMs.
  • It introduces an “abstract-to-bridge” mechanism that maps query entities to document chunks, organized via an abstract tree and paired with a multi-level retrieval strategy to preserve sufficient context.
  • To address efficiency, it employs an improved Cuckoo Filter for fast membership queries/updates, speeding up entity location during retrieval.
  • The method further optimizes retrieval performance using a block linked list structure and an entity “temperature”-based sorting scheme to leverage spatial and temporal locality.
  • Experiments report about a 15.65% accuracy improvement and a 10× to 500× reduction in retrieval time versus other RAG frameworks.

Abstract

As an important paradigm for enhancing the generation quality of Large Language Models (LLMs), retrieval-augmented generation (RAG) faces the two challenges regarding retrieval accuracy and computational efficiency. This paper presents a novel RAG framework called Bridge-RAG. To overcome the accuracy challenge, we introduce the concept of abstract to bridge query entities and document chunks, providing robust semantic understanding. We organize the abstracts into a tree structure and design a multi-level retrieval strategy to ensure the inclusion of sufficient contextual information. To overcome the efficiency challenge, we introduce the improved Cuckoo Filter, an efficient data structure supporting rapid membership queries and updates, to accelerate entity location during the retrieval process. We design a block linked list structure and an entity temperature-based sorting mechanism to improve efficiency from the aspects of spatial and temporal locality. Extensive experiments show that Bridge-RAG achieves around 15.65% accuracy improvement and reduces 10x to 500x retrieval time compared to other RAG frameworks.