EHRAG: Bridging Semantic Gaps in Lightweight GraphRAG via Hybrid Hypergraph Construction and Retrieval
arXiv cs.AI / 4/21/2026
📰 NewsDeveloper Stack & InfrastructureModels & Research
Key Points
- The paper introduces EHRAG, a lightweight GraphRAG framework designed to bridge semantic gaps that occur when entities are only connected via structural co-occurrence.
- EHRAG builds a hypergraph using both structural hyperedges (from sentence-level entity co-occurrence via lightweight extraction) and semantic hyperedges (from clustering entity text embeddings), capturing latent relationships.
- For retrieval, it uses a hybrid structure–semantic diffusion approach with topic-aware scoring and personalized PageRank (PPR) refinement to select the top-k relevant documents.
- Experiments across four datasets show EHRAG outperforms existing baselines while keeping linear indexing complexity and requiring zero tokens for hypergraph construction.
- The work is provided as an open-source implementation on GitHub, enabling replication and further research.



