CORAL: Adaptive Retrieval Loop for Culturally-Aligned Multilingual RAG

arXiv cs.CL / 4/29/2026

📰 NewsModels & Research

Key Points

  • CORAL introduces an adaptive retrieval loop for culturally aligned multilingual RAG that jointly refines the retrieval corpora and the query based on evidence quality.
  • The method addresses a key limitation of fixed mRAG retrieval spaces, where culturally grounded questions can suffer from retrieval-condition misalignment even with strong retrievers and generators.
  • CORAL iteratively performs corpus selection, document retrieval, cultural relevance critique, and sufficiency checks, and then reselects corpora and rewrites the query when evidence is insufficient.
  • On two cultural QA benchmarks, CORAL delivers up to a 3.58 percentage-point accuracy improvement on low-resource languages compared with the strongest baselines.

Abstract

Multilingual retrieval-augmented generation (mRAG) is often implemented within a fixed retrieval space, typically via query or document translation or multilingual embedding vector representations. However, this approach may be inadequate for culturally grounded queries, in which retrieval-condition misalignment may occur. Even strong retrievers and generators may struggle to produce culturally relevant answers when sourcing evidence from inappropriate linguistic or regional contexts. To this end, we introduce CORAL (COntext-aware Retrieval with Agentic Loop, an adaptive retrieval methodology for mRAG that enables iterative refinement of both the retrieval space (corpora) and the retrieval probe (query) based on the quality of the evidence. The overall process includes: (1) selecting corpora, (2) retrieving documents, (3) critiquing evidence for relevance and cultural alignment, and (4) checking sufficiency. If the retrieved documents are insufficient to answer the query correctly, the system (5) reselects corpora and rewrites the query. Across two cultural QA benchmarks, CORAL achieves up to a 3.58%p accuracy improvement on low-resource languages relative to the strongest baselines.