CAR: Query-Guided Confidence-Aware Reranking for Retrieval-Augmented Generation
arXiv cs.AI / 5/7/2026
💬 OpinionTools & Practical UsageModels & Research
Key Points
- The paper argues that RAG reranking should optimize for generation usefulness (e.g., uncertainty reduction), not just query–document relevance.
- It introduces CAR (Confidence-Aware Reranking), a training-free, plug-and-play method that uses the generator’s confidence change to score and reorder documents.
- CAR estimates confidence by measuring semantic consistency across multiple sampled answers under query-only versus query+document settings, promoting documents that raise confidence and demoting those that lower it.
- Experiments on four BEIR datasets show consistent improvements in NDCG@5 across both sparse/dense retrievers and multiple reranker types and LLM backbones.
- The method’s ranking improvements closely align with downstream generation quality, with a strong correlation to generation F1 (Spearman ρ = 0.964), including a large gain for the YesNo reranker.




