MAB-DQA: Addressing Query Aspect Importance in Document Question Answering with Multi-Armed Bandits
arXiv cs.CL / 4/13/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper targets multimodal Document Question Answering where multimodal RAG over page images often retrieves only a small top-K set, missing useful but less visually salient pages.
- It introduces MAB-DQA, which decomposes a query into aspect-aware subqueries and retrieves an aspect-specific candidate set for each subquery.
- MAB-DQA uses a multi-armed bandit strategy, treating each aspect subquery as an “arm,” to estimate aspect utility from rewards derived from reasoning on a few representative pages.
- An exploration–exploitation policy dynamically reallocates the retrieval budget toward higher-value aspects, using both informative pages and their correlations to produce final expected answers.
- Experiments on four benchmarks show 5%–18% average improvements over state-of-the-art baselines, and the authors provide released code on GitHub.
Related Articles

Black Hat Asia
AI Business

I built the missing piece of the MCP ecosystem
Dev.to

When Agents Go Wrong: AI Accountability and the Payment Audit Trail
Dev.to

Google Gemma 4 Review 2026: The Open Model That Runs Locally and Beats Closed APIs
Dev.to

OpenClaw Deep Dive Guide: Self-Host Your Own AI Agent on Any VPS (2026)
Dev.to