A3R: Agentic Affordance Reasoning via Cross-Dimensional Evidence in 3D Gaussian Scenes
arXiv cs.CV / 4/3/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that many failures in 3D Gaussian scene affordance reasoning come from missing task-relevant evidence under fixed observations rather than from weak prediction capability alone.
- It reformulates affordance reasoning as a sequential evidence-acquisition process that iteratively reduces ambiguity using complementary 3D geometric evidence and 2D semantic evidence.
- The proposed A3R framework uses an MLLM-based policy to choose evidence acquisition actions and update its affordance belief via cross-dimensional (2D+3D) evidence.
- To train the sequential policy effectively, the authors introduce a GRPO-based learning strategy aimed at improving evidence acquisition efficiency and reasoning accuracy.
- Experiments on scene-level benchmarks show that A3R outperforms static one-shot baselines, highlighting the benefits of agentic evidence gathering for fine-grained affordance reasoning in complex 3D environments.
Related Articles

Black Hat Asia
AI Business

Mistral raises $830M, 9fin hits unicorn status, and new Tech.eu Summit speakers unveiled
Tech.eu

ChatGPT costs $20/month. I built an alternative for $2.99.
Dev.to

OpenAI shifts to usage-based pricing for Codex in ChatGPT business plans
THE DECODER

Why I built an AI assistant that doesn't know who you are
Dev.to