Affordance Agent Harness: Verification-Gated Skill Orchestration
arXiv cs.CV / 5/4/2026
💬 OpinionDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research
Key Points
- The paper addresses affordance grounding in open-world scenes where actionable regions are often small, occluded, reflective, or visually ambiguous.
- It argues that existing multi-skill agents rely on fixed pipelines that don’t adapt to per-instance difficulty, recover well from intermediate errors, or reuse experience for recurring objects.
- The authors propose “Affordance Agent Harness,” a closed-loop runtime that combines heterogeneous skills via an evidence store with inference-cost control and a memory-augmented router for recurring categories.
- A verifier gates when the agent should commit to an affordance prediction using self-consistency, cross-scale stability, and evidence sufficiency, enabling targeted retries before a final judge fuses evidence and trajectories.
- Experiments on multiple affordance benchmarks show a better accuracy–cost tradeoff than fixed pipelines, reducing average skill calls and latency while improving grounding quality.
Related Articles
AnnouncementsBuilding a new enterprise AI services company with Blackstone, Hellman & Friedman, and Goldman Sachs
Anthropic News

Dara Khosrowshahi on replacing Uber drivers — and himself — with AI
The Verge
CLMA Frame Test
Dev.to
You Are Right — You Don't Need CLAUDE.md
Dev.to
Governance and Liability in AI Agents: What I Built Trying to Answer Those Questions
Dev.to