Part-Aware Open-Vocabulary 3D Affordance Grounding via Prototypical Semantic and Geometric Alignment
arXiv cs.CV / 3/19/2026
📰 NewsModels & Research
Key Points
- The paper presents a two-stage cross-modal framework for open-vocabulary 3D affordance grounding to improve semantic and geometric alignment.
- Stage 1 uses large language models to generate part-aware instructions that recover missing semantics and link semantically similar affordances.
- Stage 2 introduces Affordance Prototype Aggregation (APA) for cross-object geometric consistency and Intra-Object Relational Modeling (IORM) for refining within-object geometry to support precise semantic alignment.
- Robust experiments on a new benchmark and two existing benchmarks show superior performance compared with existing methods.
Related Articles
When AI Grows Up: Identity, Memory, and What Persists Across Versions
Dev.to
OpenAI is throwing everything into building a fully automated researcher
MIT Technology Review
Kimi just published a paper replacing residual connections in transformers. results look legit
Reddit r/LocalLLaMA
機械学習の最適化対象まとめ(E資格対策にも)
Qiita

14 Best Self-Hosted Claude Alternatives for AI and Coding in 2026
Dev.to