Interact3D: Compositional 3D Generation of Interactive Objects
arXiv cs.CV / 3/18/2026
📰 NewsModels & Research
Key Points
- Interact3D introduces a framework for generating physically plausible interacting 3D compositional objects from a single image, addressing occlusions and maintaining object-object spatial relationships.
- The approach uses a two-stage composition pipeline: global-to-local registration to anchor the primary object and differentiable SDF-based optimization to integrate additional assets while penalizing intersections.
- A closed-loop refinement strategy leverages a Vision-Language Model to analyze multi-view renderings, generate corrective prompts, and guide an image editing module to self-correct.
- Experiments show enhanced geometric fidelity, reduced collisions, and consistent spatial relationships in collision-aware compositions compared with prior 3D compositional generation methods.
Related Articles

ラピダス、半導体設計AIエージェント「国内2社海外1社が使用中」
日経XTECH

Superposition and the Capsule: Quantum State Collapse Meets AI Identity
Dev.to

The Basilisk Inversion: Why Coercive AI Futures Are Thermodynamically Unlikely
Dev.to

The Loop as Laboratory: What 3,190 Cycles of Autonomous AI Operation Reveal
Dev.to

MiMo-V2-Pro & Omni & TTS: "We will open-source — when the models are stable enough to deserve it."
Reddit r/LocalLLaMA