Part-Level 3D Gaussian Vehicle Generation with Joint and Hinge Axis Estimation

arXiv cs.AI / 4/8/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that autonomous-driving simulation needs part-level articulation rather than treating vehicles as rigid assets, because real-world perception increasingly relies on dynamics like wheel steering and door opening.
  • It proposes a generative framework that creates an animatable 3D Gaussian vehicle from a single image or sparse multi-view inputs, explicitly targeting faithful part behavior during animation.
  • To address distortions at part boundaries, the method adds a part-edge refinement module that enforces exclusive Gaussian ownership among parts.
  • Because segmentation outputs don’t provide motion parameters, the framework includes a kinematic reasoning head that predicts joint positions and hinge axes for movable components.
  • Overall, the approach bridges static 3D asset generation and kinematics-aware simulation by producing models that are both part-aware and motion-parameterizable.

Abstract

Simulation is essential for autonomous driving, yet current frameworks often model vehicles as rigid assets and fail to capture part-level articulation. With perception algorithms increasingly leveraging dynamics such as wheel steering or door opening, realistic simulation requires animatable vehicle representations. Existing CAD-based pipelines are limited by library coverage and fixed templates, preventing faithful reconstruction of in-the-wild instances. We propose a generative framework that, from a single image or sparse multi-view input, synthesizes an animatable 3D Gaussian vehicle. Our method addresses two challenges: (i) large 3D asset generators are optimized for static quality but not articulation, leading to distortions at part boundaries when animated; and (ii) segmentation alone cannot provide the kinematic parameters required for motion. To overcome this, we introduce a part-edge refinement module that enforces exclusive Gaussian ownership and a kinematic reasoning head that predicts joint positions and hinge axes of movable parts. Together, these components enable faithful part-aware simulation, bridging the gap between static generation and animatable vehicle models.