GEAR: GEometry-motion Alternating Refinement for Articulated Object Modeling with Gaussian Splatting

arXiv cs.CV / 4/10/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

共有:

Key Points

研究論文は、関節構造を持つ物体の3Dモデリングにおいて、幾何（形状）とモーションが相互に結びつくため最適化が不安定になりやすい点と、汎化性能の限界を課題として提示している。
提案手法GEARはGaussian Splattingの表現のもとで、幾何とモーションを相互依存の要素としてEMスタイルで交互最適化し、部分分割を潜在変数、関節運動パラメータを明示変数として段階的に更新する。
部分分割の品質を保ちつつ汎化を損なわないために、ベースの2Dセグメンテーションモデルからマルチビューの部分事前情報を得て、潜在変数には弱教師ありの制約で正則化を加える。
複数ベンチマークに加え、新たに構築したGEAR-Multiデータセットでも評価し、複数可動部を含む複雑な関節物体において、幾何再構成とモーション推定の両面で最先端（SOTA）性能を達成したと報告している。

Abstract

High-fidelity interactive digital assets are essential for embodied intelligence and robotic interaction, yet articulated objects remain challenging to reconstruct due to their complex structures and coupled geometry-motion relationships. Existing methods suffer from instability in geometry-motion joint optimization, while their generalization remains limited on complex multi-joint or out-of-distribution objects. To address these challenges, we propose GEAR, an EM-style alternating optimization framework that jointly models geometry and motion as interdependent components within a Gaussian Splatting representation. GEAR treats part segmentation as a latent variable and joint motion parameters as explicit variables, alternately refining them for improved convergence and geometric-motion consistency. To enhance part segmentation quality without sacrificing generalization, we leverage a vanilla 2D segmentation model to provide multi-view part priors, and employ a weakly supervised constraint to regularize the latent variable. Experiments on multiple benchmarks and our newly constructed dataset GEAR-Multi demonstrate that GEAR achieves state-of-the-art results in geometric reconstruction and motion parameters estimation, particularly on complex articulated objects with multiple movable parts.

Black Hat Asia

AI Business

GLM 5.1 tops the code arena rankings for open models

Reddit r/LocalLLaMA

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Dev.to

My Bestie Built a Free MCP Server for Job Search — Here's How It Works

Dev.to

can we talk about how AI has gotten really good at lying to you?

Reddit r/artificial

GEAR: GEometry-motion Alternating Refinement for Articulated Object Modeling with Gaussian Splatting

Key Points

Abstract

Related Articles

Black Hat Asia

GLM 5.1 tops the code arena rankings for open models

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

My Bestie Built a Free MCP Server for Job Search — Here's How It Works

can we talk about how AI has gotten really good at lying to you?

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer