TSM-Pose: Topology-Aware Learning with Semantic Mamba for Category-Level Object Pose Estimation

arXiv cs.CV / 4/21/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces TSM-Pose, a topology-aware framework for category-level object pose estimation aimed at improving generalization to unseen instances.
  • It proposes a Topology Extractor that captures global point-cloud topology and integrates it with local geometric features to represent category-level structure more robustly.
  • It also adds a Mamba-based Global Semantic Aggregator that injects semantic priors into keypoints and uses multiple TwinMamba blocks to capture long-range dependencies for better global feature aggregation.
  • Experiments on REAL275, CAMERA25, and HouseCat6D show that TSM-Pose achieves better performance than existing state-of-the-art approaches.

Abstract

Category-level object pose estimation is fundamental for embodied intelligence, yet achieving robust generalization to unseen instances remains challenging. However, existing methods mainly rely on simple feature extraction and aggregation, which struggle to capture category-shared topological structures and conduct semantic keypoint modeling, limiting their generalization. To address these, we propose a \textbf{T}opology-Aware Learning with \textbf{S}emantic \textbf{M}amba for Category-Level \textbf{P}ose Estimation framework (TSM-Pose). Specifically, we introduce a Topology Extractor to capture the global topological representation of the point cloud, which is integrated into local geometry features and enables robust category-level structural representation. Simultaneously, we propose a Mamba-based Global Semantic Aggregator that injects semantics priors into keypoints to enhance their expressiveness and leverages multiple TwinMamba blocks to model long-range dependencies for more effective global feature aggregation. Extensive experiments on three benchmark datasets (REAL275, CAMERA25, and HouseCat6D) demonstrate that TSM-Pose outperforms existing state-of-the-art methods.

TSM-Pose: Topology-Aware Learning with Semantic Mamba for Category-Level Object Pose Estimation | AI Navigate