Point2Pose: Occlusion-Recovering 6D Pose Tracking and 3D Reconstruction for Multiple Unknown Objects Via 2D Point Trackers
arXiv cs.RO / 4/14/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- Point2Pose is a model-free approach for causal 6D pose tracking and 3D reconstruction of multiple rigid objects from monocular RGB-D video, initialized using only sparse points on the objects.
- It tracks unseen objects without CAD models or category priors by using a 2D point tracker to generate long-range correspondences and recover instantly after full occlusion.
- The method incrementally builds an online TSDF representation for the tracked targets, enabling simultaneous pose estimation and surface/geometry reconstruction.
- The authors also introduce a new multi-object tracking dataset (simulation + real-world) with motion-capture ground truth for evaluation.
- Experiments indicate performance comparable to state-of-the-art methods on a severe-occlusion benchmark, with added multi-object handling and complete-occlusion recovery beyond earlier model-free tracking approaches.
Related Articles

Black Hat Asia
AI Business

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

Don't forget, there is more than forgetting: new metrics for Continual Learning
Dev.to

Microsoft MAI-Image-2-Efficient Review 2026: The AI Image Model Built for Production Scale
Dev.to
Bit of a strange question?
Reddit r/artificial