PCSTracker: Long-Term Scene Flow Estimation for Point Cloud Sequences

arXiv cs.CV / 3/23/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • PCSTracker is presented as the first end-to-end framework designed for consistent long-term scene flow estimation in point cloud sequences.
  • It features an iterative geometry motion joint optimization (IGMO) module that explicitly models temporal evolution of point features to reduce correspondence errors from dynamic geometry.
  • It introduces a spatio-temporal point trajectory update (STTU) module that uses broad temporal context to infer plausible positions for occluded points, improving motion coherence.
  • An overlapping sliding-window inference strategy with cross-window propagation and in-window refinement helps suppress error accumulation across long sequences, delivering real-time performance (32.5 FPS) on synthetic PointOdyssey3D and real ADT3D datasets.

Abstract

Point cloud scene flow estimation is fundamental to long-term and fine-grained 3D motion analysis. However, existing methods are typically limited to pairwise settings and struggle to maintain temporal consistency over long sequences as geometry evolves, occlusions emerge, and errors accumulate. In this work, we propose PCSTracker, the first end-to-end framework specifically designed for consistent scene flow estimation in point cloud sequences. Specifically, we introduce an iterative geometry motion joint optimization module (IGMO) that explicitly models the temporal evolution of point features to alleviate correspondence inconsistencies caused by dynamic geometric changes. In addition, a spatio-temporal point trajectory update module (STTU) is proposed to leverage broad temporal context to infer plausible positions for occluded points, ensuring coherent motion estimation. To further handle long sequences, we employ an overlapping sliding-window inference strategy that alternates cross-window propagation and in-window refinement, effectively suppressing error accumulation and maintaining stable long-term motion consistency. Extensive experiments on the synthetic PointOdyssey3D and real-world ADT3D datasets show that PCSTracker achieves the best accuracy in long-term scene flow estimation and maintains real-time performance at 32.5 FPS, while demonstrating superior 3D motion understanding compared to RGB-D-based approaches.