From Images2Mesh: A 3D Surface Reconstruction Pipeline for Non-Cooperative Space Objects

arXiv cs.CV / 5/4/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Key Points

  • The paper proposes a neural implicit 3D surface reconstruction pipeline to recover the geometry of non-cooperative space objects from monocular on-orbit inspection imagery.
  • It shows results using publicly released ISS footage (STS-119) and publicly released on-orbit imagery of an H-IIA rocket upper stage.
  • The authors find segmentation-based background removal is critical because frame-to-frame background variation in real footage can cause pose estimation and reconstruction to fail when processing is applied directly.
  • They add photometric correction to handle per-frame exposure differences and analyze how shadow-region performance depends on the illumination characteristics of the input datasets.

Abstract

On-orbit inspection imagery is crucial as it enables characterization of non-cooperative resident space objects, providing the geometry and structural condition essential for active debris removal and on-orbit servicing mission planning. However, most existing neural implicit surface reconstruction methods have been confined to synthetic or hardware-in-the-loop data with known camera poses and controlled illumination. In this work, we present a pipeline for neural implicit surface reconstruction of non-cooperative space objects from monocular inspection imagery. We demonstrate it on publicly released ISS inspection footage from the STS-119 mission and publicly released on-orbit inspection footage of an H-IIA rocket upper stage. We find that segmentation-based background removal is essential for successful camera pose estimation from real on-orbit footage, where background variation between frames caused direct processing to fail entirely. We further incorporate photometric correction of per-frame exposure variations and analyze its behavior across datasets, finding that performance in shadowed regions varies with the illumination characteristics of the input footage.