GravCal: Single-Image Calibration of IMU Gravity Priors with Per-Sample Confidence

arXiv cs.CV / 3/23/2026

📰 NewsModels & Research

Key Points

  • GravCal is a feedforward model that recalibrates a noisy gravity prior from an IMU using a single RGB image to produce a corrected gravity direction and a per-sample confidence score.
  • It fuses a residual correction of the input prior with a prior-independent image estimate through a learned gate that adaptively weighs the two sources.
  • In experiments, GravCal reduces mean angular error from 22.02 degrees (unnormalized IMU prior) to 14.24 degrees, with larger gains when the prior is severely corrupted.
  • The work introduces a dataset with over 148,000 frames containing VIO-derived gravity ground truth and Mahony priors across diverse scenes and orientations, and shows the learned gate correlates with prior quality as a useful confidence signal for downstream systems.

Abstract

Gravity estimation is fundamental to visual-inertial perception, augmented reality, and robotics, yet gravity priors from IMUs are often unreliable under linear acceleration, vibration, and transient motion. Existing methods often estimate gravity directly from images or assume reasonably accurate inertial input, leaving the practical problem of correcting a noisy gravity prior from a single image largely unaddressed. We present GravCal, a feedforward model for single-image gravity prior calibration. Given one RGB image and a noisy gravity prior, GravCal predicts a corrected gravity direction and a per-sample confidence score. The model combines two complementary predictions, including a residual correction of the input prior and a prior-independent image estimate, and uses a learned gate to fuse them adaptively. Extensive experiments show strong gains over raw inertial priors: GravCal reduces mean angular error from 22.02{\deg} (IMU prior) to 14.24{\deg}, with larger improvements when the prior is severely corrupted. We also introduce a novel dataset of over 148K frames with paired VIO-derived ground-truth gravity and Mahony-filter IMU priors across diverse scenes and arbitrary camera orientations. The learned gate also correlates with prior quality, making it a useful confidence signal for downstream systems.