GravCal: Single-Image Calibration of IMU Gravity Priors with Per-Sample Confidence

arXiv cs.CV / 3/23/2026

📰 NewsModels & Research

共有:

Key Points

GravCal is a feedforward model that recalibrates a noisy gravity prior from an IMU using a single RGB image to produce a corrected gravity direction and a per-sample confidence score.
It fuses a residual correction of the input prior with a prior-independent image estimate through a learned gate that adaptively weighs the two sources.
In experiments, GravCal reduces mean angular error from 22.02 degrees (unnormalized IMU prior) to 14.24 degrees, with larger gains when the prior is severely corrupted.
The work introduces a dataset with over 148,000 frames containing VIO-derived gravity ground truth and Mahony priors across diverse scenes and orientations, and shows the learned gate correlates with prior quality as a useful confidence signal for downstream systems.

Abstract

Gravity estimation is fundamental to visual-inertial perception, augmented reality, and robotics, yet gravity priors from IMUs are often unreliable under linear acceleration, vibration, and transient motion. Existing methods often estimate gravity directly from images or assume reasonably accurate inertial input, leaving the practical problem of correcting a noisy gravity prior from a single image largely unaddressed. We present GravCal, a feedforward model for single-image gravity prior calibration. Given one RGB image and a noisy gravity prior, GravCal predicts a corrected gravity direction and a per-sample confidence score. The model combines two complementary predictions, including a residual correction of the input prior and a prior-independent image estimate, and uses a learned gate to fuse them adaptively. Extensive experiments show strong gains over raw inertial priors: GravCal reduces mean angular error from 22.02{\deg} (IMU prior) to 14.24{\deg}, with larger improvements when the prior is severely corrupted. We also introduce a novel dataset of over 148K frames with paired VIO-derived ground-truth gravity and Mahony-filter IMU priors across diverse scenes and arbitrary camera orientations. The learned gate also correlates with prior quality, making it a useful confidence signal for downstream systems.

Interactive Web Visualization of GPT-2

Reddit r/artificial

[R] Causal self-attention as a probabilistic model over embeddings

Reddit r/MachineLearning

The 5 software development trends that actually matter in 2026 (and what they mean for your startup)

Dev.to

iPhone 17 Pro Running a 400B LLM: What It Really Means

Dev.to

[R] V-JEPA 2 has no pixel decoder, so how do you inspect what it learned? We attached a VQ probe to the frozen encoder and found statistically significant physical structure

Reddit r/artificial

GravCal: Single-Image Calibration of IMU Gravity Priors with Per-Sample Confidence

Key Points

Abstract

Related Articles

Interactive Web Visualization of GPT-2

[R] Causal self-attention as a probabilistic model over embeddings

The 5 software development trends that actually matter in 2026 (and what they mean for your startup)

iPhone 17 Pro Running a 400B LLM: What It Really Means

[R] V-JEPA 2 has no pixel decoder, so how do you inspect what it learned? We attached a VQ probe to the frozen encoder and found statistically significant physical structure

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer