ReManNet: A Riemannian Manifold Network for Monocular 3D Lane Detection

arXiv cs.CV / 3/23/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The Road-Manifold Assumption models the road as a smooth 2D manifold in R^3 with lanes as embedded 1D submanifolds, coupling geometry across surfaces, curves, and point sets to address depth ambiguity in monocular 3D lane detection.
  • ReManNet first predicts lanes with a visual backbone and heads, then encodes geometry as Riemannian Gaussian descriptors on the SPD manifold, fusing these with image features through a lightweight gate for coherent 3D reasoning.
  • The paper introduces the 3D Tunnel Lane IoU (3D-TLIoU) loss, a joint point-curve objective that computes slice-wise overlap of tubular neighborhoods along each lane to improve shape-level alignment.
  • Empirically, ReManNet achieves state-of-the-art or competitive results on standard benchmarks, notably boosting OpenLane F1 by 8.2 points over the baseline and 1.8 points over the previous best, with scenario-level gains up to 6.6%, and code will be released publicly.

Abstract

Monocular 3D lane detection remains challenging due to depth ambiguity and weak geometric constraints. Mainstream methods rely on depth guidance, BEV projection, and anchor- or curve-based heads with simplified physical assumptions, remapping high-dimensional image features while only weakly encoding road geometry. Lacking an invariant geometric-topological coupling between lanes and the underlying road surface, 2D-to-3D lifting is ill-posed and brittle, often degenerating into concavities, bulges, and twists. To address this, we propose the Road-Manifold Assumption: the road is a smooth 2D manifold in \mathbb{R}^3, lanes are embedded 1D submanifolds, and sampled lane points are dense observations, thereby coupling metric and topology across surfaces, curves, and point sets. Building on this, we propose ReManNet, which first produces initial lane predictions with an image backbone and detection heads, then encodes geometry as Riemannian Gaussian descriptors on the symmetric positive-definite (SPD) manifold, and fuses these descriptors with visual features through a lightweight gate to maintain coherent 3D reasoning. We also propose the 3D Tunnel Lane IoU (3D-TLIoU) loss, a joint point-curve objective that computes slice-wise overlap of tubular neighborhoods along each lane to improve shape-level alignment. Extensive experiments on standard benchmarks demonstrate that ReManNet achieves state-of-the-art (SOTA) or competitive results. On OpenLane, it improves F1 by +8.2% over the baseline and by +1.8% over the previous best, with scenario-level gains of up to +6.6%. The code will be publicly available at https://github.com/changehome717/ReManNet.