GeoReFormer: Geometry-Aware Refinement for Lane Segment Detection and Topology Reasoning

arXiv cs.CV / 3/30/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • GeoReFormer is a unified transformer-based approach for 3D lane segment detection and topology reasoning that adds geometry- and topology-aware inductive biases to the decoder.
  • Instead of using generic object-detection-style query initialization and unconstrained refinement, it uses structured query initialization with data-driven geometric priors and bounded coordinate-space refinement for stable polyline deformation.
  • The method incorporates per-query gated topology propagation to selectively integrate relational context needed for directed-graph lane topology consistency.
  • On the OpenLane-V2 benchmark, GeoReFormer reports state-of-the-art results with 34.5% mAP and improved topology consistency compared with strong transformer baselines, suggesting the benefits of explicitly encoding lane geometry/relations.

Abstract

Accurate 3D lane segment detection and topology reasoning are critical for structured online map construction in autonomous driving. Recent transformer-based approaches formulate this task as query-based set prediction, yet largely inherit decoder designs originally developed for compact object detection. However, lane segments are continuous polylines embedded in directed graphs, and generic query initialization and unconstrained refinement do not explicitly encode this geometric and relational structure. We propose GeoReFormer (Geometry-aware Refinement Transformer), a unified query-based architecture that embeds geometry- and topology-aware inductive biases directly within the transformer decoder. GeoReFormer introduces data-driven geometric priors for structured query initialization, bounded coordinate-space refinement for stable polyline deformation, and per-query gated topology propagation to selectively integrate relational context. On the OpenLane-V2 benchmark, GeoReFormer achieves state-of-the-art performance with 34.5% mAP while improving topology consistency over strong transformer baselines, demonstrating the utility of explicit geometric and relational structure encoding.