GeoReFormer: Geometry-Aware Refinement for Lane Segment Detection and Topology Reasoning

arXiv cs.CV / 3/30/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

共有:

Key Points

GeoReFormer is a unified transformer-based approach for 3D lane segment detection and topology reasoning that adds geometry- and topology-aware inductive biases to the decoder.
Instead of using generic object-detection-style query initialization and unconstrained refinement, it uses structured query initialization with data-driven geometric priors and bounded coordinate-space refinement for stable polyline deformation.
The method incorporates per-query gated topology propagation to selectively integrate relational context needed for directed-graph lane topology consistency.
On the OpenLane-V2 benchmark, GeoReFormer reports state-of-the-art results with 34.5% mAP and improved topology consistency compared with strong transformer baselines, suggesting the benefits of explicitly encoding lane geometry/relations.

Abstract

Accurate 3D lane segment detection and topology reasoning are critical for structured online map construction in autonomous driving. Recent transformer-based approaches formulate this task as query-based set prediction, yet largely inherit decoder designs originally developed for compact object detection. However, lane segments are continuous polylines embedded in directed graphs, and generic query initialization and unconstrained refinement do not explicitly encode this geometric and relational structure. We propose GeoReFormer (Geometry-aware Refinement Transformer), a unified query-based architecture that embeds geometry- and topology-aware inductive biases directly within the transformer decoder. GeoReFormer introduces data-driven geometric priors for structured query initialization, bounded coordinate-space refinement for stable polyline deformation, and per-query gated topology propagation to selectively integrate relational context. On the OpenLane-V2 benchmark, GeoReFormer achieves state-of-the-art performance with 34.5% mAP while improving topology consistency over strong transformer baselines, demonstrating the utility of explicit geometric and relational structure encoding.

💡 Insights using this article

This article is featured in our daily AI news digest — key takeaways and action items at a glance.

📅 3/30DailyView insight →

Mr. Chatterbox is a (weak) Victorian-era ethically trained model you can run on your own computer

Simon Willison's Blog

Beyond the Chatbot: Engineering Multi-Agent Ecosystems in 2026

Dev.to

I missed the "fun" part in software development

Dev.to

The Billion Dollar Tax on AI Agents

Dev.to

Hermes Agent: A Self-Improving AI Agent That Runs Anywhere

Dev.to

GeoReFormer: Geometry-Aware Refinement for Lane Segment Detection and Topology Reasoning

Key Points

Abstract

💡 Insights using this article

Related Articles

Mr. Chatterbox is a (weak) Victorian-era ethically trained model you can run on your own computer

Beyond the Chatbot: Engineering Multi-Agent Ecosystems in 2026

I missed the "fun" part in software development

The Billion Dollar Tax on AI Agents

Hermes Agent: A Self-Improving AI Agent That Runs Anywhere

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer