ARETE: Attention-based Rasterized Encoding for Topology Estimation using HSV-transformed Crowdsourced Vehicle Fleet Data

arXiv cs.RO / 4/28/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes a method to estimate road topology—specifically centerlines and lane dividers—by learning from crowdsourced vehicle trajectory data to help keep HD maps accurate for autonomous driving.
  • It uses a DETR-based pipeline where rasterized trajectory tiles (encoding presence and direction) are transformed into predictions of vectorized lane representations.
  • Each predicted lane includes a centerline with an associated direction, and lane divider geometry is constrained to align with the centerline structure.
  • Experiments evaluate the approach on both an internal dataset and public benchmarks, including nuScenes and nuPlan, to validate performance and generalization.

Abstract

The continuous advancement of autonomous driving (AD) introduces challenges across multiple disciplines to ensure safe and efficient driving. One such challenge is the generation of High-Definition (HD) maps, which must remain up to date and highly accurate for downstream automotive tasks. One promising approach is the use of crowdsourced data from a vehicle fleet, representing road topology and lane-level features. This work focuses on the generation of centerlines and lane dividers from crowdsourced vehicle trajectories. We adopt a Detection Transformer (DETR)-based approach, where a rasterized representation of vehicle trajectories is used as input to predict vectorized lane representations. Each lane consists of a centerline with an associated direction and corresponding lane dividers that are geometrically constrained by the centerline. Our method includes the extraction of local tiles, from which crowdsourced vehicle trajectories are aggregated. Each tile undergoes a transformation into a rasterized representation encoding both the presence and direction of each trajectory, enabling the prediction of vectorized directed lanes. Experiments are conducted on an internal dataset as well as on the public datasets nuScenes and nuPlan.