BEVal: A Cross-dataset Evaluation Study of BEV Segmentation Models for Autonomous Driving

arXiv cs.CV / 4/29/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The study evaluates state-of-the-art BEV (bird’s-eye view) segmentation models across multiple training and testing datasets to measure robustness against domain shift beyond single-dataset (often nuScenes) assumptions.
  • It analyzes how different sensor modalities (e.g., cameras vs. LiDAR) affect a model’s ability to generalize to new environments and scenarios.
  • The paper includes experiments on multi-dataset training, showing that training on multiple datasets can improve BEV segmentation performance compared with training on a single dataset.
  • It addresses a key evaluation gap by providing cross-dataset validation insights, emphasizing the need for better generalizability and adaptability for reliable autonomous-driving BEV segmentation.
  • The authors release the associated code to support reproducibility and further research (GitHub repository linked in the paper).

Abstract

Current research in semantic bird's-eye view segmentation for autonomous driving focuses solely on optimizing neural network models using a single dataset, typically nuScenes. This practice leads to the development of highly specialized models that may fail when faced with different environments or sensor setups, a problem known as domain shift. In this paper, we conduct a comprehensive cross-dataset evaluation of state-of-the-art BEV segmentation models to assess their performance across different training and testing datasets and setups, as well as different semantic categories. We investigate the influence of different sensors, such as cameras and LiDAR, on the models' ability to generalize to diverse conditions and scenarios. Additionally, we conduct multi-dataset training experiments that improve models' BEV segmentation performance compared to single-dataset training. Our work addresses the gap in evaluating BEV segmentation models under cross-dataset validation. And our findings underscore the importance of enhancing model generalizability and adaptability to ensure more robust and reliable BEV segmentation approaches for autonomous driving applications. The code for this paper available at https://github.com/manueldiaz96/beval .