Toward Unified Fine-Grained Vehicle Classification and Automatic License Plate Recognition

arXiv cs.CV / 4/8/2026

📰 NewsSignals & Early TrendsModels & Research

Key Points

  • The paper proposes a more realistic, fine-grained vehicle classification framework that targets real surveillance conditions and complements Automatic License Plate Recognition (ALPR) rather than replacing it.
  • It introduces UFPR-VeSV, a publicly available dataset with 24,945 surveillance images of 16,297 vehicles, annotated for 13 colors, 26 makes, 136 models, and 14 vehicle types with challenging factors like occlusions and nighttime infrared imagery.
  • The authors validate FGVC labels using license plate information and provide additional text/corner annotations to support both classification and license plate-related tasks.
  • They report qualitative and quantitative comparisons against established datasets and run a benchmark across five deep learning models, identifying key difficulties such as multicolored vehicles, IR images, and model-level distinctions among vehicles sharing a platform.
  • Finally, the work combines two OCR approaches for ALPR and evaluates joint usage of FGVC with ALPR, showing promise for integrated real-world transportation and investigation applications.

Abstract

Extracting vehicle information from surveillance images is essential for intelligent transportation systems, enabling applications such as traffic monitoring and criminal investigations. While Automatic License Plate Recognition (ALPR) is widely used, Fine-Grained Vehicle Classification (FGVC) offers a complementary approach by identifying vehicles based on attributes such as color, make, model, and type. Although there have been advances in this field, existing studies often assume well-controlled conditions, explore limited attributes, and overlook FGVC integration with ALPR. To address these gaps, we introduce UFPR-VeSV, a dataset comprising 24,945 images of 16,297 unique vehicles with annotations for 13 colors, 26 makes, 136 models, and 14 types. Collected from the Military Police of Paran\'a (Brazil) surveillance system, the dataset captures diverse real-world conditions, including partial occlusions, nighttime infrared imaging, and varying lighting. All FGVC annotations were validated using license plate information, with text and corner annotations also being provided. A qualitative and quantitative comparison with established datasets confirmed the challenging nature of our dataset. A benchmark using five deep learning models further validated this, revealing specific challenges such as handling multicolored vehicles, infrared images, and distinguishing between vehicle models that share a common platform. Additionally, we apply two optical character recognition models to license plate recognition and explore the joint use of FGVC and ALPR. The results highlight the potential of integrating these complementary tasks for real-world applications. The UFPR-VeSV dataset is publicly available at: https://github.com/Lima001/UFPR-VeSV-Dataset.