AIDOVECL: AI-generated Dataset of Outpainted Vehicles for Eye-level Classification and Localization

arXiv cs.CV / 4/29/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes AIDOVECL, an AI-generated dataset creation method that uses image outpainting to reduce the labeling bottleneck in computer vision.
  • It generates new, eye-level vehicle images by detecting and cropping vehicles from selected seed images and then outpainting them onto larger canvases to simulate varied real-world contexts.
  • The outpainted images come with detailed, high-quality annotations to provide ground truth without requiring proportional manual labeling.
  • Experiments and ablation studies show detection performance improvements of up to ~10% overall, with larger gains (up to ~40%) when diversity in context, scale, and placement increases, and underrepresented classes seeing up to ~50% higher true positives.
  • The authors release code and dataset links to support replication and further research into automatic annotation via outpainting for fine-grained vision tasks.

Abstract

Image labeling is a critical bottleneck in the development of computer vision technologies, often constraining machine learning performance due to the time-intensive nature of manual annotations. This work introduces a novel approach that leverages outpainting to mitigate annotated data scarcity by generating artificial contexts and annotations, significantly reducing labeling efforts. We apply this technique to a particularly acute challenge in autonomous driving, urban planning, and environmental monitoring: the lack of diverse, eye-level vehicle images from desired classes. Our dataset comprises AI-generated vehicle images obtained by detecting and cropping vehicles from manually selected seed images, which are then outpainted onto larger canvases to simulate varied real-world conditions. The outpainted images include detailed annotations, providing high-quality ground truth data. Advanced outpainting techniques and image quality assessments ensure visual fidelity and contextual relevance. Ablation results show that incorporating AIDOVECL improves overall detection performance by up to about 10%, and delivers gains of up to about 40% in settings with greater diversity of context, object scale, and placement, with underrepresented classes achieving up to about 50% higher true positives. AIDOVECL enhances vehicle detection by augmenting real training data and supporting evaluation across diverse scenarios. By demonstrating outpainting as an automatic annotation paradigm, it offers a practical and versatile solution for building fine-grained datasets with reduced labeling effort across multiple machine learning domains. The code and links to datasets are available for further research and replication at https://github.com/amir-kazemi/aidovecl.