CAD 100K: A Comprehensive Multi-Task Dataset for Car Related Visual Anomaly Detection

arXiv cs.CV / 4/13/2026

📰 NewsSignals & Early TrendsModels & Research

Key Points

  • The paper introduces CAD Dataset, a large-scale benchmark for car-related multi-task visual anomaly detection with over 100K images across 7 vehicle domains and 3 tasks.
  • It is positioned as the first car-related anomaly dataset specifically specialized for multi-task learning (MTL) evaluation, aiming to overcome the lack of unified benchmarks.
  • The dataset includes synthesis-based data augmentation to better support few-shot anomaly image scenarios.
  • The authors provide a multi-task baseline and extensive empirical studies showing MTL can improve knowledge transfer and task interaction, while also revealing potential task conflicts.
  • CAD is intended to serve as a standardized research platform to accelerate future advances in multi-task anomaly detection for automotive manufacturing quality assessment.

Abstract

Multi-task visual anomaly detection is critical for car-related manufacturing quality assessment. However, existing methods remain task-specific, hindered by the absence of a unified benchmark for multi-task evaluation. To fill in this gap, We present the CAD Dataset, a large-scale and comprehensive benchmark designed for car-related multi-task visual anomaly detection. The dataset contains over 100 images crossing 7 vehicle domains and 3 tasks, providing models a comprehensive view for car-related anomaly detection. It is the first car-related anomaly dataset specialized for multi-task learning(MTL), while combining synthesis data augmentation for few-shot anomaly images. We implement a multi-task baseline and conduct extensive empirical studies. Results show MTL promotes task interaction and knowledge transfer, while also exposing challenging conflicts between tasks. The CAD dataset serves as a standardized platform to drive future advances in car-related multi-task visual anomaly detection.