AI Navigate

StructDamage:A Large Scale Unified Crack and Surface Defect Dataset for Robust Structural Damage Detection

arXiv cs.CV / 3/12/2026

📰 NewsTools & Practical UsageModels & Research

Key Points

  • Introduces StructDamage, a large-scale crack and surface defect dataset with approximately 78,093 images spanning nine surface types to support robust structural damage detection.
  • The dataset is assembled by harmonizing and reannotating 32 public datasets, covering concrete structures, pavements, masonry, bridges, and historic buildings, with a folder-level hierarchy for training CNNs and Vision Transformers.
  • Baseline evaluations across fifteen DL architectures show macro F1-scores above 0.96 on twelve models, with DenseNet201 achieving 98.62% accuracy, indicating strong performance potential and generalization.
  • The work emphasizes reproducible research and fair evaluation by providing thorough documentation and a standardized structure to enable researchers and practitioners to compare methods consistently.

Abstract

Automated detection and classification of structural cracks and surface defects is a critical challenge in civil engineering, infrastructure maintenance, and heritage preservation. Recent advances in Computer Vision (CV) and Deep Learning (DL) have significantly improved automatic crack detection. However, these methods rely heavily on large, diverse, and carefully curated datasets that include various crack types across different surface materials. Many existing public crack datasets lack geographic diversity, surface types, scale, and labeling consistency, making it challenging for trained algorithms to generalize effectively in real world conditions. We provide a novel dataset, StructDamage, a curated collection of approximately 78,093 images spanning nine surface types: walls, tile, stone, road, pavement, deck, concrete, and brick. The dataset was constructed by systematically aggregating, harmonizing, and reannotating images from 32 publicly available datasets covering concrete structures, asphalt pavements, masonry walls, bridges, and historic buildings. All images are organized in a folder level classification hierarchy suitable for training Convolutional Neural Networks (CNNs) and Vision Transformers. To highlight the practical value of the dataset, we present baseline classification results using fifteen DL architectures from six model families, with twelve achieving macro F1-scores over 0.96. The best performing model DenseNet201 achieves 98.62% accuracy. The proposed dataset provides a comprehensive and versatile resource suitable for classification tasks. With thorough documentation and a standard structure, it is designed to promote reproducible research and support the development and fair evaluation of robust crack damage detection approaches.