AI Navigate

R&D: Balancing Reliability and Diversity in Synthetic Data Augmentation for Semantic Segmentation

arXiv cs.CV / 3/20/2026

📰 NewsTools & Practical UsageModels & Research

Key Points

  • The paper proposes a synthetic data augmentation pipeline that uses controllable diffusion models to balance diversity and reliability for pixel-level semantic segmentation.
  • It leverages class-aware prompting and visual prior blending to improve image quality and ensure precise alignment with segmentation labels.
  • Experiments on PASCAL VOC and BDD100K show substantial improvements in data-scarce settings and improved model robustness in real-world scenarios.
  • The authors release the code on GitHub to enable reproduction and adoption.

Abstract

Collecting and annotating datasets for pixel-level semantic segmentation tasks are highly labor-intensive. Data augmentation provides a viable solution by enhancing model generalization without additional real-world data collection. Traditional augmentation techniques, such as translation, scaling, and color transformations, create geometric variations but fail to generate new structures. While generative models have been employed to extend semantic information of datasets, they often struggle to maintain consistency between the original and generated images, particularly for pixel-level tasks. In this work, we propose a novel synthetic data augmentation pipeline that integrates controllable diffusion models. Our approach balances diversity and reliability data, effectively bridging the gap between synthetic and real data. We utilize class-aware prompting and visual prior blending to improve image quality further, ensuring precise alignment with segmentation labels. By evaluating benchmark datasets such as PASCAL VOC and BDD100K, we demonstrate that our method significantly enhances semantic segmentation performance, especially in data-scarce scenarios, while improving model robustness in real-world applications. Our code is available at \href{https://github.com/chequanghuy/Enhanced-Generative-Data-Augmentation-for-Semantic-Segmentation-via-Stronger-Guidance}{https://github.com/chequanghuy/Enhanced-Generative-Data-Augmentation-for-Semantic-Segmentation-via-Stronger-Guidance}.