Synthetic Melanoma Image Generation and Evaluation Using Generative Adversarial Networks
arXiv cs.CV / 3/17/2026
📰 NewsTools & Practical UsageModels & Research
Key Points
- The paper systematically benchmarks four GAN architectures (DCGAN, StyleGAN2, and two StyleGAN3 variants) for high-resolution melanoma image synthesis on ISIC 2018 and ISIC 2020, with unified preprocessing and hyperparameter exploration emphasizing R1 regularization.
- The evaluation uses a multi-faceted protocol including FID, FMD, qualitative dermoscopic inspection, a frozen EfficientNet melanoma detector, and independent dermatologists, to assess both statistical quality and diagnostic relevance.
- StyleGAN2 achieves the best balance of quantitative performance and perceptual quality, with FID scores of 24.8 (ISIC 2018) and 7.96 (ISIC 2020) at gamma=0.8, and the frozen classifier recognizing 83% of its synthetic images as melanoma.
- Dermatologists distinguish synthetic from real images with 66.5% accuracy (chance 50%), and exhibit low inter-rater agreement (kappa = 0.17), indicating substantial realism of the generated images.
- Augmenting real datasets with StyleGAN2-generated melanoma images improves melanoma detection AUC from 0.925 to 0.945 on a held-out real-test set, demonstrating practical benefit for addressing class imbalance in melanoma ML pipelines.




