VS-DDPM: Efficient Low-Cost Diffusion Model for Medical Modality Translation

arXiv cs.CV / 4/28/2026

📰 NewsSignals & Early TrendsModels & Research

Key Points

  • The paper introduces VS-DDPM, a 3D variable-step diffusion model framework designed to accelerate slow diffusion-model inference while preserving generative quality.
  • Experiments on BraTS2025 and SynthRAD2025 challenge tasks—including missing MRI synthesis, tumor removal, MRI-to-sCT, and CBCT-to-sCT—show that the method targets efficiency under strict hardware and time constraints.
  • VS-DDPM achieves state-of-the-art performance for missing MRI synthesis, reporting Dice scores up to 0.88 (whole tumor) and SSIM of 0.95.
  • For MRI tumor removal, it reports RMSE of 0.053, PSNR of 26.77, and SSIM of 0.918, while MRI-to-sCT and CBCT-to-sCT were competitive but not SOTA, potentially due to data preprocessing/postprocessing or loss-function sensitivities.
  • The authors provide an implementation repository via GitHub for reproducibility and further tuning.

Abstract

Diffusion models produce high-quality synthetic data but suffer from slow inference. We propose 3D Variable-Step Denoising Diffusion Probabilistic Model (VS-DDPM) a framework engineered to maintain generative quality while accelerating inference by several factors. We tested our approach on four tasks (missing MRI, tumor removal, MRI-to-sCT, and CBCT-to-sCT) within the BraTS2025 and SynthRAD2025 challenges. Designed for high efficiency under hardware and time constrains imposed by both challenges. VS-DDPM achieved state-of-the-art (SOTA) performance in missing MRI synthesis, yielding Dice scores of 0.80, 0.83, and 0.88 for the enhancing tumor, tumor core, and whole tumor regions, respectively, alongside a structural similarity index (SSIM) of 0.95. For MRI tumor removal, the model attained a root mean squared error (RMSE) of 0.053, a peak signal-to-noise ratio (PSNR) of 26.77, and an SSIM of 0.918. While the framework demonstrated competitive performance in MRI-to-sCT and CBCT-to-sCT tasks, it did not reach SOTA benchmarks, potentially due to sensitivities in data pre and post-processing pipelines or specific loss function configurations. These results demonstrate that VS-DDPM provides a robust and tunable solution for high-fidelity 3D medical image synthesis. The code is available in https://github.com/andre-fs-ferreira/SynthRAD_by_Faking_it.