TAFG-MAN: Timestep-Adaptive Frequency-Gated Latent Diffusion for Efficient and High-Quality Low-Dose CT Image Denoising

arXiv cs.CV / 3/24/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces TAFG-MAN, a latent diffusion-based framework for low-dose CT (LDCT) denoising that targets both noise suppression and preservation of subtle anatomical structures.
  • It uses a perceptually optimized autoencoder and conditional latent diffusion in a compact latent space to improve efficiency while maintaining reconstruction quality.
  • The key innovation is Timestep-Adaptive Frequency-Gated (TAFG) conditioning, which decomposes guidance into low- and high-frequency components and progressively releases high-frequency detail guidance in later denoising steps.
  • Experiments report a strong quality–efficiency trade-off versus baseline models, with TAFG-MAN improving detail/perceptual quality over a variant without TAFG at roughly the same inference cost.
  • Ablation studies support that the timestep-adaptive, frequency-gated conditioning mechanism is responsible for the observed gains in balancing denoising strength and detail retention.

Abstract

Low-dose computed tomography (LDCT) reduces radiation exposure but also introduces substantial noise and structural degradation, making it difficult to suppress noise without erasing subtle anatomical details. In this paper, we present TAFG-MAN, a latent diffusion framework for efficient and high-quality LDCT image denoising. The framework combines a perceptually optimized autoencoder, conditional latent diffusion restoration in a compact latent space, and a lightweight Timestep-Adaptive Frequency-Gated (TAFG) conditioning design. TAFG decomposes condition features into low- and high-frequency components, predicts timestep-adaptive gates from the current denoising feature and timestep embedding, and progressively releases high-frequency guidance in later denoising stages before cross-attention. In this way, the model relies more on stable structural guidance at early reverse steps and introduces fine details more cautiously as denoising proceeds, improving the balance between noise suppression and detail preservation. Experiments show that TAFG-MAN achieves a favorable quality-efficiency trade-off against representative baselines. Compared with its base variant without TAFG, it further improves detail preservation and perceptual quality while maintaining essentially the same inference cost, and ablation results confirm the effectiveness of the proposed conditioning mechanism.