LPNSR: Prior-Enhanced Diffusion Image Super-Resolution via LR-Guided Noise Prediction

arXiv cs.CV / 3/24/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces LPNSR, a diffusion-based image super-resolution framework designed to maintain high reconstruction quality while using an efficient compact 4-step inference trajectory.
  • It addresses performance drops in residual-shifting diffusion by deriving an analytical optimal intermediate noise solution and replacing unconstrained random Gaussian noise with an LR-guided, multi-input-aware noise predictor that injects low-resolution structural priors into the reverse process.
  • To fix initialization bias from naive bicubic upsampling, LPNSR uses a high-quality pre-upsampling network to produce a better diffusion starting point.
  • The method is trained end-to-end and reportedly achieves state-of-the-art perceptual quality on both synthetic and real-world datasets without using large-scale text-to-image priors, with code released on GitHub.

Abstract

Diffusion-based image super-resolution (SR), which aims to reconstruct high-resolution (HR) images from corresponding low-resolution (LR) observations, faces a fundamental trade-off between inference efficiency and reconstruction quality. The state-of-the-art residual-shifting diffusion framework achieves efficient 4-step inference, yet suffers from severe performance degradation in compact sampling trajectories. This is mainly attributed to two core limitations: the inherent suboptimality of unconstrained random Gaussian noise in intermediate steps, which leads to error accumulation and insufficient LR prior guidance, and the initialization bias caused by naive bicubic upsampling. In this paper, we propose LPNSR, a prior-enhanced efficient diffusion framework to address these issues. We first mathematically derive the closed-form analytical solution of the optimal intermediate noise for the residual-shifting diffusion paradigm, and accordingly design an LR-guided multi-input-aware noise predictor to replace random Gaussian noise, embedding LR structural priors into the reverse process while fully preserving the framework's core efficient residual-shifting mechanism. We further mitigate initial bias with a high-quality pre-upsampling network to optimize the diffusion starting point. With a compact 4-step trajectory, LPNSR can be optimized in an end-to-end manner. Extensive experiments demonstrate that LPNSR achieves state-of-the-art perceptual performance on both synthetic and real-world datasets, without relying on any large-scale text-to-image priors. The source code of our method can be found at https://github.com/Faze-Hsw/LPNSR.