AI Navigate

Fractals made Practical: Denoising Diffusion as Partitioned Iterated Function Systems

arXiv cs.LG / 3/16/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper reframes the DDIM reverse diffusion process as a Partitioned Iterated Function System (PIFS), offering a unified design language for diffusion model schedules, architectures, and training objectives.
  • It derives three computable geometric quantities—per-step contraction threshold L*_t, diagonal expansion function f_t(lambda), and global expansion threshold lambda**—that fully characterize denoising dynamics without requiring model evaluations.
  • The PIFS view explains the two-regime denoising behavior: global context assembly at high noise via diffuse cross-patch attention and fine-detail synthesis at low noise via patch-by-patch suppression in strict variance order, with self-attention identified as the natural contraction primitive.
  • Analytically, the Kaplan-Yorke dimension of the PIFS attractor is determined through a discrete Moran equation on the Lyapunov spectrum, and the authors show four practical design choices (cosine schedule offset, logSNR shift, Min-SNR loss weighting, and Align Your Steps sampling) emerge as approximate solutions to these geometric optimization problems.

Abstract

What is a diffusion model actually doing when it turns noise into a photograph? We show that the deterministic DDIM reverse chain operates as a Partitioned Iterated Function System (PIFS) and that this framework serves as a unified design language for denoising diffusion model schedules, architectures, and training objectives. From the PIFS structure we derive three computable geometric quantities: a per-step contraction threshold L^*_t, a diagonal expansion function f_t(\lambda) and a global expansion threshold \lambda^{**}. These quantities require no model evaluation and fully characterize the denoising dynamics. They structurally explain the two-regime behavior of diffusion models: global context assembly at high noise via diffuse cross-patch attention and fine-detail synthesis at low noise via patch-by-patch suppression release in strict variance order. Self-attention emerges as the natural primitive for PIFS contraction. The Kaplan-Yorke dimension of the PIFS attractor is determined analytically through a discrete Moran equation on the Lyapunov spectrum. Through the study of the fractal geometry of the PIFS, we derive three optimal design criteria and show that four prominent empirical design choices (the cosine schedule offset, resolution-dependent logSNR shift, Min-SNR loss weighting, and Align Your Steps sampling) each arise as approximate solutions to our explicit geometric optimization problems tuning theory into practice.