Fractals made Practical: Denoising Diffusion as Partitioned Iterated Function Systems

arXiv cs.LG / 3/16/2026

📰 NewsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper reframes the DDIM reverse diffusion process as a Partitioned Iterated Function System (PIFS), offering a unified design language for diffusion model schedules, architectures, and training objectives.
It derives three computable geometric quantities—per-step contraction threshold L*_t, diagonal expansion function f_t(lambda), and global expansion threshold lambda**—that fully characterize denoising dynamics without requiring model evaluations.
The PIFS view explains the two-regime denoising behavior: global context assembly at high noise via diffuse cross-patch attention and fine-detail synthesis at low noise via patch-by-patch suppression in strict variance order, with self-attention identified as the natural contraction primitive.
Analytically, the Kaplan-Yorke dimension of the PIFS attractor is determined through a discrete Moran equation on the Lyapunov spectrum, and the authors show four practical design choices (cosine schedule offset, logSNR shift, Min-SNR loss weighting, and Align Your Steps sampling) emerge as approximate solutions to these geometric optimization problems.

Abstract

What is a diffusion model actually doing when it turns noise into a photograph? We show that the deterministic DDIM reverse chain operates as a Partitioned Iterated Function System (PIFS) and that this framework serves as a unified design language for denoising diffusion model schedules, architectures, and training objectives. From the PIFS structure we derive three computable geometric quantities: a per-step contraction threshold

L^*_t

, a diagonal expansion function

f_t(\lambda)

and a global expansion threshold

\lambda^{**}

. These quantities require no model evaluation and fully characterize the denoising dynamics. They structurally explain the two-regime behavior of diffusion models: global context assembly at high noise via diffuse cross-patch attention and fine-detail synthesis at low noise via patch-by-patch suppression release in strict variance order. Self-attention emerges as the natural primitive for PIFS contraction. The Kaplan-Yorke dimension of the PIFS attractor is determined analytically through a discrete Moran equation on the Lyapunov spectrum. Through the study of the fractal geometry of the PIFS, we derive three optimal design criteria and show that four prominent empirical design choices (the cosine schedule offset, resolution-dependent logSNR shift, Min-SNR loss weighting, and Align Your Steps sampling) each arise as approximate solutions to our explicit geometric optimization problems tuning theory into practice.

Astral to Join OpenAI

Dev.to

PearlOS. We gave swarm intelligence a local desktop environment and code control to self-evolve. Has been pretty incredible to see so far. Open source and free if you want your own.

Reddit r/LocalLLaMA

Why Data is Important for LLM

Dev.to

The Inference Market Is Consolidating. Agent Payments Are Still Nobody's Problem.

Dev.to

YouTube's Deepfake Shield for Politicians Changes Evidence Forever

Dev.to

Fractals made Practical: Denoising Diffusion as Partitioned Iterated Function Systems

Key Points

Abstract

Related Articles

Astral to Join OpenAI

PearlOS. We gave swarm intelligence a local desktop environment and code control to self-evolve. Has been pretty incredible to see so far. Open source and free if you want your own.

Why Data is Important for LLM

The Inference Market Is Consolidating. Agent Payments Are Still Nobody's Problem.

YouTube's Deepfake Shield for Politicians Changes Evidence Forever

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer