Beyond Fixed Formulas: Data-Driven Linear Predictor for Efficient Diffusion Models

arXiv cs.LG / 4/30/2026

📰 NewsIdeas & Deep AnalysisTools & Practical UsageModels & Research

共有:

Key Points

The paper addresses the high sampling cost of Diffusion Transformers (DiTs) and argues that hand-crafted feature-caching formulas break down when aggressive skipping is used.
It introduces L2P (Learnable Linear Predictor), a training-free-acceleration caching framework that replaces fixed coefficients with learnable, per-timestep weights to reconstruct current features from past trajectories.
L2P can be trained rapidly (about 20 seconds on a single GPU) and is designed to work efficiently for DiT inference.
Experiments show substantial performance gains, including 4.55× FLOPs reduction and 4.15× latency speedup on FLUX.1-dev, and strong quality retention up to 7.18× acceleration on Qwen-Image compared with prior baselines.
The authors provide code publicly and conclude that learning linear predictors is an effective strategy for efficient diffusion model sampling.

Abstract

To address the high sampling cost of Diffusion Transformers (DiTs), feature caching offers a training-free acceleration method. However, existing methods rely on hand-crafted forecasting formulas that fail under aggressive skipping. We propose L2P (Learnable Linear Predictor), a simple data-driven caching framework that replaces fixed coefficients with learnable per-timestep weights. Rapidly trained in ~20 seconds on a single GPU, L2P accurately reconstructs current features from past trajectories. L2P significantly outperforms existing baselines: it achieves a 4.55x FLOPs reduction and 4.15x latency speedup on FLUX.1-dev, and maintains high visual fidelity under up to 7.18x acceleration on Qwen-Image models, where prior methods show noticeable quality degradation. Our results show learning linear predictors is highly effective for efficient DiT inference. Code is available at https://github.com/Aredstone/L2P-Cache.

💡 Insights using this article

This article is featured in our daily AI news digest — key takeaways and action items at a glance.

📅 4/30DailyView insight →

Black Hat USA

AI Business

Vector DB and ANN vs PHE conflict, is there a practical workaround? [D]

Reddit r/MachineLearning

Agent Amnesia and the Case of Henry Molaison

Dev.to

Azure Weekly: Microsoft and OpenAI Restructure Partnership as GPT-5.5 Lands in Foundry

Dev.to

Proven Patterns for OpenAI Codex in 2026: Prompts, Validation, and Gateway Governance

Dev.to

Beyond Fixed Formulas: Data-Driven Linear Predictor for Efficient Diffusion Models

Key Points

Abstract

💡 Insights using this article

Related Articles

Black Hat USA

Vector DB and ANN vs PHE conflict, is there a practical workaround? [D]

Agent Amnesia and the Case of Henry Molaison

Azure Weekly: Microsoft and OpenAI Restructure Partnership as GPT-5.5 Lands in Foundry

Proven Patterns for OpenAI Codex in 2026: Prompts, Validation, and Gateway Governance

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer