Curriculum Sampling: A Two-Phase Curriculum for Efficient Training of Flow Matching

arXiv cs.LG / 3/16/2026

📰 NewsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The authors show that the common static middle-biased timestep sampling in Flow Matching creates a speed-quality trade-off: faster early convergence but worse long-term fidelity compared to Uniform sampling.
They reveal a U-shaped training difficulty across timesteps, with persistent errors near the boundaries, meaning endpoints can’t be under-sampled without losing detail.
They propose Curriculum Sampling: a two-phase schedule that starts with middle-biased sampling to learn structure quickly, then switches to Uniform sampling for boundary refinement.
Empirical results on CIFAR-10 show FID improvement from 3.85 (Uniform baseline) to 3.22, and peak performance achieved earlier at 100k steps, illustrating faster, higher-quality training.
The broader implication is that timestep sampling should be treated as an evolving curriculum, not a fixed hyperparameter.

Abstract

Timestep sampling

p(t)

is a central design choice in Flow Matching models, yet common practice increasingly favors static middle-biased distributions (e.g., Logit-Normal). We show that this choice induces a speed--quality trade-off: middle-biased sampling accelerates early convergence but yields worse asymptotic fidelity than Uniform sampling. By analyzing per-timestep training losses, we identify a U-shaped difficulty profile with persistent errors near the boundary regimes, implying that under-sampling the endpoints leaves fine details unresolved. Guided by this insight, we propose \textbf{Curriculum Sampling}, a two-phase schedule that begins with middle-biased sampling for rapid structure learning and then switches to Uniform sampling for boundary refinement. On CIFAR-10, Curriculum Sampling improves the best FID from

3.85

(Uniform) to

3.22

while reaching peak performance at

100

k rather than

150

k training steps. Our results highlight that timestep sampling should be treated as an evolving curriculum rather than a fixed hyperparameter.

Hey dev.to community – sharing my journey with Prompt Builder, Insta Posts, and practical SEO

Dev.to

How to Build Passive Income with AI in 2026: A Developer's Practical Guide

Dev.to

The Research That Doesn't Exist

Dev.to

Jeff Bezos reportedly wants $100 billion to buy and transform old manufacturing firms with AI

TechCrunch

Krish Naik: AI Learning Path For 2026- Data Science, Generative and Agentic AI Roadmap

Dev.to

Curriculum Sampling: A Two-Phase Curriculum for Efficient Training of Flow Matching

Key Points

Abstract

Related Articles

Hey dev.to community – sharing my journey with Prompt Builder, Insta Posts, and practical SEO

How to Build Passive Income with AI in 2026: A Developer's Practical Guide

The Research That Doesn't Exist

Jeff Bezos reportedly wants $100 billion to buy and transform old manufacturing firms with AI

Krish Naik: AI Learning Path For 2026- Data Science, Generative and Agentic AI Roadmap

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer