Isokinetic Flow Matching for Pathwise Straightening of Generative Flows

arXiv cs.LG / 4/7/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • Flow Matching methods can produce a marginal velocity field with strong curvature from trajectory superposition, which increases numerical truncation errors and limits the quality of few-step sampling.
  • The paper proposes Isokinetic Flow Matching (Iso-FM), a lightweight, Jacobian-free regularizer that penalizes pathwise acceleration to enforce local velocity consistency during training.
  • Iso-FM estimates the material derivative via a self-guided finite-difference approximation, avoiding auxiliary encoders and expensive second-order autodifferentiation.
  • As a plug-and-play addition to single-stage FM training, Iso-FM substantially improves few-step image generation, including a CIFAR-10 DiT-S/2 result where non-OT FID at 2 steps drops from 78.82 to 27.13 (about 2.9x efficiency).
  • At 4 steps, Iso-FM reaches a best-observed FID of 10.23, supporting acceleration regularization as a principled and compute-efficient approach for faster generative sampling.

Abstract

Flow Matching (FM) constructs linear conditional probability paths, but the learned marginal velocity field inevitably exhibits strong curvature due to trajectory superposition. This curvature severely inflates numerical truncation errors, bottlenecking few-step sampling. To overcome this, we introduce Isokinetic Flow Matching (Iso-FM), a lightweight, Jacobian-free dynamical regularizer that directly penalizes pathwise acceleration. By using a self-guided finite-difference approximation of the material derivative Dv/Dt, Iso-FM enforces local velocity consistency without requiring auxiliary encoders or expensive second-order autodifferentiation. Operating as a pure plug-and-play addition to single-stage FM training, Iso-FM dramatically improves few-step generation. On CIFAR-10 (DiT-S/2), Iso-FM slashes conditional non-OT FID at 2 steps from 78.82 to 27.13 - a 2.9x relative efficiency gain - and reaches a best-observed FID at 4 steps of 10.23. These results firmly establish acceleration regularization as a principled, compute-efficient mechanism for fast generative sampling.