InjectFlow: Weak Guides Strong via Orthogonal Injection for Flow Matching

arXiv cs.CV / 3/24/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • Flow Matching (FM) models can suffer major semantic degradation on out-of-distribution or minority-class prompts due to dataset bias effects that FM is sensitive to.
  • The paper formalizes a “Bias Manifold” and attributes the inference-time performance drop to conditional expectation smoothing, which can cause trajectory lock-in.
  • InjectFlow is proposed as a training-free fix that injects orthogonal semantics into the initial velocity field computation to counter latent drift toward majority modes.
  • The method preserves generative quality and does not require changing random seeds, while experiments show strong gains, including fixing 75% of previously failed prompts on the GenEval dataset.
  • The work combines theoretical analysis with an implementable algorithm aimed at improving fairness and robustness in visual foundation models.

Abstract

Flow Matching (FM) has recently emerged as a leading approach for high-fidelity visual generation, offering a robust continuous-time alternative to ordinary differential equation (ODE) based models. However, despite their success, FM models are highly sensitive to dataset biases, which cause severe semantic degradation when generating out-of-distribution or minority-class samples. In this paper, we provide a rigorous mathematical formalization of the ``Bias Manifold'' within the FM framework. We identify that this performance drop is driven by conditional expectation smoothing, a mechanism that inevitably leads to trajectory lock-in during inference. To resolve this, we introduce InjectFlow, a novel, training-free method by injecting orthogonal semantics during the initial velocity field computation, without requiring any changes to the random seeds. This design effectively prevents the latent drift toward majority modes while maintaining high generative quality. Extensive experiments demonstrate the effectiveness of our approach. Notably, on the GenEval dataset, InjectFlow successfully fixes 75% of the prompts that standard flow matching models fail to generate correctly. Ultimately, our theoretical analysis and algorithm provide a ready-to-use solution for building more fair and robust visual foundation models.