JetPrism: diagnosing convergence for generative simulation and inverse problems in nuclear physics

arXiv cs.LG / 4/3/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper argues that standard Conditional Flow Matching (CFM) training loss can plateau too early in high-stakes physics settings, making it a misleading proxy for real convergence and physical fidelity.
It introduces JetPrism, a configurable CFM-based generative surrogate framework that diagnoses both unconditional generation quality and conditional detector unfolding using synthetic stress tests and a Jefferson Lab dataset relevant to the future Electron-Ion Collider.
JetPrism’s results show that physics-informed evaluation metrics keep improving well after generic training loss appears to converge, highlighting a fundamental disconnect between loss convergence and statistical agreement.
The authors propose a multi-metric evaluation protocol—combining marginal/pairwise χ² statistics, W1 distances, correlation-matrix distances (D_corr), and nearest-neighbor distance ratios (R_NN)—to ensure generated samples match ground truth without overfitting or memorizing training data.
Although demonstrated in nuclear physics, the diagnostic approach is presented as broadly extensible to other inverse problems and parameter generation tasks, with potential applications in medical imaging, astrophysics, semiconductor discovery, and quantitative finance.

Abstract

High-fidelity Monte Carlo simulations and complex inverse problems, such as mapping smeared experimental observations to ground-truth states, are computationally intensive yet essential for robust data analysis. Conditional Flow Matching (CFM) offers a mathematically robust approach to accelerating these tasks, but we demonstrate its standard training loss is fundamentally misleading. In rigorous physics applications, CFM loss plateaus prematurely, serving as an unreliable indicator of true convergence and physical fidelity. To investigate this disconnect, we designed JetPrism, a configurable CFM framework acting as an efficient generative surrogate for evaluating unconditional generation and conditional detector unfolding. Using synthetic stress tests and a Jefferson Lab kinematic dataset (

\gamma p \to \rho^0 p \to \pi^+\pi^- p

) relevant to the forthcoming Electron-Ion Collider (EIC), we establish that physics-informed metrics continue to improve significantly long after the standard loss converges. Consequently, we propose a multi-metric evaluation protocol incorporating marginal and pairwise

\chi^2

statistics,

W_1

distances, correlation matrix distances (

D_{\mathrm{corr}}

), and nearest-neighbor distance ratios (

R_{\mathrm{NN}}

). By demonstrating that domain-specific evaluations must supersede generic loss metrics, this work establishes JetPrism as a dependable generative surrogate that ensures precise statistical agreement with ground-truth data without memorizing the training set. While demonstrated in nuclear physics, this diagnostic framework is readily extensible to parameter generation and complex inverse problems across broad domains. Potential applications span medical imaging, astrophysics, semiconductor discovery, and quantitative finance, where high-fidelity simulation, rigorous inversion, and generative reliability are critical.