Generative Flow Networks for Model Adaptation in Digital Twins of Natural Systems

arXiv cs.LG / 4/23/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper frames digital-twin model adaptation as a simulation-based inference problem where simulator parameters cannot be directly measured and observations are sparse and partial.
  • It proposes a GFlowNet-based method that treats full simulator configurations as a generative modeling target, enabling sampling of parameterizations according to how well simulations match observed behavior.
  • The approach uses a reward signal derived from the agreement between simulated and observed dynamics to guide generation toward plausible calibrations.
  • In a controlled agriculture case study using a mechanistic tomato simulator, the method recovers major regions of the adaptation landscape, identifies strong calibration hypotheses, and maintains multiple plausible configurations when uncertainty remains.
  • Overall, the work suggests GFlowNets can better handle non-identifiability by producing a distribution over compatible simulator parameterizations rather than a single “best” calibration.

Abstract

Digital twins of natural systems must remain aligned with physical systems that evolve over time, are only partially observed, and are typically modeled by mechanistic simulators whose parameters cannot be measured directly. In such settings, model adaptation is naturally posed as a simulation-based inference problem. However, sparse and indirect observations often fail to identify a unique and optimal calibration, leaving several simulator parameterizations compatible with the available evidence. This article presents a GFlowNet-based approach to model adaptation for digital twins of natural systems. We formulate adaptation as a generative modeling problem over complete simulator configurations, so that plausible parameterizations can be sampled with probability proportional to a reward derived from agreement between simulated and observed behavior. Using a controlled environment agriculture case study based on a mechanistic tomato model, we show that the learned policy recovers dominant regions of the adaptation landscape, retrieves strong calibration hypotheses, and preserves multiple plausible configurations under uncertainty.