A Continuous-Time Ensemble Kalman-Bucy Smoother for Causal Inference and Model Discovery

arXiv stat.ML / 4/29/2026

💬 OpinionDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes an ensemble Kalman-Bucy smoother (EnKBS) for continuous-time data assimilation that uses future observations to improve uncertainty and provide better retrospective state estimation than filtering alone.
  • EnKBS reconstructs smoother conditional distributions from ensemble moments, yielding a derivative-free approach that avoids explicit tangent-linear or adjoint model computations while converging to the exact smoother as ensemble size grows.
  • The method incorporates common high-dimensional DA stabilizers—such as covariance localization and inflation—to maintain performance in complex, nonlinear dynamical systems.
  • The authors demonstrate the technique on scientific applications, including Bayesian inference of causal relationships from dyadic trigger-feedback models and a causality-driven iterative learning algorithm for discovering structure and recovering hidden parameters in a nonlinear reduced-order model.
  • Both causal inference and model discovery tasks show effectiveness with relatively small ensembles (on the order of 10), suggesting EnKBS could enable near-instant, high-dimensional causal discovery over time.

Abstract

Data assimilation (DA) integrates observational information with model predictions to improve state estimation in complex systems. While filtering provides the basis for online forecasts by using only past and present observations, it can exhibit delays and biases when the underlying dynamics evolve rapidly or undergo regime transitions. Smoothing, which additionally incorporates future observations, provides a natural pipeline for hindcasting and reanalysis that yields an uncertainty reduction beyond the filter. This paper introduces an ensemble Kalman-Bucy smoother (EnKBS) for continuous-time DA of nonlinear dynamical systems, where the smoother's conditional distributions are reconstructed using ensemble moments. The result is a derivative-free framework that does not require explicit computation of tangent-linear or adjoint models, which converges to the exact smoother solution at the infinite-ensemble limit for a wide class of complex systems. Incorporating standard regularization techniques for high-dimensional systems, such as covariance localization and inflation, the skill of the EnKBS is demonstrated in various important scientific problems. By integrating future observations, which reveal the underlying causal mechanisms for retrospective state updates, the EnKBS is used for Bayesian-based inference of causal relationships and their temporal influence range in a dyadic trigger-feedback model and the development of a causality-driven iterative learning algorithm that identifies the structure and recovers the hidden parameters of a nonlinear reduced-order model mimicking midlatitude atmospheric circulation. Notably, both tasks remain effective with an ensemble size of O(10) under partial observations, suggesting that EnKBS can support the instantaneous discovery of high-dimensional complex systems over time.

A Continuous-Time Ensemble Kalman-Bucy Smoother for Causal Inference and Model Discovery | AI Navigate