AI Navigate

Conservative Continuous-Time Treatment Optimization

arXiv cs.LG / 3/18/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes a conservative continuous-time stochastic control framework for treatment optimization from irregularly sampled patient trajectories.
  • It models patient dynamics as a controlled stochastic differential equation with treatment serving as the continuous-time control variable.
  • To curb extrapolation errors, a consistent signature-based MMD regularizer on path space is added to penalize plans whose induced trajectory distributions deviate from observed data, leading to a computable upper bound on the true cost.
  • Experiments on benchmark datasets show improved robustness and performance compared with non-conservative baselines.

Abstract

We develop a conservative continuous-time stochastic control framework for treatment optimization from irregularly sampled patient trajectories. The unknown patient dynamics are modeled as a controlled stochastic differential equation with treatment as a continuous-time control. Naive model-based optimization can exploit model errors and propose out-of-support controls, so optimizing the estimated dynamics may not optimize the true dynamics. To limit extrapolation, we add a consistent signature-based MMD regularizer on path space that penalizes treatment plans whose induced trajectory distribution deviates from observed trajectories. The resulting objective minimizes a computable upper bound on the true cost. Experiments on benchmark datasets show improved robustness and performance compared to non-conservative baselines.