Iterative Model-Learning Scheme via Gaussian Processes for Nonlinear Model Predictive Control of (Semi-)Batch Processes

arXiv cs.LG / 4/27/2026

📰 NewsDeveloper Stack & InfrastructureIndustry & Market MovesModels & Research

共有:

Key Points

The paper proposes a GP-based model-learning nonlinear model predictive control scheme (GP-MLMPC) to enable NMPC for nonlinear, transient batch processes when dynamic models are costly or unavailable.
The method starts from data collected along a single initial trajectory (e.g., from a PI controller), then iteratively runs batches with GP-embedded NMPC while updating the Gaussian Process using new observations.
It uses Gaussian Process uncertainty quantification to formulate chance constraints, enforcing safe operation with specified confidence levels during control.
In silico results on a semi-batch polymerization reactor show fast convergence: tracking error improves by 83% after four batch iterations, and an economic objective yields a 17-fold increase in final product mass by iteration 8 compared with the initial trajectory.
The authors report that GP-MLMPC achieves performance comparable to full-model NMPC while remaining sample-efficient and not requiring mechanistic process knowledge.

Abstract

Batch processes are inherently transient and typically nonlinear, motivating nonlinear model predictive control (NMPC). However, adopting NMPC is hindered by the cost and unavailability of dynamic models. Thus, we propose to use Gaussian Processes (GP) in a model-learning NMPC scheme (GP-MLMPC) for batch processes. We initialize the GP-MLMPC using data from a single initial trajectory, e.g., from a PI controller. We iteratively apply the NMPC embedded with GPs to run batches and update the GP with new observations from each iteration, thereby achieving batch-wise improvements. Using uncertainty quantification from the GPs, we formulate chance constraints to enforce safe operation to the required confidence levels. We demonstrate our approach in \textit{silico} on a semi-batch polymerization reactor for tracking and economic objectives over durations of two hours, and the reactor temperature is constrained in a range of

\pm2^\circ C

around its setpoint. After only four batch iterations, tracking error from the GP-MLMPC scheme converged to a reduction of

83\%

, compared to the initial trajectory. Furthermore, under an economic objective, the GP-MLMPC resulted in a 17-fold increase in final product mass by iteration 8, compared to the initial trajectory. In both cases, the resulting GP-MLMPC performance is on par with the full-model NMPC, which shows that the optimal controller can be learned by the approach. By collecting samples around the optimal trajectory, the GP-MLMPC remains sample-efficient across iterations and achieves quick convergence. Thus, the proposed GP-MLMPC scheme presents a promising data-efficient approach for the control of nonlinear batch processes without mechanistic knowledge.