Deep Learning for Model Calibration in Simulation of Itaconic Acid Production

arXiv cs.LG / 4/27/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The study uses deep learning to estimate kinetic parameters for a model of itaconic acid production from real batch experiments spanning multiple agitation speeds and reactor scales.
  • It compares two approaches—direct deep learning (DDL) and generative conditional flow matching (CFM)—against nonlinear regression as a baseline reference.
  • CFM delivers consistently higher accuracy than DDL, producing concentration-time profiles that closely match those from nonlinear regression.
  • The advantage of CFM persists in scale-up experiments, where it shows better generalization and robustness across operating conditions.
  • Overall, the results suggest CFM is a flexible and relatively data-efficient framework for parameter estimation in dynamic bioprocess models.

Abstract

In this study, deep learning is used to estimate kinetic parameters for modeling itaconic acid production based on real batch experiments conducted at different agitation speeds and reactor scales. Two deep learning strategies, namely direct deep learning (DDL) and generative conditional flow matching (CFM) are compared and benchmarked against nonlinear regression as a reference method. Compared with DDL, CFM consistently yields more accurate results. The concentration profiles predicted by CFM closely match those obtained from nonlinear regression, whereas DDL results in larger deviations. Similar behavior is observed in the scale-up experiments, where the CFM model again generalizes better and is more robust than the direct approach. These findings demonstrate that CFM can reliably predict system behavior across different operating conditions and scales, offering a flexible and data-efficient framework for parameter estimation in dynamic bioprocess models.