An Integrative Genome-Scale Metabolic Modeling and Machine Learning Framework for Predicting and Optimizing Biofuel-Relevant Biomass Production in Saccharomyces cerevisiae
arXiv cs.LG / 3/27/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The study introduces a computational framework that integrates the Yeast9 genome-scale metabolic model with machine learning and optimization to predict and improve biomass flux in Saccharomyces cerevisiae under varying glucose, oxygen, and ammonium conditions.
- Using 2,000 simulated flux profiles, Random Forest and XGBoost regressors achieved very high predictive performance (R2 ≈ 0.9999 and 0.9990), enabling accurate flux prediction.
- A variational autoencoder identified four metabolic clusters, while SHAP-based interpretability highlighted glycolysis, the TCA cycle, and lipid biosynthesis as key determinants of biomass production.
- The authors report practical in silico optimization results: in silico overexpression improved biomass flux to 0.979 gDW/hr, and Bayesian optimization of nutrient constraints increased biomass flux up to about 12x (0.0858 to 1.041 gDW/hr).
- A generative adversarial network (GAN) further proposes new, stoichiometrically feasible flux configurations, combining simulation validity with generative search for novel metabolic states relevant to biofuel-relevant biomass engineering.
広告




