AutoSurrogate: An LLM-Driven Multi-Agent Framework for Autonomous Construction of Deep Learning Surrogate Models in Subsurface Flow

arXiv cs.AI / 4/15/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • AutoSurrogate is an LLM-driven multi-agent framework designed to let domain scientists build high-quality deep learning surrogate models for computationally expensive subsurface flow simulations using natural-language instructions instead of specialized ML expertise.
  • The system uses multiple specialized agents to perform end-to-end surrogate construction tasks including data profiling, selecting architectures from a model zoo, running Bayesian hyperparameter optimization, training models, and evaluating quality against user-defined thresholds.
  • It autonomously addresses common failure modes by restarting training with adjusted configurations when numerical instabilities occur and swapping architectures when accuracy does not meet target requirements.
  • In a 3D geological carbon storage modeling case (predicting pressure and CO2 saturation over 31 timesteps), AutoSurrogate can generate a deployment-ready surrogate from a single natural-language sentence with minimal human intervention.
  • The reported results show AutoSurrogate outperforms both expert-designed baselines and domain-agnostic AutoML approaches without manual tuning, indicating strong potential for practical adoption.

Abstract

High-fidelity numerical simulation of subsurface flow is computationally intensive, especially for many-query tasks such as uncertainty quantification and data assimilation. Deep learning (DL) surrogates can significantly accelerate forward simulations, yet constructing them requires substantial machine learning (ML) expertise - from architecture design to hyperparameter tuning - that most domain scientists do not possess. Furthermore, the process is predominantly manual and relies heavily on heuristic choices. This expertise gap remains a key barrier to the broader adoption of DL surrogate techniques. For this reason, we present AutoSurrogate, a large-language-model-driven multi-agent framework that enables practitioners without ML expertise to build high-quality surrogates for subsurface flow problems through natural-language instructions. Given simulation data and optional preferences, four specialized agents collaboratively execute data profiling, architecture selection from a model zoo, Bayesian hyperparameter optimization, model training, and quality assessment against user-specified thresholds. The system also handles common failure modes autonomously, including restarting training with adjusted configurations when numerical instabilities occur and switching to alternative architectures when predictive accuracy falls short of targets. In our setting, a single natural-language sentence can be sufficient to produce a deployment-ready surrogate model, with minimum human intervention required at any intermediate stage. We demonstrate the utility of AutoSurrogate on a 3D geological carbon storage modeling task, mapping permeability fields to pressure and CO_2 saturation fields over 31 timesteps. Without any manual tuning, AutoSurrogate is able to outperform expert-designed baselines and domain-agnostic AutoML methods, demonstrating strong potential for practical deployment.