SAGE: Subsurface AI-driven Geostatistical Extraction with proxy posterior

arXiv cs.LG / 4/2/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • SAGE is a new framework for generating statistically consistent subsurface velocity models from incomplete data, specifically combining sparse well logs with migrated seismic images during training.
  • It learns a proxy posterior over velocity models conditioned on both modalities, then at inference produces full-resolution velocity fields using only migrated images with well information encoded in the learned distribution.
  • The method aims to provide geologically plausible and statistically accurate velocity realizations despite limited observational constraints.
  • Validation on synthetic and real field datasets shows SAGE can capture complex subsurface variability and supports downstream workflows by providing samples for training other networks used in inversion.

Abstract

Recent advances in generative networks have enabled new approaches to subsurface velocity model synthesis, offering a compelling alternative to traditional methods such as Full Waveform Inversion. However, these approaches predominantly rely on the availability of large-scale datasets of high-quality, geologically realistic subsurface velocity models, which are often difficult to obtain in practice. We introduce SAGE, a novel framework for statistically consistent proxy velocity generation from incomplete observations, specifically sparse well logs and migrated seismic images. During training, SAGE learns a proxy posterior over velocity models conditioned on both modalities (wells and seismic); at inference, it produces full-resolution velocity fields conditioned solely on migrated images, with well information implicitly encoded in the learned distribution. This enables the generation of geologically plausible and statistically accurate velocity realizations. We validate SAGE on both synthetic and field datasets, demonstrating its ability to capture complex subsurface variability under limited observational constraints. Furthermore, samples drawn from the learned proxy distribution can be leveraged to train downstream networks, supporting inversion workflows. Overall, SAGE provides a scalable and data-efficient pathway toward learning geological proxy posterior for seismic imaging and inversion. Repo link: https://github.com/slimgroup/SAGE.