AI Navigate

End-to-end data-driven prediction of urban airflow and pollutant dispersion

arXiv cs.LG / 3/19/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The study proposes an end-to-end data-driven framework to predict urban airflow and pollutant dispersion in a street canyon using LES data as the database.
  • It first obtains a reduced basis via spectral proper orthogonal decomposition (SPOD) and projects time-series data onto SPOD modes to derive temporal coefficients.
  • A nonlinear compression of temporal coefficients is performed with an autoencoder to further reduce dimensionality.
  • A reduced-order model is learned in the latent space with Long Short-Term Memory (LSTM) networks, and pollutant dispersion is estimated from predicted velocity fields through a convolutional neural network.
  • The results demonstrate the framework's ability to predict instantaneous and statistically stationary fields over long time horizons, enabling data-driven decision-making for mitigation.

Abstract

Climate change and the rapid growth of urban populations are intensifying environmental stresses within cities, making the behavior of urban atmospheric flows a critical factor in public health, energy use, and overall livability. This study targets to develop fast and accurate models of urban pollutant dispersion to support decision-makers, enabling them to implement mitigation measures in a timely and cost-effective manner. To reach this goal, an end-to-end data-driven approach is proposed to model and predict the airflow and pollutant dispersion in a street canyon in skimming flow regime. A series of time-resolved snapshots obtained from large eddy simulation (LES) serves as the database. The proposed framework is based on four fundamental steps. Firstly, a reduced basis is obtained by spectral proper orthogonal decomposition (SPOD) of the database. The projection of the time series snapshot data onto the SPOD modes (time-domain approach) provides the temporal coefficients of the dynamics. Secondly, a nonlinear compression of the temporal coefficients is performed by autoencoder to reduce further the dimensionality of the problem. Thirdly, a reduced-order model (ROM) is learned in the latent space using Long Short-Term Memory (LSTM) netowrks. Finally, the pollutant dispersion is estimated from the predicted velocity field through convolutional neural network that maps both fields. The results demonstrate the efficacy of the model in predicting the instantaneous as well as statistically stationary fields over long time horizon.