Unveiling Stochasticity: Universal Multi-modal Probabilistic Modeling for Traffic Forecasting

arXiv cs.LG / 4/20/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses traffic forecasting by moving from deterministic predictions to probabilistic modeling that explicitly captures uncertainty and stochasticity in spatio-temporal traffic dynamics.
  • It introduces a universal method that converts existing traffic models into probabilistic predictors by only replacing the final output layer with a Gaussian Mixture Model (GMM) layer.
  • The approach can be trained without changing the training pipeline, using only Negative Log-Likelihood (NLL) loss and no auxiliary losses or regularization terms.
  • Experiments across multiple datasets show the method maintains deterministic-level performance while providing more accurate and informative uncertainty estimates compared with unimodal and deterministic baselines.
  • The authors add an evaluation framework using cumulative distributions and confidence intervals and analyze how data quality and imperfect observations affect uncertainty quantification, releasing code publicly.

Abstract

Traffic forecasting is a challenging spatio-temporal modeling task and a critical component of urban transportation management. Current studies mainly focus on deterministic predictions, with limited considerations on the uncertainty and stochasticity in traffic dynamics. Therefore, this paper proposes an elegant yet universal approach that transforms existing models into probabilistic predictors by replacing only the final output layer with a novel Gaussian Mixture Model (GMM) layer. The modified model requires no changes to the training pipeline and can be trained using only the Negative Log-Likelihood (NLL) loss, without any auxiliary or regularization terms. Experiments on multiple traffic datasets show that our approach generalizes from classic to modern model architectures while preserving deterministic performance. Furthermore, we propose a systematic evaluation procedure based on cumulative distributions and confidence intervals, and demonstrate that our approach is considerably more accurate and informative than unimodal or deterministic baselines. Finally, a more detailed study on a real-world dense urban traffic network is presented to examine the impact of data quality on uncertainty quantification and to show the robustness of our approach under imperfect data conditions. Code available at https://github.com/Weijiang-Xiong/OpenSkyTraffic