CNN-based Surface Temperature Forecasts with Ensemble Numerical Weather Prediction

arXiv stat.ML / 4/9/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes a CNN-based post-processing approach that combines a CNN with an ensemble of 40-km NWP model outputs to generate 5-km surface temperature forecasts out to 132 hours (5.5 days).
  • It applies bias correction and spatial downscaling to each ensemble member to improve deterministic accuracy, then uses the member-wise CNN corrections across all 51 members to build a new high-resolution ensemble system.
  • The authors argue that the CNN member-wise correction improves probabilistic reliability and the spread-skill ratio in a way that differs from standard ensemble averaging, which mainly smooths spatial errors.
  • Experimental results suggest the method is practical and scalable for operational forecast centers that have limited computational resources.

Abstract

Due to limited computational resources, medium-range temperature forecasts typically rely on low-resolution numerical weather prediction (NWP) models, which are prone to systematic and random errors. We propose a method that integrates a convolutional neural network (CNN) with an ensemble of low-resolution NWP models (40-km horizontal resolution) to produce high-resolution (5-km) surface temperature forecasts with lead times extending up to 5.5 days (132 h). First, CNN-based post-processing (bias correction and spatial downscaling) is applied to individual ensemble members to reduce systematic errors and perform downscaling, which improves the deterministic forecast accuracy. Second, this member-wise correction is applied to all 51 ensemble members to construct a new high-resolution ensemble forecasting system with an improved probabilistic reliability and spread-skill ratio that differs from the simple error reduction mechanism of ensemble averaging. Whereas averaging reduces forecast errors by smoothing spatial fields, our member-wise CNN correction reduces error from noise while maintaining forecast information at a level comparable to that of other high-resolution forecasts. Experimental results indicate that the proposed method provides a practical and scalable solution for improving medium-range temperature forecasts, which is particularly valuable for use in operational centers with limited computational resources.