EnScale: Temporally-consistent multivariate generative downscaling via proper scoring rules

arXiv stat.ML / 4/13/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • EnScale is a generative machine-learning framework for climate downscaling that learns a full GCM-to-RCM conditional distribution rather than a single deterministic mapping.
  • The method uses two stages—correcting large-scale discrepancies between GCM outputs and coarsened RCM data, then applying a super-resolution step with sparse local stochastic layers to generate high-resolution fields efficiently.
  • EnScale trains both stages with the energy score, a proper scoring rule, aiming to improve probabilistic calibration and realistic uncertainty capture.
  • The paper reports about an order-of-magnitude computational cost reduction versus leading ML downscaling methods while jointly downscaling multiple variables (temperature, precipitation, solar radiation, wind) with spatial consistency over Central Europe.
  • It also introduces EnScale-t, a variant designed for temporal consistency, and evaluates performance using an extensive framework covering calibration, spatial/temporal structure, extremes, and multivariate dependencies.

Abstract

The practical use of future climate projections from global circulation models (GCMs) is often limited by their coarse spatial resolution, requiring downscaling to generate high-resolution data. Regional climate models (RCMs) provide this refinement, but are computationally expensive. To address this issue, machine learning (ML) models can learn the downscaling function, mapping coarse GCM outputs to high-resolution fields. Among these, generative approaches aim to capture the full conditional distribution of RCM data given coarse-scale GCM data, which is characterized by large variability and thus challenging to model accurately. We introduce EnScale, a generative ML framework emulating the full GCM-to-RCM map by training on multiple pairs of GCM and corresponding RCM data. It first adjusts large-scale mismatches between GCM and coarsened RCM data, followed by a super-resolution step to generate high-resolution fields. To efficiently model the high-dimensional output, the super-resolution step employs a novel class of sparse local stochastic layers. Both steps employ generative models optimized with the energy score, a proper scoring rule. Compared to state-of-the-art ML downscaling approaches, our setup reduces computational cost by about one order of magnitude. EnScale jointly emulates multiple variables -- temperature, precipitation, solar radiation, and wind -- spatially consistent over Central Europe. In addition, we propose a variant EnScale-t that enables temporally consistent downscaling. We establish a comprehensive evaluation framework across various categories including calibration, spatial and temporal structure, extremes, and multivariate dependencies. Comparison with diverse benchmarks demonstrates EnScale(-t)'s competitive performance and computational efficiency, offering a promising approach for accurate and temporally consistent RCM emulation.