SPLICE: Latent Diffusion over JEPA Embeddings for Conformal Time-Series Inpainting

arXiv cs.LG / 5/4/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • SPLICE is a new modular generative framework for time-series imputation that pairs latent diffusion with distribution-free, online-adaptive conformal prediction intervals for finite-sample reliability in power-system gap filling.
  • The method uses a JEPA encoder to map daily load segments into a 64-dimensional latent space, a conditional latent bridge with multiple sampling modes to generate candidate trajectories, and an hourly-conditioned decoder to reconstruct the signal.
  • ACI (Adaptive Conformal Inference) wraps the generated output with coverage-guaranteed prediction bands, improving empirical coverage to about 93–95% and addressing under-coverage that can reach 7.5 percentage points with static conformal methods.
  • SPLICE’s flow-matching variant achieves diffusion-quality results comparable to DDIM using only 5–10 ODE steps (reported as a 5–10× speedup), and it outperforms several baselines on 13 load datasets, including best CRPS and strongest gap-length performance.
  • A pooled JEPA encoder shows strong transfer across unseen domains, reaching performance comparable to per-dataset oracles with only quick bridge fine-tuning.

Abstract

Generative models for time-series imputation achieve strong reconstruction accuracy, yet provide no finite-sample reliability guarantees, a critical limitation in power systems where imputed values inform dispatch and planning. We introduce SPLICE (Self-supervised Predictive Latent Inpainting with Conformal Envelopes), a modular framework coupling latent generative imputation with distribution-free, online-adaptive prediction intervals. A JEPA encoder maps daily load segments into a 64-dimensional latent space; a conditional latent bridge with four sampling modes generates candidate gap trajectories; an hourly-conditioned decoder maps back to signal space; and Adaptive Conformal Inference (ACI) wraps the output with coverage-guaranteed prediction bands. The flow-matching variant achieves comparable quality to DDIM in 5--10 ODE steps (5-10x speedup). On thirteen load datasets (nine proprietary, three UCI Electricity, ETTh1), SPLICE achieves the lowest mean Load-only MSE (0.056), winning 9/12 non-degenerate datasets at 91-day gaps and 18/32 across all gap lengths vs. five established baselines, and produces the best CRPS (0.161, -18.3% vs. the strongest competitor). ACI delivers 93--95% empirical coverage, correcting under-coverage failures of up to 7.5 pp observed with static conformal prediction. A pooled JEPA encoder trained on nine feeds transfers to four unseen domains, matching or exceeding per-dataset oracles with only a quick bridge fine-tuning.