Dynamic Linear Coregionalization for Realistic Synthetic Multivariate Time Series

arXiv cs.LG / 4/8/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces DynLMC (Dynamic Linear Model of Coregionalization) to generate synthetic multivariate time series with time-varying, regime-switching correlations and realistic cross-channel lag dependencies.
  • It addresses limitations of existing synthetic data generators that assume static inter-channel correlations and often fail to capture true dependency structures.
  • Experiments show that synthetic datasets produced by DynLMC more closely match real-data correlation dynamics.
  • Fine-tuning three foundation models on DynLMC-generated data delivers consistent zero-shot forecasting gains across nine benchmarks.
  • The authors conclude that explicitly modeling dynamic inter-channel correlations improves foundation time series transferability, reinforcing the importance of data-centric pretraining.

Abstract

Synthetic data is essential for training foundation models for time series (FMTS), but most generators assume static correlations, and are typically missing realistic inter-channel dependencies. We introduce DynLMC, a Dynamic Linear Model of Coregionalization, that incorporates time-varying, regime-switching correlations and cross-channel lag structures. Our approach produces synthetic multivariate time series with correlation dynamics that closely resemble real data. Fine-tuning three foundational models on DynLMC-generated data yields consistent zero-shot forecasting improvements across nine benchmarks. Our results demonstrate that modeling dynamic inter-channel correlations enhances FMTS transferability, highlighting the importance of data-centric pretraining.