Effective Dataset Distillation for Spatio-Temporal Forecasting with Bi-dimensional Compression
arXiv cs.LG / 3/12/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- STemDist is introduced as the first dataset distillation method specifically designed for spatio-temporal time series forecasting, addressing the limitation of prior methods that compressed only a single dimension.
- The method balances compression across both temporal and spatial dimensions and uses cluster-level distillation combined with a subset-based granular distillation to maintain forecasting performance while reducing cost.
- Evaluation on five real-world datasets shows that models trained on distilled data can be faster (up to 6x), more memory-efficient (up to 8x), and achieve lower prediction error (up to 12%).
- By enabling faster, cheaper training for large spatio-temporal models, STemDist could make large-scale forecasting workflows more practical in real-world applications like traffic and weather.
- The paper provides empirical evidence that distillation can outperform general and time-series-specific distillation methods in this domain.
Related Articles
How political censorship actually works inside Qwen, DeepSeek, GLM, and Yi: Ablation and behavioral results across 9 models
Reddit r/LocalLLaMA
Engenharia de Prompt: Por Que a Forma Como Você Pergunta Muda Tudo(Um guia introdutório)
Dev.to
The Obligor
Dev.to
The Markup
Dev.to
2026 年 AI 部落格變現完整攻略:從第一篇文章到月收入 $1000
Dev.to