TimeRFT: Stimulating Generalizable Time Series Forecasting for TSFMs via Reinforcement Finetuning

arXiv cs.AI / 5/4/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces TimeRFT, a reinforcement fine-tuning framework aimed at improving how Time Series Foundation Models (TSFMs) adapt to specific forecasting tasks.
  • It targets two key problems of supervised fine-tuning: temporal distribution shifts from non-stationary time series and overfitting that can reduce generalization.
  • TimeRFT uses a quality-based temporal reward mechanism that evaluates how each prediction step contributes to overall forecasting accuracy.
  • It also applies a difficulty-based data selection strategy to choose time series samples that contain generalizable patterns and useful training signals under varying data availability.
  • Experiments on multiple real-world forecasting tasks show TimeRFT consistently outperforms SFT-based adaptation across different training-data regimes, improving accuracy and robustness to distribution shifts.

Abstract

Time Series Foundation Models (TSFMs) advance generalization and data efficiency in time series forecasting by unified large-scale pretraining. But TSFMs remain lacking when adapting to specific downstream forecasting tasks for two reasons. First, the non-stationary and uncertain nature of time series data lead to inevitable temporal distribution shifts between historical training and future testing data, while current Supervised FineTuning (SFT)-based methods are prone to overfitting and may degrade generalization. Second, training data availability varies across forecasting tasks, requiring TSFMs to generalize well under diverse data regimes. To address these challenges, we introduce the Time series Reinforcement Finetuning (TimeRFT) paradigm for TSFM downstream adaptation, which consists of two task-specific training recipes: i) A forecasting quality-based temporal reward mechanism that conducts a multi-faceted evaluation of the contribution of each prediction step to overall forecasting accuracy. ii) A forecasting difficulty-based data selection strategy to identify time series samples with generalizable predictive patterns and informative training signals. Extensive experiments demonstrate TimeRFT can consistently outperform SFT-based adaptation methods across various real-world forecasting tasks and training data regimes, enhancing prediction accuracy and generalization against unforeseen distribution shifts.