Fine-tuning Timeseries Predictors Using Reinforcement Learning

arXiv cs.LG / 3/23/2026

📰 NewsTools & Practical UsageModels & Research

Key Points

  • The article reviews three reinforcement learning algorithms used to fine-tune financial time-series predictors.
  • It proposes a plan to backpropagate the loss from a reinforcement learning task into a model initially trained by supervised learning for end-to-end fine-tuning.
  • It reports that fine-tuning improves predictive performance and shows transfer learning properties in the models, highlighting the benefits of the approach.
  • It outlines the tuning process and provides empirical results intended to guide practitioners in future implementations.

Abstract

This chapter presents three major reinforcement learning algorithms used for fine-tuning financial forecasters. We propose a clear implementation plan for backpropagating the loss of a reinforcement learning task to a model trained using supervised learning, and compare the performance before and after the fine-tuning. We find an increase in performance after fine-tuning, and transfer learning properties to the models, indicating the benefits of fine-tuning. We also highlight the tuning process and empirical results for future implementation by practitioners.