Adaptive Regime-Aware Stock Price Prediction Using Autoencoder-Gated Dual Node Transformers with Reinforcement Learning Control
arXiv cs.LG / 3/20/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper proposes an adaptive stock price prediction framework that identifies deviations from normal market conditions and routes data through specialized prediction pathways.
- It comprises three components: an autoencoder trained on normal conditions to detect anomalies via reconstruction error, dual node transformer networks for stable and event-driven regimes, and a Soft Actor-Critic controller that tunes regime thresholds and blending weights based on prediction feedback.
- In experiments on 20 S&P 500 stocks from 1982–2025, the method achieves 0.68% MAPE without the RL controller and 0.59% MAPE with the full adaptive system, with directional accuracy around 72% and robust performance during high volatility where baseline MAPE exceeds 1.5%.
- Ablation studies show each component meaningfully contributes to performance: autoencoder routing accounts for about 36% of relative MAPE degradation when removed, SAC controller about 15%, and the dual-path architecture about 7%.
- The work suggests regime-aware, RL-guided forecasting can enhance stability and accuracy in financial markets and may inform deployment in trading systems that adapt to changing market regimes.