Rethinking Adam for Time Series Forecasting: A Simple Heuristic to Improve Optimization under Distribution Shifts
arXiv cs.LG / 3/12/2026
📰 NewsTools & Practical UsageModels & Research
Key Points
- TS_Adam is a lightweight variant of Adam that removes the second-order bias correction to improve responsiveness to distributional drift in non-stationary time-series forecasting.
- The modification preserves the core optimizer structure and requires no additional hyperparameters.
- Empirical results on ETT datasets with the MICN model show average improvements of 12.8% in MSE and 5.7% in MAE over Adam.
- The approach is easy to integrate into existing models and code is available on GitHub.
- The work highlights that removing second-order corrections can enhance adaptability to shifting loss landscapes, offering a practical optimization strategy for real-world non-stationary data.



