Spectral Alignment in Forward-Backward Representations via Temporal Abstraction

arXiv cs.LG / 3/23/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper demonstrates that temporal abstraction acts as a low-pass filter on the transition operator's spectrum, mitigating the mismatch between high-rank dynamics and low-rank FB architectures.
  • It derives a formal bound on the resulting value function error, showing the spectral simplification preserves accuracy under the proposed framework.
  • Empirical results indicate that temporal abstraction improves stability of forward-backward learning, especially at high discount factors where bootstrapping is prone to error.
  • The findings suggest temporal abstraction as a principled mechanism to shape the spectral properties of the MDP, enabling more effective long-horizon representations in continuous control.

Abstract

Forward-backward (FB) representations provide a powerful framework for learning the successor representation (SR) in continuous spaces by enforcing a low-rank factorization. However, a fundamental spectral mismatch often exists between the high-rank transition dynamics of continuous environments and the low-rank bottleneck of the FB architecture, making accurate low-rank representation learning difficult. In this work, we analyze temporal abstraction as a mechanism to mitigate this mismatch. By characterizing the spectral properties of the transition operator, we show that temporal abstraction acts as a low-pass filter that suppresses high-frequency spectral components. This suppression reduces the effective rank of the induced SR while preserving a formal bound on the resulting value function error. Empirically, we show that this alignment is a key factor for stable FB learning, particularly at high discount factors where bootstrapping becomes error-prone. Our results identify temporal abstraction as a principled mechanism for shaping the spectral structure of the underlying MDP and enabling effective long-horizon representations in continuous control.