AI Navigate

Dissecting Chronos: Sparse Autoencoders Reveal Causal Feature Hierarchies in Time Series Foundation Models

arXiv cs.AI / 3/12/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The authors apply TopK Sparse Autoencoders to Chronos-T5-Large activations across six layers to study representations in time series foundation models.
  • Through 392 single-feature ablation experiments, they show every ablated feature leads to degradation in CRPS, indicating causal relevance of individual features.
  • They reveal a depth-dependent hierarchy: early layers encode low-level frequency features, mid layers detect abrupt changes with strong causal importance, and final layers compress a broader taxonomy with less causal impact.
  • The mid-encoder contains the most critical features (max Delta CRPS = 38.61), while ablation of the final encoder can unexpectedly improve forecast quality.
  • The results support mechanistic interpretability transfer to TSFMs and suggest Chronos-T5 relies on abrupt-dynamics detection rather than periodic pattern recognition.

Abstract

Time series foundation models (TSFMs) are increasingly deployed in high-stakes domains, yet their internal representations remain opaque. We present the first application of sparse autoencoders (SAEs) to a TSFM, training TopK SAEs on activations of Chronos-T5-Large (710M parameters) across six layers. Through 392 single-feature ablation experiments, we establish that every ablated feature produces a positive CRPS degradation, confirming causal relevance. Our analysis reveals a depth-dependent hierarchy: early encoder layers encode low-level frequency features, the mid-encoder concentrates causally critical change-detection features, and the final encoder compresses a rich but less causally important taxonomy of temporal concepts. The most critical features reside in the mid-encoder (max single-feature Delta CRPS = 38.61), not in the semantically richest final encoder layer, where progressive ablation paradoxically improves forecast quality. These findings demonstrate that mechanistic interpretability transfers effectively to TSFMs and that Chronos-T5 relies on abrupt-dynamics detection rather than periodic pattern recognition.