AI Navigate

From Text to Forecasts: Bridging Modality Gap with Temporal Evolution Semantic Space

arXiv cs.AI / 3/16/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper identifies a modality gap between textual information and numeric signals in time-series forecasting and introduces Temporal Evolution Semantic Space (TESS) to bridge the gap.
  • TESS extracts interpretable temporal primitives (mean shift, volatility, shape, lag) from text using structured prompting of an LLM and filters signals with confidence-aware gating.
  • Experiments on four real-world datasets and semi-synthetic tests show up to a 29% reduction in forecasting error over state-of-the-art unimodal and multimodal baselines.
  • The authors plan to release the code after acceptance, signaling ongoing publication and reproducibility efforts.

Abstract

Incorporating textual information into time-series forecasting holds promise for addressing event-driven non-stationarity; however, a fundamental modality gap hinders effective fusion: textual descriptions express temporal impacts implicitly and qualitatively, whereas forecasting models rely on explicit and quantitative signals. Through controlled semi-synthetic experiments, we show that existing methods over-attend to redundant tokens and struggle to reliably translate textual semantics into usable numerical cues. To bridge this gap, we propose TESS, which introduces a Temporal Evolution Semantic Space as an intermediate bottleneck between modalities. This space consists of interpretable, numerically grounded temporal primitives (mean shift, volatility, shape, and lag) extracted from text by an LLM via structured prompting and filtered through confidence-aware gating. Experiments on four real-world datasets demonstrate up to a 29 percent reduction in forecasting error compared to state-of-the-art unimodal and multimodal baselines. The code will be released after acceptance.