AI Navigate

Mind the Shift: Decoding Monetary Policy Stance from FOMC Statements with Large Language Models

arXiv cs.CL / 3/17/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • Introduces Delta-Consistent Scoring (DCS), an annotation-free framework that maps frozen LLM representations to continuous hawkish–dovish stance scores for FOMC statements.
  • DCS jointly models absolute stance and relative inter-meeting shifts by using consecutive meetings as self-supervision to learn a temporally coherent stance trajectory.
  • It does not rely on manual hawkish–dovish labels, enabling scalable stance extraction without labeled data.
  • Across four LLM backbones, DCS outperforms supervised probes and LLM-as-judge baselines, achieving up to 71.1% accuracy on sentence-level classification.
  • The resulting meeting-level scores correlate with inflation indicators and are associated with Treasury yield movements, suggesting the method captures economically meaningful policy signals.

Abstract

Federal Open Market Committee (FOMC) statements are a major source of monetary-policy information, and even subtle changes in their wording can move global financial markets. A central task is therefore to measure the hawkish--dovish stance conveyed in these texts. Existing approaches typically treat stance detection as a standard classification problem, labeling each statement in isolation. However, the interpretation of monetary-policy communication is inherently relative: market reactions depend not only on the tone of a statement, but also on how that tone shifts across meetings. We introduce Delta-Consistent Scoring (DCS), an annotation-free framework that maps frozen large language model (LLM) representations to continuous stance scores by jointly modeling absolute stance and relative inter-meeting shifts. Rather than relying on manual hawkish--dovish labels, DCS uses consecutive meetings as a source of self-supervision. It learns an absolute stance score for each statement and a relative shift score between consecutive statements. A delta-consistency objective encourages changes in absolute scores to align with the relative shifts. This allows DCS to recover a temporally coherent stance trajectory without manual labels. Across four LLM backbones, DCS consistently outperforms supervised probes and LLM-as-judge baselines, achieving up to 71.1% accuracy on sentence-level hawkish--dovish classification. The resulting meeting-level scores are also economically meaningful: they correlate strongly with inflation indicators and are significantly associated with Treasury yield movements. Overall, the results suggest that LLM representations encode monetary-policy signals that can be recovered through relative temporal structure.