AI Navigate

Algorithmic Consequences of Particle Filters for Sentence Processing: Amplified Garden-Paths and Digging-In Effects

arXiv cs.CL / 3/13/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • Surprisal theory with LLMs underpredicts processing difficulty when structural ambiguity is violated, suggesting explicit ambiguity representations matter for sentence processing.
  • Particle filter models explicitly represent multiple structural hypotheses as particles, offering a concrete alternative to single-representation surprisal accounts.
  • The authors prove that particle-filter-based processing can amplify garden-path effects, making misinterpretations more pronounced under ambiguity.
  • Resampling in these particle filters creates real-time digging-in effects, causing disambiguation difficulty to grow with the length of the ambiguous region.
  • The digging-in effect scales inversely with particle count, with fully parallel (high-particle) setups predicted to show no digging-in.

Abstract

Under surprisal theory, linguistic representations affect processing difficulty only through the bottleneck of surprisal. Our best estimates of surprisal come from large language models, which have no explicit representation of structural ambiguity. While LLM surprisal robustly predicts reading times across languages, it systematically underpredicts difficulty when structural expectations are violated -- suggesting that representations of ambiguity are causally implicated in sentence processing. Particle filter models offer an alternative where structural hypotheses are explicitly represented as a finite set of particles. We prove several algorithmic consequences of particle filter models, including the amplification of garden-path effects. Most critically, we demonstrate that resampling, a common practice with these models, inherently produces real-time digging-in effects -- where disambiguation difficulty increases with ambiguous region length. Digging-in magnitude scales inversely with particle count: fully parallel models predict no such effect.