Expect the Unexpected? Testing the Surprisal of Salient Entities
arXiv cs.CL / 4/14/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper investigates how discourse entity salience affects surprisal, addressing a gap left by prior UID (Uniform Information Density) research that largely ignored participant salience.
- Using 70K manually annotated mentions across 16 English genres and a novel minimal-pair prompting method, the study finds globally salient entities have significantly higher surprisal than non-salient entities even after controlling for confounds like position and length.
- The authors also report that when salient entities are used as prompts, they systematically reduce surprisal for surrounding content, increasing overall document-level predictability.
- The magnitude of this prompt-driven predictability effect varies by genre, being strongest in topic-coherent texts and weakest in conversational contexts.
- Overall, the work refines the UID competing pressures framework by proposing global entity salience as a mechanism that shapes information distribution across discourse.
Related Articles

Don't forget, there is more than forgetting: new metrics for Continual Learning
Dev.to

Microsoft MAI-Image-2-Efficient Review 2026: The AI Image Model Built for Production Scale
Dev.to
Bit of a strange question?
Reddit r/artificial

One URL for Your AI Agent: HTML, JSON, Markdown, and an A2A Card
Dev.to

One URL for Your AI Agent: HTML, JSON, Markdown, and an A2A Card
Dev.to