Continuous Interpretive Steering for Scalar Diversity

arXiv cs.CL / 4/9/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that pragmatic inference in LLMs should be evaluated as a graded phenomenon, since different lexical items can produce scalar implicatures with different strengths (scalar diversity).
  • It introduces Continuous Interpretive Steering (CIS), which treats steering strength at the activation level as a continuous variable to probe graded interpretive effects beyond simple prompt manipulations.
  • To enable this evaluation, the authors release a new dataset, GraSD, designed to encode graded scalar diversity across scalar items.
  • Experiments on four LLMs show that uniform activation steering amplifies pragmatic interpretations overall but eliminates item-level variation, while graded activation steering preserves and aligns interpretive shifts with the dataset’s scalar diversity grades.
  • The results suggest graded sensitivity is encoded in model representations and can be systematically recovered via controlled interventions, offering a principled framework for studying graded pragmatic behavior in LLMs.

Abstract

Pragmatic inference is inherently graded. Different lexical items give rise to pragmatic enrichment to different degrees. Scalar implicature exemplifies this property through scalar diversity, where implicature strength varies across scalar items. However, evaluations of pragmatic inference in large language models (LLMs) often rely on prompt-based manipulations. Beyond prompt-level effects, this study introduces Continuous Interpretive Steering (CIS), a method that probes graded pragmatic interpretation by treating activation-level steering strength as a continuous experimental variable. To support this analysis, this study introduces a new dataset, GraSD, which encodes graded scalar diversity. Experiments on four LLMs show that uniform activation steering increases pragmatic interpretations globally but collapses item-level variation, whereas graded activation steering yields differentiated interpretive shifts aligned with scalar diversity grades. It indicates that graded sensitivity is encoded in the representation space and can be systematically recovered through controlled intervention. Together, CIS and GraSD provide a principled framework for evaluating graded pragmatic sensitivity in LLMs.