Consistency Analysis of Sentiment Predictions using Syntactic & Semantic Context Assessment Summarization (SSAS)

arXiv cs.AI / 4/20/2026

💬 OpinionModels & Research

Key Points

  • The paper addresses a key enterprise issue with LLM-based sentiment analytics: model stochasticity can make sentiment predictions inconsistent and too volatile for decision-making.
  • It proposes the SSAS (Syntactic & Semantic Context Assessment Summarization) framework to stabilize outputs by building context that constrains LLM attention via bounded, pre-processing-style guidance.
  • SSAS uses a hierarchical structure (Themes, Stories, Clusters) and an iterative Summary-of-Summaries (SoS) mechanism to compute context and generate higher-signal sentiment-focused prompts.
  • In experiments on three sentiment datasets (Amazon, Google Business, and Goodreads) using Gemini 2.0 Flash Lite, SSAS improved data quality by up to 30% compared with a direct-LLM baseline across multiple robustness scenarios.
  • The authors conclude that more consistent context estimation yields a steadier and more reliable evidentiary basis for enterprise decisions.

Abstract

The fundamental challenge of using Large Language Models (LLMs) for reliable, enterprise-grade analytics, such as sentiment prediction, is the conflict between the LLMs' inherent stochasticity (generative, non-deterministic nature) and the analytical requirement for consistency. The LLM inconsistency, coupled with the noisy nature of chaotic modern datasets, renders sentiment predictions too volatile for strategic business decisions. To resolve this, we present a Syntactic & Semantic Context Assessment Summarization (SSAS) framework for establishing context. Context established by SSAS functions as a sophisticated data pre-processing framework that enforces a bounded attention mechanism on LLMs. It achieves this by applying a hierarchical classification structure (Themes, Stories, Clusters) and an iterative Summary-of-Summaries (SoS) based context computation architecture. This endows the raw text with high-signal, sentiment-dense prompts, that effectively mitigate both irrelevant data and analytical variance. We empirically evaluated the efficacy of SSAS, using Gemini 2.0 Flash Lite, against a direct-LLM approach across three industry-standard datasets - Amazon Product Reviews, Google Business Reviews, Goodreads Book Reviews - and multiple robustness scenarios. Our results show that our SSAS framework is capable of significantly improving data quality, up to 30%, through a combination of noise removal and improvement in the estimation of sentiment prediction. Ultimately, consistency in our context-estimation capabilities provides a stable and reliable evidence base for decision-making.