A Modular LLM Framework for Explainable Price Outlier Detection

arXiv cs.CL / 3/24/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes an agentic LLM framework for explainable detection of retail/e-commerce price outliers by grounding decisions in semantically related product attributes rather than simple thresholds.
  • The method works in three stages: relevance classification to find similar, price-relevant products; relative utility assessment across price-influencing dimensions; and reasoning-based aggregation to produce an outlier verdict with justifications.
  • Experiments report over 75% agreement with human auditors and improved performance versus zero-shot and retrieval-based LLM approaches on the test dataset.
  • Ablation results indicate the approach’s sensitivity to key hyper-parameters while also showing it can be adapted to different accuracy requirements and auditor-agreement targets.

Abstract

Detecting product price outliers is important for retail and e-commerce stores as erroneous or unexpectedly high prices adversely affect competitiveness, revenue, and consumer trust. Classical techniques offer simple thresholds while ignoring the rich semantic relationships among product attributes. We propose an agentic Large Language Model (LLM) framework that treats outlier price flagging as a reasoning task grounded in related product detection and comparison. The system processes the prices of target products in three stages: (i) relevance classification selects price-relevant similar products using product descriptions and attributes; (ii) relative utility assessment evaluates the target product against each similar product along price influencing dimensions (e.g., brand, size, features); (iii) reasoning-based decision aggregates these justifications into an explainable price outlier judgment. The framework attains over 75% agreement with human auditors on a test dataset, and outperforms zero-shot and retrieval based LLM techniques. Ablation studies show the sensitivity of the method to key hyper-parameters and testify on its flexibility to be applied to cases with different accuracy requirement and auditor agreements.