Reviewing the Reviewer: Graph-Enhanced LLMs for E-commerce Appeal Adjudication

arXiv cs.CL / 3/23/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces the Evidence-Action-Factor-Decision (EAFD) schema to ground reasoning in verifiable actions, reducing hallucinations in e-commerce appeal adjudication.
  • It presents a conflict-aware graph reasoning framework that builds EAFD graphs from Maker-Checker historical cases, aggregates them into a retrievable knowledge base, and enables top-down deductive reasoning using precedents.
  • The framework adds a Request More Information (RMI) feature that identifies unexecuted verification actions and generates targeted information requests when evidence is insufficient.
  • Empirical results show LLM-only baselines achieve 70.8% alignment with human experts, action modeling with RMI raises alignment to 87.5%, and adding the knowledge-graph retrieval yields 95.8% offline and 96.3% in production.
  • The study demonstrates strong real-world effectiveness for large-scale seller appeal adjudication in e-commerce.

Abstract

Hierarchical review workflows, where a second-tier reviewer (Checker) corrects first-tier (Maker) decisions, generate valuable correction signals that encode why initial judgments failed. However, learning from these signals is hindered by information asymmetry: corrections often depend on verification actions unavailable to Makers or automated systems. We address this challenge by introducing explicit action modeling as an inferential constraint that grounds reasoning in verifiable operations rather than unconstrained text generation. We propose the Evidence-Action-Factor-Decision (EAFD) schema, a minimal representation for adjudication reasoning that prevents hallucination through operational grounding and enables learning from correction signals via explicit conflict modeling. Building on this schema, we develop a conflict-aware graph reasoning framework that: (1) constructs EAFD graphs from historical cases capturing Maker-Checker disagreements, (2) aggregates them into a retrievable knowledge base, and (3) performs top-down deductive reasoning for new cases by projecting validated resolution paths from precedents. A distinctive capability is the Request More Information (RMI) outcome: when evidence is insufficient, the system identifies precisely which verification actions remain unexecuted and generates targeted information requests. We evaluate the framework in large-scale e-commerce seller appeal adjudication. While a standard LLM-only baseline achieves only 70.8% alignment with human experts, incorporating action modeling with RMI improves alignment to 87.5%. Augmenting this with the retrieval-based knowledge graph yields the best offline performance of 95.8%. Following online deployment, the framework maintains robust performance, achieving a 96.3% alignment rate in production, demonstrating its real-world effectiveness.