Let the Agent Steer: Closed-Loop Ranking Optimization via Influence Exchange
arXiv cs.AI / 2026/3/31
📰 ニュースIdeas & Deep AnalysisIndustry & Market MovesModels & Research
要点
- The paper argues that recommendation ranking is an influence-allocation problem, where offline proxy metrics can bias the mapping from influence reallocation to online business outcomes, and simple single-factor calibration may not fix asymmetric errors.
- It introduces Sortify, described as the first fully autonomous LLM-driven ranking optimization agent for production recommender systems, designed to close the loop from diagnosis to parameter deployment without human intervention.
- Sortify reframes optimization as continuous influence exchange using a dual-channel SEU-based framework (Belief channel for offline-online transfer correction and Preference channel for constraint penalty adjustment).
- The system uses an LLM meta-controller that tunes higher-level framework parameters (not low-level search variables) and a persistent Memory DB (7 relational tables) for cross-round learning.
- Deployment results include improved GMV performance in Country A (from -3.6% to +9.2% over 7 rounds, with peak orders +12.5%) and strong cold-start gains in Country B (7-day A/B test: +4.15% GMV/UU and +3.58% ads revenue), leading to full rollout.
