Beyond Relevance: Utility-Centric Retrieval in the LLM Era

arXiv cs.CL / 4/13/2026

💬 OpinionIdeas & Deep Analysis

Key Points

  • The article argues that traditional information retrieval, which optimizes for topical relevance, should be reframed around “utility,” meaning whether retrieved content helps users accomplish their actual task.
  • It explains how retrieval-augmented generation (RAG) changes evaluation: retrieved documents function as evidence for LLMs, so success should be measured by downstream generation quality rather than relevance-only ranking metrics.
  • It proposes a unified framework distinguishing LLM-agnostic vs. LLM-specific utility and context-independent vs. context-dependent utility.
  • It connects these utility concepts to LLM information needs and agentic RAG, outlining how retrieval objectives are shifting toward LLM-centric goals.
  • The piece presents both conceptual foundations and practical guidance for designing retrieval systems aligned with LLM-based information access requirements.

Abstract

Information retrieval systems have traditionally optimized for topical relevance-the degree to which retrieved documents match a query. However, relevance only approximates a deeper goal: utility, namely, whether retrieved information helps accomplish a user's underlying task. The emergence of retrieval-augmented generation (RAG) fundamentally changes this paradigm. Retrieved documents are no longer consumed directly by users but instead serve as evidence for large language models (LLMs) that produce answers. As a result, retrieval effectiveness must be evaluated by its contribution to generation quality rather than by relevance-based ranking metrics alone. This tutorial argues that retrieval objectives are evolving from relevance-centric optimization toward LLM-centric utility. We present a unified framework covering LLM-agnostic versus LLM-specific utility, context-independent versus context-dependent utility, and the connection with LLM information needs and agentic RAG. By synthesizing recent advances, the tutorial provides conceptual foundations and practical guidance for designing retrieval systems aligned with the requirements of LLM-based information access.