RouteProfile: Elucidating the Design Space of LLM Profiles for Routing

arXiv cs.CL / 5/4/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that, in LLM routing, not only the router mechanism matters: how LLM “profiles” (capturing model capabilities) are designed can significantly affect routing performance.
  • It introduces RouteProfile, a structured design space for LLM profiles defined by four dimensions: organizational form, representation type, aggregation depth, and learning configuration.
  • Experiments across three representative routers show that structured profiles reliably outperform flat profiles, improving routing effectiveness.
  • The authors find that query-level signals are more reliable than coarse domain-level signals, and that generalization to newly introduced models benefits most from structured profiles with trainable configurations.
  • Overall, the work positions LLM profile design as a key research direction and aims to enable clearer, fairer comparisons between routing approaches by separating profile design from router design.

Abstract

As the large language model (LLM) ecosystem expands, individual models exhibit varying capabilities across queries, benchmarks, and domains, motivating the development of LLM routing. While prior work has largely focused on router mechanism design, LLM profiles, which capture model capabilities, remain underexplored. In this work, we ask: How does LLM profile design affect routing performance across different routers? Addressing this question helps clarify the role of profiles in routing, disentangle profile design from router design, and enable fairer comparison and more principled development of routing systems. To this end, we view LLM profiling as a structured information integration problem over heterogeneous interaction histories. We develop a general design space of LLM profiles, named RouteProfile, along four key dimensions: organizational form, representation type, aggregation depth, and learning configuration. Through systematic evaluation across three representative routers under both standard and new-LLM generalization settings, we show that: (1) structured profiles consistently outperform flat ones; (2) query-level signals are more reliable than coarse domain-level signals; and (3) generalization to newly introduced models benefits most from structured profiles under trainable configurations. Overall, our work highlights LLM profile design as an important direction for future routing research.