DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models

arXiv cs.LG / 3/25/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper proposes DAK-UCB, a contextual bandit approach for routing prompts to the best available LLM or generative model while accounting for both fidelity and output diversity.
It addresses limitations of prior selection methods that optimize only prompt-based fidelity scores (e.g., CLIP-Score) by explicitly incorporating diversity-related metrics into model choice.
DAK-UCB uses prompt-aware diversity score functions derived from two-sample expectations over prompt-output pairs from prior rounds, enabling online selection with diversity goals.
The authors demonstrate the method using joint kernel distance and kernel entropy as diversity measures, showing improved diversity-aware model selection without sacrificing fidelity across sequences of prompts.
The work is shared on arXiv and provides accompanying code via a public GitHub repository.

Abstract

The expansion of generative AI and LLM services underscores the growing need for adaptive mechanisms to select an appropriate available model to respond to a user's prompts. Recent works have proposed offline and online learning formulations to identify the optimal generative AI model for an input prompt, based solely on maximizing prompt-based fidelity evaluation scores, e.g., CLIP-Score in text-to-image generation. However, such fidelity-based selection methods overlook the diversity of generated outputs, and hence, they can fail to address potential diversity shortcomings in the generated responses. In this paper, we introduce the Diversity-Aware Kernelized Upper Confidence Bound (DAK-UCB) method as a contextual bandit algorithm for the online selection of generative models with diversity considerations. The proposed DAK-UCB method incorporates both fidelity and diversity-related metrics into the selection process. We design this framework based on prompt-aware diversity score functions that decompose to a two-sample-based expectation over prompt-output pairs in the previous generation rounds. Specifically, we illustrate the application of our framework using joint kernel distance and kernel entropy measures. Our experimental results demonstrate the effectiveness of DAK-UCB in promoting diversity-aware model selection while maintaining fidelity in the generations for a sequence of prompts. The code is available at https://github.com/Donya-Jafari/DAK-UCB.

Lemonade 10.0.1 improves setup process for using AMD Ryzen AI NPUs on Linux

Reddit r/artificial

The 2026 Developer Showdown: Claude Code vs. Google Antigravity

Dev.to

Google March 2026 Spam Update: SEO Impact and What to Do Now | MKDM

Dev.to

CRM Development That Drives Growth

Dev.to

Karpathy's Autoresearch: Improving Agentic Coding Skills

Dev.to

DAK-UCB: Diversity-Aware Prompt Routing for LLMs and Generative Models

Key Points

Abstract

Related Articles

Lemonade 10.0.1 improves setup process for using AMD Ryzen AI NPUs on Linux

The 2026 Developer Showdown: Claude Code vs. Google Antigravity

Google March 2026 Spam Update: SEO Impact and What to Do Now | MKDM

CRM Development That Drives Growth

Karpathy's Autoresearch: Improving Agentic Coding Skills

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer