MedRoute: RL-Based Dynamic Specialist Routing in Multi-Agent Medical Diagnosis

arXiv cs.LG / 4/9/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • MedRoute は、各エージェントを「専門医」と見立てた動的なマルチエージェント LMM 診断フレームワークで、実臨床の“症状に応じた専門家の切り替え”を模倣することを目指しています。
  • 従来のように専門家を固定・事前定義するのではなく、General Practitioner に RL(強化学習)で訓練したルータを組み込み、状況の変化に応じて適切な専門家を動的に選択します。
  • 最終判断は Moderator が統合して出力し、各専門家の見解を協調的に扱える設計になっています。
  • テキスト/画像の医療データセットで広範に評価され、診断精度で既存の最先端ベースラインを上回ったと報告されています。
  • 研究の再現性のため、コードとモデルが GitHub(UCF-CRCV/MedRoute)で公開されています。

Abstract

Medical diagnosis using Large Multimodal Models (LMMs) has gained increasing attention due to capability of these models in providing precise diagnoses. These models generally combine medical questions with visual inputs to generate diagnoses or treatments. However, they are often overly general and unsuitable under the wide range of medical conditions in real-world healthcare. In clinical practice, diagnosis is performed by multiple specialists, each contributing domain-specific expertise. To emulate this process, a potential solution is to deploy a dynamic multi-agent LMM framework, where each agent functions as a medical specialist. Current approaches in this emerging area, typically relying on static or predefined selection of various specialists, cannot be adapted to the changing practical scenario. In this paper, we propose MedRoute, a flexible and dynamic multi-agent framework that comprises of a collaborative system of specialist LMM agents. Furthermore, we add a General Practitioner with an RL-trained router for dynamic specialist selection, and a Moderator that produces the final decision. In this way, our framework closely mirrors real clinical workflows. Extensive evaluations on text and image-based medical datasets demonstrate improved diagnostic accuracy, outperforming the state-of-the-art baselines. Our work lays a strong foundation for future research. Code and models are available at https://github.com/UCF-CRCV/MedRoute/.