Dynamic Decision Learning: Test-Time Evolution for Abnormality Grounding in Rare Diseases

arXiv cs.CL / 4/29/2026

📰 NewsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper introduces Dynamic Decision Learning (DDL), a method for improving abnormality “grounding” in rare diseases when labeled data are scarce and supervised fine-tuning is impractical.
DDL keeps large vision-language models (LVLMs) frozen while iteratively refining their decisions in both language and visual spaces by optimizing instructions and consolidating predictions under visual perturbations.
Experiments on brain imaging benchmarks (including a rare-disease dataset with 281 pathology types) show DDL can boost rare-disease localization performance (up to 105% mAP@75) and exceed adaptation and supervised fine-tuning baselines.
The framework also yields consensus-based reliability scores that better reflect localization accuracy, improving calibration especially under distribution shifts and as tasks become harder.
The authors provide code publicly to support reproduction and further exploration of the DDL approach.

Abstract

Clinical abnormality grounding for rare diseases is often hindered by data scarcity, making supervised fine-tuning impractical and single-pass inference highly unstable. We propose Dynamic Decision Learning (DDL), a framework that enables frozen large vision-language models (LVLMs) to refine their decisions across both language and visual spaces by optimizing instructions and consolidating predictions under visual perturbations. This process improves localization quality and produces a consensus-based reliability score that quantifies model confidence. Results on brain imaging benchmarks, including a rare-disease dataset with 281 pathology types across models ranging from 3B to 72B parameters, show that DDL improves mAP@75 by up to 105% on rare-disease cases and outperforms adaptation baselines and supervised fine-tuning. Furthermore, DDL demonstrates stronger calibration between reliability scores and localization accuracy under severe distribution shifts and increasing task difficulty. Code is available at: https://lijunrio.github.io/DDL/

LLMs will be a commodity

Reddit r/artificial

Indian Developers: How to Build AI Side Income with $0 Capital in 2026

Dev.to

What it feels like to have to have Qwen 3.6 or Gemma 4 running locally

Reddit r/LocalLLaMA

Dex lands $5.3M to grow its AI-driven talent matching platform

Tech.eu

AI Citation Registry: Why Daily Updates Leave No Time for Data Structuring

Dev.to

Dynamic Decision Learning: Test-Time Evolution for Abnormality Grounding in Rare Diseases

Key Points

Abstract

Related Articles

LLMs will be a commodity

Indian Developers: How to Build AI Side Income with $0 Capital in 2026

What it feels like to have to have Qwen 3.6 or Gemma 4 running locally

Dex lands $5.3M to grow its AI-driven talent matching platform

AI Citation Registry: Why Daily Updates Leave No Time for Data Structuring

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer