Thinking Like a Clinician: A Cognitive AI Agent for Clinical Diagnosis via Panoramic Profiling and Adversarial Debate

arXiv cs.AI / 4/28/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that using LLMs for clinical decision support is hampered by tunnel vision and diagnostic hallucinations when processing unstructured EHR data.
  • It introduces DxChain, a chain-based clinical reasoning framework that models clinician cognition through iterative “Memory Anchoring,” “Navigation,” and “Verification” phases.
  • DxChain uses a Profile-Then-Plan approach to reduce cold-start hallucinations by first creating a panoramic baseline of the patient before proposing a plan.
  • It proposes a Medical Tree-of-Thoughts (Med-ToT) method for look-ahead planning with resource-aware navigation, plus an “Angel-Devil” adversarial debate to resolve conflicting evidence.
  • On two real-world benchmarks (MIMIC-IV-Ext Cardiac Disease and MIMIC-IV-Ext CDM), DxChain reports state-of-the-art diagnostic accuracy and improved logical consistency, with code made available.

Abstract

The application of large language models (LLMs) in clinical decision support faces significant challenges of "tunnel vision" and diagnostic hallucinations present in their processing unstructured electronic health records (EHRs). To address these challenges, we propose a novel chain-based clinical reasoning framework, called DxChain, which transforms the diagnostic workflow into an iterative process by mirroring a clinician's cognitive trajectory that consists of "Memory Anchoring", "Navigation" and "Verification" phases. DxChain introduces three key methodological innovations to elicit the potential of LLM: (i) a Profile-Then-Plan paradigm to mitigate cold-start hallucinations by establishing a panoramic patient baseline, (ii) a Medical Tree-of-Thoughts (Med-ToT) algorithm for strategic look ahead planning and resource aware navigation, and (iii) a Dialectical Diagnostic Verification procedure utilizing "Angel-Devil" adversarial debates to resolve complex evidence conflicts. Evaluated on two real world benchmarks, MIMIC-IV-Ext Cardiac Disease and MIMIC-IV-Ext CDM, DxChain achieves state-of-the-art performances in both diagnostic accuracy and logical consistency, offering a modular and reliable architecture for next-generation clinical AI. The code is at https://anonymous.4open.science/r/Dx-Chain.