DCG-Net: Dual Cross-Attention with Concept-Value Graph Reasoning for Interpretable Medical Diagnosis

arXiv cs.CV / 3/24/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces DCG-Net, an end-to-end interpretable medical diagnosis framework aimed at making deep model decision processes more transparent.
  • DCG-Net uses a Dual Cross-Attention mechanism to align visual tokens with canonicalized textual concept-value prototypes, enabling spatially localized evidence attribution rather than relying on cosine similarity matching.
  • It incorporates a Parametric Concept Graph initialized with Positive Pointwise Mutual Information priors and refined via sparsity-controlled message passing to model dependencies among clinical concepts.
  • Experiments on white blood cell morphology and skin lesion diagnosis report state-of-the-art classification performance alongside clinically interpretable diagnostic explanations.

Abstract

Deep learning models have achieved strong performance in medical image analysis, but their internal decision processes remain difficult to interpret. Concept Bottleneck Models (CBMs) partially address this limitation by structuring predictions through human-interpretable clinical concepts. However, existing CBMs typically overlook the contextual dependencies among concepts. To address these issues, we propose an end-to-end interpretable framework \emph{DCG-Net} that integrates multimodal alignment with structured concept reasoning. DCG-Net introduces a Dual Cross-Attention module that replaces cosine similarity matching with bidirectional attention between visual tokens and canonicalized textual concept-value prototypes, enabling spatially localized evidence attribution. To capture the relational structure inherent to clinical concepts, we develop a Parametric Concept Graph initialized with Positive Pointwise Mutual Information priors and refined through sparsity-controlled message passing. This formulation models inter-concept dependencies in a manner consistent with clinical domain knowledge. Experiments on white blood cell morphology and skin lesion diagnosis demonstrate that DCG-Net achieves state-of-the-art classification performance while producing clinically interpretable diagnostic explanations.