Unifying VLM-Guided Flow Matching and Spectral Anomaly Detection for Interpretable Veterinary Diagnosis

arXiv cs.CV / 4/8/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper tackles canine pneumothorax diagnosis by addressing data scarcity and the need for more trustworthy, interpretable models through a new research dataset and diagnostic framework.
  • It introduces a publicly available, pixel-level annotated dataset and a diagnostic paradigm that combines lesion localization with spectral anomaly detection.
  • For localization, a Vision-Language Model (VLM) guides an iterative Flow Matching process to progressively refine segmentation masks for improved boundary accuracy.
  • For detection, the refined segmentation is used to isolate lesion features, which are then evaluated with Random Matrix Theory (RMT) to detect pneumothorax via statistically significant outlier eigenvalues rather than a conventional classifier.
  • The authors argue that high-fidelity mask purification is critical for RMT sensitivity and provide source code to support reproducibility.

Abstract

Automatic diagnosis of canine pneumothorax is challenged by data scarcity and the need for trustworthy models. To address this, we first introduce a public, pixel-level annotated dataset to facilitate research. We then propose a novel diagnostic paradigm that reframes the task as a synergistic process of signal localization and spectral detection. For localization, our method employs a Vision-Language Model (VLM) to guide an iterative Flow Matching process, which progressively refines segmentation masks to achieve superior boundary accuracy. For detection, the segmented mask is used to isolate features from the suspected lesion. We then apply Random Matrix Theory (RMT), a departure from traditional classifiers, to analyze these features. This approach models healthy tissue as predictable random noise and identifies pneumothorax by detecting statistically significant outlier eigenvalues that represent a non-random pathological signal. The high-fidelity localization from Flow Matching is crucial for purifying the signal, thus maximizing the sensitivity of our RMT detector. This synergy of generative segmentation and first-principles statistical analysis yields a highly accurate and interpretable diagnostic system (source code is available at: https://github.com/Pu-Wang-alt/Canine-pneumothorax).