AI Navigate

Nodule-Aligned Latent Space Learning with LLM-Driven Multimodal Diffusion for Lung Nodule Progression Prediction

arXiv cs.CV / 3/18/2026

📰 NewsModels & Research

Key Points

  • Introduces Nodule-Aligned Multimodal Diffusion (NAMD), a framework to predict lung nodule progression by generating 1-year follow-up CT images from baseline scans and the patient’s EHR data.
  • A nodule-aligned latent space is learned, where distances between latents correspond to changes in nodule attributes, coupled with an LLM-driven control mechanism to condition the diffusion backbone on patient information.
  • On the NLST dataset, NAMD achieves AUROC 0.805 and AUPRC 0.346 for malignancy prediction, outperforming baseline scans and state-of-the-art synthesis methods and approaching the performance of real follow-up scans (AUROC 0.819, AUPRC 0.393).
  • The results suggest NAMD can capture clinically relevant features of nodule progression, potentially enabling earlier and more accurate diagnosis, while noting the need for further validation before clinical deployment.

Abstract

Early diagnosis of lung cancer is challenging due to biological uncertainty and the limited understanding of the biological mechanisms driving nodule progression. To address this, we propose Nodule-Aligned Multimodal (Latent) Diffusion (NAMD), a novel framework that predicts lung nodule progression by generating 1-year follow-up nodule computed tomography images with baseline scans and the patient's and nodule's Electronic Health Record (EHR). NAMD introduces a nodule-aligned latent space, where distances between latents directly correspond to changes in nodule attributes, and utilizes an LLM-driven control mechanism to condition the diffusion backbone on patient data. On the National Lung Screening Trial (NLST) dataset, our method synthesizes follow-up nodule images that achieve an AUROC of 0.805 and an AUPRC of 0.346 for lung nodule malignancy prediction, significantly outperforming both baseline scans and state-of-the-art synthesis methods, while closely approaching the performance of real follow-up scans (AUROC: 0.819, AUPRC: 0.393). These results demonstrate that NAMD captures clinically relevant features of lung nodule progression, facilitating earlier and more accurate diagnosis.