Time-driven Survival Analysis from FDG-PET/CT in Non-Small Cell Lung Cancer

arXiv cs.CV / 4/9/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • 提案された深層回帰フレームワークは、NSCLCのFDG-PET/CT画像に加えて「時間ホライズン(days)」を入力として与え、全生存期間(OS)を時間依存の形で予測する方式を示しています。
  • ResNet-50で組織単位の画像埋め込みを作成し、画像埋め込みと時間情報を結合してOS確率を計算する構成で、画像のみを扱うベースラインよりAUCが4.3%改善しました。
  • U-CANコホート(学習556例、評価292例)で評価され、臨床+IDP特徴の強固な性能に加えて、画像モデルと臨床+IDPモデルのアンサンブルが最良の総合性能(0.788)を達成しています。
  • リスクを高/低のカテゴリに層別化でき、サリエンシー解析のヒートマップでは予測に腫瘍領域が重要であることが示唆されています。
  • 本研究は、画像と表形式データのマルチモーダル統合により、時間をパラメータ化した生存予測と個別化治療計画への応用可能性を示しています。

Abstract

Purpose: Automated medical image-based prediction of clinical outcomes, such as overall survival (OS), has great potential in improving patient prognostics and personalized treatment planning. We developed a deep regression framework using tissue-wise FDG-PET/CT projections as input, along with a temporal input representing a scalar time horizon (in days) to predict OS in patients with Non-Small Cell Lung Cancer (NSCLC). Methods: The proposed framework employed a ResNet-50 backbone to process input images and generate corresponding image embeddings. The embeddings were then combined with temporal data to produce OS probabilities as a function of time, effectively parameterizing the predictions based on time. The overall framework was developed using the U-CAN cohort (n = 556) and evaluated by comparing with a baseline method on the test set (n = 292). The baseline utilized the ResNet-50 architecture, processing only the images as input and providing OS predictions at pre-specified intervals, such as 2- or 5-year. Results: The incorporation of temporal data with image embeddings demonstrated an advantage in predicting OS, outperforming the baseline method with an improvement in AUC of 4.3%. The proposed model using clinical + IDP features achieved strong performance, and an ensemble of imaging and clinical + IDP models achieved the best overall performance (0.788), highlighting the complementary value of multimodal inputs. The proposed method also enabled risk stratification of patients into distinct categories (high vs low risk). Heat maps from the saliency analysis highlighted tumor regions as key structures for the prediction. Conclusion: Our method provided an automated framework for predicting OS as a function of time and demonstrates the potential of combining imaging and tabular data for improved survival prediction.