Estimating heterogeneous treatment effects with survival outcomes via a deep survival learner

arXiv stat.ML / 4/14/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses how to estimate heterogeneous treatment effects in survival analysis with right-censoring and a time-varying estimand, where many existing methods only evaluate effects at a single prespecified time point.
  • It proposes a Deep Survival Learner (DSL) that uses a doubly robust pseudo-outcome to identify time-specific CATEs, remaining unbiased if either the outcome model or the treatment assignment model is correctly specified (with proper censoring handling).
  • DSL estimates a whole trajectory of treatment effects over a clinically relevant time spectrum by training a multi-output deep neural network with shared representations for joint estimation.
  • The authors provide theoretical error bounds for both pointwise and joint estimation across time, and argue that joint estimation can exploit temporal structure for more stable estimates under smoothness assumptions.
  • Simulations and an application to the Boston Lung Cancer Study show improved finite-sample performance, including under nuisance model misspecification, and uncover patient- and time-dependent heterogeneity in perioperative chemotherapy effects.

Abstract

Estimating heterogeneous treatment effects in survival settings is complicated by right censoring as well as the time-varying nature of the estimand. While the conditional average treatment effect (CATE) provides a natural target, most existing approaches focus on a single prespecified time point and do not account for the temporal trajectory, leading to instability in estimation. We propose a deep survival learner (DSL) for estimating heterogeneous treatment effects with right-censored outcomes. The method is based on a doubly robust pseudo-outcome whose conditional expectation identifies time-specific CATEs under standard assumptions. This construction remains unbiased if either the outcome model or the treatment assignment model is correctly specified, when properly accounting for censoring. To estimate CATEs over a clinically relevant time spectrum, DSL employs a multi-output deep neural network with shared representations, enabling joint estimation of treatment effect trajectories. From a theoretical perspective, we derive error bounds for both pointwise and joint estimation over time. We show that joint estimation can leverage temporal structure to control estimation error without incurring much additional approximation cost under smoothness conditions, leading to improved stability relative to separate estimation. Cross-fitting is incorporated to reduce overfitting and mitigate bias arising from flexible nuisance estimation. Simulation studies demonstrate favorable finite-sample performance, particularly under nuisance model misspecification. Applied to the Boston Lung Cancer Study, DSL reveals heterogeneity in the effects of perioperative chemotherapy across patient characteristics and over time.