ISAAC: Auditing Causal Reasoning in Deep Models for Drug-Target Interaction

arXiv cs.LG / 5/6/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • ISAAC is a post-hoc auditing framework designed to test whether deep learning models for drug–target interaction (DTI) rely on mechanistically meaningful molecular signals rather than spurious correlations.
  • It probes frozen models using matched input interventions that separately target mechanistic vs. spurious structural features, and it does so independently of predictive accuracy metrics.
  • Experiments on the Davis benchmark across three sequence-based DTI architectures show ~25% relative differences in “reasoning scores” between models even when AUROC is similar (within ~3%), indicating accuracy alone can miss causal-reasoning gaps.
  • The observed discrepancies are stable across different training runs and intervention random seeds and hold under two distinct perturbation operators, supporting robustness of the auditing approach.
  • The work argues that structural causal auditing should complement standard accuracy-based evaluation in scientific machine learning for molecular modeling.

Abstract

Deep learning models for drug--target interaction (DTI) prediction often achieve strong benchmark performance without necessarily relying on mechanistically meaningful molecular features, a limitation that standard accuracy-based evaluation cannot detect. We introduce ISAAC (Intervention-based Structural Auditing Approach for Causal Reasoning), a post-hoc framework that evaluates prior-relative structural sensitivity by probing frozen models through matched mechanistic and spurious input-level interventions, independently of predictive accuracy. Applied to three sequence-based DTI architectures on the Davis benchmark, ISAAC reveals approximately 25\% relative differences in reasoning scores across models with comparable AUROC (within around 3\%), stable across training and intervention seeds and two distinct perturbation operators. These discrepancies, undetectable under conventional accuracy metrics, motivate the use of post-hoc structural auditing as a complement to standard performance evaluation in scientific machine learning for molecular modeling.