Intervention-Aware Multiscale Representation Learning from Imaging Phenomics and Perturbation Transcriptomics
arXiv cs.CV / 4/28/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper proposes an intervention-aware multiscale representation learning method that uses perturbational transcriptomics to guide microscopy-based phenotypic profiling for drug discovery.
- It introduces a transcriptome-conditioned teacher that conditions on gene expression plus intervention metadata, producing soft distributions over a chemistry-aware codebook to capture drug similarity.
- A fine-tuned single-cell foundation model is used to encode cell-type context and disentangle dose effects, addressing weaknesses of prior multimodal approaches that rely on simple identity matching.
- An image-only student model distills the teacher’s mechanistic knowledge so it can run independently at test time, improving generalization to unseen interventions.
- Experiments on Cell Painting and RxRx with L1000 show significant gains in one-shot transfer to unseen interventions and drug-target gene discovery over self-supervised and alignment baselines, alongside theoretical risk-bound guarantees.


