A Multimodal Foundation Model of Spatial Transcriptomics and Histology for Biological Discovery and Clinical Prediction
arXiv cs.AI / 4/7/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- STORM is introduced as a multimodal foundation model that learns from 1.2M spatial transcriptomics profiles matched to H&E histology across 18 organs to bridge imaging and molecular omics.
- The model uses a hierarchical architecture combining morphology, gene expression, and spatial context to generate robust molecular–morphological representations for spatial domain discovery.
- STORM reportedly improves the prediction of spatial gene expression from H&E images across 11 tumor types compared with existing approaches.
- The approach is described as platform-agnostic, with consistent performance across major spatial transcriptomics platforms (Visium, Xenium, Visium HD, CosMx).
- In 23 independent patient cohorts (7,245 patients), STORM is claimed to significantly improve immunotherapy response prediction and prognostication beyond established biomarkers, supporting scalable clinical precision medicine use.
Related Articles

Black Hat Asia
AI Business

Amazon CEO takes aim at Nvidia, Intel, Starlink, more in annual shareholder letter
TechCrunch

Why Anthropic’s new model has cybersecurity experts rattled
Reddit r/artificial
Does the AI 2027 paper still hold any legitimacy?
Reddit r/artificial

Why Most Productivity Systems Fail (And What to Do Instead)
Dev.to