Cross-Slice Knowledge Transfer via Masked Multi-Modal Heterogeneous Graph Contrastive Learning for Spatial Gene Expression Inference

arXiv cs.CV / 3/25/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes SpaHGC, a multi-modal heterogeneous graph contrastive learning framework to infer spatial gene expression from pathology images by modeling both within-slide and cross-slide relationships.
  • It uses cross-slide spot-spot similarities derived from embeddings produced by a pathology foundation model to enable knowledge transfer from reference slides to target slides.
  • SpaHGC incorporates masked graph contrastive learning to strengthen feature representations and better capture complex spatial dependencies relevant to gene expression.
  • Across seven matched histology–ST datasets spanning multiple platforms, tissues, and cancer subtypes, SpaHGC reportedly outperforms nine state-of-the-art baselines on all evaluation metrics.
  • The inferred gene-expression predictions show enrichment in multiple cancer-related pathways, supporting the biological relevance and potential applicability of the approach.

Abstract

While spatial transcriptomics (ST) has advanced our understanding of gene expression in tissue context, its high experimental cost limits its large-scale application. Predicting ST from pathology images is a promising, cost-effective alternative, but existing methods struggle to capture complex cross-slide spatial relationships. To address the challenge, we propose SpaHGC, a multi-modal heterogeneous graph-based model that captures both intra-slice and inter-slice spot-spot relationships from histology images. It integrates local spatial context within the target slide and cross-slide similarities computed from image embeddings extracted by a pathology foundation model. These embeddings enable inter-slice knowledge transfer, and SpaHGC further incorporates Masked Graph Contrastive Learning to enhance feature representation and transfer spatial gene expression knowledge from reference to target slides, enabling it to model complex spatial dependencies and significantly improve prediction accuracy. We conducted comprehensive benchmarking on seven matched histology-ST datasets from different platforms, tissues, and cancer subtypes. The results demonstrate that SpaHGC significantly outperforms the existing nine state-of-the-art methods across all evaluation metrics. Additionally, the predictions are significantly enriched in multiple cancer-related pathways, thereby highlighting its strong biological relevance and application potential.