A Hierarchical Sheaf Spectral Embedding Framework for Single-Cell RNA-seq Analysis

arXiv cs.LG / 3/31/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces a hierarchical sheaf spectral embedding (HSSE) method to generate cell-level representations for single-cell RNA-seq that capture heterogeneous local structure across multiple resolutions.
  • HSSE builds data-driven cellular sheaves for cell neighborhoods at multiple scales and uses persistent sheaf Laplacians over filtration intervals to extract spectral statistics describing how local relationships evolve.
  • Spectral descriptors across scales are aggregated into a unified feature vector per cell and are designed to be directly usable for downstream learning without requiring additional model training.
  • Experiments on twelve benchmark single-cell RNA-seq datasets show HSSE achieves competitive or improved classification performance versus existing multiscale and classical embedding approaches under a consistent evaluation protocol.
  • The authors argue the resulting sheaf-based representations are both robust and interpretable, offering a structured way to encode local relational geometry in single-cell data.

Abstract

Single-cell RNA-seq data analysis typically requires representations that capture heterogeneous local structure across multiple scales while remaining stable and interpretable. In this work, we propose a hierarchical sheaf spectral embedding (HSSE) framework that constructs informative cell-level features based on persistent sheaf Laplacian analysis. Starting from scale-dependent low-dimensional embeddings, we define cell-centered local neighborhoods at multiple resolutions. For each local neighborhood, we construct a data-driven cellular sheaf that encodes local relationships among cells. We then compute persistent sheaf Laplacians over sampled filtration intervals and extract spectral statistics that summarize the evolution of local relational structure across scales. These spectral descriptors are aggregated into a unified feature vector for each cell and can be directly used in downstream learning tasks without additional model training. We evaluate HSSE on twelve benchmark single-cell RNA-seq datasets covering diverse biological systems and data scales. Under a consistent classification protocol, HSSE achieves competitive or improved performance compared with existing multiscale and classical embedding-based methods across multiple evaluation metrics. The results demonstrate that sheaf spectral representations provide a robust and interpretable approach for single-cell RNA-seq data representation learning.