MIOFlow 2.0: A unified framework for inferring cellular stochastic dynamics from single cell and spatial transcriptomics data

arXiv cs.LG / 3/25/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • MIOFlow 2.0は、単一細胞および空間トランスクリプトミクスの離散スナップショットから、連続的な細胞分化・再生などの確率的軌跡を推定する統一フレームワークを提案します。
  • その中核として、分岐をNeural Stochastic Differential Equationsで表現し、増殖による非保存的な集団変化を成長率モデル(unbalanced optimal transport初期化)で学習し、環境(ニッチ)影響を遺伝子発現と空間特徴を統合する潜在表現で扱います。
  • PHATE距離に基づくマッチングオートエンコーダの潜在空間で動作させることで、推定軌跡がデータの固有の幾何に沿うよう設計されています。
  • 合成データ、embryoid body分化、axolotl脳の空間的再生で検証され、既存の生成モデル(simulation-free flow matching等)より軌跡精度が向上し、シグナルニッチのような潜在的な駆動因子を明らかにできたと報告されています。

Abstract

Understanding cellular trajectories via time-resolved single-cell transcriptomics is vital for studying development, regeneration, and disease. A key challenge is inferring continuous trajectories from discrete snapshots. Biological complexity stems from stochastic cell fate decisions, temporal proliferation changes, and spatial environmental influences. Current methods often use deterministic interpolations treating cells in isolation, failing to capture the probabilistic branching, population shifts, and niche-dependent signaling driving real biological processes. We introduce Manifold Interpolating Optimal-Transport Flow (MIOFlow) 2.0. This framework learns biologically informed cellular trajectories by integrating manifold learning, optimal transport, and neural differential equations. It models three core processes: (1) stochasticity and branching via Neural Stochastic Differential Equations; (2) non-conservative population changes using a learned growth-rate model initialized with unbalanced optimal transport; and (3) environmental influence through a joint latent space unifying gene expression with spatial features like local cell type composition and signaling. By operating in a PHATE-distance matching autoencoder latent space, MIOFlow 2.0 ensures trajectories respect the data's intrinsic geometry. Empirical comparisons show expressive trajectory learning via neural differential equations outperforms existing generative models, including simulation-free flow matching. Validated on synthetic datasets, embryoid body differentiation, and spatially resolved axolotl brain regeneration, MIOFlow 2.0 improves trajectory accuracy and reveals hidden drivers of cellular transitions, like specific signaling niches. MIOFlow 2.0 thus bridges single-cell and spatial transcriptomics to uncover tissue-scale trajectories.