Lingshu-Cell: A generative cellular world model for transcriptome modeling toward virtual cells
arXiv cs.AI / 3/31/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- Lingshu-Cell is presented as a generative cellular “world model” that learns the distribution of single-cell transcriptomic states rather than producing only static embeddings.
- The approach uses a masked discrete diffusion model operating in a discrete token space suited to sparse, non-sequential single-cell transcriptomics, enabling simulation under perturbations.
- It is designed to model transcriptome-wide gene dependencies across ~18,000 genes without relying on common preprocessing such as filtering high-variability genes or ranking by expression.
- Across multiple tissues and species, the model reproduces transcriptomic distributions, marker-gene patterns, and cell-subtype proportions, indicating it captures cellular heterogeneity.
- By conditioning on cell type or donor identity together with perturbation, Lingshu-Cell can predict whole-transcriptome expression changes for novel identity–perturbation combinations and achieves strong results on Virtual Cell Challenge H1 and cytokine-response prediction benchmarks.
Related Articles
Why AI agent teams are just hoping their agents behave
Dev.to

Harness as Code: Treating AI Workflows Like Infrastructure
Dev.to

How to Make Claude Code Better at One-Shotting Implementations
Towards Data Science

The Crypto AI Agent Stack That Costs $0/Month to Run
Dev.to

Bag of Freebies for Training Object Detection Neural Networks
Dev.to