Intermediate Layers Encode Optimal Biological Representations in Single-Cell Foundation Models
arXiv cs.AI / 4/17/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The study challenges the common single-cell foundation model practice of using only final-layer embeddings as optimal feature representations for downstream tasks.
- It evaluates layer-wise representations from scFoundation (100M) and Tahoe-X1 (1.3B) for trajectory inference and perturbation response prediction, showing that optimal layers vary by task.
- For trajectory inference, the best layer occurs around 60% depth (about 31% above the final-layer choice), indicating a nontrivial relationship between depth and biological signal quality.
- For perturbation response prediction, optimal extraction layers shift widely (0–96%) depending on T cell activation context, highlighting strong context dependence.
- The results also find that first-layer embeddings can outperform deeper layers in quiescent cells, suggesting that “hierarchical abstraction” assumptions may not universally hold.
Related Articles
langchain-anthropic==1.4.1
LangChain Releases

🚀 Anti-Gravity Meets Cloud AI: The Future of Effortless Development
Dev.to

Talk to Your Favorite Game Characters! Mantella Brings AI to Skyrim and Fallout 4 NPCs
Dev.to

AI Will Run Companies. Here's Why That Should Excite You, Not Scare You.
Dev.to

The problem with Big Tech AI pricing (and why 8 countries can't afford to compete)
Dev.to