Exhaustive Circuit Mapping of a Single-Cell Foundation Model Reveals Massive Redundancy, Heavy-Tailed Hub Architecture, and Layer-Dependent Differentiation Control
arXiv cs.LG / 3/13/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- Exhaustive circuit tracing of all 4065 active sparse autoencoder features at layer 5 in Geneformer reveals 1,393,850 significant downstream edges, a 27-fold expansion over prior selective sampling.
- The results show a heavy-tailed hub distribution, with 1.8% of features accounting for the majority of connectivity and 40% of the top 20 hubs lacking biological annotation, indicating bias in previous analyses.
- Three-way combinatorial ablation across eight feature triplets shows redundancy deepening with interaction order (three-way ratio 0.59 vs. pairwise 0.74) and zero synergy, indicating subadditivity.
- Trajectory-guided feature steering establishes a causal link between layer position and differentiation direction, with late-layer features at L17 pushing toward maturity (fraction positive = 1.0) while early/mid-layer features at L0 and L11 push away from maturity (0.00–0.58).
- Taken together, these findings provide causal evidence for layer-dependent control of cell state in a single-cell foundation model, moving beyond previous correlational analyses.
Related Articles
Is AI becoming a bubble, and could it end like the dot-com crash?
Reddit r/artificial

Externalizing State
Dev.to

I made a 'benchmark' where LLMs write code controlling units in a 1v1 RTS game.
Dev.to

My AI Does Not Have a Clock
Dev.to
How to settle on a coding LLM ? What parameters to watch out for ?
Reddit r/LocalLLaMA