MAT-Cell: A Multi-Agent Tree-Structured Reasoning Framework for Batch-Level Single-Cell Annotation

arXiv cs.AI / 4/10/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces MAT-Cell, a neuro-symbolic multi-agent tree-structured reasoning framework for batch-level single-cell annotation that targets failures of supervised models and LLMs under out-of-distribution cell states and noisy transcriptomic signals.
  • MAT-Cell shifts from black-box classification to constructive, verifiable proof generation by injecting symbolic constraints via adaptive Retrieval-Augmented Generation (RAG) grounded in biological axioms.
  • It uses a dialectic verification mechanism with homogeneous rebuttal agents to audit and prune reasoning paths, producing syllogistic derivation trees that enforce logical consistency.
  • Experiments on large-scale, cross-species benchmarks show MAT-Cell outperforming state-of-the-art methods and maintaining robustness in difficult settings where baseline approaches degrade sharply.
  • The authors provide an open-source code repository to enable replication and further experimentation with the framework.

Abstract

Automated cellular reasoning faces a core dichotomy: supervised methods fall into the Reference Trap and fail to generalize to out-of-distribution cell states, while large language models (LLMs), without grounded biological priors, suffer from a Signal-to-Noise Paradox that produces spurious associations. We propose MAT-Cell, a neuro-symbolic reasoning framework that reframes single-cell analysis from black-box classification into constructive, verifiable proof generation. MAT-Cell injects symbolic constraints through adaptive Retrieval-Augmented Generation (RAG) to ground neural reasoning in biological axioms and reduce transcriptomic noise. It further employs a dialectic verification process with homogeneous rebuttal agents to audit and prune reasoning paths, forming syllogistic derivation trees that enforce logical consistency.Across large-scale and cross-species benchmarks, MAT-Cell significantly outperforms state-of-the-art (SOTA) models and maintains robust per-formance in challenging scenarios where baselinemethods severely degrade. Code is available at https://gith ub.com/jiangliu91/MAT-Cell-A-Mul ti-Agent-Tree-Structured-Reasoni ng-Framework-for-Batch-Level-Sin gle-Cell-Annotation.