Decompose, Structure, and Repair: A Neuro-Symbolic Framework for Autoformalization via Operator Trees
arXiv cs.LG / 4/22/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces DSR (Decompose, Structure, and Repair), a neuro-symbolic framework for statement autoformalization that restructures the task into a modular pipeline rather than an end-to-end flat sequence approach.
- DSR decomposes natural-language mathematical statements into logical components and represents them as structured operator trees to localize mistakes and refine only the problematic sub-trees.
- It proposes PRIME, a newly created benchmark of 156 undergraduate and graduate theorems from canonical textbooks, annotated in Lean 4 by experts.
- Experimental results claim DSR achieves new state-of-the-art performance, outperforming existing baselines under the same computational budgets.
- The authors state that the datasets, model, and code will be released publicly soon.


