FormalScience: Scalable Human-in-the-Loop Autoformalisation of Science with Agentic Code Generation in Lean
arXiv cs.AI / 4/28/2026
📰 NewsDeveloper Stack & InfrastructureTools & Practical UsageModels & Research
Key Points
- The paper introduces FormalScience, a domain-agnostic human-in-the-loop, agentic pipeline that helps experts convert informal scientific/mathematical reasoning into syntactically correct and semantically aligned Lean formal proofs at low economic cost.
- It demonstrates the approach in physics by building FormalPhysics, a dataset of 200 university-level physics problems and solutions (mostly quantum mechanics and electromagnetism) paired with Lean4 formalizations.
- The authors evaluate both open-source models and proprietary systems for statement autoformalisation using zero-shot prompting, self-refinement with error feedback, and a new multi-stage agentic method.
- They provide a systematic analysis of “semantic drift” in physics autoformalisation, identifying issues like notational collapse and abstraction elevation to explain what the formal system verifies when full semantic preservation fails.
- The work releases a codebase and an interactive UI-based FormalScience system to support autoformalisation and theorem proving in scientific domains beyond physics.
Related Articles

Black Hat USA
AI Business
LLMs will be a commodity
Reddit r/artificial

Indian Developers: How to Build AI Side Income with $0 Capital in 2026
Dev.to

HubSpot Just Legitimized AEO: What It Means for Your Brand AI Visibility
Dev.to

What it feels like to have to have Qwen 3.6 or Gemma 4 running locally
Reddit r/LocalLLaMA