Formula-One Prompting: Equation-First Reasoning For Applied Mathematics

arXiv cs.CL / 3/30/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper introduces Formula-One Prompting (F-1), a two-phase, single-call prompting method that first formulates governing mathematical equations from a problem description before solving.
After equation formalization, the model selects an appropriate solution strategy (e.g., CoT, PoT, or direct computation) based on the structure of the equations rather than using explicit routing rules.
Experiments across five LLMs and four benchmarks show F-1 outperforms Chain-of-Thought (CoT) by +5.76% and Program-of-Thought (PoT) by +8.42% on average.
F-1 also achieves 53 wins out of 60 benchmark-model comparisons (88.3%), with the biggest improvements in applied domains such as FinanceMath (+13.30% over CoT).
The authors’ per-problem analysis indicates that the main performance driver is the equation formalization stage itself, including stronger gains for physics-style questions within OlympiadBench.

Abstract

LLMs encode vast mathematical knowledge including governing equations from pretraining on equation-rich corpora, yet existing prompting methods, including Chain-of-Thought (CoT) and Program-of-Thought (PoT), do not explicitly elicit equation formulation as a reasoning stage. We propose Formula-One Prompting (F-1), a single-call, two-phase approach that fills this equation gap by using mathematical equations as an intermediate representation before solving through natural flow reasoning. F-1 first formulates governing equations from problem descriptions; the model then naturally selects a solving strategy among CoT, PoT, or direct computation based on the formalized equation structure, without explicit routing rules. Results across five models and four benchmarks show F-1 outperforms CoT by +5.76% and PoT by +8.42% on average, winning 53 out of 60 benchmark-model comparisons (88.3%). Gains are largest in applied domains: +13.30% on FinanceMath over CoT, and within OlympiadBench, larger gains on physics (+2.55%) than pure math (+0.44%). Per-problem analysis confirms equation formalization is the primary driver.

Black Hat Asia

AI Business

Mr. Chatterbox is a (weak) Victorian-era ethically trained model you can run on your own computer

Simon Willison's Blog

Beyond the Chatbot: Engineering Multi-Agent Ecosystems in 2026

Dev.to

I missed the "fun" part in software development

Dev.to

The Billion Dollar Tax on AI Agents

Dev.to

Formula-One Prompting: Equation-First Reasoning For Applied Mathematics

Key Points

Abstract

Related Articles

Black Hat Asia

Mr. Chatterbox is a (weak) Victorian-era ethically trained model you can run on your own computer

Beyond the Chatbot: Engineering Multi-Agent Ecosystems in 2026

I missed the "fun" part in software development

The Billion Dollar Tax on AI Agents

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer