Formula-One Prompting: Equation-First Reasoning For Applied Mathematics

arXiv cs.CL / 3/30/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces Formula-One Prompting (F-1), a two-phase, single-call prompting method that first formulates governing mathematical equations from a problem description before solving.
  • After equation formalization, the model selects an appropriate solution strategy (e.g., CoT, PoT, or direct computation) based on the structure of the equations rather than using explicit routing rules.
  • Experiments across five LLMs and four benchmarks show F-1 outperforms Chain-of-Thought (CoT) by +5.76% and Program-of-Thought (PoT) by +8.42% on average.
  • F-1 also achieves 53 wins out of 60 benchmark-model comparisons (88.3%), with the biggest improvements in applied domains such as FinanceMath (+13.30% over CoT).
  • The authors’ per-problem analysis indicates that the main performance driver is the equation formalization stage itself, including stronger gains for physics-style questions within OlympiadBench.

Abstract

LLMs encode vast mathematical knowledge including governing equations from pretraining on equation-rich corpora, yet existing prompting methods, including Chain-of-Thought (CoT) and Program-of-Thought (PoT), do not explicitly elicit equation formulation as a reasoning stage. We propose Formula-One Prompting (F-1), a single-call, two-phase approach that fills this equation gap by using mathematical equations as an intermediate representation before solving through natural flow reasoning. F-1 first formulates governing equations from problem descriptions; the model then naturally selects a solving strategy among CoT, PoT, or direct computation based on the formalized equation structure, without explicit routing rules. Results across five models and four benchmarks show F-1 outperforms CoT by +5.76% and PoT by +8.42% on average, winning 53 out of 60 benchmark-model comparisons (88.3%). Gains are largest in applied domains: +13.30% on FinanceMath over CoT, and within OlympiadBench, larger gains on physics (+2.55%) than pure math (+0.44%). Per-problem analysis confirms equation formalization is the primary driver.