AI Navigate

Data-driven Progressive Discovery of Physical Laws

arXiv cs.LG / 3/17/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces Chain of Symbolic Regression (CoSR), a framework that models the discovery of physical laws as a chain of symbolic knowledge units, progressively combining them to yield interpretable laws from data.
  • It argues that conventional end-to-end symbolic regression often produces lengthy, physically meaningless expressions and poor generalization because it bypasses the progressive discovery path.
  • CoSR reproduces the historical progression from Kepler's third law to the law of universal gravitation and is demonstrated on problems including turbulent Rayleigh-Bénard convection, viscous flow in a circular pipe, and laser–metal interaction.
  • The approach also shows potential for discovering new knowledge in engineering problems, such as aerodynamic coefficient scaling across different aircraft, thereby improving classical scaling theories.

Abstract

Symbolic regression is a powerful tool for knowledge discovery, enabling the extraction of interpretable mathematical expressions directly from data. However, conventional symbolic discovery typically follows an end-to-end, "one-step" process, which often generates lengthy and physically meaningless expressions when dealing with real physical systems, leading to poor model generalization. This limitation fundamentally stems from its deviation from the basic path of scientific discovery: physical laws do not exist in a single form but follow a hierarchical and progressive pattern from simplicity to complexity. Motivated by this principle, we propose Chain of Symbolic Regression (CoSR), a novel framework that models the discovery of physical laws as a chain of symbolic knowledge. This knowledge chain is formed by progressively combining multiple knowledge units with clear physical meanings along a specific logic, ultimately enabling the precise discovery of the underlying physical laws from data. CoSR fully recapitulates the progressive discovery path from Kepler's third law to the law of universal gravitation in classical mechanics, and is applied to three types of problems: turbulent Rayleigh-Benard convection, viscous flows in a circular pipe, and laser-metal interaction, demonstrating its ability to improve classical scaling theories. Finally, CoSR showcases its capability to discover new knowledge in the complex engineering problem of aerodynamic coefficients scaling for different aircraft.