PhysInOne: Visual Physics Learning and Reasoning in One Suite

arXiv cs.RO / 4/13/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • PhysInOne is introduced as a large-scale synthetic dataset designed to address the lack of physically grounded training data for AI systems.
  • The dataset contains 2 million videos spanning 153,810 dynamic 3D scenes across 71 mechanics/optics/fluid dynamics/magnetism phenomena, with extensive ground-truth annotations (3D geometry, semantics, motion, physical properties, and text).
  • The authors highlight multi-object interactions and complex backgrounds to move beyond simpler, smaller-scale physical datasets.
  • PhysInOne is evaluated on four application areas—physics-aware video generation, future frame prediction, physical property estimation, and motion transfer—and fine-tuning foundation models improves physical plausibility.
  • Experiments also reveal limitations in current models for complex dynamics and intrinsic property estimation, positioning PhysInOne as a new benchmark for physics-grounded world models.

Abstract

We present PhysInOne, a large-scale synthetic dataset addressing the critical scarcity of physically-grounded training data for AI systems. Unlike existing datasets limited to merely hundreds or thousands of examples, PhysInOne provides 2 million videos across 153,810 dynamic 3D scenes, covering 71 basic physical phenomena in mechanics, optics, fluid dynamics, and magnetism. Distinct from previous works, our scenes feature multiobject interactions against complex backgrounds, with comprehensive ground-truth annotations including 3D geometry, semantics, dynamic motion, physical properties, and text descriptions. We demonstrate PhysInOne's efficacy across four emerging applications: physics-aware video generation, long-/short-term future frame prediction, physical property estimation, and motion transfer. Experiments show that fine-tuning foundation models on PhysInOne significantly enhances physical plausibility, while also exposing critical gaps in modeling complex physical dynamics and estimating intrinsic properties. As the largest dataset of its kind, orders of magnitude beyond prior works, PhysInOne establishes a new benchmark for advancing physics-grounded world models in generation, simulation, and embodied AI.