PhysInOne: Visual Physics Learning and Reasoning in One Suite

arXiv cs.RO / 4/13/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

共有:

Key Points

PhysInOne is introduced as a large-scale synthetic dataset designed to address the lack of physically grounded training data for AI systems.
The dataset contains 2 million videos spanning 153,810 dynamic 3D scenes across 71 mechanics/optics/fluid dynamics/magnetism phenomena, with extensive ground-truth annotations (3D geometry, semantics, motion, physical properties, and text).
The authors highlight multi-object interactions and complex backgrounds to move beyond simpler, smaller-scale physical datasets.
PhysInOne is evaluated on four application areas—physics-aware video generation, future frame prediction, physical property estimation, and motion transfer—and fine-tuning foundation models improves physical plausibility.
Experiments also reveal limitations in current models for complex dynamics and intrinsic property estimation, positioning PhysInOne as a new benchmark for physics-grounded world models.

Abstract

We present PhysInOne, a large-scale synthetic dataset addressing the critical scarcity of physically-grounded training data for AI systems. Unlike existing datasets limited to merely hundreds or thousands of examples, PhysInOne provides 2 million videos across 153,810 dynamic 3D scenes, covering 71 basic physical phenomena in mechanics, optics, fluid dynamics, and magnetism. Distinct from previous works, our scenes feature multiobject interactions against complex backgrounds, with comprehensive ground-truth annotations including 3D geometry, semantics, dynamic motion, physical properties, and text descriptions. We demonstrate PhysInOne's efficacy across four emerging applications: physics-aware video generation, long-/short-term future frame prediction, physical property estimation, and motion transfer. Experiments show that fine-tuning foundation models on PhysInOne significantly enhances physical plausibility, while also exposing critical gaps in modeling complex physical dynamics and estimating intrinsic properties. As the largest dataset of its kind, orders of magnitude beyond prior works, PhysInOne establishes a new benchmark for advancing physics-grounded world models in generation, simulation, and embodied AI.

Black Hat Asia

AI Business

I built the missing piece of the MCP ecosystem

Dev.to

When Agents Go Wrong: AI Accountability and the Payment Audit Trail

Dev.to

Google Gemma 4 Review 2026: The Open Model That Runs Locally and Beats Closed APIs

Dev.to

OpenClaw Deep Dive Guide: Self-Host Your Own AI Agent on Any VPS (2026)

Dev.to

PhysInOne: Visual Physics Learning and Reasoning in One Suite

Key Points

Abstract

Related Articles

Black Hat Asia

I built the missing piece of the MCP ecosystem

When Agents Go Wrong: AI Accountability and the Payment Audit Trail

Google Gemma 4 Review 2026: The Open Model That Runs Locally and Beats Closed APIs

OpenClaw Deep Dive Guide: Self-Host Your Own AI Agent on Any VPS (2026)

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer