PhysVideo: Physically Plausible Video Generation with Cross-View Geometry Guidance
arXiv cs.CV / 3/20/2026
📰 NewsModels & Research
Key Points
- The paper introduces PhysVideo, a two-stage framework for physically plausible video generation, with Phys4View for physics-aware foreground video generation and VideoSyn for background-aware synthesis.
- Phys4View uses physics-aware attention, geometry-enhanced cross-view attention, and temporal attention to better capture 3D dynamics from multiple orthogonal viewpoints.
- The authors build PhysMV, a dataset of 40,000 scenes (four orthogonal viewpoints each, totaling 160,000 sequences) to train and evaluate physics-informed video generation.
- Experiments show PhysVideo improves physical realism and spatio-temporal coherence compared with existing video generation methods, enabling more controllable video synthesis in context with background dynamics.
Related Articles

My AI Does Not Have a Clock
Dev.to
How to settle on a coding LLM ? What parameters to watch out for ?
Reddit r/LocalLLaMA

Andrej Karpathy's autonomous AI research agent ran 700 experiments in 2 days and gave a glimpse of where AI is heading
Reddit r/artificial
Data Augmentation Using GANs
Dev.to
Zero Shot Deformation Reconstruction for Soft Robots Using a Flexible Sensor Array and Cage Based 3D Gaussian Modeling
arXiv cs.RO