Representation Learning for Spatiotemporal Physical Systems
arXiv cs.LG / 3/16/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that next-frame predictive emulators for spatiotemporal systems are computationally expensive and prone to error accumulation, motivating focus on downstream tasks.
- It evaluates physics-grounded representations by their usefulness in downstream tasks like estimating governing physical parameters, rather than just predicting the next frame.
- The findings show that general self-supervised methods can be competitive for these tasks, and latent-space approaches (joint embedding predictive architectures, or JEPAs) outperform pixel-level prediction objectives.
- The authors provide code at https://github.com/helenqu/physical-representation-learning for reproducing and extending the results.




