AI Navigate

Representation Learning for Spatiotemporal Physical Systems

arXiv cs.LG / 3/16/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that next-frame predictive emulators for spatiotemporal systems are computationally expensive and prone to error accumulation, motivating focus on downstream tasks.
  • It evaluates physics-grounded representations by their usefulness in downstream tasks like estimating governing physical parameters, rather than just predicting the next frame.
  • The findings show that general self-supervised methods can be competitive for these tasks, and latent-space approaches (joint embedding predictive architectures, or JEPAs) outperform pixel-level prediction objectives.
  • The authors provide code at https://github.com/helenqu/physical-representation-learning for reproducing and extending the results.

Abstract

Machine learning approaches to spatiotemporal physical systems have primarily focused on next-frame prediction, with the goal of learning an accurate emulator for the system's evolution in time. However, these emulators are computationally expensive to train and are subject to performance pitfalls, such as compounding errors during autoregressive rollout. In this work, we take a different perspective and look at scientific tasks further downstream of predicting the next frame, such as estimation of a system's governing physical parameters. Accuracy on these tasks offers a uniquely quantifiable glimpse into the physical relevance of the representations of these models. We evaluate the effectiveness of general-purpose self-supervised methods in learning physics-grounded representations that are useful for downstream scientific tasks. Surprisingly, we find that not all methods designed for physical modeling outperform generic self-supervised learning methods on these tasks, and methods that learn in the latent space (e.g., joint embedding predictive architectures, or JEPAs) outperform those optimizing pixel-level prediction objectives. Code is available at https://github.com/helenqu/physical-representation-learning.