SLAT-Phys: Fast Material Property Field Prediction from Structured 3D Latents

arXiv cs.CV / 3/26/2026

📰 NewsSignals & Early TrendsModels & Research

Key Points

  • The paper introduces SLAT-Phys, an end-to-end method that predicts spatially varying material property fields (Young’s modulus, density, and Poisson’s ratio) from a single RGB image.
  • Instead of performing explicit 3D reconstruction, it uses spatially organized latent features drawn from a pretrained 3D asset generation model as a geometry/semantic prior, followed by a lightweight neural decoder.
  • By relying on the latent representation’s coarse volumetric layout and semantic cues, SLAT-Phys achieves competitive continuous material-parameter estimation accuracy versus prior approaches.
  • The approach substantially reduces compute and preprocessing needs by avoiding reconstruction and voxelization, reporting ~9.9 seconds per object on an NVIDIA RTX A5000 and a stated 120× speedup.
  • The method is positioned as enabling faster material property estimation for physics-based simulation, robotics, and digital twin generation.

Abstract

Estimating the material property field of 3D assets is critical for physics-based simulation, robotics, and digital twin generation. Existing vision-based approaches are either too expensive and slow or rely on 3D information. We present SLAT-Phys, an end-to-end method that predicts spatially varying material property fields of 3D assets directly from a single RGB image without explicit 3D reconstruction. Our approach leverages spatially organised latent features from a pretrained 3D asset generation model that encodes rich geometry and semantic prior, and trains a lightweight neural decoder to estimate Young's modulus, density, and Poisson's ratio. The coarse volumetric layout and semantic cues of the latent representation about object geometry and appearance enable accurate material estimation. Our experiments demonstrate that our method provides competitive accuracy in predicting continuous material parameters when compared against prior approaches, while significantly reducing computation time. In particular, SLAT-Phys requires only 9.9 seconds per object on an NVIDIA RTXA5000 GPU and avoids reconstruction and voxelization preprocessing. This results in 120x speedup compared to prior methods and enables faster material property estimation from a single image.