PhysMem: Scaling Test-time Physical Memory for Robot Manipulation
arXiv cs.RO / 4/22/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces PhysMem, a memory framework that helps VLM-based robot planners learn object-specific physical behavior during test time without updating model parameters.
- PhysMem stores interaction experiences, generates candidate physical hypotheses, and validates them through targeted experiments before using the knowledge for future planning.
- The key design principle is “verification before application,” which reduces over-reliance on previously retrieved experiences when friction, stability, or other conditions shift.
- Experiments on three real-world manipulation tasks and multiple simulation benchmarks across four VLM backbones show large gains, including 76% success on a brick insertion task versus 23% for direct experience retrieval.
- Real-robot deployments demonstrate consistent improvement over 30-minute deployment sessions, indicating practical effectiveness of the test-time interaction loop.


