Physics-Aware Video Instance Removal Benchmark
arXiv cs.CV / 4/8/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces Physics-Aware Video Instance Removal (PVIR), a new benchmark for removing target objects from videos while preserving background physical realism (e.g., reflections and illumination interactions).
- PVIR contains 95 high-quality videos with instance-accurate masks and removal prompts, split into Simple and Hard subsets to explicitly test complex physical side effects.
- The authors evaluate four methods (PISCO-Removal, UniVideo, DiffuEraser, and CoCoCo) using a decoupled human evaluation protocol that separates semantic, visual, and spatial failures.
- Results indicate PISCO-Removal and UniVideo reach state-of-the-art performance, while DiffuEraser often adds blurring artifacts and CoCoCo performs poorly on instruction following.
- The persistent performance gap on the Hard subset underscores that recovering physically triggered artifacts (like lingering shadows) remains an open challenge for current approaches.
Related Articles

Black Hat Asia
AI Business
Meta's latest model is as open as Zuckerberg's private school
The Register

AI fuels global trade growth as China-US flows shift, McKinsey finds
SCMP Tech
Why multi-agent AI security is broken (and the identity patterns that actually work)
Dev.to
BANKING77-77: New best of 94.61% on the official test set (+0.13pp) over our previous tests 94.48%.
Reddit r/artificial