PhyEdit: Towards Real-World Object Manipulation via Physically-Grounded Image Editing
arXiv cs.CV / 4/9/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces PhyEdit, a physically grounded image editing framework aimed at more precise real-world object manipulation by addressing failures in scaling and positioning caused by missing 3D geometry mechanisms.
- PhyEdit improves manipulation accuracy by using a plug-and-play explicit 3D prior with geometric simulation as 3D-aware guidance, combined with joint 2D–3D supervision.
- The authors release RealManip-10K, a real-world dataset containing paired images and depth annotations to support 3D-aware object manipulation research and evaluation.
- They also propose ManipEval, a benchmark with multi-dimensional metrics to assess 3D spatial control and geometric consistency.
- Experiments indicate PhyEdit outperforms prior approaches, including strong closed-source models, on both 3D geometric accuracy and manipulation consistency.
Related Articles

Black Hat Asia
AI Business

Fully Automated Website 2026-04-11: **The Scoreboard — Visual Judge Score Comparison on the Homepage**
Dev.to
Human-Aligned Decision Transformers for satellite anomaly response operations with ethical auditability baked in
Dev.to

That Smoking-Gun Video? It's Not Evidence. It's a Suspect.
Dev.to

AI Citation Registries and Website-Based Publishing Constraints
Dev.to