Edit-As-Act: Goal-Regressive Planning for Open-Vocabulary 3D Indoor Scene Editing
arXiv cs.CV / 3/19/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- Edit-As-Act introduces a goal-regressive planning framework for open-vocabulary 3D indoor scene editing, aiming to realize a user instruction with minimal, targeted changes.
- It defines EditLang, a PDDL-inspired action language with explicit preconditions and effects to encode geometric relations such as contact and collision.
- A language-driven planner proposes actions and a validator enforces goal-directedness, monotonicity, and physical feasibility to ensure interpretable and physically coherent edits.
- By separating reasoning from low-level generation, Edit-As-Act improves instruction fidelity, semantic consistency, and physical plausibility compared with prior open-vocabulary approaches.
- On the E2A-Bench benchmark (63 tasks, 9 environments), the method significantly outperforms prior approaches across all edit types and scene categories.
Related Articles

I let an AI agent loose on my codebase. It tried to read my .env file in 30 seconds.
Dev.to
Alex Chenglin Wu of DeepWisdom On The Future Of Artificial Intelligence | by Chad Silverstein | Authority Magazine | Mar, 2026
Reddit r/artificial
The Exit
Dev.to

Chip Smuggling Arrests, OpenClaw Is 'The Next ChatGPT,' and 81K People on AI
Dev.to
The Crucible
Dev.to