Edit-As-Act: Goal-Regressive Planning for Open-Vocabulary 3D Indoor Scene Editing
arXiv cs.CV / 3/19/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- Edit-As-Act introduces a goal-regressive planning framework for open-vocabulary 3D indoor scene editing, aiming to realize a user instruction with minimal, targeted changes.
- It defines EditLang, a PDDL-inspired action language with explicit preconditions and effects to encode geometric relations such as contact and collision.
- A language-driven planner proposes actions and a validator enforces goal-directedness, monotonicity, and physical feasibility to ensure interpretable and physically coherent edits.
- By separating reasoning from low-level generation, Edit-As-Act improves instruction fidelity, semantic consistency, and physical plausibility compared with prior open-vocabulary approaches.
- On the E2A-Bench benchmark (63 tasks, 9 environments), the method significantly outperforms prior approaches across all edit types and scene categories.
Related Articles
The programming passion is melting
Dev.to
Maximize Developer Revenue with Monetzly's Innovative API for AI Conversations
Dev.to
Co-Activation Pattern Detection for Prompt Injection: A Mechanistic Interpretability Approach Using Sparse Autoencoders
Reddit r/LocalLLaMA

How to Train Custom Language Models: Fine-Tuning vs Training From Scratch (2026)
Dev.to

KoboldCpp 1.110 - 3 YR Anniversary Edition, native music gen, qwen3tts voice cloning and more
Reddit r/LocalLLaMA