Orchestrating Spatial Semantics via a Zone-Graph Paradigm for Intricate Indoor Scene Generation

arXiv cs.RO / 5/5/2026

📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsModels & Research

Key Points

  • The paper introduces ZoneMaestro, a framework for autonomous 3D indoor scene generation that addresses failures in non-convex rooms with tightly coupled spatial constraints.
  • Instead of object-centric synthesis, ZoneMaestro uses a Zone-Graph orchestration approach to convert high-level semantic intent into functional zones and explicit topological constraints.
  • The authors release Zone-Scene-10K, a large dataset with zone-graph annotations, and propose an Alternating Alignment Strategy combining internal reasoning with Zone-Aware Group Relative Policy Optimization (Z-GRPO).
  • They also define the Intricate Spatial Orchestration task and release SCALE, a stress-test benchmark for irregular indoor scenes with complex, dense spatial relations.
  • Experiments indicate ZoneMaestro improves both structural coherence and intent adherence, resolving a reported density-versus-safety tradeoff compared with existing state-of-the-art baselines.

Abstract

Autonomous 3D indoor scene synthesis breaks down in non-convex rooms with tightly coupled spatial constraints. Data-driven generators lack topological priors for long-horizon planning, while iterative agents fragment semantics and become geometrically brittle. We present ZoneMaestro, a unified framework that shifts the paradigm from object-centric synthesis to Zone-Graph Orchestration. By internalizing a novel zone-based logic, ZoneMaestro translates high-level semantic intent into functional zones and topological constraints, enabling robust adaptation to diverse architectural forms. To support this, we construct Zone-Scene-10K, a large-scale dataset enriched with explicit Zone-Graph annotations. We further introduce an Alternating Alignment Strategy that cycles between reasoning internalization and Zone-Aware Group Relative Policy Optimization (Z-GRPO), effectively reconciling the tension between semantic richness and geometric validity without relying on external physics engines. To rigorously evaluate spatial intelligence beyond convex primitives, we formally define the task of Intricate Spatial Orchestration and release SCALE, a stress-test benchmark for irregular indoor scenarios with complex, dense spatial relations. Extensive experiments demonstrate that ZoneMaestro resolves the density-safety dichotomy, significantly outperforming state-of-the-art baselines in both structural coherence and intent adherence.

Orchestrating Spatial Semantics via a Zone-Graph Paradigm for Intricate Indoor Scene Generation | AI Navigate