AeroScene: Progressive Scene Synthesis for Aerial Robotics

arXiv cs.RO / 3/25/2026

📰 NewsSignals & Early TrendsModels & Research

Key Points

  • The paper introduces AeroScene, a hierarchical diffusion model designed for progressive 3D scene synthesis specifically aimed at aerial robotics use cases like navigation, landing, and perching.
  • AeroScene uses hierarchy-aware tokenization and multi-branch feature extraction to jointly model global scene layouts and local details while enforcing physical plausibility and semantic consistency.
  • Experiments on a newly collected dataset and a public benchmark indicate AeroScene significantly outperforms prior approaches to 3D scene generation.
  • The authors generated a large-scale set of 1,000+ physics-ready, high-fidelity 3D scenes that can be directly imported into NVIDIA Isaac Sim.
  • They further validate the practical value of the synthesized environments by demonstrating improved performance on downstream drone navigation tasks, and provide code and dataset publicly.

Abstract

Generative models have shown substantial impact across multiple domains, their potential for scene synthesis remains underexplored in robotics. This gap is more evident in drone simulators, where simulation environments still rely heavily on manual efforts, which are time-consuming to create and difficult to scale. In this work, we introduce AeroScene, a hierarchical diffusion model for progressive 3D scene synthesis. Our approach leverages hierarchy-aware tokenization and multi-branch feature extraction to reason across both global layouts and local details, ensuring physical plausibility and semantic consistency. This makes AeroScene particularly suited for generating realistic scenes for aerial robotics tasks such as navigation, landing, and perching. We demonstrate its effectiveness through extensive experiments on our newly collected dataset and a public benchmark, showing that AeroScene significantly outperforms prior methods. Furthermore, we use AeroScene to generate a large-scale dataset of over 1,000 physics-ready, high fidelity 3D scenes that can be directly integrated into NVIDIA Isaac Sim. Finally, we illustrate the utility of these generated environments on downstream drone navigation tasks. Our code and dataset are publicly available at aioz-ai.github.io/AeroScene/