InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts
arXiv cs.RO / 4/29/2026
💬 OpinionSignals & Early TrendsModels & Research
Key Points
- The paper introduces InternScenes, a new large-scale indoor 3D scene dataset designed to support Embodied AI with more diversity and more realistic layouts than existing datasets.
- InternScenes reportedly contains about 40,000 diverse scenes, totaling 1.96M 3D objects across 15 scene types and 288 object classes, with an emphasis on preserving many small items and reducing unrealistic omissions.
- The dataset includes a full processing pipeline that produces real-to-sim replicas for real-world scans, adds interactive objects to improve interactivity, and uses physical simulation to resolve object collisions.
- The authors validate the dataset via two benchmarks—scene layout generation and point-goal navigation—and show that the more complex, realistic layouts create new challenges and enable scaling model training.
- The team plans to open-source the dataset, models, and benchmarks to support broader community research and development.
Related Articles

How I Use AI Agents to Maintain a Living Knowledge Base for My Team
Dev.to
IK_LLAMA now supports Qwen3.5 MTP Support :O
Reddit r/LocalLLaMA
OpenAI models, Codex, and Managed Agents come to AWS
Dev.to

Indian Developers: How to Build AI Side Income with $0 Capital in 2026
Dev.to

Automatic Error Recovery in AI Agent Networks
Dev.to