AmaraSpatial-10K: A Spatially and Semantically Aligned 3D Dataset for Spatial Computing and Embodied AI
arXiv cs.CV / 4/28/2026
📰 NewsSignals & Early TrendsTools & Practical UsageModels & Research
Key Points
- Web-scale 3D asset datasets are common but often not deployment-ready due to issues like incorrect metric scale, misaligned axes, brittle geometry, and relighting-incompatible textures.
- AmaraSpatial-10K provides 10,000+ synthetic, deployment-oriented 3D assets packaged as metric-scaled, semantically anchored .glb files with separated PBR maps, convex collision hulls, reference images, and rich multi-sentence text metadata.
- The dataset uses a unified spatial convention and covers categories including indoor objects, vehicles, architecture, creatures, and props for spatial computing and embodied AI use cases.
- An accompanying evaluation suite introduces metrics such as Scale Plausibility Score (with an LLM-as-Judge protocol), LLM Concept Density, anchor-error, and cross-modal CLIP coherence to audit 3D asset banks.
- Compared with Objaverse-derived assets, AmaraSpatial-10K significantly improves text-based retrieval (CLIP Recall@5: 0.612 vs 0.181; 3.4x improvement, with median rank dropping from 267 to 3), and is publicly available on Hugging Face.
Related Articles

Black Hat USA
AI Business

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to
How I Automate My Dev Workflow with Claude Code Hooks
Dev.to

Same Agent, Different Risk | How Microsoft 365 Copilot Grounding Changes the Security Model | Rahsi Framework™
Dev.to

Claude Haiku for Low-Cost AI Inference: Patterns from a Horse Racing Prediction System
Dev.to