Dual-Domain Representation Alignment: Bridging 2D and 3D Vision via Geometry-Aware Architecture Search
arXiv cs.AI / 3/23/2026
💬 OpinionDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research
Key Points
- EvoNAS provides an efficient distributed approach to multi-objective evolutionary architecture search, reducing candidate evaluation cost while preserving Pareto-optimal accuracy-efficiency trade-offs.
- It uses a hybrid supernet that combines Vision State Space (VSS) blocks with Vision Transformer (ViT) modules and introduces Cross-Architecture Dual-Domain Knowledge Distillation (CA-DDKD) to boost shared representational capacity and ranking consistency.
- A Distributed Multi-Model Parallel Evaluation (DMMPE) framework with GPU pooling and asynchronous scheduling further speeds up large-scale validation, achieving over 70% efficiency gains versus traditional data-parallel methods.
- Experiments on COCO, ADE20K, KITTI, and NYU-Depth v2 show EvoNets achieve Pareto-optimal trade-offs with lower inference latency and higher throughput under fixed budgets, while maintaining strong generalization on downstream tasks like novel view synthesis.
- The work provides code at GitHub, enabling replication and adoption of EvoNets in resource-constrained deployment scenarios.