EdgeCrafter: Compact ViTs for Edge Dense Prediction via Task-Specialized Distillation
arXiv cs.CV / 3/20/2026
📰 NewsTools & Practical UsageModels & Research
Key Points
- The paper introduces EdgeCrafter, a unified compact ViT framework for edge dense prediction to address the performance-gap of small-scale ViTs on resource-constrained devices.
- It centers on ECDet, a detection model built from a distilled compact backbone and an edge-friendly encoder-decoder design to enable efficient object detection, instance segmentation, and pose estimation.
- On COCO, ECDet-S achieves 51.7 AP with fewer than 10M parameters using only COCO annotations, and ECInsSeg reaches performance comparable to RF-DETR with substantially fewer parameters; ECPose-X attains 74.8 AP, outperforming YOLO26Pose-X despite less extensive pretraining.
- The results imply that compact ViTs paired with task-specific distillation and edge-aware design can be a practical and competitive option for edge dense prediction, with code released for community use.
Related Articles

ベテランの若手育成負担を減らせ、PLC制御の「ラダー図」をAIで生成
日経XTECH

Your AI generated code is "almost right", and that is actually WORSE than it being "wrong".
Dev.to

Lessons from Academic Plagiarism Tools for SaaS Product Development
Dev.to

Windsurf’s New Pricing Explained: Simpler AI Coding or Hidden Trade-Offs?
Dev.to

Building Production RAG Systems with PostgreSQL: Complete Implementation Guide
Dev.to