SAGE: Training-Free Semantic Evidence Composition for Edge-Cloud Inference under Hard Uplink Budgets
arXiv cs.LG / 4/22/2026
📰 NewsDeveloper Stack & InfrastructureModels & Research
Key Points
- The paper argues that collaborative edge-cloud inference under strict per-request uplink bit budgets cannot rely on attention-based importance selection alone, since that approach has inherent limitations.
- It shows that substituting high-importance segments with lower-importance but complementary ones improves server accuracy, indicating that coverage diversity—not individual importance—drives performance.
- It further finds that spatially uniform evidence selection without content information can reach competitive accuracy at moderate budgets, highlighting the independent value of spatial coverage.
- Based on these insights, the authors propose SAGE (Semantic Attention-Guided Evidence), a training-free method that blends importance filtering with embedding-diversity sampling, achieving about 93% of server-side ceiling accuracy while sending less than half the evidence units on ImageNet-1K.
- The results demonstrate a substantial improvement over importance-only composition in offloaded accuracy versus transmitted data volume trade-offs.
Related Articles

Enterprise AI Governance Has Shifted from Policy to Execution
Dev.to

Rethinking CNN Models for Audio Classification
Dev.to
v0.20.0rc1
vLLM Releases

Build-in-Public: What I Learned Building an AI Image SaaS
Dev.to
I built my own event bus for a sustainability app — here's what I learned about agent automation using OpenClaw
Dev.to