Learning Where to Embed: Noise-Aware Positional Embedding for Query Retrieval in Small-Object Detection
arXiv cs.CV / 4/17/2026
📰 NewsDeveloper Stack & InfrastructureModels & Research
Key Points
- The paper addresses inefficiency and background-induced “query noise” in transformer-based small-object detectors by refining where positional information is embedded.
- It proposes HELP (Heatmap-guided Embedding Learning Paradigm), which selectively preserves positional encodings in foreground-salient regions while suppressing background clutter.
- The core method, Heatmap-guided Positional Embedding (HPE), fuses positional and semantic information in both encoder and decoder, using a gradient-based mask filter to improve query retrieval.
- To handle sparse features in small, complex targets, the approach integrates Linear-Snake Convolution to enrich retrieval-relevant representations.
- Experiments show substantial model compression—cutting decoder layers from 8 to 3 and reducing parameters by 59.4%—while maintaining accuracy improvements with no extra training-time gradient cost at inference.
Related Articles
langchain-anthropic==1.4.1
LangChain Releases

Stop burning tokens on DOM noise: a Playwright MCP optimizer layer
Dev.to

Talk to Your Favorite Game Characters! Mantella Brings AI to Skyrim and Fallout 4 NPCs
Dev.to

OpenAI Codex Update Adds macOS Agent, Browser, Memory; 3M Weekly Users
Dev.to

How Data Science Is Used to Predict User BeReducing Human Error in Compliance With AI Technology havior
Dev.to