FSDETR: Frequency-Spatial Feature Enhancement for Small Object Detection
arXiv cs.CV / 4/17/2026
📰 NewsModels & Research
Key Points
- Small object detection is difficult because downsampling degrades features, dense scenes cause mutual occlusion, and complex backgrounds interfere with recognition.
- The paper introduces FSDETR, a frequency–spatial feature enhancement framework built on the RT-DETR baseline, aiming to better preserve complementary structural information.
- FSDETR uses a Spatial Hierarchical Attention Block (SHAB) to capture both local details and global dependencies for stronger semantic representation.
- To address occlusion and dense-scene challenges, it adds a Deformable Attention-based Intra-scale Feature Interaction (DA-AIFI) that performs dynamic sampling of informative regions.
- It also proposes a Frequency-Spatial Feature Pyramid Network (FSFPN) with a Cross-domain Frequency-Spatial Block (CFSB) that combines frequency filtering with spatial edge extraction, achieving strong small-object results with only 14.7M parameters.
Related Articles
langchain-anthropic==1.4.1
LangChain Releases
Talk to Your Favorite Game Characters! Mantella Brings AI to Skyrim and Fallout 4 NPCs
Dev.to

OpenAI Codex Update Adds macOS Agent, Browser, Memory; 3M Weekly Users
Dev.to
1.14.2
CrewAI Releases

Should my enterprise AI agent do that? NanoClaw and Vercel launch easier agentic policy setting and approval dialogs across 15 messaging apps
VentureBeat