A3-FPN: Asymptotic Content-Aware Pyramid Attention Network for Dense Visual Prediction
arXiv cs.CV / 4/14/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces A3-FPN, an Asymptotic Content-Aware Pyramid Attention Network designed to improve dense visual prediction by better capturing discriminative multi-scale features, especially for small objects.
- A3-FPN uses a horizontally spread column network with an asymptotically disentangled framework to enable asymptotically global feature interaction and disentangle each pyramid level from hierarchical representations.
- For feature fusion, it introduces content-aware attention that collects supplementary adjacent-level context to compute position-wise offsets/weights for context-aware resampling and applies deep context reweighting to enhance intra-category similarity.
- For feature reassembly, it strengthens intra-scale discriminative learning and reassembles redundant features using information content and spatial variation of feature maps.
- Experiments on MS COCO, VisDrone2019-DET, and Cityscapes show that A3-FPN can be plugged into both CNN and Transformer-based SOTA architectures, reporting strong results such as 49.6 mask AP with OneFormer + Swin-L and 85.6 mIoU on Cityscapes.



