Uncertainty-Aware Pedestrian Attribute Recognition via Evidential Deep Learning

arXiv cs.CV / 4/30/2026

📰 NewsModels & Research

Key Points

  • The paper introduces UAPAR, a pedestrian attribute recognition framework that explicitly estimates uncertainty to improve robustness on low-quality or complex real-world inputs.
  • UAPAR is built by integrating Evidential Deep Learning (EDL) into a CLIP-based architecture, using a region-aware evidence reasoning module to capture fine-grained local features for attribute-wise epistemic uncertainty.
  • It adds an uncertainty-guided dual-stage curriculum learning strategy to reduce the negative impact of severe label noise during training.
  • Experiments on PA100K, PETA, RAPv1, and RAPv2 show that UAPAR delivers competitive or better recognition performance, and qualitative results indicate its uncertainty estimates correlate with difficult or erroneous samples.
  • The work claims to be the first EDL-based uncertainty-aware approach for pedestrian attribute recognition, differentiating it from prior deterministic methods that cannot judge prediction reliability.

Abstract

We propose UAPAR, an Uncertainty-Aware Pedestrian Attribute Recognition framework. To the best of our knowledge, this is the first EDL-based uncertainty-aware framework for pedestrian attribute recognition (PAR). Unlike conventional deterministic methods, which fail to assess prediction reliability on low-quality samples, UAPAR effectively identifies unreliable predictions and thus enhances system robustness in complex real-world scenarios. To achieve this, UAPAR incorporates Evidential Deep Learning (EDL) into a CLIP-based architecture. Specifically, a Region-Aware Evidence Reasoning module employs cross-attention and spatial prior masks to capture fine-grained local features, which are further processed by an evidence head to estimate attribute-wise epistemic uncertainty. To further enhance training robustness, we develop an uncertainty-guided dual-stage curriculum learning strategy to alleviate the adverse effects of severe label noise during training. Extensive experiments on the PA100K, PETA, RAPv1, and RAPv2 datasets demonstrate that UAPAR achieves competitive or superior performance. Furthermore, qualitative results confirm that the proposed framework generates uncertainty estimates that are predictive of challenging or erroneous samples.