Geometry-Guided Self-Supervision for Ultra-Fine-Grained Recognition with Limited Data

arXiv cs.CV / 4/22/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces a new self-supervised framework, the Geometric Attribute Exploration Network (GAEor), aimed at ultra-fine-grained visual recognition when labeled data is limited.
  • GAEor focuses on extracting intrinsic geometric cues from highly similar objects by generating geometric attributes tied to geometric patterns in the data (e.g., leaf vein structures).
  • It claims that each category has distinct geometric descriptors that can remain informative even when visual appearance varies very little, addressing a gap in prior research.
  • The method amplifies geometry-relevant details through backbone-driven visual feedback and encodes them using relative polar coordinates in the final representation.
  • Experiments report state-of-the-art performance on five widely used Ultra-FGVC benchmarks, indicating strong practical impact for data-limited settings.

Abstract

This paper investigates the intrinsic geometrical features of highly similar objects and introduces a general self-supervised framework called the Geometric Attribute Exploration Network (GAEor), which is designed to address the ultra-fine-grained visual categorization (Ultra-FGVC) task in data-limited scenarios. Unlike prior work that often captures subtle yet critical distinctions, GAEor generates geometric attributes as novel alternative recognition cues. These attributes are determined by various details within the object, aligned with its geometric patterns, such as the intricate vein structures in soybean leaves. Crucially, each category exhibits distinct geometric descriptors that serve as powerful cues, even among objects with minimal visual variation -- a factor largely overlooked in recent research. GAEor discovers these geometric attributes by first amplifying geometry-relevant details via visual feedback from a backbone network, then embedding the relative polar coordinates of these details into the final representation. Extensive experiments demonstrate that GAEor significantly sets new state-of-the-art records in five widely-used Ultra-FGVC benchmarks.