H-SemiS: Hierarchical Fusion of Semi and Self-Supervised Learning for Knee Osteoarthritis Severity Grading

arXiv cs.CV / 4/28/2026

📰 NewsSignals & Early TrendsModels & Research

Key Points

  • The paper introduces H-SemiS, a hierarchical semi-/self-supervised learning framework to grade knee osteoarthritis severity from knee X-rays using limited labeled data.
  • By reframing multi-class severity grading into a sequence of binary sub-tasks within a teacher–student architecture, H-SemiS reduces sensitivity to class imbalance and noisy clinical annotations.
  • It improves representation learning from unlabeled images via an adversarial self-supervised reconstruction module that targets robust anatomical features.
  • A teacher–student setup with quantum-inspired feature mixing is used to strengthen decision boundaries between adjacent severity grades, particularly under noisy pseudo-labels.
  • Experiments on multiple multi-class and binary datasets show H-SemiS outperforms several baselines and state-of-the-art methods, and the code is available on GitHub.

Abstract

Knee osteoarthritis (KOA) is a degenerative joint disease that can lead to chronic pain, reduced mobility, and long-term disability. Automated severity grading from knee radiographs can support early assessment, but current methods heavily depend on large labeled datasets and remain sensitive to class imbalance, noisy samples, and variability in clinical annotations. To alleviate these limitations, we propose a Hierarchical fusion of Semi-Supervised framework with Self-Supervision (H-SemiS) for KOA severity grading in knee X-ray samples using limited annotated data. Rather than treating severity grading as a flat multi-class problem, H-SemiS decomposes the task into a sequence of binary sub-tasks within a semi-supervised teacher-student architecture, directly mitigating the impact of class imbalance. To further enhance feature learning from unlabeled data, the framework integrates an adversarial self-supervised reconstruction module that encourages the network to capture robust anatomical structures. In parallel, a teacher-student design with quantum-inspired feature mixing improves discrimination boundaries between adjacent grades when pseudo-labels are noisy. We comprehensively evaluate H-SemiS on two challenging multi-class datasets and assess its generalizability on two binary-class datasets. Our experimental results demonstrate the superiority of the proposed H-SemiS framework across multiple evaluation metrics, consistently outperforming several competing baselines and state-of-the-art methods. The code is publicly available at https://github.com/chandravardhan-singh-raghaw/H-SemiS.