AI Navigate

MDS-VQA: Model-Informed Data Selection for Video Quality Assessment

arXiv cs.CV / 3/13/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • MDS-VQA proposes a model-informed data selection mechanism to curate unlabeled videos that are both difficult for the base VQA model and diverse in content.
  • Difficulty is estimated by a failure predictor trained with a ranking objective, and diversity is measured using deep semantic video features, with a greedy procedure balancing the two under a constrained labeling budget.
  • Experiments across multiple VQA datasets show that using only 5% of labeled samples yields a meaningful boost, improving mean SRCC from 0.651 to 0.722 and achieving the top gMAD rank.
  • The work demonstrates the value of data-centric selection for active fine-tuning, highlighting a practical approach to improving adaptation and generalization in video quality assessment.

Abstract

Learning-based video quality assessment (VQA) has advanced rapidly, yet progress is increasingly constrained by a disconnect between model design and dataset curation. Model-centric approaches often iterate on fixed benchmarks, while data-centric efforts collect new human labels without systematically targeting the weaknesses of existing VQA models. Here, we describe MDS-VQA, a model-informed data selection mechanism for curating unlabeled videos that are both difficult for the base VQA model and diverse in content. Difficulty is estimated by a failure predictor trained with a ranking objective, and diversity is measured using deep semantic video features, with a greedy procedure balancing the two under a constrained labeling budget. Experiments across multiple VQA datasets and models demonstrate that MDS-VQA identifies diverse, challenging samples that are particularly informative for active fine-tuning. With only a 5% selected subset per target domain, the fine-tuned model improves mean SRCC from 0.651 to 0.722 and achieves the top gMAD rank, indicating strong adaptation and generalization.