GameScope: A Multi-Attribute, Multi-Codec Benchmark Dataset for Gaming Video Quality Assessment

arXiv cs.CV / 5/5/2026

📰 NewsSignals & Early TrendsTools & Practical UsageModels & Research

Key Points

  • GameScope is a new benchmark dataset for gaming video quality assessment designed to provide consistent evaluation across multiple streaming codecs used by platforms like YouTube and Twitch.
  • The dataset combines user-generated content (UGC) and professional-generated content (PGC), offering extensive visual diversity to reflect real-world gaming footage.
  • It includes 4,048 video samples spanning the most common codecs (H.264, H.265, and AV1), with each sample labeled by an average of 37 mean opinion score (MOS) ratings.
  • Beyond overall quality, GameScope also collects coarse-grained visual quality attributes, helping models learn which perceptual factors drive perceived quality.
  • The authors benchmark leading video quality assessment methods on the dataset and report that a vision-language model achieves top performance, and they make the dataset publicly available.

Abstract

The development of video game streaming has grown rapidly, with major platforms such as YouTube and Twitch using different codecs. To support quality assessment models that work consistently across any codec, it is necessary to have access to large, diverse subjective gaming quality datasets. Currently, there are only a few available, each having limitations. To address this gap, we present the largest gaming video quality dataset to date, incorporating both user-generated content (UGC) and professional-generated content (PGC) with extensive visual diversity. Our dataset covers the most widely used codecs - H.264, H.265, and AV1 - and consists of 4,048 video samples, each annotated by an average of 37 mean opinion score (MOS) ratings. In addition to overall quality scores, we collect coarse-grained quality attributes, enabling a better understanding of perceptual factors. We study the performance of leading video quality assessment methods on this dataset, including a vision language model that outperforms all the benchmarks. To the best of our knowledge, this is the first dataset that comprehensively addresses gaming video quality assessment across multiple codecs and content types with quality attributes. Our dataset is publicly available at https://rajeshsureddi.github.io/GameScope/.