Aes3D: Aesthetic Assessment in 3D Gaussian Splatting

arXiv cs.CV / 5/7/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Key Points

  • The paper introduces Aes3D, a systematic framework for evaluating the aesthetics of 3D neural rendering scenes using 3D Gaussian Splatting (3DGS), moving beyond existing metrics focused mainly on reconstruction fidelity and realism.
  • It addresses key gaps in prior work by proposing Aesthetic3D, the first dataset specifically annotated for aesthetic assessment of 3D scenes, alongside an annotation strategy tailored to aesthetic attributes.
  • The authors present Aes3DGSNet, a lightweight model that predicts scene-level aesthetic scores directly from 3D Gaussian primitives, avoiding the need to render multi-view images and thereby lowering computational and hardware costs.
  • Using aesthetics-supervised learning on multi-view 3DGS scene representations, Aes3DGSNet captures higher-level aesthetic signals and regresses aesthetic scores with strong performance.
  • The work establishes a new benchmark for 3D scene aesthetic assessment, with code and datasets planned for release in a future version.

Abstract

As 3D Gaussian Splatting (3DGS) gains attention in immersive media and digital content creation, assessing the aesthetics of 3D scenes becomes important in helping creators build more visually compelling 3D content. However, existing evaluation methods for 3D scenes primarily emphasize reconstruction fidelity and perceptual realism, largely overlooking higher-level aesthetic attributes such as composition, harmony, and visual appeal. This limitation comes from two key challenges: (1) the absence of general 3DGS datasets with aesthetic annotations, and (2) the intrinsic nature of 3DGS as a low-level primitive representation, which makes it difficult to capture high-level aesthetic features. To address these challenges, we propose Aes3D, the first systematic framework for assessing the aesthetics of 3D neural rendering scenes. Aes3D includes Aesthetic3D, the first dataset dedicated to 3D scene aesthetic assessment, built on our proposed annotation strategy for 3D scene aesthetics. In addition, we present Aes3DGSNet, a lightweight model that directly predicts scene-level aesthetic scores from 3DGS representations. Notably, our model operates solely on 3D Gaussian primitives, eliminating the need for rendering multi-view images and thus reducing computational cost and hardware requirements. Through aesthetics-supervised learning on multi-view 3DGS scene representations, Aes3DGSNet effectively captures high-level aesthetic cues and accurately regresses aesthetic scores. Experimental results demonstrate that our approach achieves strong performance while maintaining a lightweight design, establishing a new benchmark for 3D scene aesthetic assessment. Code and datasets will be made available in a future version.