A Computational Model of Message Sensation Value in Short Video Multimodal Features that Predicts Sensory and Behavioral Engagement

arXiv cs.CV / 4/23/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The study introduces a computational model based on Message Sensation Value (MSV) to analyze how combined multimodal features in sensational short videos influence viewer engagement.
  • Using human evaluations of 1,200 short videos, the model predicts sensory and behavioral engagement more effectively than treating individual features in isolation.
  • The approach was validated on two additional unseen datasets spanning three platforms, totaling 14,492 videos, demonstrating robustness across different sources.
  • Results show a positive link between MSV and sensory engagement, but an inverted U-shaped pattern for behavioral engagement, where moderate MSV best drives viewers’ behavior.
  • The work contributes both to theory on short-video engagement and to practical research tooling for analyzing multimodal content effects at scale.

Abstract

The contemporary media landscape is characterized by sensational short videos. While prior research examines the effects of individual multimodal features, the collective impact of multimodal features on viewer engagement with short videos remains unknown. Grounded in the theoretical framework of Message Sensation Value (MSV), this study develops and tests a computational model of MSV with multimodal feature analysis and human evaluation of 1,200 short videos. This model that predicts sensory and behavioral engagement was further validated across two unseen datasets from three short video platforms (combined N = 14,492). While MSV is positively associated with sensory engagement, it shows an inverted U-shaped relationship with behavioral engagement: Higher MSV elicits stronger sensory stimulation, but moderate MSV optimizes behavioral engagement. This research advances the theoretical understanding of short video engagement and introduces a robust computational tool for short video research.