A Computational Model of Message Sensation Value in Short Video Multimodal Features that Predicts Sensory and Behavioral Engagement
arXiv cs.CV / 4/23/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The study introduces a computational model based on Message Sensation Value (MSV) to analyze how combined multimodal features in sensational short videos influence viewer engagement.
- Using human evaluations of 1,200 short videos, the model predicts sensory and behavioral engagement more effectively than treating individual features in isolation.
- The approach was validated on two additional unseen datasets spanning three platforms, totaling 14,492 videos, demonstrating robustness across different sources.
- Results show a positive link between MSV and sensory engagement, but an inverted U-shaped pattern for behavioral engagement, where moderate MSV best drives viewers’ behavior.
- The work contributes both to theory on short-video engagement and to practical research tooling for analyzing multimodal content effects at scale.
Related Articles

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

Trajectory Forecasts in Unknown Environments Conditioned on Grid-Based Plans
Dev.to

Why use an AI gateway at all?
Dev.to

OpenAI Just Named It Workspace Agents. We Open-Sourced Our Lark Version Six Months Ago
Dev.to

GPT Image 2 Subject-Lock Editing: A Practical Guide to input_fidelity
Dev.to