SACRED: A Faithful Annotated Multimedia Multimodal Multilingual Dataset for Classifying Connectedness Types in Online Spirituality
arXiv cs.CL / 3/31/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces SACRED, a faithful, annotated, multimedia multimodal multilingual dataset aimed at classifying types of connectedness in online spirituality communication.
- The authors report that SACRED was developed in collaboration with social scientists to address the lack of high-quality datasets accessible online for this research area.
- Using SACRED, they benchmark 13 popular LLMs alongside rule-based and fine-tuned approaches, finding DeepSeek-V3 performs strongly on the Quora test set (79.19% accuracy).
- For vision-related tasks, GPT-4o-mini achieves the best overall performance among compared models (63.99% F1 score).
- The study also identifies a newly observed connectedness type, intended to support further communication science research.
Related Articles
[D] How does distributed proof of work computing handle the coordination needs of neural network training?
Reddit r/MachineLearning

BYOK is not just a pricing model: why it changes AI product trust
Dev.to

AI Citation Registries and Identity Persistence Across Records
Dev.to

Building Real-Time AI Voice Agents with Google Gemini 3.1 Flash Live and VideoSDK
Dev.to

Your Knowledge, Your Model: A Method for Deterministic Knowledge Externalization
Dev.to