Detecting Basic Values in A Noisy Russian Social Media Text Data: A Multi-Stage Classification Framework
arXiv cs.CL / 3/20/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The study presents a multi-stage pipeline for detecting Schwartz's basic human values in noisy Russian social media text, combining spam filtering, targeted post selection, LLM-based annotation, and transformer-based multi-label classification.
- It treats expert annotations as interpretative benchmarks with uncertainty and aggregates multiple LLM judgments into soft labels to reflect varying levels of agreement.
- The best model, XLM-RoBERTa large, achieves an F1 macro of 0.83 and an F1 of 0.71 on held-out test data, demonstrating effective value detection and handling of annotation subjectivity.
- The work adds to understanding cultural variation in value expression on social platforms and publicly releases all models for further research.
Related Articles

Attacks On Data Centers, Qwen3.5 In All Sizes, DeepSeek’s Huawei Play, Apple’s Multimodal Tokenizer
The Batch

Your AI generated code is "almost right", and that is actually WORSE than it being "wrong".
Dev.to

Lessons from Academic Plagiarism Tools for SaaS Product Development
Dev.to

**Core Allocation Optimization for Energy‑Efficient Multi‑Core Scheduling in ARINC650 Systems**
Dev.to

KI in der amtlichen Recherche beim DPMA: Was Patentanwälte bei Neuanmeldungen jetzt beachten sollten (Stand: März 2026)
Dev.to