CNSocialDepress: A Chinese Social Media Dataset for Depression Risk Detection and Structured Analysis
arXiv cs.CL / 3/27/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces CNSocialDepress, a Chinese-language social media benchmark dataset aimed at depression risk detection and analysis.
- The dataset includes 44,178 posts from 233 users, with psychological experts annotating 10,306 depression-related segments.
- Unlike binary-only resources, CNSocialDepress provides both binary risk labels and structured, multidimensional psychological attributes for more interpretable, fine-grained signal analysis.
- Experiments show the dataset supports multiple NLP tasks, including structured psychological profiling and fine-tuning large language models for depression detection.
- The authors position CNSocialDepress as a practical resource for mental-health applications tailored to Chinese-speaking populations, addressing a gap in publicly available resources.
Related Articles

GDPR and AI Training Data: What You Need to Know Before Training on Personal Data
Dev.to
Edge-to-Cloud Swarm Coordination for heritage language revitalization programs with embodied agent feedback loops
Dev.to

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

AI Crawler Management: The Definitive Guide to robots.txt for AI Bots
Dev.to

Data Sovereignty Rules and Enterprise AI
Dev.to