Why Are We Lonely? Leveraging LLMs to Measure and Understand Loneliness in Caregivers and Non-caregivers

arXiv cs.CL / 4/10/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes an LLM-driven pipeline to construct and label diverse social-media datasets for measuring loneliness in caregivers versus non-caregivers using an expert-developed evaluation framework and typology of loneliness causes.
  • It applies multiple LLMs (GPT-4o, GPT-5-nano, and GPT-5) with a human-validated data processing workflow to build a high-quality Reddit corpus for analysis.
  • The loneliness classifier achieved average accuracies of 76.09% for caregivers and 79.78% for non-caregivers, indicating reasonably strong performance across populations.
  • The cause categorization component reached micro-aggregate F1 scores of 0.825 (caregivers) and 0.80 (non-caregivers), enabling analysis of which underlying reasons differ most between groups.
  • Results show caregivers’ loneliness is more often tied to caregiving roles, identity recognition, and feelings of abandonment, and demographic extraction supports Reddit as a viable source for diverse caregiver-focused loneliness datasets.

Abstract

This paper presents an LLM-driven approach for constructing diverse social media datasets to measure and compare loneliness in the caregiver and non-caregiver populations. We introduce an expert-developed loneliness evaluation framework and an expert-informed typology for categorizing causes of loneliness for analyzing social media text. Using a human-validated data processing pipeline, we apply GPT-4o, GPT-5-nano, and GPT-5 to build a high-quality Reddit corpus and analyze loneliness across both populations. The loneliness evaluation framework achieved average accuracies of 76.09% and 79.78% for caregivers and non-caregivers, respectively. The cause categorization framework achieved micro-aggregate F1 scores of 0.825 and 0.80 for caregivers and non-caregivers, respectively. Across populations, we observe substantial differences in the distribution of types of causes of loneliness. Caregivers' loneliness were predominantly linked to caregiving roles, identity recognition, and feelings of abandonment, indicating distinct loneliness experiences between the two groups. Demographic extraction further demonstrates the viability of Reddit for building a diverse caregiver loneliness dataset. Overall, this work establishes an LLM-based pipeline for creating high quality social media datasets for studying loneliness and demonstrates its effectiveness in analyzing population-level differences in the manifestation of loneliness.