A Human-Centered Workflow for Using Large Language Models in Content Analysis

arXiv cs.AI / 3/23/2026

💬 OpinionTools & Practical UsageModels & Research

共有:

Key Points

LLMs should be leveraged via APIs rather than chat interfaces, and a three-task workflow is proposed for content analysis: annotation, summarization, and information extraction.
The workflow is explicitly human-centered, with researchers designing, supervising, and validating each stage to ensure rigor and transparency.
The approach synthesizes insights from multiple disciplines and provides validation procedures and best practices to address limitations such as black-box behavior, prompt sensitivity, and hallucinations.
For practical adoption, the authors supply supplementary materials including a prompt library and Python code in Jupyter Notebook format with detailed usage instructions.

Abstract

While many researchers use Large Language Models (LLMs) through chat-based access, their real potential lies in leveraging LLMs via application programming interfaces (APIs). This paper conceptualizes LLMs as universal text processing machines and presents a comprehensive workflow for employing LLMs in three qualitative and quantitative content analysis tasks: (1) annotation (an umbrella term for qualitative coding, labeling and text classification), (2) summarization, and (3) information extraction. The workflow is explicitly human-centered. Researchers design, supervise, and validate each stage of the LLM process to ensure rigor and transparency. Our approach synthesizes insights from extensive methodological literature across multiple disciplines: political science, sociology, computer science, psychology, and management. We outline validation procedures and best practices to address key limitations of LLMs, such as their black-box nature, prompt sensitivity, and tendency to hallucinate. To facilitate practical implementation, we provide supplementary materials, including a prompt library and Python code in Jupyter Notebook format, accompanied by detailed usage instructions.

The Security Gap in MCP Tool Servers (And What I Built to Fix It)

Dev.to

I made a new programming language to get better coding with less tokens.

Dev.to

RSA Conference 2026: The Week Vibe Coding Security Became Impossible to Ignore

Dev.to

Adversarial AI framework reveals mechanisms behind impaired consciousness and a potential therapy

Reddit r/artificial

Why I Switched From GPT-4 to Small Language Models for Two of My Products

Dev.to

A Human-Centered Workflow for Using Large Language Models in Content Analysis

Key Points

Abstract

Related Articles

The Security Gap in MCP Tool Servers (And What I Built to Fix It)

I made a new programming language to get better coding with less tokens.

RSA Conference 2026: The Week Vibe Coding Security Became Impossible to Ignore

Adversarial AI framework reveals mechanisms behind impaired consciousness and a potential therapy

Why I Switched From GPT-4 to Small Language Models for Two of My Products

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer