Measuring What Cannot Be Surveyed: LLMs as Instruments for Latent Cognitive Variables in Labor Economics
arXiv cs.CL / 4/6/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper proposes a framework for using LLMs as valid measurement instruments for latent economic variables, particularly the cognitive content of occupational tasks at fine granularity beyond traditional surveys.
- It formalizes four validity conditions—semantic exogeneity, construct relevance, monotonicity, and model invariance—to justify when LLM-generated scores can serve as instruments.
- The authors apply the method to construct an Augmented Human Capital Index (AHC_o) from 18,796 O*NET task statements scored by Claude Haiku 4.5, and report strong convergent validity against six existing AI exposure indices.
- Statistical checks—including discriminant validity, PCA revealing two AI-related dimensions (augmentation vs substitution), and inter-model reliability (Pearson r and Krippendorff’s alpha)—support the index’s measurement quality.
- The study also finds robust task rankings under prompt variation and shows that ORIV estimation corrects measurement error attenuation compared with OLS, with the approach intended to generalize to other domains requiring scalable semantic quantification.
Related Articles

Black Hat Asia
AI Business

Оказывается, эта нейросеть рисует бесплатно. Я узнал случайно.
Dev.to
Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to
Three-Layer Memory Governance: Core, Provisional, Private
Dev.to

I Researched AI Prompting So You Don’t Have To
Dev.to