Modeling the human lexicon under temperature variations: linguistic factors, diversity and typicality in LLM word associations
arXiv cs.CL / 3/20/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The study compares human and LLM-generated word associations using the SWOW dataset and three LLMs (Mistral-7B, Llama-3.1-8B, Qwen-2.5-32B) across multiple temperature settings.
- It examines how lexical factors such as word frequency and concreteness influence cue-response pairs in both humans and models.
- Results show all models mirror human trends for frequency and concreteness but differ in response variability and typicality, with larger models emitting highly typical but less variable responses.
- Temperature settings modulate this trade-off by increasing variability while reducing typicality, highlighting how sampling temperature shapes lexical representations.
- The work underscores the importance of considering model size and temperature when probing LLM lexical representations and comparing to human data.
Related Articles
ADICはどの種類の革新なのか ―― ドリフト監査デモで見る「事後説明」から「通過条件」への移行**
Qiita
Complete Guide: How To Make Money With Ai
Dev.to
Built a small free iOS app to reduce LLM answer uncertainty with multiple models
Dev.to
Without Valid Data, AI Transformation Is Flying Blind – Why We Need to “Grasp” Work Again
Dev.to
How We Used Hindsight Memory to Build an AI That Knows Your Weaknesses
Dev.to