Modeling Human-Like Color Naming Behavior in Context

arXiv cs.CL / 4/29/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper investigates how to model human-like color naming behavior in computational systems using neural agents trained via supervised learning and reinforcement learning in referential games.
  • It highlights a key limitation of prior work (NeLLCom-Lex): the resulting color lexicons systematically diverge from human categories by forming highly non-convex regions in color space.
  • To improve alignment with human-like categorization, the authors introduce two interventions: upsampling rare color terms during supervised learning and using multi-listener reinforcement learning interactions.
  • They use a convexity-based measure to quantify the geometric coherence of the learned color categories and show that moderate upsampling plus multiple listeners yields lexicons most similar to human systems.
  • The results suggest that both learning from human data (with adjusted sampling) and richer communication setups (many-listener RL) are important for achieving more human-like pragmatic color lexicons.

Abstract

Modeling the emergence of human-like lexicons in computational systems has advanced through the use of interacting neural agents, which simulate both learning and communicative pressures. The NeLLCom-Lex framework (Zhang et al., 2025) allows neural agents to develop pragmatic color naming behavior and human-like lexicons through supervised learning (SL) from human data and reinforcement learning (RL) in referential games. Despite these successes, the lexicons that emerge diverge systematically from human color categories, producing highly non-convex regions in color space, which contrast with the convexity typical of human categories. To address this, we introduce two factors, upsampling rare color terms during SL and multi-listener RL interactions, and adopt a convexity measure to quantify geometric coherence. We find that upsampling improves lexical diversity and system-level informativeness of the color lexicon, while many-listener setups promote more convex color categories. The combination of moderate upsampling and multiple listeners produces lexicons most similar to human systems.