AI Navigate

NormGenesis: Multicultural Dialogue Generation via Exemplar-Guided Social Norm Modeling and Violation Recovery

arXiv cs.CL / 3/13/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • NormGenesis introduces a multicultural framework for generating socially grounded dialogues across English, Chinese, and Korean, including the new Violation-to-Resolution (V2R) dialogue type to model post-violation repair.
  • The approach uses exemplar-based iterative refinement early in dialogue synthesis to align with linguistic, emotional, and sociocultural expectations before full generation.
  • The authors construct a dataset of 10,800 multi-turn dialogues annotated for norm adherence, speaker intent, and emotional response, and show improvements in refinement quality, naturalness, and generalization over existing datasets.
  • Models trained on V2R-augmented data exhibit improved pragmatic competence in ethically sensitive contexts and establish a new benchmark for culturally adaptive dialogue modeling across languages.

Abstract

Social norms govern culturally appropriate behavior in communication, enabling dialogue systems to produce responses that are not only coherent but also socially acceptable. We present NormGenesis, a multicultural framework for generating and annotating socially grounded dialogues across English, Chinese, and Korean. To model the dynamics of social interaction beyond static norm classification, we propose a novel dialogue type, Violation-to-Resolution (V2R), which models the progression of conversations following norm violations through recognition and socially appropriate repair. To improve pragmatic consistency in underrepresented languages, we implement an exemplar-based iterative refinement early in the dialogue synthesis process. This design introduces alignment with linguistic, emotional, and sociocultural expectations before full dialogue generation begins. Using this framework, we construct a dataset of 10,800 multi-turn dialogues annotated at the turn level for norm adherence, speaker intent, and emotional response. Human and LLM-based evaluations demonstrate that NormGenesis significantly outperforms existing datasets in refinement quality, dialogue naturalness, and generalization performance. We show that models trained on our V2R-augmented data exhibit improved pragmatic competence in ethically sensitive contexts. Our work establishes a new benchmark for culturally adaptive dialogue modeling and provides a scalable methodology for norm-aware generation across linguistically and culturally diverse languages.