Representing expertise accelerates learning from pedagogical interaction data

arXiv cs.CL / 4/15/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper investigates why interaction traces between a novice and an expert can improve learning, focusing on which interaction features drive the gains versus expert-only demonstrations.
  • It uses a controlled spatial navigation paradigm and generates synthetic datasets of pedagogical (expert–novice) interactions alongside expert-only behavior.
  • Transformer models trained on pedagogical interaction data outperform models trained solely on expert demonstrations and remain more robust across different scenarios.
  • The study also finds that explicitly representing epistemically distinct agents enables expert-like behavior even when expert actions appear infrequently in the data.

Abstract

Work in cognitive science and artificial intelligence has suggested that exposing learning agents to traces of interaction between multiple individuals can improve performance in a variety of settings, yet it remains unknown which features of interactions contribute to this improvement. We examined the factors that support the effectiveness of interaction data, using a controlled paradigm that allowed us to precisely operationalize key distinctions between interaction and an expert acting alone. We generated synthetic datasets of simple interactions between an expert and a novice in a spatial navigation task, and then trained transformer models on those datasets, evaluating performance after exposure to different datasets. Our experiments showed that models trained on pedagogical interactions were more robust across a variety of scenarios compared to models trained only on expert demonstrations, and that having the ability to represent epistemically distinct agents led to expert-like behavior even when expert behavior was rarely observed.