AgenticAI-DialogGen: Topic-Guided Conversation Generation for Fine-Tuning and Evaluating Short- and Long-Term Memories of LLMs

arXiv cs.CL / 4/15/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces AgenticAI-DialogGen, a modular, agent-based framework that generates persona-grounded, topic-guided dialogues while avoiding human supervision.
  • It builds long- and short-term memory representations by extracting knowledge graphs, identifying topics, constructing speaker personas, and simulating topic continuity across conversations.
  • A QA module creates memory-grounded question–answer pairs sourced from both short- and long-term conversational histories to support evaluation of “memory” capabilities.
  • The authors release a new dataset, TopicGuidedChat (TGC), encoding long-term memory as speaker-specific knowledge graphs and short-term memory as newly generated topic-guided conversations.
  • Experimental results indicate that AgenticAI-DialogGen improves conversational quality and that fine-tuning LLMs on TGC boosts performance on memory-grounded QA tasks.

Abstract

Recent advancements in Large Language Models (LLMs) have improved their ability to process extended conversational contexts, yet fine-tuning and evaluating short- and long-term memories remain difficult due to the absence of datasets that encode both short- and long-term conversational history. Existing conversational datasets lack memory grounding, overlook topic continuity, or rely on costly human annotation. To address these gaps, we introduce AgenticAI-DialogGen, a modular agent-based framework that generates persona-grounded and topic-guided conversations without human supervision. The framework uses LLM agents to extract knowledge graphs, identify topics, build speaker personas, and simulate topic-guided conversations from unstructured conversations. A QA module generates memory-grounded Question Answer (QA) pairs drawn from short- and long-term conversational histories. We also generated a new dataset entitled, TopicGuidedChat (TGC), where long-term memory is encoded as speaker-specific knowledge graphs and short-term memory as newly generated topic-guided conversations. Evaluations depict that AgenticAI-DialogGen yields higher conversational quality and LLMs fine-tuned on TGC dataset achieve improved performance on memory-grounded QA tasks.