Collaborative Evaluation of Deepfake Text with Deliberation-Enhancing Dialogue Systems

arXiv cs.CL / 3/25/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper investigates whether DeepFakeDeLiBot, a deliberation-enhancing chatbot, can help groups detect deepfake text generated by machine learning models.
  • Results show that collaborative, group-based problem solving improves deepfake text detection accuracy compared with individuals working alone.
  • Overall performance gains from interacting with DeepFakeDeLiBot are limited, but the chatbot positively affects group interaction by increasing engagement, consensus-building, and the amount and variety of reasoning-focused dialogue.
  • Participants who believed group collaboration was more effective benefited more from the chatbot, suggesting perceived collaboration quality moderates the tool’s impact.
  • The study proposes deliberative chat systems as a way to support both accurate detection and healthier, more productive group dynamics, with dataset and code planned for release after acceptance.

Abstract

The proliferation of generative models has presented significant challenges in distinguishing authentic human-authored content from deepfake content. Collaborative human efforts, augmented by AI tools, present a promising solution. In this study, we explore the potential of DeepFakeDeLiBot, a deliberation-enhancing chatbot, to support groups in detecting deepfake text. Our findings reveal that group-based problem-solving significantly improves the accuracy of identifying machine-generated paragraphs compared to individual efforts. While engagement with DeepFakeDeLiBot does not yield substantial performance gains overall, it enhances group dynamics by fostering greater participant engagement, consensus building, and the frequency and diversity of reasoning-based utterances. Additionally, participants with higher perceived effectiveness of group collaboration exhibited performance benefits from DeepFakeDeLiBot. These findings underscore the potential of deliberative chatbots in fostering interactive and productive group dynamics while ensuring accuracy in collaborative deepfake text detection. \textit{Dataset and source code used in this study will be made publicly available upon acceptance of the manuscript.