AI Navigate

Automatic detection of Gen-AI texts: A comparative framework of neural models

arXiv cs.CL / 3/20/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper analyzes AI-generated text detection, addressing the challenge posed by rapid LLM proliferation across domains.
  • It designs and evaluates four neural detectors: Multilayer Perceptron, 1D-CNN, MobileNet-based CNN, and a Transformer model.
  • The detectors are benchmarked against widely used online tools such as ZeroGPT, GPTZero, QuillBot, Originality.AI, Sapling, IsGen, Rephrase, and Writer.
  • Experiments use the COLING Multilingual Dataset (English and Italian) and an in-house dataset focused on Art and Mental Health to assess cross-language and domain robustness.
  • The results show supervised detectors offer more stable and robust performance than commercial tools across languages and domains, highlighting current strengths and limitations of detection approaches.

Abstract

The rapid proliferation of Large Language Models has significantly increased the difficulty of distinguishing between human-written and AI generated texts, raising critical issues across academic, editorial, and social domains. This paper investigates the problem of AI generated text detection through the design, implementation, and comparative evaluation of multiple machine learning based detectors. Four neural architectures are developed and analyzed: a Multilayer Perceptron, a one-dimensional Convolutional Neural Network, a MobileNet-based CNN, and a Transformer model. The proposed models are benchmarked against widely used online detectors, including ZeroGPT, GPTZero, QuillBot, Originality.AI, Sapling, IsGen, Rephrase, and Writer. Experiments are conducted on the COLING Multilingual Dataset, considering both English and Italian configurations, as well as on an original thematic dataset focused on Art and Mental Health. Results show that supervised detectors achieve more stable and robust performance than commercial tools across different languages and domains, highlighting key strengths and limitations of current detection strategies.