HalluCiteChecker: A Lightweight Toolkit for Hallucinated Citation Detection and Verification in the Era of AI Scientists

arXiv cs.CL / 4/30/2026

📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsTools & Practical UsageModels & Research

Key Points

  • HalluCiteChecker is a lightweight toolkit designed to detect and verify hallucinated citations in scientific papers that do not match any existing work.
  • The authors frame hallucinated citation detection as an NLP task and provide an accompanying toolkit meant to serve as a practical foundation for addressing the problem.
  • The package can verify citations in seconds on a standard laptop, runs fully offline, and is optimized to use only CPUs.
  • The project aims to reduce reviewer and author burden by enabling systematic pre-review and publication checks, and it is released as Apache 2.0 open source on GitHub and distributed via PyPI.
  • A demo video is also provided on YouTube, supporting adoption and evaluation by researchers and developers.

Abstract

We introduce HalluCiteChecker, a toolkit for detecting and verifying hallucinated citations in scientific papers. While AI assistant technologies have transformed the academic writing process, including citation recommendation, they have also led to the emergence of hallucinated citations that do not correspond to any existing work. Such citations not only undermine the credibility of scientific papers but also impose an additional burden on reviewers and authors, who must manually verify their validity during the review process. In this study, we formalize hallucinated citation detection as an NLP task and provide a corresponding toolkit as a practical foundation for addressing this problem. Our package is lightweight and can perform verification in seconds on a standard laptop. It can also be executed entirely offline and runs efficiently using only CPUs. We hope that HalluCiteChecker will help reduce reviewer workload and support organizers by enabling systematic pre-review and publication checks. Our code is released under the Apache 2.0 license on GitHub and is distributed as an installable package via PyPI. A demonstration video is available on YouTube.