Who Wrote the Book? Detecting and Attributing LLM Ghostwriters
arXiv cs.CL / 3/31/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces GhostWriteBench, a new dataset for LLM authorship attribution using long-form (50K+ words) book-length texts generated by frontier models to evaluate generalization across multiple OOD dimensions.
- It also proposes TRACE, an interpretable and lightweight fingerprinting approach that infers token-level transition patterns (such as word-rank changes) using a separate lightweight language model.
- TRACE is designed to work with both open- and closed-source LLMs, addressing a key challenge in real-world attribution where model access may be restricted.
- Experiments reported on GhostWriteBench indicate TRACE delivers state-of-the-art results, maintains robustness under OOD conditions, and performs effectively with limited training data.
Related Articles

Black Hat Asia
AI Business
[D] How does distributed proof of work computing handle the coordination needs of neural network training?
Reddit r/MachineLearning

Claude Code's Entire Source Code Was Just Leaked via npm Source Maps — Here's What's Inside
Dev.to

BYOK is not just a pricing model: why it changes AI product trust
Dev.to

AI Citation Registries and Identity Persistence Across Records
Dev.to