From Flat Facts to Sharp Hallucinations: Detecting Stubborn Errors via Gradient Sensitivity
arXiv cs.LG / 5/5/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that existing hallucination-detection methods struggle with “stubborn hallucinations,” where LLMs remain confidently wrong.
- It proposes a geometric approach called Embedding-Perturbed Gradient Sensitivity (EPGS) that distinguishes stable factual knowledge from brittle memorization.
- EPGS works by adding Gaussian noise to input embeddings and measuring the resulting increase in gradient magnitude, using this spike as a proxy for the Hessian spectrum.
- Experiments on hallucination detection show EPGS significantly outperforms entropy-based and representation-based baselines, improving detection of high-confidence factual errors.
Related Articles

Why Retail Chargeback Recovery Could Be AgentHansa's First Real PMF
Dev.to

Why B2B Revenue-Recovery Casework Looks Like AgentHansa's Best Early PMF
Dev.to

10 Ways AI Has Become Your Invisible Daily Companion in 2026
Dev.to

When a Bottling Line Stops at 2 A.M., the Agent That Wins Is the One That Finds the Right Replacement Part
Dev.to

My ‘Busy’ Button Is a Chat Window: 8 Hours of Sorting & Broccoli Poetry
Dev.to