Detecting Hallucinations in SpeechLLMs at Inference Time Using Attention Maps

arXiv cs.CL / 4/22/2026

📰 NewsModels & Research

共有:

Key Points

The paper proposes inference-time hallucination detection for SpeechLLMs without needing costly gold-standard outputs by using attention-derived metrics tailored to audio inputs.
It introduces four attention-based features—AUDIORATIO, AUDIOCONSISTENCY, AUDIOENTROPY, and TEXTENTROPY—and trains lightweight logistic regression classifiers to flag likely hallucinations efficiently.
Experiments on ASR and speech-to-text translation using Qwen-2-Audio and Voxtral-3B show the method outperforms uncertainty- and prior-attention-based baselines on in-domain data, with up to +0.23 PR-AUC improvements.
The approach also generalizes to out-of-domain ASR, and strong results can be achieved with about 100 attention heads rather than using all heads, improving generalization in some settings.
Effectiveness depends on the specific model and task, and the classifier requires task-specific training, but the study demonstrates attention patterns as a practical signal for SpeechLLM hallucination detection.

Abstract

Hallucinations in Speech Large Language Models (SpeechLLMs) pose significant risks, yet existing detection methods typically rely on gold-standard outputs that are costly or impractical to obtain. Moreover, hallucination detection methods developed for text-based LLMs do not directly capture audio-specific signals. We investigate four attention-derived metrics: AUDIORATIO, AUDIOCONSISTENCY, AUDIOENTROPY, and TEXTENTROPY, designed to capture pathological attention patterns associated with hallucination, and train lightweight logistic regression classifiers on these features for efficient inference-time detection. Across automatic speech recognition and speech-to-text translation tasks, evaluations on Qwen-2-Audio and Voxtral-3B show that our approach outperforms uncertainty-based and prior attention-based baselines on in-domain data, achieving improvements of up to +0.23 PR-AUC, and generalises to out-of-domain ASR settings. We further find that strong performance can be achieved with approximately 100 attention heads, improving out-of-domain generalisation compared to using all heads. While effectiveness is model-dependent and task-specific training is required, our results demonstrate that attention patterns provide a valuable tool for hallucination detection in SpeechLLMs.

I’m working on an AGI and human council system that could make the world better and keep checks and balances in place to prevent catastrophes. It could change the world. Really. Im trying to get ahead of the game before an AGI is developed by someone who only has their best interest in mind.

Reddit r/artificial

Deepseek V4 Flash and Non-Flash Out on HuggingFace

Reddit r/LocalLLaMA

DeepSeek V4 Flash & Pro Now out on API

Reddit r/LocalLLaMA

From "Hello World" to "Hello Agents": The Developer Keynote That Rewired Software Engineering

Dev.to

AI swarms could hijack democracy without anyone noticing

Reddit r/artificial

Detecting Hallucinations in SpeechLLMs at Inference Time Using Attention Maps

Key Points

Abstract

Related Articles

I’m working on an AGI and human council system that could make the world better and keep checks and balances in place to prevent catastrophes. It could change the world. Really. Im trying to get ahead of the game before an AGI is developed by someone who only has their best interest in mind.

Deepseek V4 Flash and Non-Flash Out on HuggingFace

DeepSeek V4 Flash & Pro Now out on API

From "Hello World" to "Hello Agents": The Developer Keynote That Rewired Software Engineering

AI swarms could hijack democracy without anyone noticing

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer