AI Navigate

Huntington Disease Automatic Speech Recognition with Biomarker Supervision

arXiv cs.LG / 3/13/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper studies automatic speech recognition for Huntington's disease using a high-fidelity clinical speech corpus and compares multiple ASR families under a unified evaluation.
  • HD speech induces architecture-specific error regimes, with Parakeet-TDT outperforming encoder–decoder and CTC baselines.
  • HD-specific adaptation reduces word error rate (WER) from 6.99% to 4.95% on the HD corpus.
  • The authors propose biomarker-based auxiliary supervision and analyze how error behavior changes with disease severity rather than yielding uniform improvements.
  • All code and models are open-sourced to enable replication and further research.

Abstract

Automatic speech recognition (ASR) for pathological speech remains underexplored, especially for Huntington's disease (HD), where irregular timing, unstable phonation, and articulatory distortion challenge current models. We present a systematic HD-ASR study using a high-fidelity clinical speech corpus not previously used for end-to-end ASR training. We compare multiple ASR families under a unified evaluation, analyzing WER as well as substitution, deletion, and insertion patterns. HD speech induces architecture-specific error regimes, with Parakeet-TDT outperforming encoder-decoder and CTC baselines. HD-specific adaptation reduces WER from 6.99% to 4.95% and we also propose a method for using biomarker-based auxiliary supervision and analyze how error behavior is reshaped in severity-dependent ways rather than uniformly improving WER. We open-source all code and models.