DWTSumm: Discrete Wavelet Transform for Document Summarization

arXiv cs.LG / 4/24/2026

💬 OpinionModels & Research

Key Points

  • The paper tackles the difficulty of summarizing long, domain-specific documents with LLMs by proposing a Discrete Wavelet Transform (DWT)-based multi-resolution framework.
  • It views text embeddings as a semantic signal and decomposes them into global “approximation” and local “detail” components to produce compact representations that preserve structure and domain-critical details.
  • The approach can be used directly as summaries or to steer LLM generation, aiming to reduce information loss and hallucinations.
  • Experiments on clinical and legal benchmarks show competitive ROUGE-L results versus strong baselines, with GPT-4o comparisons indicating improvements in semantic similarity and grounding (e.g., BERTScore, Semantic Fidelity, and factual consistency in legal tasks).
  • Across multiple embedding models, Fidelity reaches up to 97%, suggesting DWT functions as a semantic denoising mechanism that strengthens factual grounding and supports reliable long-document summarization.

Abstract

Summarizing long, domain-specific documents with large language models (LLMs) remains challenging due to context limitations, information loss, and hallucinations, particularly in clinical and legal settings. We propose a Discrete Wavelet Transform (DWT)-based multi-resolution framework that treats text as a semantic signal and decomposes it into global (approximation) and local (detail) components. Applied to sentence- or word-level embeddings, DWT yields compact representations that preserve overall structure and critical domain-specific details, which are used directly as summaries or to guide LLM generation. Experiments on clinical and legal benchmarks demonstrate comparable ROUGE-L scores. Compared to a GPT-4o baseline, the DWT based summarization consistently improve semantic similarity and grounding, achieving gains of over 2% in BERTScore, more than 4\% in Semantic Fidelity, factual consistency in legal tasks, and large METEOR improvements indicative of preserved domain-specific semantics. Across multiple embedding models, Fidelity reaches up to 97%, suggesting that DWT acts as a semantic denoising mechanism that reduces hallucinations and strengthens factual grounding. Overall, DWT provides a lightweight, generalizable method for reliable long-document and domain-specific summarization with LLMs.