SpectralLoRA: Is Low-Frequency Structure Sufficient for LoRA Adaptation? A Spectral Analysis of Weight Updates

arXiv cs.LG / 4/14/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper conducts a systematic spectral analysis of LoRA weight updates using 2D DCT on adaptation matrices trained for BERT-base and RoBERTa-base across four GLUE tasks.
  • It finds LoRA updates are consistently dominated by low-frequency components, with 33% of DCT coefficients capturing 90% of spectral energy on average.
  • By retaining only 10% of frequency coefficients, the method can reduce adapter storage by about 10× while incurring a relatively small performance drop of 1.95 percentage points on SST-2.
  • Frequency masking at around 50% can outperform full LoRA for 3 of 8 model–task pairs, implying high-frequency components may contribute more noise than signal.
  • The study shows RoBERTa-base is more spectrally compressible than BERT-base, and that task complexity affects required spectral budget (e.g., NLI needing more high-frequency capacity than sentiment tasks).

Abstract

We present a systematic empirical study of the spectral structure of LoRA weight updates. Through 2D Discrete Cosine Transform (DCT) analysis of trained adaptation matrices across BERT-base and RoBERTa-base on four GLUE benchmarks (SST-2, MNLI, CoLA, QQP), we establish that LoRA updates are universally dominated by low-frequency components: on average, just 33% of DCT coefficients capture 90% of total spectral energy. Retaining only 10% of frequency coefficients reduces adapter storage by 10x while sacrificing only 1.95pp on SST-2. Notably, frequency masking at k=50% improves over full LoRA on 3 of 8 model-task pairs, suggesting high-frequency components act as adaptation noise. We further discover that RoBERTa-base is systematically more spectrally compressible than BERT-base across all tasks, and that task complexity governs spectral sensitivity -- NLI tasks require more frequency budget than sentiment classification. These findings motivate a new design principle for PEFT: spectral sparsity in adaptation.