IMPACT: Importance-Aware Activation Space Reconstruction
arXiv stat.ML / 4/22/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that weight low-rank compression often fails for LLMs because the low-rank assumption for weights may not hold.
- It proposes IMPACT, which instead reconstructs and compresses using activation low-rank structure to better match how LLMs behave in practice.
- IMPACT introduces an importance-aware optimization that weights activation reconstruction by gradient-based importance, producing a closed-form solution based on an importance-weighted activation covariance matrix.
- Experiments across multiple models and tasks show IMPACT can reduce model size significantly (up to 55.4%) while keeping accuracy comparable to or better than existing compression baselines.
- Overall, the method directly connects compression choices to expected performance impact, aiming to improve deployability in resource-constrained environments.


