Locate-then-Sparsify: Attribution Guided Sparse Strategy for Visual Hallucination Mitigation
arXiv cs.CV / 3/18/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces Locate-Then-Sparsify for Feature Steering (LTS-FS), a plug-and-play framework that applies layer-wise, attribution-guided feature steering to mitigate visual hallucinations in LVLMs.
- It develops an attribution method based on causal interventions to quantify each layer's relevance to hallucinations, using a synthetic dataset with token-level and sentence-level hallucination cases.
- The approach converts layer attribution scores into per-layer steering intensities, enabling targeted adjustments only on hallucination-relevant layers to avoid degrading non-hallucination tasks.
- Extensive experiments across multiple LVLMs and benchmarks show effective hallucination reduction while preserving strong overall performance.




