SEM: Sparse Embedding Modulation for Post-Hoc Debiasing of Vision-Language Models
arXiv cs.CV / 3/20/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces Sparse Embedding Modulation (SEM), a post-hoc, zero-shot debiasing framework for vision-language models like CLIP that operates in a Sparse Autoencoder latent space.
- SEM disentangles bias- and query-relevant features by decomposing CLIP text embeddings into sparse components and modulating bias-relevant neurons.
- The approach enables non-linear debiasing interventions and demonstrates substantial fairness gains in retrieval and zero-shot classification across four benchmark datasets and two CLIP backbones.
- Overall, the results indicate that sparse latent representations can provide an effective foundation for debiasing vision-language models without sacrificing semantic fidelity.
Related Articles

Attacks On Data Centers, Qwen3.5 In All Sizes, DeepSeek’s Huawei Play, Apple’s Multimodal Tokenizer
The Batch

Your AI generated code is "almost right", and that is actually WORSE than it being "wrong".
Dev.to

Lessons from Academic Plagiarism Tools for SaaS Product Development
Dev.to

**Core Allocation Optimization for Energy‑Efficient Multi‑Core Scheduling in ARINC650 Systems**
Dev.to

KI in der amtlichen Recherche beim DPMA: Was Patentanwälte bei Neuanmeldungen jetzt beachten sollten (Stand: März 2026)
Dev.to