Suppressing Non-Semantic Noise in Masked Image Modeling Representations
arXiv cs.CV / 4/2/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- Masked Image Modeling (MIM) representations can unintentionally preserve non-semantic “noise,” which degrades inference performance.
- The paper proposes a model-agnostic semantic-invariance scoring approach using PCA on real and synthetic non-semantic images.
- It introduces Semantically Orthogonal Artifact Projection (SOAP), a post-hoc method that suppresses non-semantic information in patch representations without additional training.
- SOAP is designed to be plug-and-play: it can be attached to different MIM-based models as a single linear head and yields consistent gains in zero-shot performance.
Related Articles
v5.5.0
Transformers(HuggingFace)Releases
Bonsai (PrismML's 1 bit version of Qwen3 8B 4B 1.7B) was not an aprils fools joke
Reddit r/LocalLLaMA

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

Inference Engines - A visual deep dive into the layers of an LLM
Dev.to
Surprised by how capable Qwen3.5 9B is in agentic flows (CodeMode)
Reddit r/LocalLLaMA