Air-Know: Arbiter-Calibrated Knowledge-Internalizing Robust Network for Composed Image Retrieval
arXiv cs.CV / 4/22/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that Composed Image Retrieval (CIR) is strongly limited by the Noisy Triplet Correspondence (NTC) problem, where semantic ambiguity breaks the common “small loss hypothesis” used by robust learning methods.
- It proposes Air-Know (ArbIteR calibrated Knowledge iNternalizing rObust netWork), an approach that avoids an adversarial feedback loop between the learner and an arbiter that can cause catastrophic representation pollution.
- Air-Know uses an “Expert-Proxy-Diversion” decoupling framework with three modules: External Prior Arbitration (EPA) using multimodal LLMs to build a high-precision anchor dataset, Expert Knowledge Internalization (EKI) to train a lightweight proxy arbiter, and Dual Stream Reconciliation (DSR) to divert data based on matching confidence.
- Experiments on multiple CIR benchmarks show Air-Know substantially improves state-of-the-art performance specifically under the NTC noise setting, while remaining competitive in standard (non-NTC) CIR.
- The work highlights a practical strategy for robust retrieval training: calibrating learning using offline expert knowledge and separating arbitration from representation learning via confidence-based data routing.
![AI TikTok Marketing for Pet Brands [2026 Guide]](/_next/image?url=https%3A%2F%2Fmedia2.dev.to%2Fdynamic%2Fimage%2Fwidth%3D1200%2Cheight%3D627%2Cfit%3Dcover%2Cgravity%3Dauto%2Cformat%3Dauto%2Fhttps%253A%252F%252Fdev-to-uploads.s3.amazonaws.com%252Fuploads%252Farticles%252Fj35r9qm34d68qf2gq7no.png&w=3840&q=75)


