MELT: Improve Composed Image Retrieval via the Modification Frequentation-Rarity Balance Network
arXiv cs.CV / 4/1/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper addresses two common issues in Composed Image Retrieval (CIR): frequency bias that causes rare modification semantics to be neglected, and instability of similarity scores due to hard negative samples and noise.
- It introduces MELT, a Modification frEquentation-rarity baLance neTwork that increases attention to rare modification semantics in multimodal (reference image + text) settings.
- To improve robustness against hard negatives, MELT uses diffusion-based denoising to reduce the influence of hard negative samples with high similarity scores.
- Experiments on two CIR benchmarks reportedly show that MELT achieves superior performance compared with existing CIR approaches.
- The authors provide implementation code at the linked GitHub repository, enabling reproducibility and further experimentation.
Related Articles

Show HN: 1-Bit Bonsai, the First Commercially Viable 1-Bit LLMs
Dev.to

I Built an AI Agent That Can Write Its Own Tools When It Gets Stuck
Dev.to

Agent Self-Discovery: How AI Agents Find Their Own Wallets
Dev.to
[P] Federated Adversarial Learning
Reddit r/MachineLearning

The Inversion Error: Why Safe AGI Requires an Enactive Floor and State-Space Reversibility
Towards Data Science