HIFICL: High-Fidelity In-Context Learning for Multimodal Tasks
arXiv cs.CV / 3/16/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper notes that In-Context Learning for large multimodal models is sensitive to demonstration configurations and computationally expensive.
- It introduces High-Fidelity In-Context Learning (HIFICL) with virtual key-value pairs as learnable context to more faithfully model the ICL mechanism.
- HIFICL uses a low-rank factorization for stable, regularized training and frames the approach as context-aware parameter-efficient fine-tuning.
- Extensive experiments on multimodal benchmarks show HIFICL consistently outperforms existing approximation methods, and the code is publicly available.




