Sharing a model I've been working on: ColQwen3.5-v1, a 4.5B param model built on Qwen3.5-4B using the ColPali late-interaction approach.
Currently #1 on ViDoRe V1 (nDCG@5 0.917) & competitive on ViDoRe V3. Trained across 4 phases including hard negative mining and domain specialization on finance/table docs.
Apache 2.0, weights on HF: https://huggingface.co/athrael-soju/colqwen3.5-v1 & PR raised to merge in https://github.com/illuin-tech/colpali
Working on v2 to simplify the training recipe & cover more domains, with the aim of reaching SOTA #1 on ViDoRe V3 soon.
Let me know if you try it out!
[link] [comments]


