AI Navigate

[P] ColQwen3.5-v3 release + Case study

Reddit r/MachineLearning / 3/18/2026

📰 NewsTools & Practical UsageModels & Research

Key Points

  • ColQwen3.5-4.5B-v3 is being released with 4.5B parameters and is reported to be #1 (avg) on the MTEB ViDoRe leaderboard with a mean score of 75.67 (pending release).
  • The model claims substantially higher efficiency, described as ~half the parameters and memory footprint, and ~13x fewer embedding dimensions than the previous #1.
  • A full evaluation trail is public, including result files for every candidate and a detailed blog post outlining the training methodology and case study.
  • It is officially supported by colpali-engine and vLLM (ROCm + CUDA) for immediate use, and the author is already training a 9B variant.

Happy to share the latest colqwen3.5-4.5B model in the series.

ColQwen3.5-4.5B-v3 is #1 (avg) on the MTEB ViDoRe leaderboard (Pending release) at 75.67 mean, ~half the params, ~13x fewer embedding dims, ~half the memory footprint of the previous #1 model.

Thoughts: V3 edges out v2 on V3 English u@5 (0.6034 vs 0.6023), a marginal gain for substantially more compute. The real win was the V2 benchmark jump and surpassing 8B models on V3. That's where I decided to draw the line between further optimization and accepting the limitations of the model and training data.

The full evaluation trail is public, with result files covering every candidate tried.

Links:

ColQwen3.5-4.5B-v3 is already officially supported by colpali-engine and vLLM (ROCm + CUDA), so you can actually use the thing.

License: Apache 2.0

I'm now training the 9B variant with a much simpler setup and will post once that's done.

submitted by /u/madkimchi
[link] [comments]