Hey everyone,
A while ago I shared my fine-tuned Qwen3.5-2B OCR model. Since then I kept working on the pipeline and just released a new version based on Qwen3.5-0.8B.
This one uses improved training samples and better output formatting, and it’s outperforming my previous 2B release on English archival and document OCR tasks.
It’s trained for markdown-first OCR output with HTML tables, LaTeX for formulas, [image] tags for figures/images, and [chart: ...] extraction for chart content. It also does a better job preserving reading order and more complex layouts.
Model link: loay/English-Document-OCR-Qwen3.5-0.8B
I’m planning to release versions for other languages soon as well, including Arabic and broader RTL document OCR support.
If you test it on messy scans or edge cases, I’d love to hear how it performs.
[link] [comments]


