| Hi LocalLLaMA, I created a post a few weeks ago, but this time this project has become more reliable and easier to use. This is a manga translator that can also be used to translate any image. It uses a combination of object detection, visual LLM-based OCR, layout analysis, and fine-tuned inpainting models. I believe it is the most performant and easy-to-use pipeline for manga translation. For the LLM part, I have integrated llama.cpp into this application; it supports the Gemma 4 family and the Qwen3.5 family, and also includes uncensored and fine-tuned models. It also supports OpenAPI-compatible API, so you can use LM Studio or OpenRouter, etc. I think the demo video explains the workflow a lot, basiclly you just click a button and it will run the pipeline for you. You can also proofread and edit the result, changing the font, size, color, etc. It's a mini Photoshop editor. For who may have interest on this, it's fully open-source: https://github.com/mayocream/koharu [link] [comments] |
Local manga translator with LLM build-in, written in Rust with llama.cpp integration
Reddit r/LocalLLaMA / 4/22/2026
📰 NewsDeveloper Stack & InfrastructureTools & Practical UsageModels & Research
Key Points
- A new open-source manga translator named Koharu converts manga (and other images) into translated text using a pipeline that combines object detection, visual LLM-based OCR, layout analysis, and fine-tuned inpainting models.
- The project integrates llama.cpp in a Rust application, with support for Gemma 4 and Qwen3.5 model families, including uncensored and fine-tuned variants.
- It offers an OpenAPI-compatible API so users can connect via tools like LM Studio or OpenRouter.
- The workflow is designed to be mostly one-click, while users can proofread and edit outputs (e.g., changing font, size, and color) like a mini Photoshop editor.
- The author claims the pipeline is both high-performing and easy to use, and provides the code on GitHub for the community.
Related Articles

Black Hat USA
AI Business
Autoencoders and Representation Learning in Vision
Dev.to
Every AI finance app wants your data. I didn’t trust that — so I built my own. Offline.
Dev.to
Control Claude with Just a URL. The Chrome Extension "Send to Claude" Is Incredibly Useful
Dev.to
Google Stitch 2.0: Senior-Level UI in Seconds, But Editing Still Breaks
Dev.to