MUNIChus: Multilingual News Image Captioning Benchmark
arXiv cs.CL / 3/12/2026
📰 NewsModels & Research
Key Points
- MUNIChus is introduced as the first multilingual benchmark for news image captioning, spanning 9 languages including Sinhala and Urdu.
- The dataset addresses the shortage of multilingual resources in this field and enables cross-lingual evaluation.
- The benchmark evaluates several state-of-the-art neural models and confirms that multilingual news image captioning remains challenging.
- The authors publicly release MUNIChus with benchmarking results for over 20 models, facilitating further research and benchmarking.
- This release opens new avenues for advancing multilingual news image captioning research and its evaluation.
Related Articles

PearlOS. We gave swarm intelligence a local desktop environment and code control to self-evolve. Has been pretty incredible to see so far. Open source and free if you want your own.
Reddit r/LocalLLaMA
QwenDean-4B | fine-tuned SLM for UIGen; our first attempt, looking for feedback!
Reddit r/LocalLLaMA
acestep.cpp: portable C++17 implementation of ACE-Step 1.5 music generation using GGML. Runs on CPU, CUDA, ROCm, Metal, Vulkan
Reddit r/LocalLLaMA

**Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding**
Hugging Face Blog

Newest GPU server in the lab! 72gb ampere vram!
Reddit r/LocalLLaMA