| The benchmarks look really impressive for such small models. Even in general, they stand up well. Gemma 4 31B is (of all tested models): - 3rd on Dutch - 2nd on Danish - 3rd on English - 1st on Finish - 2nd on French - 5th on German - 2nd on Italian - 3rd on Swedish Curious if real-world experience matches that. [link] [comments] |
Gemma 4 is a huge improvement in many European languages, including Danish, Dutch, French and Italian
Reddit r/LocalLLaMA / 4/7/2026
💬 OpinionSignals & Early TrendsModels & Research
Key Points
- Reddit post claims Gemma 4 31B shows strong benchmark performance across multiple European languages, ranking near the top on several leaderboards.
- Reported results include 1st on Finnish, 2nd on Danish, 2nd on French, and 2nd on Italian, with additional high placements on Dutch, English, German, and Swedish.
- The post emphasizes the impressiveness of these results for a relatively small model size while acknowledging uncertainty about how well benchmarks will translate to real-world usage.
- The claims cite the EuroEval leaderboards as the source for the ranking information.
- The underlying takeaway is that Gemma 4 appears to be meaningfully improving multilingual capabilities for European languages, relevant to local or region-specific deployment decisions.
Related Articles

Black Hat Asia
AI Business

Can You Really Trust AI Anonymizers? Governments Are Changing the Rules
Dev.to

AI Agents Don’t Need Bigger Context Windows. They Need Real Memory
Dev.to
[D] Is ACL more about the benchmarks now?
Reddit r/MachineLearning

Vector Databases and RAG: Semantic Search, pgvector, and Answering Questions from Your Data
Dev.to