AI Navigate

Multilingual Reasoning Gym: Multilingual Scaling of Procedural Reasoning Environments

arXiv cs.CL / 3/12/2026

📰 NewsTools & Practical UsageModels & Research

Key Points

  • The Multilingual Reasoning Gym extends Reasoning Gym to procedurally generate verifiable reasoning problems across 14 languages.
  • It translates 94 task templates with native-speaker validation in 10 languages and adapts code and templates to ensure linguistic naturalness.
  • It preserves core benefits such as virtually unlimited problem instances, adjustable difficulty, and remains directly usable for Reinforcement Learning from Verifiable Rewards and evaluation.
  • Problems are parallel across languages, enabling massive-scale cross-language data generation due to the procedural nature, and the authors release the implementation to support multilingual reasoning research.

Abstract

We present the Multilingual Reasoning Gym, an extension of Reasoning Gym (Stojanovski et al., 2025), that procedurally generates verifiable reasoning problems across 14 languages. We translate templates for 94 tasks with native-speaker validation in 10 languages and targeted code or template adaptations to ensure linguistic naturalness. The Multilingual Reasoning Gym preserves the core benefits of the procedural generation approach used in the original Reasoning Gym, such as virtually unlimited problem instance generation and adjustable difficulty, and remains directly usable for Reinforcement Learning from Verifiable Rewards and evaluation settings. Problems in the Multilingual Reasoning Gym are parallel across languages, enabling crosslingually parallel data generation at massive scale due to the procedural nature of the environments. We release our implementation to support research into multilingual reasoning models.