IRIS: Interleaved Reinforcement with Incremental Staged Curriculum for Cross-Lingual Mathematical Reasoning
arXiv cs.CL / 4/28/2026
📰 NewsSignals & Early TrendsModels & Research
Key Points
- The paper introduces IRIS, a two-axis curriculum framework for improving cross-lingual mathematical reasoning by combining progressively harder supervised fine-tuning with reverse-curriculum reinforcement learning to lessen dependence on step-by-step prompts.
- IRIS uses a composite reward that covers answer correctness, step-level alignment, continuity of reasoning, and numeric incentives, optimized with Group Relative Policy Optimization (GRPO).
- The authors release CL-Math, a dataset of 29k multilingual math problems with step-level annotations in English, Hindi, and Marathi, intended to support multilingual reasoning research.
- Experiments across benchmarks and curated multilingual test sets show consistent performance gains, particularly for low-resource and bilingual settings, with smaller improvements for high-resource languages.
Related Articles
LLMs will be a commodity
Reddit r/artificial
HubSpot Just Legitimized AEO: What It Means for Your Brand AI Visibility
Dev.to

What it feels like to have to have Qwen 3.6 or Gemma 4 running locally
Reddit r/LocalLLaMA

From Fault Codes to Smart Fixes: How Google Cloud NEXT ’26 Inspired My AI Mechanic Assistant
Dev.to

Dex lands $5.3M to grow its AI-driven talent matching platform
Tech.eu