Indirect Question Answering in English, German and Bavarian: A Challenging Task for High- and Low-Resource Languages Alike
arXiv cs.CL / 3/17/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper presents two multilingual IQA corpora, InQA+ and GenIQA, covering English, Standard German, and Bavarian, with InQA+ being hand-annotated and GenIQA generated via GPT-4o-mini.
- It shows IQA is pragmatically hard, with low performance even in English and signs of severe overfitting, indicating that data quality and size are critical.
- Experiments with multilingual transformers (mBERT, XLM-R, mDeBERTa) reveal that label ambiguity, label-set choices, and dataset size strongly influence results.
- The authors offer recommendations to address these challenges and highlight that larger training data improves IQA performance, while GPT-4o-mini data may not yield high-quality IQA data.
Related Articles

Attacks On Data Centers, Qwen3.5 In All Sizes, DeepSeek’s Huawei Play, Apple’s Multimodal Tokenizer
The Batch

Your AI generated code is "almost right", and that is actually WORSE than it being "wrong".
Dev.to

Lessons from Academic Plagiarism Tools for SaaS Product Development
Dev.to

**Core Allocation Optimization for Energy‑Efficient Multi‑Core Scheduling in ARINC650 Systems**
Dev.to

KI in der amtlichen Recherche beim DPMA: Was Patentanwälte bei Neuanmeldungen jetzt beachten sollten (Stand: März 2026)
Dev.to