SwissGov-RSD: A Human-annotated, Cross-lingual Benchmark for Token-level Recognition of Semantic Differences Between Related Documents
arXiv cs.CL / 3/13/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- SwissGov-RSD is introduced as a naturalistic, document-level cross-lingual benchmark for token-level recognition of semantic differences across related documents.
- It covers 224 multi-parallel English-German, English-French, and English-Italian documents with human-annotated token-level difference labels, enabling cross-language evaluation.
- The work evaluates a range of open-source and closed-source LLMs and encoder models under various fine-tuning settings, revealing substantial gaps relative to monolingual or synthetic benchmarks.
- The authors release code and datasets publicly to support replication and further research.
Related Articles

The programming passion is melting
Dev.to

Maximize Developer Revenue with Monetzly's Innovative API for AI Conversations
Dev.to
Co-Activation Pattern Detection for Prompt Injection: A Mechanistic Interpretability Approach Using Sparse Autoencoders
Reddit r/LocalLLaMA

How to Train Custom Language Models: Fine-Tuning vs Training From Scratch (2026)
Dev.to

KoboldCpp 1.110 - 3 YR Anniversary Edition, native music gen, qwen3tts voice cloning and more
Reddit r/LocalLLaMA