WorkRB: A Community-Driven Evaluation Framework for AI in the Work Domain
arXiv cs.AI / 4/16/2026
💬 OpinionTools & Practical UsageModels & Research
Key Points
- WorkRB is introduced as an open-source, community-driven benchmark aimed specifically at evaluating AI systems in the work/labor domain, where research has been fragmented and hard to compare.
- The framework unifies 13 diverse work-related tasks across 7 task groups into standardized recommendation and NLP task formats, including job/skill and candidate recommendation as well as skill extraction and normalization.
- WorkRB supports both monolingual and cross-lingual evaluation by dynamically loading multilingual ontologies, helping address the mismatch caused by using different labor taxonomies across studies.
- It is designed to improve reproducibility while mitigating employment-data sensitivity, with a modular architecture that allows integration of proprietary tasks without exposing sensitive datasets.
- WorkRB is released under the Apache 2.0 license and is made available via a public GitHub repository, enabling ongoing community contributions.




