CNSL-bench: Benchmarking the Sign Language Understanding Capabilities of MLLMs on Chinese National Sign Language
arXiv cs.AI / 4/27/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces CNSL-bench, the first comprehensive benchmark for evaluating multimodal large language models’ (MLLMs) understanding of Chinese National Sign Language (CNSL).
- CNSL-bench is grounded in the officially standardized National Common Sign Language Dictionary to reduce ambiguity from regional or non-canonical sign variants.
- It covers multiple modalities—aligned text descriptions, images, and sign language videos—and includes articulatory diversity such as air-writing, finger-spelling, and the Chinese manual alphabet.
- The authors benchmark 21 up-to-date open-source and proprietary MLLMs and find they are still far behind human performance, with systematic gaps varying by modality and manual articulatory form.
- Diagnostic analyses indicate that key performance limitations remain even as reasoning improves, and instruction-following robustness differs significantly across models.
Related Articles

Subagents: The Building Block of Agentic AI
Dev.to

DeepSeek-V4 Models Could Change Global AI Race
AI Business

Got OpenAI's privacy filter model running on-device via ExecuTorch
Reddit r/LocalLLaMA

The Agent-Skill Illusion: Why Prompt-Based Control Fails in Multi-Agent Business Consulting Systems
Dev.to

We Built a Voice AI Receptionist in 8 Weeks — Every Decision We Made and Why
Dev.to