Aesthetic Assessment of Chinese Handwritings Based on Vision Language Models

arXiv cs.CL / 3/31/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that prior automated assessments of Chinese handwriting that only output a numeric score are less helpful for learners because they provide limited actionable guidance.
  • It proposes using vision-language models (VLMs) to perform aesthetic assessment of handwritten Chinese characters and produce multi-level feedback rather than score-only outputs.
  • Two feedback-generation tasks are explored: simple grade feedback and richer descriptive feedback aimed at being more instructive for improvement.
  • The authors investigate methods to incorporate handwriting aesthetic assessment knowledge into VLMs, including LoRA-based fine-tuning and in-context learning.
  • Experiments report state-of-the-art performance on multiple evaluation tracks from the CCL 2025 workshop on evaluating handwritten Chinese character quality.

Abstract

The handwriting of Chinese characters is a fundamental aspect of learning the Chinese language. Previous automated assessment methods often framed scoring as a regression problem. However, this score-only feedback lacks actionable guidance, which limits its effectiveness in helping learners improve their handwriting skills. In this paper, we leverage vision-language models (VLMs) to analyze the quality of handwritten Chinese characters and generate multi-level feedback. Specifically, we investigate two feedback generation tasks: simple grade feedback (Task 1) and enriched, descriptive feedback (Task 2). We explore both low-rank adaptation (LoRA)-based fine-tuning strategies and in-context learning methods to integrate aesthetic assessment knowledge into VLMs. Experimental results show that our approach achieves state-of-the-art performances across multiple evaluation tracks in the CCL 2025 workshop on evaluation of handwritten Chinese character quality.