AI Navigate

Scribe Verification in Chinese manuscripts using Siamese, Triplet, and Vision Transformer Neural Networks

arXiv cs.LG / 3/17/2026

📰 NewsModels & Research

Key Points

  • The paper analyzes deep metric learning models, including Siamese, Triplet, and Vision Transformer architectures, for scribe verification in Chinese manuscripts.
  • It uses two datasets—Tsinghua Bamboo Slips and a subset of the Multi-Attribute Chinese Calligraphy Dataset—focusing on calligraphers with many samples.
  • The study implements both convolutional and Transformer-based backbones, including MobileNetV3+ Custom Siamese, for comparison.
  • The MobileNetV3+ Custom Siamese model trained with contrastive loss achieves the best or second-best accuracy and AUC on both datasets.
  • This work advances automatic authorship verification in historical manuscripts and demonstrates the effectiveness of deep metric learning for script analysis.

Abstract

The paper examines deep learning models for scribe verification in Chinese manuscripts. That is, to automatically determine whether two manuscript fragments were written by the same scribe using deep metric learning methods. Two datasets were used: the Tsinghua Bamboo Slips Dataset and a selected subset of the Multi-Attribute Chinese Calligraphy Dataset, focusing on the calligraphers with a large number of samples. Siamese and Triplet neural network architectures are implemented, including convolutional and Transformer-based models. The experimental results show that the MobileNetV3+ Custom Siamese model trained with contrastive loss achieves either the best or the second-best overall accuracy and area under the Receiver Operating Characteristic Curve on both datasets.