Open-Set Vein Biometric Recognition with Deep Metric Learning

arXiv cs.CV / 4/17/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses the limitation of closed-set vein recognition by evaluating deep metric learning for open-set scenarios where new users can be handled without full retraining.
  • It studies how data scarcity and domain shift affect recognition accuracy under strict open-set constraints and designs an embedding + prototype matching approach with a calibrated similarity threshold for rejecting unknown impostors.
  • Experiments use a strict subject-disjoint protocol across four vein datasets (finger, wrist, and dorsal hand) to test both recognition of enrolled users and discrimination against unseen subjects.
  • On the large-scale MMCBNU 6000 benchmark, the best model (ResNet50-CBAM) reports OSCR 0.9945, AUROC 0.9974, EER 1.57%, and 99.6% Rank-1 accuracy while still strongly rejecting unknowns.
  • Ablation results show that triplet-based objectives with a simple 1-NN classifier provide a strong accuracy–efficiency balance, supporting real-time deployment on commodity hardware.

Abstract

Most state-of-the-art vein recognition methods rely on closed-set classification, which inherently limits their scalability and prevents the adaptive enrollment of new users without complete model retraining. We rigorously evaluate the computational boundaries of Deep Metric Learning (DML) under strict open-set constraints. Unlike standard closed-set approaches, we analyze the impact of data scarcity and domain shift on recognition performance. Our approach learns discriminative L2-normalised embeddings and employs prototype-based matching with a calibrated similarity threshold to effectively distinguish between enrolled users and unseen impostors. We evaluate the framework under a strict subject-disjoint protocol across four diverse datasets covering finger, wrist, and dorsal hand veins (MMCBNU 6000, UTFVP, FYO, and a dorsal hand-vein dataset). On the large-scale MMCBNU 6000 benchmark, our best model (ResNet50-CBAM) achieves an OSCR of 0.9945, AUROC of 0.9974, and EER of 1.57%, maintaining high identification accuracy (99.6% Rank-1) while robustly rejecting unknown subjects. Cross-dataset experiments evaluate the framework's generalisation across different acquisition setups, confirming that while the model handles large-scale data robustly, performance remains sensitive to domain shifts in low-data regimes. Ablation studies demonstrate that triplet-based objectives combined with a simple 1-NN classifier offer an optimal trade-off between accuracy and efficiency, enabling real-time deployment on commodity hardware.