Abstract
In the digital era, digital signatures are crucial for online processes and transactions. Their blend of security and ease has spurred various verification methods, including machine and deep learning models. Vision Transformers (ViT) have recently gained traction for image tasks due to their effectiveness. This study presents a signature verification method combining a custom ResNet-50 Convolutional Neural Network (CNN) and Transformers. This combined model leverages ResNet-50’s image extraction capabilities and the Transformer’s attention mechanism. We optimized the model with learning rate adjustments, enhancing its performance. Our data included 534 personal signatures from students, split into 128 for training and 406 for testing. We used machine learning for feature extraction and classification, assessing genuineity with cosine similarity. We tested various CNNs, ViT, and our combined model. Paired with XGBoost, our model achieved a 97.7% precision, 98.45% recall, and 97.5% F1 score. It also performed well in signature verification metrics like FNMR 0.090, FMR 0.059, and EER 0.075. Our method shows promise for advancing signature verification, suggesting potential future research directions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition “(CVPR), pp. 770–778 (2016)
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Polosukhin, I.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 30–31 (2017)
Rexit, A., Sharma, R.K., Kushwaha, D.S.: Multilingual handwritten signature recognition based on high-dimensional feature fusion. In: Proceedings of the IEEE International Conference on Pattern Recognition, pp. 1234–1239 (2019)
Day, S., Halder, S., Roy, P.P.: SigNet: convolutional Siamese network for writer independent offline signature verification. In: Proceedings of the IEEE International Conference on Document Analysis and Recognition, pp. 567–572 (2018)
Ren, J.-X., Xiong, Y.-J., Zhang, H., Huang, B.: 2C2S: a two-channel and two-stream transformer-based framework for offline signature verification. In: 2023 IEEE International Conference on Image Processing (ICIP) (2023)
Cortes, C., Vapnik, V.: Support-vector networks. In: Proceedings of the Machine Learning Conference (MLC), pp. 123–130 (1995)
Chen, T., Guestrin, C.: XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), pp. 785–794 (2016)
Bishop, C.M.: Pattern Recognition and Machine Learning (Logistic Regression). Springer (2006)
John, G.H., Langley, P.: The Naive Bayes model. In: Proceedings of the 17th International Joint Conference on Artificial Intelligence (IJCAI), pp. 338–343 (1999)
He, K. et al.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition (VGG16). In: Proceedings of the International Conference on Learning Representations (ICLR) (2015)
Tan, M., Le, Q.V.: EfficientNet: rethinking model scaling for convolutional neural networks (EfficientNetB3). In: Proceedings of the 36th International Conference on Machine Learning (ICML), pp. 610–618 (2019)
Dosovitskiy, A., Beyer, L., Kolesnikov, A. et al.: An image is worth 16×16 words: transformers for image recognition (ViT-B16). In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6318–6328 (2021)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Do Thanh, T., Nguyen, C.T., Phung, N.H., Minh, N.H., Nguyen, VH. (2023). ViT-SigNet: Combining Deep CNN and Vision Transformer for Enhanced Signature Verification. In: Nghia, P.T., Thai, V.D., Thuy, N.T., Son, L.H., Huynh, VN. (eds) Advances in Information and Communication Technology. ICTA 2023. Lecture Notes in Networks and Systems, vol 847. Springer, Cham. https://doi.org/10.1007/978-3-031-49529-8_23
Download citation
DOI: https://doi.org/10.1007/978-3-031-49529-8_23
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-49528-1
Online ISBN: 978-3-031-49529-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)