Abstract
Chinese calligraphy style classification plays a significant role in Chinese calligraphy study. It is a fine-grained classification problem since the difference among different styles is extremely subtle. We propose a novel convolutional neural network equipped with the cross-layer interaction module to address the issue of Chinese calligraphy style classification in this paper. In our proposed network, a multi-scale attention mechanism is first presented, with which the input image can be characterized at multiple levels. Then we model the interaction between any two layers in the network using Hadamard product. In addition, for each input image, we generate its profile image, which is fed to the network together with the input image. In order to evaluate the effectiveness of the proposed network, we conduct extensive experiments on two datasets. The results show that modeling cross-layer interaction is beneficial for the fine-grained Chinese calligraphy style classification task. The multi-scale attention mechanism can highlight the informative part of the image at multiple scales, which can boost the classification performance. Since the profile image can give clues about the stroke compactness of the characters, it is useful in capturing the subtle difference among different styles. The proposed network achieves the accuracies of \(98.62\%\) and \(95.92\%\) on the two datasets respectively, which compares favorably with state-of-the-art methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zhang, J., Guo, M., Fan, J.: A novel CNN structure for fine-grained classification of Chinese calligraphy styles. Int. J. Doc. Anal. Recogn. 22(2), 177–88 (2019)
Peng, Y., He, X., Zhao, J.: Object-part attention model for fine-grained image classification. IEEE Trans. Image Process. 27(3), 1487–1500 (2017)
He, M., Cheng, Q., Qi, G.: Weakly supervised semantic and attentive data mixing augmentation for fine-grained visual categorization. IEEE Access 10, 35814–35823 (2022)
Guang, J., Liang, J.: CMSEA: compound model scaling with efficient attention for fine-grained image classification. IEEE Access 10, 18222–18232 (2022)
Melnyk, P., You, Z., Li, K.: A high-performance CNN method for offline handwritten Chinese character recognition and visualization. Soft. Comput. 24(11), 7977–7987 (2020)
Zhang, X., Nagy, G.: Style comparisons in calligraphy. In: Document Recognition and Retrieval XIX, pp. 177–186 (2012)
Zhang, Y., Liu, Y., He, J., Zhang, J.: Recognition of calligraphy style based on global feature descriptor. In: Proceedings of the International Conference on Multimedia and Expo (ICME), pp. 1–6 (2013)
Dai, F., Tang, C., Lv, J.: Classification of calligraphy style based on convolutional neural network. In: Proceedings of the International Conference on Neural Information Processing (ICONIP), pp. 359–370 (2018)
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7132–7141 (2018)
Woo, S., Park, J., Lee, J.-Y., Kweon, I.S.: CBAM: convolutional block attention module. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 3–19. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_1
Zhang, J., Yu, W., Wang, Z., Li, J., Pan, Z.: Attention-enhanced CNN for Chinese calligraphy styles classification. In: Proceedings of the International Conference on Virtual Reality (ICVR), pp. 352–358 (2021)
Liu, L., et al.: Multi-loss Siamese convolutional neural network for Chinese calligraphy style classification. In: Proceedings of the International Conference on Neural Information Processing (ICONIP), pp. 425–432 (2021)
Li, X., Wang, J., Zhang, H., Huang, Y., Huang, H.: SwordNet: Chinese character font style recognition network. IEEE Access 10, 8388–8398 (2022)
Chen, J., Mu, S., Xu, S., Ding, Y.: HENet: forcing a network to think more for font recognition. In: Proceedings of the International Conference on Advanced Information Science and System (AISS), pp. 1–5 (2021)
Yu, C., Zhao, X., Zheng, Q., Zhang, P., You, X.: Hierarchical bilinear pooling for fine-grained visual recognition. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11220, pp. 595–610. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01270-0_35
Tan, M., Wang, G., Zhou, J., Peng, Z., Zheng, M.: Fine-grained classification via hierarchical bilinear pooling with aggregated slack mask. IEEE Access 7, 117944–117953 (2019)
Gao, Y., Han, X., Wang, X., Huang, W., Scott, M.: Channel interaction networks for fine-grained image categorization. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 10818–10825 (2020)
Ruan, X., Lin, G., Long, C., Lu, S.: Few-shot fine-grained classification with spatial attentive comparison. Knowl.-Based Syst. 218, 106840 (2021)
Chen, J., Hu, J., Li, S.: Learning to locate for fine-grained image recognition. Comput. Vis. Image Underst. 206, 103184 (2021)
He, G., Li, F., Wang, Q., Bai, Z., Xu, Y.: A hierarchical sampling based triplet network for fine-grained image classification. Pattern Recogn. 115, 107889 (2021)
Wang, L., He, K., Feng, X., Ma, X.: Multilayer feature fusion with parallel convolutional block for fine-grained image classification. Appl. Intell. 52(3), 2872–2883 (2022)
Li, X., Wang, W., Hu, X., Yang, J.: Selective kernel networks. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 510–519 (2019)
Howard, A., et al.: Searching for MobileNetV3. In: Proceedings of the International Conference on Computer Vision (ICCV), pp. 1314–1324 (2019)
Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., Hu, Q.: ECA-Net: efficient channel attention for deep convolutional neural networks. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 11531–11539 (2020)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Li, Z., Liu, L., Qiu, T., Lu, Y., Suen, C.Y. (2023). Modeling Cross-layer Interaction for Chinese Calligraphy Style Classification. In: Fink, G.A., Jain, R., Kise, K., Zanibbi, R. (eds) Document Analysis and Recognition - ICDAR 2023. ICDAR 2023. Lecture Notes in Computer Science, vol 14190. Springer, Cham. https://doi.org/10.1007/978-3-031-41685-9_5
Download citation
DOI: https://doi.org/10.1007/978-3-031-41685-9_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-41684-2
Online ISBN: 978-3-031-41685-9
eBook Packages: Computer ScienceComputer Science (R0)