MTFFNet: a Multi-task Feature Fusion Framework for Chinese Painting Classification

Jiang, Wei; Wang, Xiaoyu; Ren, Jinchang; Li, Sen; Sun, Meijun; Wang, Zheng; Jin, Jesse S.

doi:10.1007/s12559-021-09896-9

MTFFNet: a Multi-task Feature Fusion Framework for Chinese Painting Classification

Published: 10 September 2021

Volume 13, pages 1287–1296, (2021)
Cite this article

Cognitive Computation Aims and scope Submit manuscript

Wei Jiang¹,
Xiaoyu Wang¹,
Jinchang Ren²,
Sen Li¹,
Meijun Sun¹,
Zheng Wang ORCID: orcid.org/0000-0001-8458-6704¹ &
…
Jesse S. Jin¹

488 Accesses
6 Citations
Explore all metrics

Abstract

Different artists have their unique painting styles, which can be hardly recognized by ordinary people without professional knowledge. How to intelligently analyze such artistic styles via underlying features remains to be a challenging research problem. In this paper, we propose a novel multi-task feature fusion architecture (MTFFNet), for cognitive classification of traditional Chinese paintings. Specifically, by taking the full advantage of the pre-trained DenseNet as backbone, MTFFNet benefits from the fusion of two different types of feature information: semantic and brush stroke features. These features are learned from the RGB images and auxiliary gray-level co-occurrence matrix (GLCM) in an end-to-end manner, to enhance the discriminative power of the features for the first time. Through abundant experiments, our results demonstrate that our proposed model MTFFNet achieves significantly better classification performance than many state-of-the-art approaches.

In this paper, an end-to-end multi-task feature fusion method for Chinese painting classification is proposed. We come up with a new model named MTFFNet, composed of two branches, in which one branch is top-level RGB feature learning and the other branch is low-level brush stroke feature learning. The semantic feature learning branch takes the original image of traditional Chinese painting as input, extracting the color and semantic information of the image, while the brush feature learning branch takes the GLCM feature map as input, extracting the texture and edge information of the image. Multi-kernel learning SVM (supporting vector machine) is selected as the final classifier. Evaluated by experiments, this method improves the accuracy of Chinese painting classification and enhances the generalization ability. By adopting the end-to-end multi-task feature fusion method, MTFFNet could extract more semantic features and texture information in the image. When compared with state-of-the-art classification method for Chinese painting, the proposed method achieves much higher accuracy on our proposed datasets, without lowering speed or efficiency. The proposed method provides an effective solution for cognitive classification of Chinese ink painting, where the accuracy and efficiency of the approach have been fully validated.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Image Matching from Handcrafted to Deep Features: A Survey

Article Open access 04 August 2020

Recognition of Oracle Bone Inscriptions by using Two Deep Learning Models

Article 05 April 2022

Using machine learning to predict artistic styles: an analysis of trends and the research agenda

Article Open access 15 April 2024

References

Zabalza J, Ren J, Zheng J, et al. Novel segmented stacked autoencoder for effective dimensionality reduction and feature extraction in hyperspectral imaging. Neurocomputing. 2016;185:1–10.
Article Google Scholar
Zhang D, Han J, Zhao L, et al. Leveraging prior-knowledge for weakly supervised object detection under a collaborative self-paced curriculum learning framework. Int J Comput Vision. 2019;1:4–10.
MATH Google Scholar
Ren J, ANN vs. SVM: Which one performs better in classification of MCCs in mammogram imaging. Knowledge-Based Systems. 2012;26:144–153.
Huang G, Liu Z, Weinberger K, Maaten L. Densely connected convolutional networks . arXiv preprint arXiv. 2016;1608(06993):1–4.
Google Scholar
Li J, Wang J. Studying digital imagery of ancient paintings by mixtures of stochastic models. IEEE Transactions on Image Processing A Publication of the IEEE Signal Processing Society. 2004;13(3):340–353.
Article Google Scholar
Wang JZ. Image processing for artist identification - computerized analysis of Vincent van Gogh’s painting brushstrokes. Nuclear instruments & methods in physics research. 2008;122(4):650–656.
Jiang S, Huang Q, Ye Q, et al. An effective method to detect and categorize digitized traditional Chinese paintings. Pattern Recogn Lett. 2006;27(7):734–746.
Article Google Scholar
Johnson CR, Hendriks E, Berezhnoy IJ, et al. Image processing for artist identification. Signal Processing Magazine IEEE. 2008;25(4):37–48.
Article Google Scholar
Li J. Rhythmic brushstrokes distinguish van Gogh from his contemporaries: findings via automated brushstroke extraction. IEEE Trans Pattern Anal Mach Intell. 2012;34(6):1159–1176.
Article Google Scholar
Niu XX, Suen CY. A novel hybrid CNN–SVM classifier for recognizing handwritten digits. Pattern Recogn. 2012;45(4):1318–1325.
Article Google Scholar
Shi L, Zhang Y, Cheng J, Lu H. Skeleton-based action recognition with multi-stream adaptive graph convolutional networks. IEEE. 2019;29:3247–3257.
Padfield N, Zabalza J, Zhao H, et al. EEG-based brain-computer interfaces using motor-imagery: techniques and challenges[J]. Sensors. 2019;19(6):1–5.
Article Google Scholar
Abdel-Hamid O, Mohamed AR, Jiang H, et al. Convolutional neural networks for speech recognition. IEEE/ACM Transactions on Audio Speech & Language Processing. 2014;22(10):1533–1545.
Article Google Scholar
Han J, Cheng G, Li Z, et al. A unified metric learning-based framework for co-saliency detection. IEEE Trans Circuits Syst Video Technol. 2018;28(10):2473–83.
Article Google Scholar
Leibe B, Matas J, Sebe N, et al. [Lecture Notes in Computer Science] Computer Vision -ECCV 2016 Volume 9912. Saliency Detection via Combining Region-Level and Pixel-Level Predictions with CNNs https://doi.org/10.1007/978-3-319-46484-8(Chapter49):809-825.
Niu M, Song K, Huang L, et al. Unsupervised saliency detection of rail surface defects using stereoscopic images. IEEE Transactions on Industrial Informatics. PP(99):1–1.
Gong C, Junwei H, Peicheng Z, et al. Learning rotation-invariant and Fisher discriminative convolutional neural networks for object detection. IEEE Trans Image Process. 2018;28:265–278.
MathSciNet MATH Google Scholar
Krizhevsky A, Sutskever I, Hinton GE. ImageNet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems 25 (NIPS’2012). 2012:1–5.
Deng J, Dong W, Socher R, et al. ImageNet: A large-scale hierarchical image database. Proc of IEEE Computer Vision & Pattern Recognition. 2009;1:248–255.
Google Scholar
Thomas L. The classification of style in painting. Dissertation Abstracts International, Volume: 66–09, Section: B, page: 4908.Advisers: Charles Tappe, 2008.
Saleh B, Elgammal A. Large-scale classification of fine-art paintings: learning the right metric on the right feature. J Comput Sci Technol. 2015;25(3):595–605.
Google Scholar
Zabalza J, Ren J, Zheng J, et al. Novel two-dimensional singular spectrum analysis for effective feature extraction and data classification in hyperspectral imaging. IEEE Transactions on Geoence and Remote Sensing. 2015;53(8):1–16.
Article Google Scholar
Zabalza J, Ren J, Yang M, et al. Novel Folded-PCA for improved feature extraction and data reduction with hyperspectral imaging and SAR in remote sensing. IsprJournal of Photogrammetry & Remote Sensing. 2014;93(7):112–122.
Article Google Scholar
Zamani F, Jamzad M. A feature fusion based localized multiple kernel learning system for real world image classification. Eurasip J Image Video Process. 2017;2017(1):75–8.
Article Google Scholar
Liu P, Guo JM, Chamnongthai K, et al. Fusion of color histogram and LBP-based features for texture image retrieval and classification. Information ences. 2017;390:95–111.
Google Scholar
Niazmardi S, Demir B, Bruzzone L, et al. Multiple kernel learning for remote sensing image classification. IEEE Transactions on Geoence & Remote Sensing. 2018;1:1–19.
Google Scholar
Saleh B, Elgammal A. Large-scale classification of fine-art paintings: learning the right metric on the right feature. J Comput Sci Technol. 2015;25(3):595–605.
Google Scholar
Tan W, Chan C, Aguirre H, et al. Ceci n'est pas une pipe: A deep convolutional network for fine-art paintings classification. 2016 IEEE International Conference on Image Processing (ICIP). 2016;3703–3707.
Huang X, Zhong SH, Xiao Z. Fine-art painting classification via two-channel deep residual network. Pacific Rim Conference on Multimedia. Springer, Cham. 2017(2):1–5.
Qian W, Xu D, Guan Z, et al. Simulating chalk art style painting. International Journal of Pattern Recognition and Artificial Intelligence. 2017;31(12):1759026.1–20.
Jia S. Automatic categorization of traditional Chinese paintings based on wavelet transform. Comput Sci. 2014;41(2):317–319.
Google Scholar
Simon N, Friedman J, Hastie T, et al. A sparse-group Lasso. J Comput Graph Stat. 2013;22(2):231–245.
Article MathSciNet Google Scholar
Utgoff PE, Berkman NC, Clouse JA. Decision tree induction based on efficient tree restructuring[J]. Mach Learn. 1997;29(1):5–44.
Article Google Scholar

Download references

Funding

This research work was financially supported by the National Natural Science Foundation, China, under grant nos. 61772360 and 61876125.

Author information

Authors and Affiliations

College of Intelligence and Computing, Tianjin University, Tianjin, China
Wei Jiang, Xiaoyu Wang, Sen Li, Meijun Sun, Zheng Wang & Jesse S. Jin
Department of Electronic and Electrical Engineering, University of Strathclyde, Glasgow, UK
Jinchang Ren

Authors

Wei Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jinchang Ren
View author publications
You can also search for this author in PubMed Google Scholar
Sen Li
View author publications
You can also search for this author in PubMed Google Scholar
Meijun Sun
View author publications
You can also search for this author in PubMed Google Scholar
Zheng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jesse S. Jin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zheng Wang.

Ethics declarations

Conflict of Interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jiang, W., Wang, X., Ren, J. et al. MTFFNet: a Multi-task Feature Fusion Framework for Chinese Painting Classification. Cogn Comput 13, 1287–1296 (2021). https://doi.org/10.1007/s12559-021-09896-9

Download citation

Received: 19 August 2020
Accepted: 03 February 2021
Published: 10 September 2021
Issue Date: September 2021
DOI: https://doi.org/10.1007/s12559-021-09896-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

MTFFNet: a Multi-task Feature Fusion Framework for Chinese Painting Classification

Abstract

Access this article

Similar content being viewed by others

Image Matching from Handcrafted to Deep Features: A Survey

Recognition of Oracle Bone Inscriptions by using Two Deep Learning Models

Using machine learning to predict artistic styles: an analysis of trends and the research agenda

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

MTFFNet: a Multi-task Feature Fusion Framework for Chinese Painting Classification

Abstract

Access this article

Similar content being viewed by others

Image Matching from Handcrafted to Deep Features: A Survey

Recognition of Oracle Bone Inscriptions by using Two Deep Learning Models

Using machine learning to predict artistic styles: an analysis of trends and the research agenda

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation