Abstract
A great number of algorithms have been proposed for multi-label learning, and these algorithms usually divide the labels with an optimal threshold according to their relevances to an unseen instance. However, it may easily cause misclassification to directly determine whether an unseen instance has the label with relevance close to the threshold. The label with relevance close to the threshold has a high uncertainty. Three-way decisions theory is an efficient method to solve the uncertainty problem. Therefore, based on three-way decisions theory, a multi-label learning algorithm with label dependency is proposed in this paper. Label dependency is an inherent property in multi-label data. The labels with high uncertainty are further handled with a label dependency model, which is represented by the logistic regression in this paper. The experimental results show that this algorithm performs better.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Tsoumakas, G., Katakis, I., Vlahavas, I.: Mining multi-label data. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook, pp. 667–685. Springer, New York (2010)
Tsoumakas, G., Katakis, I.: Multi label classification: an overview. Int. J. Data Warehouse. Min. 3(3), 1–13 (2007)
Zhang, M.L., Zhou, Z.H.: A review on multi-label learning algorithms. IEEE Trans. Knowl. Data Eng. 26(8), 1819–1837 (2014)
Yu, Y., Pedrycz, W., Miao, D.Q.: Neighborhood rough sets based multi-label classification for automatic image annotation. Int. J. Approximate Reason. 54(9), 1373–1387 (2013)
Schapire, R.E., Singer, Y.: BoosTexter: a boosting-based system for text categorization. Mach. Learn. 39(2), 135–168 (2000)
Pavlidis, P., Weston, J., Cai, J., Grundy, W.N.: Combining microarray expression data and phylogenetic profiles to learn functional categories using support vector machines. In: Proceedings of the Fifth Annual International Conference on Computational Biology, Montreal, Canada, pp. 242–248 (2001)
Snoek, C.G.M., Worring, M., Van Gemert, J.C., et al.: The challenge problem for automated detection of 101 semantic concepts in multimedia. In: Proceedings of the 14th Annual ACM International Conference on Multimedia, pp. 421–430 (2006)
Yao, Y.: An outline of a theory of three-way decisions. In: Yao, J.T., Yang, Y., Słowiński, R., Greco, S., Li, H., Mitra, S., Polkowski, L. (eds.) RSCTC 2012. LNCS, vol. 7413, pp. 1–17. Springer, Heidelberg (2012). doi:10.1007/978-3-642-32115-3_1
Pawlak, Z.: Rough sets. Int. J. Parallel Prog. 11(5), 341–356 (1982)
Zhang, M.L., Zhang, K.: Multi-label learning by exploiting label dependency. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 999–1008. ACM, New York (2010)
Kang, F., Jin, R., Sukthankar, R.: Correlated label propagation with application to multi-label learning. In: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1719–1726 (2006)
Yu, Y., Predrycz, W., Miao, D.Q.: Multi-label classification by exploiting label correlations. Expert Syst. Appl. 41(6), 2989–3004 (2014)
Boutell, M.R., Luo, J., Shen, X., et al.: Learning multi-label scene classification. Pattern Recogn. 37(9), 1757–1771 (2004)
Hllermeier, E., Frnkranz, J., Cheng, W., et al.: Label ranking by learning pairwise preferences. Artif. Intell. 172(16), 1897–1916 (2008)
Tsoumakas, G., Vlahavas, I.P.: Random k-labelsets: an ensemble method for multilabel classification. In: Kok, J.N., Koronacki, J., Lopez de Mantaras, R., Matwin, S., Mladenič, D., Skowron, A. (eds.) ECML 2007. LNCS (LNAI), vol. 4701, pp. 406–417. Springer, Heidelberg (2007). doi:10.1007/978-3-540-74958-5_38
Clare, A., King, R.D.: Knowledge discovery in multi-label phenotype data. In: Raedt, L., Siebes, A. (eds.) PKDD 2001. LNCS (LNAI), vol. 2168, pp. 42–53. Springer, Heidelberg (2001). doi:10.1007/3-540-44794-6_4
Elisseeff, A., Weston, J.: A kernel method for multi-labelled classification. In: Advances in Neural Information Processing Systems, vol. 14, pp. 681–687 (2001)
Zhang, M.L., Zhou, Z.H.: Multilabel neural networks with applications to functional genomics and text categorization. IEEE Trans. Knowl. Data Eng. 18(10), 1338–1351 (2006)
Yao, Y.Y.: Three-way decisions with probabilistic rough sets. Inf. Sci. 180(3), 341–353 (2010)
Tsoumakas, G., Spyromitros-Xiousfis, E., Vilcek, I.V.J.: Mulan: a Java library for multi-label learning. J. Mach. Learn. Res. 12(7), 2411–2414 (2011)
Pestian, J., Brew, C., Matykiewicz, P., et al.: A shared task involving multi-label classification of clinical free text. In: Proceedings of the Workshop on BioNLp 2007, pp. 97–104. Association for Computational Linguistics, Stroudsburg (2007)
UC Berkeley Enron Email Analysis Project. http://bailando.sims.berkeley.edu/enron_email.html
Turnbull, D., Barrington, L., Torres, D., et al.: Semantic annotation and retrieval of music and sound effects. IEEE Trans. Audio Speech Lang. Process. 16(2), 467–476 (2008)
Zhang, M.L., Zhou, Z.H.: ML-kNN: a lazy learning approach to multi-label learning. Pattern Recogn. 40(7), 2038–2048 (2007)
Acknowledgments
The work is partially supported by the National Natural Science Foundation of China (Nos. 61273304, 61573259), the Specialized Research Fund for the Doctoral Program of Higher Education of China (No. 20130072130004), and the program of Further Accelerating the Development of Chinese Medicine Three Year Action of Shanghai (2014–2016) (No. ZY3-CCCX-3-6002).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Li, F., Miao, D., Zhang, W. (2016). Three-Way Decisions Based Multi-label Learning Algorithm with Label Dependency. In: Flores, V., et al. Rough Sets. IJCRS 2016. Lecture Notes in Computer Science(), vol 9920. Springer, Cham. https://doi.org/10.1007/978-3-319-47160-0_22
Download citation
DOI: https://doi.org/10.1007/978-3-319-47160-0_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-47159-4
Online ISBN: 978-3-319-47160-0
eBook Packages: Computer ScienceComputer Science (R0)