Abstract
One of the most popular approaches to solve hierarchical multi-label classification problem is to induce Support Vector Machine (SVM) for each class in the hierarchy independently and employ them in a top-down fashion. This approach always suffers from error propagation and yields such a poor performance of classifiers at the lower levels since no label correlation is considered during the construction. In this paper, we present a novel method called “label correction”, which takes label correlation into consideration and corrects the results of unusual prediction patterns. In the experiment, our method does not only improve prediction accuracy on data in hierarchical domains, but it also contributes such a significant impact on data in multi-label domains.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Boser, B.E., Guyon, I., Vapnik, V.: A Training Algorithm for Optimal Margin Classifiers. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 144–152 (1992)
Cortes, C., Vapnik, V.: Support-Vector Network. Machine Learning 20, 273–297 (1995)
Tsoumakas, G., Katakis, I., Vlahavas, I.: Mining Multi-Label Data. In: Data Mining and Knowledge Discovery Handbook, 2nd edn., pp. 667–686. Springer (2010a)
Brinker, K., Fürnkranz, J., Hüllermeier, E.: A Unified Model for Multilabel Classification and Ranking. In: Proceeding of the 17th European Conference on Artificial Intelligence, pp. 489–493 (2006)
Boutell, M.R., et al.: Learning Multi-Label Scene Classification. Pattern Recognition 37(9), 1757–1771 (2004)
Katakis, I., Tsoumakas, G., Vlahavas, I.: Multilabel Text Classification for Automated Tag Suggestion. In: Proceedings of the ECML/PKDD 2008 Discovery Challenge, Antwerp, Belgium (2008)
Elisseeff, A., Weston, J.: A Kernel Method for Multi-Labelled Classification. Advances in Neural Information Processing Systems 14, 681–687 (2001)
Briggs, F., et al.: New Methods for Acoustic Classification of Multiple Simultaneous Bird Species in a Noisy Environment. In: IEEE International Workshop on Machine Learning for Signal Processing, pp. 1–8 (2013)
Snoek, C.G.M., et al.: The Challenge Problem for Automated Detection of 101 Semantic Concepts in Multimedia. In: Proceedings of the ACM International Conference on Multimedia, pp. 421–430 (2006)
Barutcuoglu, Z., et al.: Hierarchical Multi-Label Prediction of Gene Function. Bioinformatics 22, 830–836 (2006)
Dimitrovski, I., et al.: Hierarchical Annotation of Medical Images. Pattern Recognition 44, 2436–2449 (2011)
Klimt, B., Yang, Y.: The Enron Corpus: A New Dataset for Email Classification Research. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) ECML 2004. LNCS (LNAI), vol. 3201, pp. 217–226. Springer, Heidelberg (2004)
Zhang, M., Zhou, Z.: ML-KNN: A Lazy Learning Approach to Multi-Label Learning. Pattern Recognition 40(7), 2038–2048 (2007)
Van Rijsbergen, C.J.: Information Retrieval, 2nd edn. (1979)
Yiming, Y.: An Evaluation of Statistical Approaches to Text Categorization. Information Retrieval 1, 69–90 (1999)
Kiritchenko, S., Matwin, S., Nock, R., Famili, A.F.: Learning and Evaluation in the Presence of Class Hierarchies: Application to Text Categorization. In: Lamontagne, L., Marchand, M. (eds.) Canadian AI 2006. LNCS (LNAI), vol. 4013, pp. 395–406. Springer, Heidelberg (2006)
Vateekul, P., Kubat, M., Sarinnapakorn, K.: Top-Down Optimized SVMs for Hierarchical Multi-Label Classification: A Case Study in Gene Function Prediction. In: Intelligent Data Analysis (in press)
Mulan Multi-Label Dataset, http://mulan.sourceforge.net/datasets.html
Schietgat, L., et al.: Predicting Gene Function using Hierarchical Multi-Label Decision Tree Ensembles. BMC Bioinformatics (2010)
Dragi, K.: Tree Ensembles for Predicting Structured Outputs. Pattern Recognition 46, 817–833 (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Ananpiriyakul, T., Poomsirivilai, P., Vateekul, P. (2014). Label Correction Strategy on Hierarchical Multi-Label Classification. In: Perner, P. (eds) Machine Learning and Data Mining in Pattern Recognition. MLDM 2014. Lecture Notes in Computer Science(), vol 8556. Springer, Cham. https://doi.org/10.1007/978-3-319-08979-9_17
Download citation
DOI: https://doi.org/10.1007/978-3-319-08979-9_17
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08978-2
Online ISBN: 978-3-319-08979-9
eBook Packages: Computer ScienceComputer Science (R0)