Skip to main content

Label Correction Strategy on Hierarchical Multi-Label Classification

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8556))

Abstract

One of the most popular approaches to solve hierarchical multi-label classification problem is to induce Support Vector Machine (SVM) for each class in the hierarchy independently and employ them in a top-down fashion. This approach always suffers from error propagation and yields such a poor performance of classifiers at the lower levels since no label correlation is considered during the construction. In this paper, we present a novel method called “label correction”, which takes label correlation into consideration and corrects the results of unusual prediction patterns. In the experiment, our method does not only improve prediction accuracy on data in hierarchical domains, but it also contributes such a significant impact on data in multi-label domains.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Boser, B.E., Guyon, I., Vapnik, V.: A Training Algorithm for Optimal Margin Classifiers. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 144–152 (1992)

    Google Scholar 

  2. Cortes, C., Vapnik, V.: Support-Vector Network. Machine Learning 20, 273–297 (1995)

    MATH  Google Scholar 

  3. Tsoumakas, G., Katakis, I., Vlahavas, I.: Mining Multi-Label Data. In: Data Mining and Knowledge Discovery Handbook, 2nd edn., pp. 667–686. Springer (2010a)

    Google Scholar 

  4. Brinker, K., Fürnkranz, J., Hüllermeier, E.: A Unified Model for Multilabel Classification and Ranking. In: Proceeding of the 17th European Conference on Artificial Intelligence, pp. 489–493 (2006)

    Google Scholar 

  5. Boutell, M.R., et al.: Learning Multi-Label Scene Classification. Pattern Recognition 37(9), 1757–1771 (2004)

    Article  Google Scholar 

  6. Katakis, I., Tsoumakas, G., Vlahavas, I.: Multilabel Text Classification for Automated Tag Suggestion. In: Proceedings of the ECML/PKDD 2008 Discovery Challenge, Antwerp, Belgium (2008)

    Google Scholar 

  7. Elisseeff, A., Weston, J.: A Kernel Method for Multi-Labelled Classification. Advances in Neural Information Processing Systems 14, 681–687 (2001)

    Google Scholar 

  8. Briggs, F., et al.: New Methods for Acoustic Classification of Multiple Simultaneous Bird Species in a Noisy Environment. In: IEEE International Workshop on Machine Learning for Signal Processing, pp. 1–8 (2013)

    Google Scholar 

  9. Snoek, C.G.M., et al.: The Challenge Problem for Automated Detection of 101 Semantic Concepts in Multimedia. In: Proceedings of the ACM International Conference on Multimedia, pp. 421–430 (2006)

    Google Scholar 

  10. Barutcuoglu, Z., et al.: Hierarchical Multi-Label Prediction of Gene Function. Bioinformatics 22, 830–836 (2006)

    Article  Google Scholar 

  11. Dimitrovski, I., et al.: Hierarchical Annotation of Medical Images. Pattern Recognition 44, 2436–2449 (2011)

    Article  Google Scholar 

  12. Klimt, B., Yang, Y.: The Enron Corpus: A New Dataset for Email Classification Research. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) ECML 2004. LNCS (LNAI), vol. 3201, pp. 217–226. Springer, Heidelberg (2004)

    Google Scholar 

  13. Zhang, M., Zhou, Z.: ML-KNN: A Lazy Learning Approach to Multi-Label Learning. Pattern Recognition 40(7), 2038–2048 (2007)

    Article  MATH  Google Scholar 

  14. Van Rijsbergen, C.J.: Information Retrieval, 2nd edn. (1979)

    Google Scholar 

  15. Yiming, Y.: An Evaluation of Statistical Approaches to Text Categorization. Information Retrieval 1, 69–90 (1999)

    Article  Google Scholar 

  16. Kiritchenko, S., Matwin, S., Nock, R., Famili, A.F.: Learning and Evaluation in the Presence of Class Hierarchies: Application to Text Categorization. In: Lamontagne, L., Marchand, M. (eds.) Canadian AI 2006. LNCS (LNAI), vol. 4013, pp. 395–406. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  17. Vateekul, P., Kubat, M., Sarinnapakorn, K.: Top-Down Optimized SVMs for Hierarchical Multi-Label Classification: A Case Study in Gene Function Prediction. In: Intelligent Data Analysis (in press)

    Google Scholar 

  18. Mulan Multi-Label Dataset, http://mulan.sourceforge.net/datasets.html

  19. Schietgat, L., et al.: Predicting Gene Function using Hierarchical Multi-Label Decision Tree Ensembles. BMC Bioinformatics (2010)

    Google Scholar 

  20. Dragi, K.: Tree Ensembles for Predicting Structured Outputs. Pattern Recognition 46, 817–833 (2013)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Ananpiriyakul, T., Poomsirivilai, P., Vateekul, P. (2014). Label Correction Strategy on Hierarchical Multi-Label Classification. In: Perner, P. (eds) Machine Learning and Data Mining in Pattern Recognition. MLDM 2014. Lecture Notes in Computer Science(), vol 8556. Springer, Cham. https://doi.org/10.1007/978-3-319-08979-9_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-08979-9_17

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-08978-2

  • Online ISBN: 978-3-319-08979-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics