International Conference on Artificial Intelligence and Soft Computing

ICAISC 2016: Artificial Intelligence and Soft Computing pp 667-680

Comparison of SVM and Ontology-Based Text Classification Methods

  • Krzysztof Wróbel
  • Maciej Wielgosz
  • Aleksander Smywiński-Pohl
  • Marcin Pietron
Conference paper

DOI: 10.1007/978-3-319-39378-0_57

Volume 9692 of the book series Lecture Notes in Computer Science (LNCS)
Cite this paper as:
Wróbel K., Wielgosz M., Smywiński-Pohl A., Pietron M. (2016) Comparison of SVM and Ontology-Based Text Classification Methods. In: Rutkowski L., Korytkowski M., Scherer R., Tadeusiewicz R., Zadeh L., Zurada J. (eds) Artificial Intelligence and Soft Computing. ICAISC 2016. Lecture Notes in Computer Science, vol 9692. Springer, Cham

Abstract

This work addresses the challenging task of text categorization. The main goal is the comparison of two different approaches, i.e. Vector Space Model and ontology-based solutions. The authors compare and contrast them with respect to accuracy and processing flow, which affect the classification results. The ontology-based method outperforms its counter-part when it comes to category resolution, i.e. the number of categories which can be processed. On the other hand, the SVM-based module is much faster and performs well when trained on an appropriately-structured learning set. The authors performed a series of tests to compare the methods and, as expected, the ontology-based solution outperformed the SVM classifier. It reached a micro averaged F1-score of 0.90 with 2.8 million Wikipedia articles, whereas the SVM-based module did not exceed 0.86 with the same data set. The macro averaged F1-score of both solutions was inferior to the micro one and reached values of 0.75 and 0.57, for ontology and SVM-based solutions respectively.

Keywords

Text classificationVector space modelOntology-based methodsSupport vector machineWikipedia

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  • Krzysztof Wróbel
    • 3
  • Maciej Wielgosz
    • 1
  • Aleksander Smywiński-Pohl
    • 1
    • 3
  • Marcin Pietron
    • 2
  1. 1.AGH University of Science and TechnologyKrakowPoland
  2. 2.ACK Cyfronet AGHKrakowPoland
  3. 3.Jagiellonian UniversityKrakowPoland