Abstract
Text mining saves the necessity to sift through vast amount of documents manually to find relevant information. This paper focuses on text categorization, one of the tasks under text mining. This paper introduces fuzzy grammar as a technique for building text classifier and investigates the performance of fuzzy grammar against other machine learning methods such as decision table, support vector machine, statistic, nearest neighbor and boosting. Incidents dataset was used where the focus was given on classifying the incidents events. Results have shown that fuzzy grammar has gotten promising results among the other benchmark machine learning methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Schapire, R.E., Singer, Y.: BoosTexter: A Boosting-based System for Text Categorization. In: Machine Learning, pp. 135–168 (2000)
Sebastiani, F., Sperduti, A., Valdambrini, N.: An Improved Boosting Algorithm and its Application to Text Categorization. Informatica (2000)
Ying-Wei, L., Zheng-Tao, Y., Xiang-Yan, M., Wen-Gang, C., Cun-Li, M.: Question Classification Based on Incremental Modified Bayes. In: 2008 Second International Conference on Future Generation Communication and Networking, vol. 2, pp. 149–152 (December 2008)
Denoyer, L.: Bayesian network model for semi-structured document classification. Information Processing & Management 40(5), 807–827 (2004)
Baoli, L., Shiwen, Y., Qin, L.: An Improved k-Nearest Neighbor Algorithm. In: International Conference on Computer Processing of Oriental Languages (2003)
Han, E.-H., Karypis, G., Kumar, V.: Text Categorization Using Weight Adjusted k-Nearest Neighbor Classification. In: Cheung, D., Williams, G.J., Li, Q. (eds.) PAKDD 2001. LNCS (LNAI), vol. 2035, pp. 53–65. Springer, Heidelberg (2001)
Apte, C., Damerau, F., Weiss, S.: Text Mining with Decision Rules and Decision Trees. In: Conference on Automated Learning and Discovery (June 1998)
Johnson, D.E., Oles, F.J., Zhang, T., Goetz, T.: A decision-tree-based symbolic rule induction system for text categorization. IBM Systems Journal 41(3), 428–437 (2002)
Joachims, T.: Text Categorization with Support Vector Machines: Learning with Many Relevant Features. Machine Learning, 2–7
Sharef, N.M., Shen, Y.: Text Fragment Extraction using Incremental Evolving Fuzzy Grammar Fragments Learner. In: World Congress on Computational Intelligence, pp. 18–23 (2010)
Sharef, N.M.: Location Recognition with Fuzzy Grammar. In: Proceedings of the 3rd Semantic Technology and Knowledge Engineering Conference, pp. 75–83 (2011)
Sharef, N.M., Martin, T., Shen, Y.: Order Independent Incremental Evolving Fuzzy Grammar Fragment Learner. In: Ninth International Conference on Intelligent Systems Design and Applications, pp. 1221–1226 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mohd Sharef, N., Kasmiran, K.A. (2012). Examining Text Categorization Methods for Incidents Analysis. In: Chau, M., Wang, G.A., Yue, W.T., Chen, H. (eds) Intelligence and Security Informatics. PAISI 2012. Lecture Notes in Computer Science, vol 7299. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30428-6_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-30428-6_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-30427-9
Online ISBN: 978-3-642-30428-6
eBook Packages: Computer ScienceComputer Science (R0)