Skip to main content

Examining Text Categorization Methods for Incidents Analysis

  • Conference paper
Intelligence and Security Informatics (PAISI 2012)

Part of the book series: Lecture Notes in Computer Science ((LNSC,volume 7299))

Included in the following conference series:

Abstract

Text mining saves the necessity to sift through vast amount of documents manually to find relevant information. This paper focuses on text categorization, one of the tasks under text mining. This paper introduces fuzzy grammar as a technique for building text classifier and investigates the performance of fuzzy grammar against other machine learning methods such as decision table, support vector machine, statistic, nearest neighbor and boosting. Incidents dataset was used where the focus was given on classifying the incidents events. Results have shown that fuzzy grammar has gotten promising results among the other benchmark machine learning methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Schapire, R.E., Singer, Y.: BoosTexter: A Boosting-based System for Text Categorization. In: Machine Learning, pp. 135–168 (2000)

    Google Scholar 

  2. Sebastiani, F., Sperduti, A., Valdambrini, N.: An Improved Boosting Algorithm and its Application to Text Categorization. Informatica (2000)

    Google Scholar 

  3. Ying-Wei, L., Zheng-Tao, Y., Xiang-Yan, M., Wen-Gang, C., Cun-Li, M.: Question Classification Based on Incremental Modified Bayes. In: 2008 Second International Conference on Future Generation Communication and Networking, vol. 2, pp. 149–152 (December 2008)

    Google Scholar 

  4. Denoyer, L.: Bayesian network model for semi-structured document classification. Information Processing & Management 40(5), 807–827 (2004)

    Article  Google Scholar 

  5. Baoli, L., Shiwen, Y., Qin, L.: An Improved k-Nearest Neighbor Algorithm. In: International Conference on Computer Processing of Oriental Languages (2003)

    Google Scholar 

  6. Han, E.-H., Karypis, G., Kumar, V.: Text Categorization Using Weight Adjusted k-Nearest Neighbor Classification. In: Cheung, D., Williams, G.J., Li, Q. (eds.) PAKDD 2001. LNCS (LNAI), vol. 2035, pp. 53–65. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  7. Apte, C., Damerau, F., Weiss, S.: Text Mining with Decision Rules and Decision Trees. In: Conference on Automated Learning and Discovery (June 1998)

    Google Scholar 

  8. Johnson, D.E., Oles, F.J., Zhang, T., Goetz, T.: A decision-tree-based symbolic rule induction system for text categorization. IBM Systems Journal 41(3), 428–437 (2002)

    Article  Google Scholar 

  9. Joachims, T.: Text Categorization with Support Vector Machines: Learning with Many Relevant Features. Machine Learning, 2–7

    Google Scholar 

  10. Sharef, N.M., Shen, Y.: Text Fragment Extraction using Incremental Evolving Fuzzy Grammar Fragments Learner. In: World Congress on Computational Intelligence, pp. 18–23 (2010)

    Google Scholar 

  11. Sharef, N.M.: Location Recognition with Fuzzy Grammar. In: Proceedings of the 3rd Semantic Technology and Knowledge Engineering Conference, pp. 75–83 (2011)

    Google Scholar 

  12. Sharef, N.M., Martin, T., Shen, Y.: Order Independent Incremental Evolving Fuzzy Grammar Fragment Learner. In: Ninth International Conference on Intelligent Systems Design and Applications, pp. 1221–1226 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mohd Sharef, N., Kasmiran, K.A. (2012). Examining Text Categorization Methods for Incidents Analysis. In: Chau, M., Wang, G.A., Yue, W.T., Chen, H. (eds) Intelligence and Security Informatics. PAISI 2012. Lecture Notes in Computer Science, vol 7299. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30428-6_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-30428-6_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-30427-9

  • Online ISBN: 978-3-642-30428-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics