Skip to main content

Efficient Fuzzy Similarity-Based Text Classification with SVM and Feature Reduction

  • Conference paper
  • First Online:
Congress on Intelligent Systems (CIS 2020)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1335))

Included in the following conference series:

Abstract

With the generation of enormous data day by day, the need of feature reduction has tremendously increased in the field of text classification. In this direction, this paper presents two text classification systems, called concept-based mining model using threshold (CMMT) and fuzzy similarity-based concept mining model using feature clustering (FSCMM-FC). Both systems aim to classify the English text documents into pre-defined mutually exclusive categories. These systems preprocess the documents at the sentence, document, and integrated corpora levels; apply feature extraction and reduction; train the classifier; and finally, classify the documents using support vector machine. CMMT cuts off the less frequent features by applying threshold on the extracted features, whereas FSCMM-FC reduces the features by finding the feature points using fuzzy C-means. The experimental results obtained 95.8% and 94.695% feature reduction in CMMT and FSCMM-FC, respectively, and also the 85.41% and 93.43% classification accuracy in CMMT and FSCMM-FC, respectively. Therefore, these results state that FSCMM-FC outperformed CMMT greatly with effective memory usage and efficient classification accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Puri, S.: A fuzzy similarity based concept mining model for text classification. Int. J. Adv. Comput. Sci. Appl. 2, 115–121 (2011)

    Google Scholar 

  2. Puri, S., Kaushik, S.: An enhanced fuzzy similarity based concept mining model for text classification using feature clustering. In: IEEE Students’ Conference on Engineering and Systems, pp. 1–6. IEEE Press (2012)

    Google Scholar 

  3. Puri, S., Kaushik, S.: A technical study and analysis on fuzzy similarity based models for text classification. Int. J. Data Mining Knowl. Manage. Process 2(2), 1–15 (2012)

    Article  Google Scholar 

  4. Chen, Y., Han, B., Hou, P.: New feature selection methods based on context similarity for text categorization. In: 11th International Conference on Fuzzy Systems and Knowledge Discovery, pp. 598–604. IEEE Press (2014)

    Google Scholar 

  5. Sheeba, J.I., Vivekanandan, K.: A fuzzy logic based improved keyword extraction from meeting transcripts. Int. J. Comput. Sci. Eng. 6, 287–299 (2014)

    Google Scholar 

  6. Modarresi, K.: Unsupervised feature extraction using singular value decomposition. Proc. Comput. Sci. 51, 2417–2425 (2015)

    Article  Google Scholar 

  7. Maji, P., Garai, P.: IT2 Fuzzy-rough sets and max relevance-max significance criterion for attribute selection. IEEE Trans. Cybern. 45(8), 1657–1668 (2015)

    Article  Google Scholar 

  8. Arumugam, P., Jose, P.: Recent advances on kernel fuzzy support vector machine model for supervised learning. In: International Conference on Circuits, Power and Computing Technologies, pp. 1–5. IEEE Press (2015)

    Google Scholar 

  9. Almasi, O.N., Rouhani, M.: Fast and de-noise support vector machine training method based on fuzzy clustering method for large real world datasets. Turkish J. Electr. Eng. Comput. Sci. 24, 219–233 (2016)

    Article  Google Scholar 

  10. Bano, S., Bandhekar, S.: Fuzzy clustering based data reduction for improvement in classification. Int. J. Innov. Res. Comput. Commun. Eng. 4(5), 1111–1118 (2016)

    Google Scholar 

  11. Zobeidi, S., Naderan, M., Alavi, S. E.: Effective text classification using multi-level fuzzy neural network. In: 5th Iranian Joint Congress on Fuzzy and Intelligent Systems, pp. 91–96. IEEE Press (2017)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Puri, S. (2021). Efficient Fuzzy Similarity-Based Text Classification with SVM and Feature Reduction. In: Sharma, H., Saraswat, M., Yadav, A., Kim, J.H., Bansal, J.C. (eds) Congress on Intelligent Systems. CIS 2020. Advances in Intelligent Systems and Computing, vol 1335. Springer, Singapore. https://doi.org/10.1007/978-981-33-6984-9_28

Download citation

Publish with us

Policies and ethics