Efficient Fuzzy Similarity-Based Text Classification with SVM and Feature Reduction

Puri, Shalini

doi:10.1007/978-981-33-6984-9_28

Shalini Puri ORCID: orcid.org/0000-0002-1700-1865¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1335))

Included in the following conference series:

Congress on Intelligent Systems

441 Accesses
6 Citations

Abstract

With the generation of enormous data day by day, the need of feature reduction has tremendously increased in the field of text classification. In this direction, this paper presents two text classification systems, called concept-based mining model using threshold (CMMT) and fuzzy similarity-based concept mining model using feature clustering (FSCMM-FC). Both systems aim to classify the English text documents into pre-defined mutually exclusive categories. These systems preprocess the documents at the sentence, document, and integrated corpora levels; apply feature extraction and reduction; train the classifier; and finally, classify the documents using support vector machine. CMMT cuts off the less frequent features by applying threshold on the extracted features, whereas FSCMM-FC reduces the features by finding the feature points using fuzzy C-means. The experimental results obtained 95.8% and 94.695% feature reduction in CMMT and FSCMM-FC, respectively, and also the 85.41% and 93.43% classification accuracy in CMMT and FSCMM-FC, respectively. Therefore, these results state that FSCMM-FC outperformed CMMT greatly with effective memory usage and efficient classification accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Puri, S.: A fuzzy similarity based concept mining model for text classification. Int. J. Adv. Comput. Sci. Appl. 2, 115–121 (2011)
Google Scholar
Puri, S., Kaushik, S.: An enhanced fuzzy similarity based concept mining model for text classification using feature clustering. In: IEEE Students’ Conference on Engineering and Systems, pp. 1–6. IEEE Press (2012)
Google Scholar
Puri, S., Kaushik, S.: A technical study and analysis on fuzzy similarity based models for text classification. Int. J. Data Mining Knowl. Manage. Process 2(2), 1–15 (2012)
Article Google Scholar
Chen, Y., Han, B., Hou, P.: New feature selection methods based on context similarity for text categorization. In: 11th International Conference on Fuzzy Systems and Knowledge Discovery, pp. 598–604. IEEE Press (2014)
Google Scholar
Sheeba, J.I., Vivekanandan, K.: A fuzzy logic based improved keyword extraction from meeting transcripts. Int. J. Comput. Sci. Eng. 6, 287–299 (2014)
Google Scholar
Modarresi, K.: Unsupervised feature extraction using singular value decomposition. Proc. Comput. Sci. 51, 2417–2425 (2015)
Article Google Scholar
Maji, P., Garai, P.: IT2 Fuzzy-rough sets and max relevance-max significance criterion for attribute selection. IEEE Trans. Cybern. 45(8), 1657–1668 (2015)
Article Google Scholar
Arumugam, P., Jose, P.: Recent advances on kernel fuzzy support vector machine model for supervised learning. In: International Conference on Circuits, Power and Computing Technologies, pp. 1–5. IEEE Press (2015)
Google Scholar
Almasi, O.N., Rouhani, M.: Fast and de-noise support vector machine training method based on fuzzy clustering method for large real world datasets. Turkish J. Electr. Eng. Comput. Sci. 24, 219–233 (2016)
Article Google Scholar
Bano, S., Bandhekar, S.: Fuzzy clustering based data reduction for improvement in classification. Int. J. Innov. Res. Comput. Commun. Eng. 4(5), 1111–1118 (2016)
Google Scholar
Zobeidi, S., Naderan, M., Alavi, S. E.: Effective text classification using multi-level fuzzy neural network. In: 5th Iranian Joint Congress on Fuzzy and Intelligent Systems, pp. 91–96. IEEE Press (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Poornima College of Engineering, Jaipur, Rajasthan, India
Shalini Puri

Authors

Shalini Puri
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Rajasthan Technical University, Kota, Rajasthan, India
Harish Sharma
Department of Computer Science and Engineering, Jaypee Institute of Information Technology, Noida, Uttar Pradesh, India
Mukesh Saraswat
National Institute of Technology, Jalandhar, Punjab, India
Anupam Yadav
Korea University, Seoul, Korea (Republic of)
Joong Hoon Kim
South Asian University, New Delhi, Delhi, India
Jagdish Chand Bansal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Puri, S. (2021). Efficient Fuzzy Similarity-Based Text Classification with SVM and Feature Reduction. In: Sharma, H., Saraswat, M., Yadav, A., Kim, J.H., Bansal, J.C. (eds) Congress on Intelligent Systems. CIS 2020. Advances in Intelligent Systems and Computing, vol 1335. Springer, Singapore. https://doi.org/10.1007/978-981-33-6984-9_28

Download citation

DOI: https://doi.org/10.1007/978-981-33-6984-9_28
Published: 02 June 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-33-6983-2
Online ISBN: 978-981-33-6984-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics