Abstract
With the generation of enormous data day by day, the need of feature reduction has tremendously increased in the field of text classification. In this direction, this paper presents two text classification systems, called concept-based mining model using threshold (CMMT) and fuzzy similarity-based concept mining model using feature clustering (FSCMM-FC). Both systems aim to classify the English text documents into pre-defined mutually exclusive categories. These systems preprocess the documents at the sentence, document, and integrated corpora levels; apply feature extraction and reduction; train the classifier; and finally, classify the documents using support vector machine. CMMT cuts off the less frequent features by applying threshold on the extracted features, whereas FSCMM-FC reduces the features by finding the feature points using fuzzy C-means. The experimental results obtained 95.8% and 94.695% feature reduction in CMMT and FSCMM-FC, respectively, and also the 85.41% and 93.43% classification accuracy in CMMT and FSCMM-FC, respectively. Therefore, these results state that FSCMM-FC outperformed CMMT greatly with effective memory usage and efficient classification accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Puri, S.: A fuzzy similarity based concept mining model for text classification. Int. J. Adv. Comput. Sci. Appl. 2, 115–121 (2011)
Puri, S., Kaushik, S.: An enhanced fuzzy similarity based concept mining model for text classification using feature clustering. In: IEEE Students’ Conference on Engineering and Systems, pp. 1–6. IEEE Press (2012)
Puri, S., Kaushik, S.: A technical study and analysis on fuzzy similarity based models for text classification. Int. J. Data Mining Knowl. Manage. Process 2(2), 1–15 (2012)
Chen, Y., Han, B., Hou, P.: New feature selection methods based on context similarity for text categorization. In: 11th International Conference on Fuzzy Systems and Knowledge Discovery, pp. 598–604. IEEE Press (2014)
Sheeba, J.I., Vivekanandan, K.: A fuzzy logic based improved keyword extraction from meeting transcripts. Int. J. Comput. Sci. Eng. 6, 287–299 (2014)
Modarresi, K.: Unsupervised feature extraction using singular value decomposition. Proc. Comput. Sci. 51, 2417–2425 (2015)
Maji, P., Garai, P.: IT2 Fuzzy-rough sets and max relevance-max significance criterion for attribute selection. IEEE Trans. Cybern. 45(8), 1657–1668 (2015)
Arumugam, P., Jose, P.: Recent advances on kernel fuzzy support vector machine model for supervised learning. In: International Conference on Circuits, Power and Computing Technologies, pp. 1–5. IEEE Press (2015)
Almasi, O.N., Rouhani, M.: Fast and de-noise support vector machine training method based on fuzzy clustering method for large real world datasets. Turkish J. Electr. Eng. Comput. Sci. 24, 219–233 (2016)
Bano, S., Bandhekar, S.: Fuzzy clustering based data reduction for improvement in classification. Int. J. Innov. Res. Comput. Commun. Eng. 4(5), 1111–1118 (2016)
Zobeidi, S., Naderan, M., Alavi, S. E.: Effective text classification using multi-level fuzzy neural network. In: 5th Iranian Joint Congress on Fuzzy and Intelligent Systems, pp. 91–96. IEEE Press (2017)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Puri, S. (2021). Efficient Fuzzy Similarity-Based Text Classification with SVM and Feature Reduction. In: Sharma, H., Saraswat, M., Yadav, A., Kim, J.H., Bansal, J.C. (eds) Congress on Intelligent Systems. CIS 2020. Advances in Intelligent Systems and Computing, vol 1335. Springer, Singapore. https://doi.org/10.1007/978-981-33-6984-9_28
Download citation
DOI: https://doi.org/10.1007/978-981-33-6984-9_28
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-33-6983-2
Online ISBN: 978-981-33-6984-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)