Abstract
This paper presents a new rough fuzzy classification approach for class imbalanced data. Here, interval type-2 fuzzy granulation of input features is formulated, various combinations of rough set extension-based methods are used to perform class imbalance learning, and K-nearest neighbor (KNN) classifier is used for data classification. The experimental results on the UCI data sets are reported to demonstrate the effectiveness of the proposed rough fuzzy classification model. Performance evaluation measures viz F-measure and geometric mean (G-mean) are used for analyzing classifier’s performance and suitability of the developed model for class imbalance learning.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. Wiley, New York (2001)
GarcÃa, V., Mollineda, R., Sánchez, J.: On the KNN performance in a challenging scenario of imbalance and overlapping. Pattern Anal. App. 11, 269–280 (2008)
Chawla, N.V.: Data mining for imbalanced datasets: an overview. In: Data Mining and Knowledge Discovery Handbook, pp. 875–886 (2010)
Weiss, G.: Mining with rarity: a unifying framework. SIGKDD Explor. 6(1), 7–19 (2004)
Drummond, C., Holte, R.C.: C4.5, class imbalance, and cost sensitivity: why under-sampling beats over-sampling. In: Chawla , N. et al. (eds.) Proceedings of ICML’03, Washington DC, USA (2003)
Pal, S.K., Skowron, A.: Rough-Fuzzy Hybridization: A New Trend in Decision Making. Springer, Singapore (1999)
Pal, S.K., Mitra, S.: Multilayer perceptron, fuzzy sets, and classification. IEEE Trans. Neural Netw. 3, 683–697 (1992)
Pawlak, Z.: Rough sets. Int. J. Comput. Inf. Sci. 11, 341–356 (1982)
Lin, T.Y.: Granulation and nearest neighborhoods: rough set approach. In: Pedrycz, W. (ed.) Granular Computing: An Emerging Paradigm, pp. 125–142. Physica-Verlag, Heidelberg (2001)
Hu, Q., Yu, D., Liu, J., Wu, C.: Neighborhood rough set based heterogeneous feature subset selection. Inf. Sci. 178, 3577–3594 (2008)
Mazumder, R.U., Begum, S.A., Biswas, D.: An Exponential kernel based fuzzy rough sets model for feature selection. Int. J. Comput. Appl. (IJCA) 81(6), 24–31 (2013)
Pawlak, Z.: Rough Sets: Theoretical Aspects of Reasoning about Data. Kluwer Academic Publishers, Dordrecht (1991)
Pawlak, Z., Skowron, A.: Rough sets: some extensions. Inf. Sci. 177(1), 28–40 (2007)
Genton, M.: Classes of kernels for machine learning: a statistics perspective. J. Mach. Learn. Res. 22, 99–312 (2001)
Zadeh, L.A.: Toward a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic. Fuzzy Sets and Syst. 90, 111–127 (1997)
Liang, Q., Mendel, J.M.: Interval type-2 fuzzy logic systems: theory and design. IEEE Trans. Fuzzy Syst. 8(5), 535–550 (2000)
Blake, C., Merz, C., Hettich, S., Newman, D.J.: UCI repository of machine learning databases. University of California, School of Information and Computer Sciences, Irvine, CA (1998)
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority oversampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)
Gu, Q., Zhu, L., Cai, Z.: Evaluation measures of the classification performance of imbalanced data sets. In: Cai, Z. et al. (eds.) Computational Intelligence and Intelligent Systems. Commun. Comput. Inf. Sci. 51, 461–471 (2009)
Buckland, M., Gey, F.: The Relationship between recall and precision. J. Am. Soc. Inf. Sci. 45(1), 12–19 (1994)
Kubat, M., Holte, R.C., Matwin, S.: Machine learning for the detection of oil spills in satellite radar images. Mach. Learn. 30, 195–215 (1998)
Liu, J., Hu, Q., Yu, D.: A weighted rough set based method developed for class imbalance learning. Inf. Sci. 178, 1235–1256 (2008)
Acknowledgment
Author (RUM) gratefully acknowledges UGC for granting Maulana Azad National Fellowship for the research work.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer India
About this paper
Cite this paper
Mazumder, R.U., Begum, S.A., Biswas, D. (2015). Rough Fuzzy Classification for Class Imbalanced Data. In: Das, K., Deep, K., Pant, M., Bansal, J., Nagar, A. (eds) Proceedings of Fourth International Conference on Soft Computing for Problem Solving. Advances in Intelligent Systems and Computing, vol 335. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2217-0_14
Download citation
DOI: https://doi.org/10.1007/978-81-322-2217-0_14
Published:
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-2216-3
Online ISBN: 978-81-322-2217-0
eBook Packages: EngineeringEngineering (R0)