Abstract
Machine learning has emerged as one of the most indispensable fields of this era in mining stream data. The stream data have the tendency to change their characteristics over time. Mining imbalanced stream is a research demanding subfield of this area. In imbalanced data, one of the target classes has much less instances than other class. The imbalanced data may differ either in their ratio between majority and minority class or dimension or the number of classes. The performance of the classifier is affected by variety of imbalanced data set used for training the classifier. Because, the learning of classifier is different for different types of data sets. Imbalanced data can result due to rare events which can have negative impact on society. Most traditional data mining algorithm misclassifies the minority class of the imbalance data sets or considers it as noise. Therefore, the decision is biased toward majority class and hence reduces the accuracy and overall performance of the algorithm. The algorithms for classifying imbalanced data sets thus demand high adaptability to changes in the majority and minority class ratios. The performance of traditional machine learning algorithms is enhanced by applying ensemble method and deep learning approaches. Purpose: This paper proposes a framework based on deep ensemble learning for classifying imbalanced stream of data. Methodology: The deep ensemble methods ensemble multiple base learners. While deep learning approach is applied to improve the performance by extracting lower-level features and feeding them forward for the next layer to identify higher level features. Results: In this method, the effect of highly imbalanced classes is reduced by uniting the ensemble method with deep learning. The accuracy of the classifier is improved in terms of not only accuracy but also categorical accuracy. Conclusion: In addition to accuracy, other performance measures like categorical prediction accuracy, training accuracy and prediction accuracy have also been compared which are crucial metrics for evaluating imbalance stream of data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zyblewski, P., Ksieniewicz, P., Woźniak, M.: Classifier selection for highly imbalanced data streams with minority driven ensemble. In: International Conference on Artificial Intelligence and Soft Computing, pp. 626–635. Springer, Cham (2019)
Klikowski, J., Woźniak, M.: Multi sampling random subspace ensemble for imbalanced data stream classification. In: International Conference on Computer Recognition Systems, pp. 360–369. Springer, Cham (2019)
Sun, B., Chen, H., Wang, J., Xie, H.: Evolutionary under-sampling based bagging ensemble method for imbalanced data classification. Front. Comp. Sci. 12(2), 331–350 (2018)
Zhang, Y., Liu, B., Cai, J., Zhang, S.: Ensemble weighted extreme learning machine for imbalanced data classification based on differential evolution. Neural Comput. Appl. 28(1), 259–267 (2017)
Yijing, L., Haixiang, G., Xiao, L., Yanan, L., Jinling, L.: Adapted ensemble classification algorithm based on multiple classifier system and feature selection for classifying multi-class imbalanced data. Knowl. Based Syst. 94, 88–104 (2016)
Arabmakki, E., Kantardzic, M., Sethi, T.S.: Ensemble classifier for imbalanced streaming data using partial labeling. In: 2016 IEEE 17th International Conference on Information Reuse and Integration (IRI), pp. 257–260. IEEE (2016)
Wang, S., Minku, L.L., Yao, X.: A multi-objective ensemble method for online class imbalance learning. In: 2014 International Joint Conference on Neural Networks (IJCNN), pp. 3311–3318. IEEE (2014)
Wang, S., Minku, L.L., Yao, X.: Resampling-based ensemble methods for online class imbalance learning. IEEE Trans. Knowl. Data Eng. 27(5), 1356–1368 (2014)
Ghazikhani, A., Monsefi, R., Yazdi, H.S.: Ensemble of online neural networks for non-stationary and imbalanced data streams. Neurocomputing 122, 535–544 (2013)
Shi, H., Li, H., Zhang, D., Cheng, C., Cao, X.: An efficient feature generation approach based on deep learning and feature selection techniques for traffic classification. Comput. Netw. 132, 81–98 (2018)
Dhote, S., Vichoray, C., Pais, R., Baskar, S., Shakeel, P.M.: Hybrid geometric sampling and AdaBoost based deep learning approach for data imbalance in E-commerce. Electron. Commerce Res. 1–16 (2019)
Hu, Z., Jiang, P.: An imbalance modified deep neural network with dynamical incremental learning for chemical fault diagnosis. IEEE Trans. Industr. Electron. 66(1), 540–550 (2018)
Zhang, Y., Yu, J., Liu, W., Ota, K.: Ensemble classification for skewed data streams based on neural network. Int. J. Uncertain. Fuzziness Knowl. Based Syst. 26(05), 839–853 (2018)
Pouyanfar, S., Tao, Y., Mohan, A., Tian, H., Kaseb, A.S., Gauen, K., et al.: Dynamic sampling in convolutional neural networks for imbalanced data classification. In: 2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR), pp. 112–117. IEEE (2018)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Arya, M., Hanumat Sastry, G. (2022). A Novel Deep Ensemble Learning Framework for Classifying Imbalanced Data Stream. In: Senjyu, T., Mahalle, P., Perumal, T., Joshi, A. (eds) IOT with Smart Systems. Smart Innovation, Systems and Technologies, vol 251. Springer, Singapore. https://doi.org/10.1007/978-981-16-3945-6_60
Download citation
DOI: https://doi.org/10.1007/978-981-16-3945-6_60
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-3944-9
Online ISBN: 978-981-16-3945-6
eBook Packages: EngineeringEngineering (R0)