Skip to main content

A Novel Deep Ensemble Learning Framework for Classifying Imbalanced Data Stream

  • Conference paper
  • First Online:
IOT with Smart Systems

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 251))

Abstract

Machine learning has emerged as one of the most indispensable fields of this era in mining stream data. The stream data have the tendency to change their characteristics over time. Mining imbalanced stream is a research demanding subfield of this area. In imbalanced data, one of the target classes has much less instances than other class. The imbalanced data may differ either in their ratio between majority and minority class or dimension or the number of classes. The performance of the classifier is affected by variety of imbalanced data set used for training the classifier. Because, the learning of classifier is different for different types of data sets. Imbalanced data can result due to rare events which can have negative impact on society. Most traditional data mining algorithm misclassifies the minority class of the imbalance data sets or considers it as noise. Therefore, the decision is biased toward majority class and hence reduces the accuracy and overall performance of the algorithm. The algorithms for classifying imbalanced data sets thus demand high adaptability to changes in the majority and minority class ratios. The performance of traditional machine learning algorithms is enhanced by applying ensemble method and deep learning approaches. Purpose: This paper proposes a framework based on deep ensemble learning for classifying imbalanced stream of data. Methodology: The deep ensemble methods ensemble multiple base learners. While deep learning approach is applied to improve the performance by extracting lower-level features and feeding them forward for the next layer to identify higher level features. Results: In this method, the effect of highly imbalanced classes is reduced by uniting the ensemble method with deep learning. The accuracy of the classifier is improved in terms of not only accuracy but also categorical accuracy. Conclusion: In addition to accuracy, other performance measures like categorical prediction accuracy, training accuracy and prediction accuracy have also been compared which are crucial metrics for evaluating imbalance stream of data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 299.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Zyblewski, P., Ksieniewicz, P., Woźniak, M.: Classifier selection for highly imbalanced data streams with minority driven ensemble. In: International Conference on Artificial Intelligence and Soft Computing, pp. 626–635. Springer, Cham (2019)

    Google Scholar 

  2. Klikowski, J., Woźniak, M.: Multi sampling random subspace ensemble for imbalanced data stream classification. In: International Conference on Computer Recognition Systems, pp. 360–369. Springer, Cham (2019)

    Google Scholar 

  3. Sun, B., Chen, H., Wang, J., Xie, H.: Evolutionary under-sampling based bagging ensemble method for imbalanced data classification. Front. Comp. Sci. 12(2), 331–350 (2018)

    Article  Google Scholar 

  4. Zhang, Y., Liu, B., Cai, J., Zhang, S.: Ensemble weighted extreme learning machine for imbalanced data classification based on differential evolution. Neural Comput. Appl. 28(1), 259–267 (2017)

    Article  Google Scholar 

  5. Yijing, L., Haixiang, G., Xiao, L., Yanan, L., Jinling, L.: Adapted ensemble classification algorithm based on multiple classifier system and feature selection for classifying multi-class imbalanced data. Knowl. Based Syst. 94, 88–104 (2016)

    Article  Google Scholar 

  6. Arabmakki, E., Kantardzic, M., Sethi, T.S.: Ensemble classifier for imbalanced streaming data using partial labeling. In: 2016 IEEE 17th International Conference on Information Reuse and Integration (IRI), pp. 257–260. IEEE (2016)

    Google Scholar 

  7. Wang, S., Minku, L.L., Yao, X.: A multi-objective ensemble method for online class imbalance learning. In: 2014 International Joint Conference on Neural Networks (IJCNN), pp. 3311–3318. IEEE (2014)

    Google Scholar 

  8. Wang, S., Minku, L.L., Yao, X.: Resampling-based ensemble methods for online class imbalance learning. IEEE Trans. Knowl. Data Eng. 27(5), 1356–1368 (2014)

    Article  Google Scholar 

  9. Ghazikhani, A., Monsefi, R., Yazdi, H.S.: Ensemble of online neural networks for non-stationary and imbalanced data streams. Neurocomputing 122, 535–544 (2013)

    Article  Google Scholar 

  10. Shi, H., Li, H., Zhang, D., Cheng, C., Cao, X.: An efficient feature generation approach based on deep learning and feature selection techniques for traffic classification. Comput. Netw. 132, 81–98 (2018)

    Article  Google Scholar 

  11. Dhote, S., Vichoray, C., Pais, R., Baskar, S., Shakeel, P.M.: Hybrid geometric sampling and AdaBoost based deep learning approach for data imbalance in E-commerce. Electron. Commerce Res. 1–16 (2019)

    Google Scholar 

  12. Hu, Z., Jiang, P.: An imbalance modified deep neural network with dynamical incremental learning for chemical fault diagnosis. IEEE Trans. Industr. Electron. 66(1), 540–550 (2018)

    Article  Google Scholar 

  13. Zhang, Y., Yu, J., Liu, W., Ota, K.: Ensemble classification for skewed data streams based on neural network. Int. J. Uncertain. Fuzziness Knowl. Based Syst. 26(05), 839–853 (2018)

    Article  Google Scholar 

  14. Pouyanfar, S., Tao, Y., Mohan, A., Tian, H., Kaseb, A.S., Gauen, K., et al.: Dynamic sampling in convolutional neural networks for imbalanced data classification. In: 2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR), pp. 112–117. IEEE (2018)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Arya, M., Hanumat Sastry, G. (2022). A Novel Deep Ensemble Learning Framework for Classifying Imbalanced Data Stream. In: Senjyu, T., Mahalle, P., Perumal, T., Joshi, A. (eds) IOT with Smart Systems. Smart Innovation, Systems and Technologies, vol 251. Springer, Singapore. https://doi.org/10.1007/978-981-16-3945-6_60

Download citation

  • DOI: https://doi.org/10.1007/978-981-16-3945-6_60

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-16-3944-9

  • Online ISBN: 978-981-16-3945-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics