Skip to main content
Log in

Concept drift detection methods based on different weighting strategies

  • Original Article
  • Published:
International Journal of Machine Learning and Cybernetics Aims and scope Submit manuscript

Abstract

The distribution of data often evolves over time, necessitating classifiers to adjust in order to maintain optimal classification accuracy. This phenomenon, termed “concept drift”, poses a significant challenge. Detectors specifically designed for identifying concept drift are typically integrated to bolster classifier performance. In this study, we introduce two innovative methodologies designed to address the prevalent issues of high miss detection, excessive false alarms, and prolonged detection latencies encountered in many contemporary concept drift detection algorithms. The first approach, termed the Hybrid Weighting-based Concept Drift Detection Method (HW_DDM), incorporates both linear and exponential weighting for long and short windows, respectively, within a composite window model. Subsequently, concept drift is detected by calculating the weighted mean value within the window, leveraging the Hoeffding and McDirmid inequality thresholds. The second strategy, named the Dynamic Weighting-based Hoeffding Drift Detection Method (DW_HDDM), employs a mechanism that dynamically adjusts the Hoeffding threshold and dynamically weights classification prediction outcomes, thereby catering to the drift and augmenting detection efficacy. Comparative evaluations using both synthetic and real-world datasets against leading-edge algorithms are presented. The empirical results underscore that HW_DDM exhibits the lowest false detection rate with negligible miss detections on synthetic datasets. In contrast, DW_HDDM shines with minimal detection delay, reduced miss detection rates, and a diminished false detection rate, demonstrating superior classification accuracy on real-world datasets when pitted against benchmark algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig.7
Algorithm 1
Algorithm 2
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Availability of data and material

Data will be available on reasonable request.

References

  1. Jian D, Men H, Juan Li (2016) Review of concept drift data streams mining techniques. Computer Science 43(12):24–29

    Google Scholar 

  2. Xiulin Z, Peipei L, Xindong W (2022) Data stream classification based on extreme learning machine: a review. Big Data Res 30:100356

    Article  Google Scholar 

  3. Abbasi A, Javed AR, Chakraborty C et al (2021) ElStream: an ensemble learning approach for concept drift detection in dynamic social big data stream learning. Piscat IEEE Access 9:66408–66419

    Article  Google Scholar 

  4. Hang Yu, Liu Weixu Lu, Jie, et al (2023) Detecting group concept drift from multiple data streams. Pattern Recogn 134:109113

    Article  Google Scholar 

  5. Gama J, Žliobaitė I, Bifet A et al (2014) A survey on concept drift adaptation. ACM Comput Surv (CSUR) 46(4):1–37

    Article  Google Scholar 

  6. Page ES (1954) Continuous inspection schemes. Biometrika 41(1/2):100

    Article  MathSciNet  Google Scholar 

  7. Roberts SW (2000) Control chart test s based on geometric moving averages. Technometrics 42(1):97–101

    Article  Google Scholar 

  8. Gama J, Medas P et al (2004) Learning with drift detection. Adv Artif Intell 3171:286–295

    Google Scholar 

  9. Baena M, Del Campo J, Fidalgo R et al (2006) Early drift detection method. Proceedings of the 2016 International Workshop on Knowledge Discovery from Data Streams. Porto Citeseer 6:77–86

    Google Scholar 

  10. Barros RS, Cabral DR, Gonçalves PM, Jr, et al (2017) RDDM: reactive drift detection method. Expert Syst Appl 90:344–355

    Article  Google Scholar 

  11. Bifet A, Gavaldá R (2007) Learning from time-changing data with adaptive windowing. Proceedings of the Seventh SIAM International Conference on Data Mining. Minneapolis: SIAM 443–448.

  12. Frias-blanco I, Campo-avila JD, Ramos-jimenez G et al (2015) Online and non-parametric drift detection methods based on hoeffding bounds. IEEE Trans Knowl Data Eng 27(3):810–823

    Article  Google Scholar 

  13. Pesaranghader A, Viktor HL (2016) Fast hoeffding drift detection method for evolving data streams. In: Frasconi P, Landwehr N, Manco G, Vreeken J (eds) Proceedings of 2016 Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, Cham, pp 96–111

    Google Scholar 

  14. Pesaranghader A, Viktor HL (2018) Mcdiarmid drift detection methods for evolving data streams. Proceedings of 2018 International Joint Confer-ence on Neural Networks. Piscataway: IEEE 1–9.

  15. Pesaranghader A, Viktor H, Paquet E (2018) Reservoir of diverse adaptive learners and stacking fast hoeffd-ing drift detection methods for evolving data streams. Mach Learn 107(11):1711–1743

    Article  MathSciNet  Google Scholar 

  16. Goel K, Batra S (2021) Adaptive online learning for classification under concept drift. Int J Comput Sci Eng 24(2):128–135

    Google Scholar 

  17. Qingyan Xu, Li He, Hongxi Z (2020) Improved detection method of concept drift based on the hoeffding inequality. Comput Eng Appl 56(19):55–61

    Google Scholar 

  18. Chen Z, Han M, Wu H et al (2022) A multi-level weighted concept drift detection method. J Supercomput 79(5):5154–5180

    Article  Google Scholar 

  19. Nishida K, Yamauchi K (2007) Detecting Concept Drift Using Statistical Testing. Springer, Cham, pp 264–269

    Google Scholar 

  20. Huang D T J, Koh Y S, Dobbie G, et al (2014) Detecting volatility shift in data streams. Proceedings of 2014 IEEE International Conference on Data Mining(ICDM). Shenzhen: IEEE. 863–868

  21. Baidari I, Honnikoll N (2021) Bhattacharyya distance based concept drift detection method for evolving data stream. Expert Syst Appl 6:115303

    Article  Google Scholar 

  22. Ali P, Herna V, Eric P (2017) Reservoir of diverse adaptive learners and stacking fast hoeffding drift detection methods for evolving data Streams. Mach Learn 3:1–33

    Google Scholar 

  23. Guo HS, Li H, Ren QY et al (2022) Concept drift type identification based on multi-sliding windows. Inf Sci 585:1–23

    Article  Google Scholar 

  24. Valiant LG (1984) A theory of the learnable. Commun ACM 27(11):1134–1142

    Article  Google Scholar 

  25. Bifet A, Holmes G, Kirkby R, Pfahringer B (2010) Moa: massive online analysis. J Mach Learn Res 11:1601–1604

    Google Scholar 

Download references

Acknowledgements

This work was supported by the National Nature Science Foundation of China (62062004), the Ningxia Natural Science Foundation Project (2022AAC03279).

Author information

Authors and Affiliations

Authors

Contributions

Dongliang Mu are responsible for conceptualization, methodology, software, writing, all tables and figures. Meng Han are responsible for writing, review, editing, supervision and funding acquisition. Ang Li are responsible for data curation, investigation. Shujuan Liu are responsible for formal analysis, project administration. Zhihui Gao are responsible for validation.

Corresponding author

Correspondence to Meng Han.

Ethics declarations

Confict of interest

We declare that we do not have any commercial or associative interest that represents. The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Informed consent

Informed consent was obtained from all individual participants included in the study.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Research involving human and animal rights

This article does not contain any studies with human or animal subjects performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Han, M., Mu, D., Li, A. et al. Concept drift detection methods based on different weighting strategies. Int. J. Mach. Learn. & Cyber. (2024). https://doi.org/10.1007/s13042-024-02186-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13042-024-02186-4

Keywords

Navigation