Abstract
The distribution of data often evolves over time, necessitating classifiers to adjust in order to maintain optimal classification accuracy. This phenomenon, termed “concept drift”, poses a significant challenge. Detectors specifically designed for identifying concept drift are typically integrated to bolster classifier performance. In this study, we introduce two innovative methodologies designed to address the prevalent issues of high miss detection, excessive false alarms, and prolonged detection latencies encountered in many contemporary concept drift detection algorithms. The first approach, termed the Hybrid Weighting-based Concept Drift Detection Method (HW_DDM), incorporates both linear and exponential weighting for long and short windows, respectively, within a composite window model. Subsequently, concept drift is detected by calculating the weighted mean value within the window, leveraging the Hoeffding and McDirmid inequality thresholds. The second strategy, named the Dynamic Weighting-based Hoeffding Drift Detection Method (DW_HDDM), employs a mechanism that dynamically adjusts the Hoeffding threshold and dynamically weights classification prediction outcomes, thereby catering to the drift and augmenting detection efficacy. Comparative evaluations using both synthetic and real-world datasets against leading-edge algorithms are presented. The empirical results underscore that HW_DDM exhibits the lowest false detection rate with negligible miss detections on synthetic datasets. In contrast, DW_HDDM shines with minimal detection delay, reduced miss detection rates, and a diminished false detection rate, demonstrating superior classification accuracy on real-world datasets when pitted against benchmark algorithms.
Similar content being viewed by others
Availability of data and material
Data will be available on reasonable request.
References
Jian D, Men H, Juan Li (2016) Review of concept drift data streams mining techniques. Computer Science 43(12):24–29
Xiulin Z, Peipei L, Xindong W (2022) Data stream classification based on extreme learning machine: a review. Big Data Res 30:100356
Abbasi A, Javed AR, Chakraborty C et al (2021) ElStream: an ensemble learning approach for concept drift detection in dynamic social big data stream learning. Piscat IEEE Access 9:66408–66419
Hang Yu, Liu Weixu Lu, Jie, et al (2023) Detecting group concept drift from multiple data streams. Pattern Recogn 134:109113
Gama J, Žliobaitė I, Bifet A et al (2014) A survey on concept drift adaptation. ACM Comput Surv (CSUR) 46(4):1–37
Page ES (1954) Continuous inspection schemes. Biometrika 41(1/2):100
Roberts SW (2000) Control chart test s based on geometric moving averages. Technometrics 42(1):97–101
Gama J, Medas P et al (2004) Learning with drift detection. Adv Artif Intell 3171:286–295
Baena M, Del Campo J, Fidalgo R et al (2006) Early drift detection method. Proceedings of the 2016 International Workshop on Knowledge Discovery from Data Streams. Porto Citeseer 6:77–86
Barros RS, Cabral DR, Gonçalves PM, Jr, et al (2017) RDDM: reactive drift detection method. Expert Syst Appl 90:344–355
Bifet A, Gavaldá R (2007) Learning from time-changing data with adaptive windowing. Proceedings of the Seventh SIAM International Conference on Data Mining. Minneapolis: SIAM 443–448.
Frias-blanco I, Campo-avila JD, Ramos-jimenez G et al (2015) Online and non-parametric drift detection methods based on hoeffding bounds. IEEE Trans Knowl Data Eng 27(3):810–823
Pesaranghader A, Viktor HL (2016) Fast hoeffding drift detection method for evolving data streams. In: Frasconi P, Landwehr N, Manco G, Vreeken J (eds) Proceedings of 2016 Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, Cham, pp 96–111
Pesaranghader A, Viktor HL (2018) Mcdiarmid drift detection methods for evolving data streams. Proceedings of 2018 International Joint Confer-ence on Neural Networks. Piscataway: IEEE 1–9.
Pesaranghader A, Viktor H, Paquet E (2018) Reservoir of diverse adaptive learners and stacking fast hoeffd-ing drift detection methods for evolving data streams. Mach Learn 107(11):1711–1743
Goel K, Batra S (2021) Adaptive online learning for classification under concept drift. Int J Comput Sci Eng 24(2):128–135
Qingyan Xu, Li He, Hongxi Z (2020) Improved detection method of concept drift based on the hoeffding inequality. Comput Eng Appl 56(19):55–61
Chen Z, Han M, Wu H et al (2022) A multi-level weighted concept drift detection method. J Supercomput 79(5):5154–5180
Nishida K, Yamauchi K (2007) Detecting Concept Drift Using Statistical Testing. Springer, Cham, pp 264–269
Huang D T J, Koh Y S, Dobbie G, et al (2014) Detecting volatility shift in data streams. Proceedings of 2014 IEEE International Conference on Data Mining(ICDM). Shenzhen: IEEE. 863–868
Baidari I, Honnikoll N (2021) Bhattacharyya distance based concept drift detection method for evolving data stream. Expert Syst Appl 6:115303
Ali P, Herna V, Eric P (2017) Reservoir of diverse adaptive learners and stacking fast hoeffding drift detection methods for evolving data Streams. Mach Learn 3:1–33
Guo HS, Li H, Ren QY et al (2022) Concept drift type identification based on multi-sliding windows. Inf Sci 585:1–23
Valiant LG (1984) A theory of the learnable. Commun ACM 27(11):1134–1142
Bifet A, Holmes G, Kirkby R, Pfahringer B (2010) Moa: massive online analysis. J Mach Learn Res 11:1601–1604
Acknowledgements
This work was supported by the National Nature Science Foundation of China (62062004), the Ningxia Natural Science Foundation Project (2022AAC03279).
Author information
Authors and Affiliations
Contributions
Dongliang Mu are responsible for conceptualization, methodology, software, writing, all tables and figures. Meng Han are responsible for writing, review, editing, supervision and funding acquisition. Ang Li are responsible for data curation, investigation. Shujuan Liu are responsible for formal analysis, project administration. Zhihui Gao are responsible for validation.
Corresponding author
Ethics declarations
Confict of interest
We declare that we do not have any commercial or associative interest that represents. The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Informed consent
Informed consent was obtained from all individual participants included in the study.
Consent to participate
Not applicable.
Consent for publication
Not applicable.
Research involving human and animal rights
This article does not contain any studies with human or animal subjects performed by any of the authors.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Han, M., Mu, D., Li, A. et al. Concept drift detection methods based on different weighting strategies. Int. J. Mach. Learn. & Cyber. (2024). https://doi.org/10.1007/s13042-024-02186-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13042-024-02186-4