Concept drift detection methods based on different weighting strategies

Han, Meng; Mu, Dongliang; Li, Ang; Liu, Shujuan; Gao, Zhihui

doi:10.1007/s13042-024-02186-4

Concept drift detection methods based on different weighting strategies

Original Article
Published: 13 May 2024

(2024)
Cite this article

International Journal of Machine Learning and Cybernetics Aims and scope Submit manuscript

Meng Han ORCID: orcid.org/0000-0002-8994-1274¹,
Dongliang Mu¹,
Ang Li¹,
Shujuan Liu¹ &
…
Zhihui Gao¹

63 Accesses
Explore all metrics

Abstract

The distribution of data often evolves over time, necessitating classifiers to adjust in order to maintain optimal classification accuracy. This phenomenon, termed “concept drift”, poses a significant challenge. Detectors specifically designed for identifying concept drift are typically integrated to bolster classifier performance. In this study, we introduce two innovative methodologies designed to address the prevalent issues of high miss detection, excessive false alarms, and prolonged detection latencies encountered in many contemporary concept drift detection algorithms. The first approach, termed the Hybrid Weighting-based Concept Drift Detection Method (HW_DDM), incorporates both linear and exponential weighting for long and short windows, respectively, within a composite window model. Subsequently, concept drift is detected by calculating the weighted mean value within the window, leveraging the Hoeffding and McDirmid inequality thresholds. The second strategy, named the Dynamic Weighting-based Hoeffding Drift Detection Method (DW_HDDM), employs a mechanism that dynamically adjusts the Hoeffding threshold and dynamically weights classification prediction outcomes, thereby catering to the drift and augmenting detection efficacy. Comparative evaluations using both synthetic and real-world datasets against leading-edge algorithms are presented. The empirical results underscore that HW_DDM exhibits the lowest false detection rate with negligible miss detections on synthetic datasets. In contrast, DW_HDDM shines with minimal detection delay, reduced miss detection rates, and a diminished false detection rate, demonstrating superior classification accuracy on real-world datasets when pitted against benchmark algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 2

Fig. 8

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

A survey on imbalanced learning: latest research, applications and future directions

Article Open access 09 May 2024

Introduction to Machine Learning

Availability of data and material

Data will be available on reasonable request.

References

Jian D, Men H, Juan Li (2016) Review of concept drift data streams mining techniques. Computer Science 43(12):24–29
Google Scholar
Xiulin Z, Peipei L, Xindong W (2022) Data stream classification based on extreme learning machine: a review. Big Data Res 30:100356
Article Google Scholar
Abbasi A, Javed AR, Chakraborty C et al (2021) ElStream: an ensemble learning approach for concept drift detection in dynamic social big data stream learning. Piscat IEEE Access 9:66408–66419
Article Google Scholar
Hang Yu, Liu Weixu Lu, Jie, et al (2023) Detecting group concept drift from multiple data streams. Pattern Recogn 134:109113
Article Google Scholar
Gama J, Žliobaitė I, Bifet A et al (2014) A survey on concept drift adaptation. ACM Comput Surv (CSUR) 46(4):1–37
Article Google Scholar
Page ES (1954) Continuous inspection schemes. Biometrika 41(1/2):100
Article MathSciNet Google Scholar
Roberts SW (2000) Control chart test s based on geometric moving averages. Technometrics 42(1):97–101
Article Google Scholar
Gama J, Medas P et al (2004) Learning with drift detection. Adv Artif Intell 3171:286–295
Google Scholar
Baena M, Del Campo J, Fidalgo R et al (2006) Early drift detection method. Proceedings of the 2016 International Workshop on Knowledge Discovery from Data Streams. Porto Citeseer 6:77–86
Google Scholar
Barros RS, Cabral DR, Gonçalves PM, Jr, et al (2017) RDDM: reactive drift detection method. Expert Syst Appl 90:344–355
Article Google Scholar
Bifet A, Gavaldá R (2007) Learning from time-changing data with adaptive windowing. Proceedings of the Seventh SIAM International Conference on Data Mining. Minneapolis: SIAM 443–448.
Frias-blanco I, Campo-avila JD, Ramos-jimenez G et al (2015) Online and non-parametric drift detection methods based on hoeffding bounds. IEEE Trans Knowl Data Eng 27(3):810–823
Article Google Scholar
Pesaranghader A, Viktor HL (2016) Fast hoeffding drift detection method for evolving data streams. In: Frasconi P, Landwehr N, Manco G, Vreeken J (eds) Proceedings of 2016 Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, Cham, pp 96–111
Google Scholar
Pesaranghader A, Viktor HL (2018) Mcdiarmid drift detection methods for evolving data streams. Proceedings of 2018 International Joint Confer-ence on Neural Networks. Piscataway: IEEE 1–9.
Pesaranghader A, Viktor H, Paquet E (2018) Reservoir of diverse adaptive learners and stacking fast hoeffd-ing drift detection methods for evolving data streams. Mach Learn 107(11):1711–1743
Article MathSciNet Google Scholar
Goel K, Batra S (2021) Adaptive online learning for classification under concept drift. Int J Comput Sci Eng 24(2):128–135
Google Scholar
Qingyan Xu, Li He, Hongxi Z (2020) Improved detection method of concept drift based on the hoeffding inequality. Comput Eng Appl 56(19):55–61
Google Scholar
Chen Z, Han M, Wu H et al (2022) A multi-level weighted concept drift detection method. J Supercomput 79(5):5154–5180
Article Google Scholar
Nishida K, Yamauchi K (2007) Detecting Concept Drift Using Statistical Testing. Springer, Cham, pp 264–269
Google Scholar
Huang D T J, Koh Y S, Dobbie G, et al (2014) Detecting volatility shift in data streams. Proceedings of 2014 IEEE International Conference on Data Mining(ICDM). Shenzhen: IEEE. 863–868
Baidari I, Honnikoll N (2021) Bhattacharyya distance based concept drift detection method for evolving data stream. Expert Syst Appl 6:115303
Article Google Scholar
Ali P, Herna V, Eric P (2017) Reservoir of diverse adaptive learners and stacking fast hoeffding drift detection methods for evolving data Streams. Mach Learn 3:1–33
Google Scholar
Guo HS, Li H, Ren QY et al (2022) Concept drift type identification based on multi-sliding windows. Inf Sci 585:1–23
Article Google Scholar
Valiant LG (1984) A theory of the learnable. Commun ACM 27(11):1134–1142
Article Google Scholar
Bifet A, Holmes G, Kirkby R, Pfahringer B (2010) Moa: massive online analysis. J Mach Learn Res 11:1601–1604
Google Scholar

Download references

Acknowledgements

This work was supported by the National Nature Science Foundation of China (62062004), the Ningxia Natural Science Foundation Project (2022AAC03279).

Author information

Authors and Affiliations

School of Computer Science and Engineering, North Minzu University, Yinchuan, 750021, China
Meng Han, Dongliang Mu, Ang Li, Shujuan Liu & Zhihui Gao

Authors

Meng Han
View author publications
You can also search for this author in PubMed Google Scholar
Dongliang Mu
View author publications
You can also search for this author in PubMed Google Scholar
Ang Li
View author publications
You can also search for this author in PubMed Google Scholar
Shujuan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Zhihui Gao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Dongliang Mu are responsible for conceptualization, methodology, software, writing, all tables and figures. Meng Han are responsible for writing, review, editing, supervision and funding acquisition. Ang Li are responsible for data curation, investigation. Shujuan Liu are responsible for formal analysis, project administration. Zhihui Gao are responsible for validation.

Corresponding author

Correspondence to Meng Han.

Ethics declarations

Confict of interest

We declare that we do not have any commercial or associative interest that represents. The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Informed consent

Informed consent was obtained from all individual participants included in the study.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Research involving human and animal rights

This article does not contain any studies with human or animal subjects performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Han, M., Mu, D., Li, A. et al. Concept drift detection methods based on different weighting strategies. Int. J. Mach. Learn. & Cyber. (2024). https://doi.org/10.1007/s13042-024-02186-4

Download citation

Received: 20 April 2023
Accepted: 15 April 2024
Published: 13 May 2024
DOI: https://doi.org/10.1007/s13042-024-02186-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Concept drift detection methods based on different weighting strategies

Abstract

Access this article

Similar content being viewed by others

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

A survey on imbalanced learning: latest research, applications and future directions

Introduction to Machine Learning

Availability of data and material

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Confict of interest

Informed consent

Consent to participate

Consent for publication

Research involving human and animal rights

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Concept drift detection methods based on different weighting strategies

Abstract

Access this article

Similar content being viewed by others

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

A survey on imbalanced learning: latest research, applications and future directions

Introduction to Machine Learning

Availability of data and material

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Confict of interest

Informed consent

Consent to participate

Consent for publication

Research involving human and animal rights

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation