Drifting Concepts as Hidden Factors in Clinical Studies

Kukar, Matjaž

doi:10.1007/978-3-540-39907-0_49

Matjaž Kukar⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2780))

Included in the following conference series:

Conference on Artificial Intelligence in Medicine in Europe

614 Accesses
15 Citations

Abstract

Most statistical, Machine Learning and Data Mining algorithms assume that the data they use is a random sample drawn from a stationary distribution. Unfortunately, many of the databases available for mining today violate this assumption. They were gathered over months or years, and the underlying processes generating them may have changed during this time, sometimes radically (this is also known as a concept drift). In clinical institutions, where the patients’ data are regularly stored in a central computer databases, similar situations may occur. Expert physicians may easily, even unconsciously, adapt to the changed environment, whereas Machine Learning and Data Mining tools may fail due to their underlaying assumptions. It is therefore important to detect and adapt to the changed situation. In the paper we review several techniques for dealing with concept drift in Machine Learning and Data Mining frameworks and evaluate their use in clinical studies with a case study of coronary artery disease diagnostics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cohen, W.W.: Fast effective rule induction. In: Prieditis, A., Russel, S. (eds.) Proc. 12th Intl. Conf. on Machine Learning ICML 1995, San Francisco, California, USA, pp. 115–123. Morgan Kaufmann, San Francisco (1995)
Google Scholar
Esposito, F., Malerba, D., Semeraro, G.: Simplifying decision trees by pruning and grafting: new results. In: Lavrac, N., Wrobel, S. (eds.) ECML 1995. LNCS, vol. 912, pp. 287–290. Springer, Heidelberg (1995)
Google Scholar
Grabtree, I., Soltysiak, S.: Identifying and tracking changing interests. International Journal of Digital Libraries 2, 38–53 (1998)
Article Google Scholar
Grošelj, C., Kukar, M., Fettich, J., Kononenko, I.: Machine learning improves the accuracy of coronary artery disease diagnostic methods. In: Proc. Computers in Cardiology, Lund, Sweden, vol. 24, pp. 57–60 (1997)
Google Scholar
Harries, M.B., Sammut, C., Horn, K.: Extracting hidden context. Machine Learning 32, 101–126 (1998)
Article MATH Google Scholar
Helmbold, D.P., Long, P.M.: Tracking drifting concepts by minimizing disagreements. Machine Learning 14, 27–45 (1994)
MATH Google Scholar
Hulten, G., Spencer, L., Domingos, P.: Mining time-changing data streams. In: Proceedings of the 17th ACM SIGKDD Inter. Conf. on Knowledge Discovery and Data Mining, San Francisco, CA, pp. 97–106. ACM Press, New York (2001)
Google Scholar
Klinkenberg, R., Joachims, T.: Detecting concept drift with support vector machines. In: Langley, P. (ed.) Proceedings of ICML 2000, 17th International Conference on Machine Learning, Stanford, US, pp. 487–494. Morgan Kaufmann Publishers, San Francisco (2000)
Google Scholar
Koychev, I.: Gradual forgetting for adaptation to concept drift. In: Proceedings of ECAI 2000 Workshop Current Issues in Spatio-Temporal Reasoning, Berlin, Germany, pp. 101–106 (2000)
Google Scholar
Kukar, M.: Making reliable diagnoses with machine learning: A case study. In: Quaglini, S., Barahona, P., Andreassen, S. (eds.) AIME 2001. LNCS (LNAI), vol. 2101, pp. 88–96. Springer, Heidelberg (2001)
Chapter Google Scholar
Kukar, M., Kononenko, I.: Reliable classifications with Machine Learning. In: Elomaa, T., Mannila, H., Toivonen, H. (eds.) ECML 2002. LNCS (LNAI), vol. 2430, p. 219. Springer, Heidelberg (2002)
Chapter Google Scholar
Kukar, M., Kononenko, I., Grošelj, C., Kralj, K., Fettich, J.: Analysing and improving the diagnosis of ischaemic heart disease with machine learning. Artificial Intelligence in Medicine 16(1), 25–50 (1999)
Article Google Scholar
Maloof, M.A., Michalski, R.S.: Selecting examples for partial memory learning. Machine Learning 41(1), 27–52 (2000)
Article Google Scholar
Syed, N.A., Liu, H., Sung, K.K.: Handling concept drifts in incremental learning with support vector machines. Knowledge Discovery and Data Mining, 317–321 (1999)
Google Scholar
Widmer, G., Kubat, M.: Learning in the presence of concept drift and hidden contexts. Machine Learning 23(1), 69–101 (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Computer and Information Science, University of Ljubljana, Tržaška 25, SI-1001, Ljubljana, Slovenia
Matjaž Kukar

Authors

Matjaž Kukar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

INSERM U836-UJF-CEA-CHU (Grenoble Institute of Neuroscience),
Michel Dojat
Department of Computer Science, University of Cyprus, P.O.Box 20537, CY-1678, Nicosia, Cyprus
Elpida T. Keravnou
Centro de Inteligência Artificial, Departamento de Informática, Universidade Nova de Lisboa, 2829-516, Caparica, Portugal
Pedro Barahona

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kukar, M. (2003). Drifting Concepts as Hidden Factors in Clinical Studies. In: Dojat, M., Keravnou, E.T., Barahona, P. (eds) Artificial Intelligence in Medicine. AIME 2003. Lecture Notes in Computer Science(), vol 2780. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39907-0_49

Download citation

DOI: https://doi.org/10.1007/978-3-540-39907-0_49
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20129-8
Online ISBN: 978-3-540-39907-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics