Skip to main content

Tracking Drifting Concepts by Time Window Optimisation

  • Conference paper
Research and Development in Intelligent Systems XXII (SGAI 2005)

Abstract

This paper addresses the task of learning concept descriptions from streams of data. As new data are obtained the concept description has to be updated regularly to include the new data. In this case we can face the problem that the concept changes over time. Hence the old data become irrelevant to the current concept and have to be removed from the training dataset. This problem is known in the area of machine learning as concept drift. We develop a mechanism that tracks changing concepts using an adaptive time window. The method uses a significance test to detect concept drift and then optimizes the size of the time window, aiming to maximise the classification accuracy on recent data. The method presented is general in nature and can be used with any learning algorithm. The method is tested with three standard learning algorithms (kNN, ID3 and NBC). Three datasets have been used in these experiments. The experimental results provide evidence that the suggested forgetting mechanism is able significantly to improve predictive accuracy on changing concepts.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aha, D., Kibler, D. and Albert, M.: Instance-Based Learning Algorithms. Machine Learning 6, (1991)37–66

    Google Scholar 

  2. Burghes, D. and Graham, A.: Introduction to Control Theory including Optimal Control: Ellis Horwood Series Mathematics and its Applications. John Wiley & Sons (1980)

    Google Scholar 

  3. Delany, SJ., Cunningham. P., Tsymbal, A. and Coyle, L.: A Case-Based Technique for Tracking Concept Drift in Spam Filtering. In: Macintosh, A., Ellis, R. & Allen T. (eds.) Applications and Innovations in Intelligent Systems XII, Proceedings of AI2004, Lecture Notes in Computer Science, Springer (2004) 3–16

    Google Scholar 

  4. Gama, J., Medas, P., Castillo, G. and Rodrigues, P.: Learning with Drift Detection. In: Ana, C, Bazzan, S. and Labidi (Eds.): Proceedings of the 17th Brazilian Symposium on Artificial Intelligence. Lecture Notes in Computer Science, Vol. 3171, Springer, (2004) 286–295

    Google Scholar 

  5. Harries, M. and Sammut, C.: Extracting Hidden Context. Machine Learning 32 (1998) 101–126

    Article  MATH  Google Scholar 

  6. Koychev, I.: Gradual Forgetting for Adaptation to Concept Drift. Proceedings of ECAI 2000 Workshop on Current Issues in Spatio-Temporal Reasoning, Berlin, (2000) 101–107

    Google Scholar 

  7. Koychev, I.: Tracking Changing User Interests through Prior-Learning of Context. In: de Bra, P., Brusilovsky, P., Conejo, R. (eds.): Adaptive Hypermedia and Adaptive Web Based Systems. Lecture Notes in Computer Science, Vol. 2347, Springer-Verlag (2002) 223–232

    Google Scholar 

  8. Klinkenberg, R.: Learning Drifting Concepts: Example Selection vs. Example Weighting. In Intelligent Data Analysis, Special Issue on Incremental Learning Systems Capable of Dealing with Concept Drift, Vol. 8, No. 3, (2004) 281–300

    Google Scholar 

  9. Kukar, M.: Drifting Concepts as Hidden Factors in Clinical Studies. In Dojat, D., Elpida T. Keravnou, Pedro Barahona (Eds.): Proceedings of 9th Conference on Artificial Intelligence in Medicine in Europe, AIME 2003, Protaras, Cyprus, October 18–22, 2003, Lecture Notes in Computer Science, Vol. 2780, Springer-Verlag (2003) 355–364

    Google Scholar 

  10. Chu, F.and Zaniolo, C: Fast and light boosting for adaptive mining of data streams. In: Proc. of the 8th Pacific-Asia Conference on Knowledge Discovery and Data Mining. Lec ture Notes in Computer Science, Vol.3056, Springer-Verlag, (2004) 282–292

    Google Scholar 

  11. Lazarescu, M., Venkatesh, S. and Bui H. H.: Using Multiple Windows to Track Concept Drift. In the Intelligent Data Analysis Journal, Vol 8(1), (2004) 29–59

    Google Scholar 

  12. Maloof, M. and Michalski, R.: Selecting examples for partial memory learning. Machine Learning 41 (2000) 27–52

    Article  Google Scholar 

  13. Mitchell, T., Caruana, R., Freitag, D., McDermott, J. and Zabowski, D.: Experience with a Learning Personal Assistant. Communications of the ACM 37(7) (1994) 81–91

    Article  Google Scholar 

  14. Mitchell T. Machine Learning. McGraw-Hill (1997)

    Google Scholar 

  15. Quintan, R.: Induction of Decision Trees. Machine Learning 1 (1986) 81–106

    Google Scholar 

  16. Schlimmer, J. and Granger, R.: Incremental Learning from Noisy Data. Machine Learning 3, (1986), 317–357

    Google Scholar 

  17. Widmer, G.: Tracking Changes through Meta-Leaming. Machine Learning 27 (1997) 256–286

    Article  Google Scholar 

  18. Widmer, G. and Kubat, M.: Learning in the presence of concept drift and hidden contexts: Machine Learning 23 (1996) 69–101

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag London Limited

About this paper

Cite this paper

Koychev, I., Lothian, R. (2006). Tracking Drifting Concepts by Time Window Optimisation. In: Bramer, M., Coenen, F., Allen, T. (eds) Research and Development in Intelligent Systems XXII. SGAI 2005. Springer, London. https://doi.org/10.1007/978-1-84628-226-3_5

Download citation

  • DOI: https://doi.org/10.1007/978-1-84628-226-3_5

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-84628-225-6

  • Online ISBN: 978-1-84628-226-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics