Skip to main content
Log in

Improved concept drift handling in surgery prediction and other applications

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

The article presents a new algorithm for handling concept drift: the Trigger-based Ensemble (TBE) is designed to handle concept drift in surgery prediction but it is shown to perform well for other classification problems as well. At the primary care, queries about the need for surgical treatment are referred to a surgeon specialist. At the secondary care, referrals are reviewed by a team of specialists. The possible outcomes of this review are that the referral: (i) is canceled, (ii) needs to be complemented, or (iii) is predicted to lead to surgery. In the third case, the referred patient is scheduled for an appointment with a surgeon specialist. This article focuses on the binary prediction of case three (surgery prediction). The guidelines for the referral and the review of the referral are changed due to, e.g., scientific developments and clinical practices. Existing decision support is based on the expert systems approach, which usually requires manual updates when changes in clinical practice occur. In order to automatically revise decision rules, the occurrence of concept drift (CD) must be detected and handled. The existing CD handling techniques are often specialized; it is challenging to develop a more generic technique that performs well regardless of CD type. Experiments are conducted to measure the impact of CD on prediction performance and to reduce CD impact. The experiments evaluate and compare TBE to three existing CD handling methods (AWE, Active Classifier, and Learn++) on one real-world dataset and one artificial dataset. TBA significantly outperforms the other algorithms on both datasets but is less accurate on noisy synthetic variations of the real-world dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. The UCI machine learning repository, http://archive.ics.uci.edu/ml/.

References

  1. Alippi C, Boracchi G, Roveri M (2011) An effective just-in-time adaptive classifier for gradual concept drifts. In: International joint conference on neural networks, pp 1675–1682

  2. Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. Mach Learn Res 7:1–30

    MATH  MathSciNet  Google Scholar 

  3. Elwell R, Polikar R (2011) Incremental learning of concept drift in nonstationary environments. IEEE Trans Neural Netw 22(10):1517–1531

    Article  Google Scholar 

  4. Gama J, Medas P, Castillo G, Rodrigues P (2004) Learning with drift detection. In: Advances in artificial intelligence, vol 3171. Springer, pp 286–295

  5. Jose MBG, Campo-Avila JD, Fidalgo R, Bifet A, Gavalda R, Morales-bueno R (2006) Early drift detection method. In: ECML/PKDD workshop on knowledge discovery from data streams, ACM Press, pp 77–86

  6. Magoulas GD, Prentza A (2001) Machine learning in medical applications. Machine learning and its applications, advanced lectures, vol 2049. Springer, Berlin, pp 300–307

  7. Markou M, Singh S (2003) Novelty detection: a review-part 1: statistical approaches. Signal Process 83(12):2481–2497

  8. Masud MM, Chen Q, Khan L, Aggarwal C, Gao J, Han J, Thuraisingham B (2010) Addressing concept-evolution in concept-drifting data streams. In: IEEE international conference on data mining, IEEE Press, pp 929–934

  9. McAllester DA (1998) Some PAC-Bayesian theorems. In: Annual conference on computational learning theory, ACM Press, pp 230–234

  10. Minku L, White A, Yao X (2010) The impact of diversity on online ensemble learning in the presence of concept drift. IEEE Trans Knowl Data Eng 22(5):730–742

    Article  Google Scholar 

  11. Mitchell TM (1997) Machine learning. McGraw-Hill, New York

    MATH  Google Scholar 

  12. Nishida K, Yamauchi K (2007) Detecting concept drift using statistical testing. In: International conference on discovery science, vol 4755. Springer, pp 264–269

  13. Ouyang Z, Zhao Z, Gao Y, Wang T (2011) Study on the classification of data streams with concept drift. International conference on fuzzy systems and knowledge discovery, pp 1673–1677

  14. Oza NC, Russell S (2001) Experimental comparisons of online and batch versions of bagging and boosting. In: ACM SIGKDD international conference on knowledge discovery and data mining, ACM Press, pp 359–364

  15. Persson M, Lavesson N (2009) Identification of surgery indicators by mining hospital data: a preliminary study. In: Int conf database expert syst appl, pp 323–327

  16. Schlimmer J, Granger R (1986) Beyond incremental processing: tracking concept drift. In: National conference on artificial intelligence, Morgan Kaufmann, pp 502–507

  17. Sobhani P, Beigy H (2011) New drift detection method for data streams. In: International conference on adaptive and intelligent systems, vol 6943. Springer, pp 88–97

  18. Street WN, Kim Y (2001) A streaming ensemble algorithm (sea) for large-scale classification. In: ACM SIGKDD International conference on knowledge discovery and data mining, ACM Press, pp 377–382

  19. Tsymbal A (2004) The problem of concept drift: Definitions and related work. Computer Science Department, Trinity College Dublin

  20. Tsymbal A, Pechenizkiy M, Cunningham P, Puuronen S (2006) Handling local concept drift with dynamic integration of classifiers: Domain of antibiotic resistance in nosocomial infections. In: IEEE symposium on computer-based medical systems, IEEE Computer Society, pp 679–684

  21. Wang H, Fan W, Yu PS, Han J (2003) Mining concept-drifting data streams using ensemble classifiers. In: ACM SIGKDD international conference on knowledge discovery and data mining, ACM Press, pp 226–235

  22. Wang S, Schlobach S, Klein M (2011) Concept drift and how to identify it. Web Semantics Science Services and Agents on the World Wide Web 9(3):247–265

    Article  Google Scholar 

  23. Widmer G, Kubat M (1996) Learning in the presence of concept drift and hidden contexts. Mach Learn 23:69–101

    Google Scholar 

  24. Xiang C, Chen M, Wang H (2009) An ensemble method for medicine best selling prediction. Int Conf Fuzzy Syst Knowl Discov 1:100–103

    Google Scholar 

  25. Yang Y, Wu X, Zhu X (2006) Mining in anticipation for concept change: proactive–reactive prediction in data streams. Data Min Knowl Discov 13(3):261–289

    Article  MathSciNet  Google Scholar 

  26. Zhao QL, Jiang YH, Xu M (2009) A fast ensemble pruning algorithm based on pattern mining process. Data Min Knowl Discov 19(2):277–292

    Article  MathSciNet  Google Scholar 

  27. Zliobaite I, Bifet A, Pfahringer B, Holmes G (2011) Active learning with evolving streaming data. In: European conference on machine learning and knowledge discovery in databases, vol 6913. Springer, pp 597–612

Download references

Acknowledgments

The conducted research is funded by a grant from Blekinge county research council. The authors would like to thank Elisabeth Håkansson at Blekinge hospital for extracting and compiling patient records as well as the corresponding decisions concerning surgery to enable the generation of the real-world hip-replacement dataset.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Niklas Lavesson.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Beyene, A.A., Welemariam, T., Persson, M. et al. Improved concept drift handling in surgery prediction and other applications. Knowl Inf Syst 44, 177–196 (2015). https://doi.org/10.1007/s10115-014-0756-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-014-0756-9

Keywords

Navigation