Abstract
The article presents a new algorithm for handling concept drift: the Trigger-based Ensemble (TBE) is designed to handle concept drift in surgery prediction but it is shown to perform well for other classification problems as well. At the primary care, queries about the need for surgical treatment are referred to a surgeon specialist. At the secondary care, referrals are reviewed by a team of specialists. The possible outcomes of this review are that the referral: (i) is canceled, (ii) needs to be complemented, or (iii) is predicted to lead to surgery. In the third case, the referred patient is scheduled for an appointment with a surgeon specialist. This article focuses on the binary prediction of case three (surgery prediction). The guidelines for the referral and the review of the referral are changed due to, e.g., scientific developments and clinical practices. Existing decision support is based on the expert systems approach, which usually requires manual updates when changes in clinical practice occur. In order to automatically revise decision rules, the occurrence of concept drift (CD) must be detected and handled. The existing CD handling techniques are often specialized; it is challenging to develop a more generic technique that performs well regardless of CD type. Experiments are conducted to measure the impact of CD on prediction performance and to reduce CD impact. The experiments evaluate and compare TBE to three existing CD handling methods (AWE, Active Classifier, and Learn++) on one real-world dataset and one artificial dataset. TBA significantly outperforms the other algorithms on both datasets but is less accurate on noisy synthetic variations of the real-world dataset.
Similar content being viewed by others
Notes
The UCI machine learning repository, http://archive.ics.uci.edu/ml/.
References
Alippi C, Boracchi G, Roveri M (2011) An effective just-in-time adaptive classifier for gradual concept drifts. In: International joint conference on neural networks, pp 1675–1682
Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. Mach Learn Res 7:1–30
Elwell R, Polikar R (2011) Incremental learning of concept drift in nonstationary environments. IEEE Trans Neural Netw 22(10):1517–1531
Gama J, Medas P, Castillo G, Rodrigues P (2004) Learning with drift detection. In: Advances in artificial intelligence, vol 3171. Springer, pp 286–295
Jose MBG, Campo-Avila JD, Fidalgo R, Bifet A, Gavalda R, Morales-bueno R (2006) Early drift detection method. In: ECML/PKDD workshop on knowledge discovery from data streams, ACM Press, pp 77–86
Magoulas GD, Prentza A (2001) Machine learning in medical applications. Machine learning and its applications, advanced lectures, vol 2049. Springer, Berlin, pp 300–307
Markou M, Singh S (2003) Novelty detection: a review-part 1: statistical approaches. Signal Process 83(12):2481–2497
Masud MM, Chen Q, Khan L, Aggarwal C, Gao J, Han J, Thuraisingham B (2010) Addressing concept-evolution in concept-drifting data streams. In: IEEE international conference on data mining, IEEE Press, pp 929–934
McAllester DA (1998) Some PAC-Bayesian theorems. In: Annual conference on computational learning theory, ACM Press, pp 230–234
Minku L, White A, Yao X (2010) The impact of diversity on online ensemble learning in the presence of concept drift. IEEE Trans Knowl Data Eng 22(5):730–742
Mitchell TM (1997) Machine learning. McGraw-Hill, New York
Nishida K, Yamauchi K (2007) Detecting concept drift using statistical testing. In: International conference on discovery science, vol 4755. Springer, pp 264–269
Ouyang Z, Zhao Z, Gao Y, Wang T (2011) Study on the classification of data streams with concept drift. International conference on fuzzy systems and knowledge discovery, pp 1673–1677
Oza NC, Russell S (2001) Experimental comparisons of online and batch versions of bagging and boosting. In: ACM SIGKDD international conference on knowledge discovery and data mining, ACM Press, pp 359–364
Persson M, Lavesson N (2009) Identification of surgery indicators by mining hospital data: a preliminary study. In: Int conf database expert syst appl, pp 323–327
Schlimmer J, Granger R (1986) Beyond incremental processing: tracking concept drift. In: National conference on artificial intelligence, Morgan Kaufmann, pp 502–507
Sobhani P, Beigy H (2011) New drift detection method for data streams. In: International conference on adaptive and intelligent systems, vol 6943. Springer, pp 88–97
Street WN, Kim Y (2001) A streaming ensemble algorithm (sea) for large-scale classification. In: ACM SIGKDD International conference on knowledge discovery and data mining, ACM Press, pp 377–382
Tsymbal A (2004) The problem of concept drift: Definitions and related work. Computer Science Department, Trinity College Dublin
Tsymbal A, Pechenizkiy M, Cunningham P, Puuronen S (2006) Handling local concept drift with dynamic integration of classifiers: Domain of antibiotic resistance in nosocomial infections. In: IEEE symposium on computer-based medical systems, IEEE Computer Society, pp 679–684
Wang H, Fan W, Yu PS, Han J (2003) Mining concept-drifting data streams using ensemble classifiers. In: ACM SIGKDD international conference on knowledge discovery and data mining, ACM Press, pp 226–235
Wang S, Schlobach S, Klein M (2011) Concept drift and how to identify it. Web Semantics Science Services and Agents on the World Wide Web 9(3):247–265
Widmer G, Kubat M (1996) Learning in the presence of concept drift and hidden contexts. Mach Learn 23:69–101
Xiang C, Chen M, Wang H (2009) An ensemble method for medicine best selling prediction. Int Conf Fuzzy Syst Knowl Discov 1:100–103
Yang Y, Wu X, Zhu X (2006) Mining in anticipation for concept change: proactive–reactive prediction in data streams. Data Min Knowl Discov 13(3):261–289
Zhao QL, Jiang YH, Xu M (2009) A fast ensemble pruning algorithm based on pattern mining process. Data Min Knowl Discov 19(2):277–292
Zliobaite I, Bifet A, Pfahringer B, Holmes G (2011) Active learning with evolving streaming data. In: European conference on machine learning and knowledge discovery in databases, vol 6913. Springer, pp 597–612
Acknowledgments
The conducted research is funded by a grant from Blekinge county research council. The authors would like to thank Elisabeth Håkansson at Blekinge hospital for extracting and compiling patient records as well as the corresponding decisions concerning surgery to enable the generation of the real-world hip-replacement dataset.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Beyene, A.A., Welemariam, T., Persson, M. et al. Improved concept drift handling in surgery prediction and other applications. Knowl Inf Syst 44, 177–196 (2015). https://doi.org/10.1007/s10115-014-0756-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-014-0756-9