Abstract
In predictive business process monitoring current and historical process data from event logs is used to predict the evolvement of running process instances. A wide number of machine learning approaches, especially different types of artificial neural networks, are successfully applied for this task. Nevertheless, experimental studies revealed that the resulting predictive models are not able to properly predict non-frequent activities. In this paper we investigate the usefulness of the concept of cost-sensitive learning, which introduces a cost model for different activities to better represent them in the training phase. An evaluation of this concept applied to common predictive monitoring approaches on various real life event logs shows encouraging results.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
The source code can be accessed at https://github.com/mkaep/.
- 2.
- 3.
i.e., it can be calculated from true positives (TP), true negatives (TN), false positives (FP), and true positive (TP).
References
van der Aalst, W.: Process Mining - Discovery, Conformance and Enhancement of Business Processes. Springer, Heidelberg (2011)
Al-Jebrni, A.H., Cai, H., Jiang, L.: Predicting the next process event using convolutional neural networks. In: IEEE International Conference on PIC, pp. 332–338 (2018)
Breiman, L., Friedman, J., Stone, C., Olshen, R.: Classification and Regression Trees. Taylor & Francis (1984)
Camargo, M., Dumas, M., González-Rojas, O.: Learning accurate LSTM models of business processes. In: Hildebrandt, T., van Dongen, B.F., Röglinger, M., Mendling, J. (eds.) BPM 2019. LNCS, vol. 11675, pp. 286–302. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-26619-6_19
Di Mauro, N., Appice, A., Basile, T.M.A.: Activity prediction of business process instances with inception CNN models. In: Alviano, M., Greco, G., Scarcello, F. (eds.) AI*IA 2019. LNCS (LNAI), vol. 11946, pp. 348–361. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-35166-3_25
Elkan, C.: The foundations of cost-sensitive learning. In: Proceedings of IJCAI 2001, pp. 973–978. Morgan Kaufmann Publishers Inc., San Francisco (2001)
Evermann, J., Rehse, J.R., Fettke, P.: Predicting process behaviour using deep learning. Decis. Support Syst. 100, 129–140 (2017)
Fernández, A., García, S., Galar, M., Prati, R.C., Krawczyk, B., Herrera, F.: Learning from Imbalanced Data Sets. Springer, Heidelberg (2018)
He, H., Ma, Y.: Imbalanced Learning: Foundations, Algorithms, and Applications, 1st edn. Wiley-IEEE Press (2013)
Hinkka, M., Lehto, T., Heljanko, K.: Exploiting event log event attributes in RNN based prediction. In: Welzer, T., et al. (eds.) ADBIS 2019. CCIS, vol. 1064, pp. 405–416. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-30278-8_40
Japkowicz, N., Stephen, S.: The class imbalance problem: a systematic study. Intell. Data Anal. 6(5), 429–449 (2002)
Käppel, M.: Evaluating predictive business process monitoring approaches on small event logs. arXiv (2021)
Lin, L., Wen, L., Wang, J.: A deep predictive model for multi-attribute event sequence. In: Proceedings of the International Conference on Data Mining 2019, pp. 118–126 (2019)
Mehdiyev, N., Evermann, J., Fettke, P.: A multi-stage deep learning approach for business process event prediction. In: IEEE 19th CBI, pp. 119–128 (2017)
Ortigosa-Hernández, J., Inza, I., Lozano, J.A.: Measuring the class-imbalance extent of multi-class problems. Pattern Recogn. Lett. 98, 32–38 (2017)
Pasquadibisceglie, V., Appice, A., Castellano, G., Malerba, D.: Using convolutional neural networks for predictive process analytics. In: Proceedings of ICPM 2019 (2019)
Schönig, S., Jasinski, R., Ackermann, L., Jablonski, S.: Deep learning process prediction with discrete and continuous data features. In: Proceedings of ENASE 2018 (2018)
Sun, Y., Kamel, M.S., Wong, A.K., Wang, Y.: Cost-sensitive boosting for classification of imbalanced data. Pattern Recogn. 40(12), 3358–3378 (2007)
Tax, N., Verenich, I., La Rosa, M., Dumas, M.: Predictive business process monitoring with LSTM neural networks. In: Dubois, E., Pohl, K. (eds.) CAiSE 2017. LNCS, vol. 10253, pp. 477–492. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59536-8_30
Teinemaa, I., Dumas, M., Rosa, M.L., Maggi, F.M.: Outcome-oriented predictive process monitoring: review and benchmark. TKDD 13(2) (2019)
Theis, J., Darabi, H.: Decay replay mining to predict next process events. IEEE Access 7, 119787–119803 (2019). https://doi.org/10.1109/ACCESS.2019.2937085
Wu, J., Wan, L., Xu, Z.: Algorithms to discover complete frequent episodes in sequences. In: Cao, L., Huang, J.Z., Bailey, J., Koh, Y.S., Luo, J. (eds.) PAKDD 2011. LNCS (LNAI), vol. 7104, pp. 267–278. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-28320-8_23
Zhang, X., Hu, B.: A new strategy of cost-free learning in the class imbalance problem. IEEE Trans. Knowl. Data Eng. 26(12), 2872–2885 (2014)
Zhou, Z.-H., Liu, X.-Y.: Training cost-sensitive NN with methods addressing the class imbalance problem. IEEE Trans. Knowl. Data Eng. 63–77 (2006)
Zhou, Z.H., Liu, X.Y.: On multi-class cost-sensitive learning. In: Proceedings of the 21st National Conference on AI, pp. 567–572. AAAI Press (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Käppel, M., Jablonski, S., Schönig, S. (2021). Cost-Sensitive Predictive Business Process Monitoring. In: Bellatreche, L., et al. New Trends in Database and Information Systems. ADBIS 2021. Communications in Computer and Information Science, vol 1450. Springer, Cham. https://doi.org/10.1007/978-3-030-85082-1_2
Download citation
DOI: https://doi.org/10.1007/978-3-030-85082-1_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-85081-4
Online ISBN: 978-3-030-85082-1
eBook Packages: Computer ScienceComputer Science (R0)