Outlier Detection Techniques for Process Mining Applications

Ghionna, Lucantonio; Greco, Gianluigi; Guzzo, Antonella; Pontieri, Luigi

doi:10.1007/978-3-540-68123-6_17

Lucantonio Ghionna¹,
Gianluigi Greco¹,
Antonella Guzzo² &
…
Luigi Pontieri³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4994))

Included in the following conference series:

International Symposium on Methodologies for Intelligent Systems

1303 Accesses
26 Citations

Abstract

Classical outlier detection approaches may hardly fit process mining applications, since in these settings anomalies emerge not only as deviations from the sequence of events most often registered in the log, but also as deviations from the behavior prescribed by some (possibly unknown) process model. These issues have been faced in the paper via an approach for singling out anomalous evolutions within a set of process traces, which takes into account both statistical properties of the log and the constraints associated with the process model. The approach combines the discovery of frequent execution patterns with a cluster-based anomaly detection procedure; notably, this procedure is suited to deal with categorical data and is, hence, interesting in its own, given that outlier detection has mainly been studied on numerical domains in the literature. All the algorithms presented in the paper have been implemented and integrated into a system prototype that has been thoroughly tested to assess its scalability and effectiveness.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Apostolico, A., Bock, M.E., Lonardi, S., Xu, X.: Efficient detection of unusual words. Journal of Computational Biology 7(1/2), 71–94 (2000)
Article Google Scholar
Dhillon, I.S., Mallela, S., Modha, D.S.: Information-theoretic co-clustering. In: Proc. 9th ACM SIGKDD Conf. on Knowledge Discovery and Data Mining (KDD 2003), pp. 89–98 (2003)
Google Scholar
Dustdar, S., Hoffmann, T., van der Aalst, W.M.P.: Mining of ad-hoc business processes with teamlog. Data and Knowledge Engineering 55(2), 129–158 (2005)
Article Google Scholar
Enright, A.J., Van Dongen, S., Ouzounis, C.A.: An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 30(7), 1575–1584 (2002)
Article Google Scholar
Fawcett, T.E., Provost, F.: Fraud detection. In: Handbook of data mining and knowledge discovery, pp. 726–731. Oxford University Press, Oxford (2002)
Google Scholar
Greco, G., Guzzo, A., Pontieri, L., Saccà, D.: Discovering expressive process models by clustering log traces. IEEE Trans. on Knowledge and Data Engin. 18(8), 1010–1027 (2006)
Article Google Scholar
He, Z., Xu, Z., Huang, J.Z., Deng, S.: Fp-outlier: Frequent pattern based outlier detection. In: Hao, Y., Liu, J., Wang, Y.-P., Cheung, Y.-m., Yin, H., Jiao, L., Ma, J., Jiao, Y.-C. (eds.) CIS 2005. LNCS (LNAI), vol. 3801, pp. 735–740. Springer, Heidelberg (2005)
Chapter Google Scholar
Jaing, M.F., Tseng, S.S., Su, C.M.: Two-phase clustering process for outliers detection. Pattern Recogn. Lett. 22(6-7), 691–700 (2001)
Article Google Scholar
Jiang, S., Song, X., Wang, H., Han, J.-J., Li, Q.-H.: A clustering-based method for unsupervised intrusion detections. Pattern Recogn. Lett. 27(7), 802–810 (2006)
Article Google Scholar
Maruster, L., Weijters, A.J.M.M., van der Aalst, W.M.P., van den Bosch, A.: A rule-based approach for process discovery: Dealing with noise and imbalance in process logs. Data Mining and Knowledge Discovery 13(1), 67–87 (2006)
Article MathSciNet Google Scholar
Motahari Nezhad, H.R., Saint-Paul, R., Benatallah, B., Casati, F.: Protocol discovery from imperfect service interaction logs. In: Proc. of ICDE 2007, pp. 1405–1409 (2007)
Google Scholar
van der Aalst, W.M.P., van Dongen, B.F., Herbst, J., Maruster, L., Schimm, G., Weijters, A.: Workflow mining: a survey of issues and approaches. Data & Know. Engin. 47(2), 237–267 (2003)
Article Google Scholar
Yu, D., Sheikholeslami, G., Zhang, A.: Findout: finding outliers in very large datasets. Knowledge Information Systems 4(4), 387–412 (2002)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Mathematics, UNICAL, Via P. Bucci 30B, 87036, Rende, Italy
Lucantonio Ghionna & Gianluigi Greco
DEIS, UNICAL, Via P. Bucci 30B, 87036, Rende, Italy
Antonella Guzzo
ICAR-CNR, Via P. Bucci 41C, 87036, Rende, Italy
Luigi Pontieri

Authors

Lucantonio Ghionna
View author publications
You can also search for this author in PubMed Google Scholar
Gianluigi Greco
View author publications
You can also search for this author in PubMed Google Scholar
Antonella Guzzo
View author publications
You can also search for this author in PubMed Google Scholar
Luigi Pontieri
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Aijun An Stan Matwin Zbigniew W. Raś Dominik Ślęzak

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ghionna, L., Greco, G., Guzzo, A., Pontieri, L. (2008). Outlier Detection Techniques for Process Mining Applications. In: An, A., Matwin, S., Raś, Z.W., Ślęzak, D. (eds) Foundations of Intelligent Systems. ISMIS 2008. Lecture Notes in Computer Science(), vol 4994. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68123-6_17

Download citation

DOI: https://doi.org/10.1007/978-3-540-68123-6_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68122-9
Online ISBN: 978-3-540-68123-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics