A Unified Approach for Measuring Precision and Generalization Based on Anti-alignments
The holy grail in process mining is an algorithm that, given an event log, produces fitting, precise, properly generalizing and simple process models. While there is consensus on the existence of solid metrics for fitness and simplicity, current metrics for precision and generalization have important flaws, which hamper their applicability in a general setting. In this paper, a novel approach to measure precision and generalization is presented, which relies on the notion of anti-alignments. An anti-alignment describes highly deviating model traces with respect to observed behavior. We propose metrics for precision and generalization that resemble the leave-one-out cross-validation techniques, where individual traces of the log are removed and the computed anti-alignment assess the model’s capability to describe precisely or generalize the observed behavior. The metrics have been implemented in ProM and tested on several examples.
KeywordsProcess Mining Edit Distance Single Trace Abstraction Mechanism Precision Metrics
This work has been partially supported by funds from the Spanish Ministry for Economy and Competitiveness (MINECO), the European Union (FEDER funds) under grant COMMAS (ref. TIN2013-46181-C2-1-R).
- 3.Adriansyah, A.: Aligning observed and modeled behavior. Ph.D. thesis, Eindhoven (2014)Google Scholar
- 4.Munoz-Gama, J.: Conformance checking and diagnosis in process mining. Ph.D. thesis, Universitat Politecnica de Catalunya (2014)Google Scholar
- 8.Rozinat, A.: Process mining: conformance and extension. Ph.D. thesis (2010)Google Scholar
- 9.van der Aalst, W.M.P., Adriansyah, A., van Dongen, B.F.: Replaying history on process models for conformance checking and performance analysis. Wiley Interdisc. Rev.: Data Min. Knowl. Disc. 2(2), 182–192 (2012)Google Scholar