Skip to main content
Log in

Mining process models with non-free-choice constructs

  • Published:
Data Mining and Knowledge Discovery Aims and scope Submit manuscript

Abstract

Process mining aims at extracting information from event logs to capture the business process as it is being executed. Process mining is particularly useful in situations where events are recorded but there is no system enforcing people to work in a particular way. Consider for example a hospital where the diagnosis and treatment activities are recorded in the hospital information system, but where health-care professionals determine the “careflow.” Many process mining approaches have been proposed in recent years. However, in spite of many researchers’ persistent efforts, there are still several challenging problems to be solved. In this paper, we focus on mining non-free-choice constructs, i.e., situations where there is a mixture of choice and synchronization. Although most real-life processes exhibit non-free-choice behavior, existing algorithms are unable to adequately deal with such constructs. Using a Petri-net-based representation, we will show that there are two kinds of causal dependencies between tasks, i.e., explicit and implicit ones. We propose an algorithm that is able to deal with both kinds of dependencies. The algorithm has been implemented in the ProM framework and experimental results shows that the algorithm indeed significantly improves existing process mining techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • van der Aalst WMP (1998) The application of petri nets to workflow management. J Circuits Syst Comp 8(1):21–66

    Article  Google Scholar 

  • van der Aalst WMP (2004a) Business alignment: using process mining as a tool for delta analysis. In: Grundspenkis J, Kirikova M (eds) Proceedings of the 5th workshop on business process modeling, development and support (BPMDS’04), volume 2 of Caise’04 workshops. Riga Technical University, Latvia, pp 138–145

  • van der Aalst WMP (2004b) Business Process management demystified: a tutorial on models, systems and standards for workflow management. In Desel J, Reisig W, Rozenberg G (eds) Lectures on concurrency and Petri nets, vol 3098 of Lecture Notes in Computer Science. Springer-Verlag, Berlin, pp 1–65

  • van der Aalst WMP, van Hee KM (2002) Workflow management: models, methods, and systems. MIT press, Cambridge, MA

    Google Scholar 

  • van der Aalst WMP and ter Hofstede AHM (2005) YAWL: yet another workflow language. Inform Syst 30(4):245–275

    Article  Google Scholar 

  • van der Aalst WMP, de Medeiros AKA (2004) Process mining and security: detecting anomalous process executions and checking process conformance. In: Busi N, Gorrieri R, Martinelli F (eds) Second international workshop on security issues with petri nets and other computational models (WISP 2004). STAR, Servizio Tipografico Area della Ricerca, CNR Pisa, Italy, pp 69–84

  • van der Aalst WMP, Song M (2004) Mining social networks: uncovering interaction patterns in business processes. In: Desel J, Pernici B, Weske M (eds) International conference on business process management (BPM 2004), vol 3080 of Lecture notes in computer science. Springer-Verlag, Berlin, pp 244–260

  • van der Aalst WMP, Weijters AJMM (2004) (eds) Process mining, special issue of computers in industry, vol 53, number 3. Elsevier Science Publishers, Amsterdam

  • van der Aalst WMP, van Dongen BF, Herbst J, Maruster L, Schimm G, Weijters AJMM (2003) Workflow mining: a survey of issues and approaches. Data Knowl Eng 47(2):237–267

    Article  Google Scholar 

  • van der Aalst WMP, Weijters AJMM, Maruster L (2004) Workflow mining: discovering process models from event logs. IEEE Trans Knowl Data Eng 16(9):1128–1142

    Article  Google Scholar 

  • van der Aalst WMP, Dumas M, Ouyang C, Rozinat A, Verbeek HMW (2005a) Choreography conformance checking: an approach based on BPEL and Petri nets (extended version). BPM Center Report BPM-05-25, BPMcenter.org.

  • van der Aalst WMP, Alves de Medeiros AK, Weijters AJMM (2005b) Genetic process mining. In: Ciardo G, Darondeau P (eds) Applications and theory of Petri nets 2005, vol 3536 of Lecture notes in computer science. Springer-Verlag, Berlin

  • van der Aalst WMP, Weske M, Grünbauer D (2005c) Case handling: a new paradigm for business process support. Data Knowl Eng 53(2):129–162

    Article  Google Scholar 

  • van der Aalst WMP, Alves de Medeiros AK, Weijters AJMM (2006) Process equivalence: comparing two process models based on observed behavior. In: Dustdar S, Faideiro JL, Sheth A (eds) International conference on business process management (BPM 2006), vol 4102 of Lecture notes in computer science. Springer-Verlag, Berlin, pp 129–144

  • Agrawal R, Gunopulos D, Leymann F (1998) Mining process models from workflow logs. In: Sixth international conference on extending database technology, pp 469–483

  • Andrews T, Curbera F, Dholakia H, Goland Y, Klein J, Leymann F, Liu K, Roller D, Smith D, Thatte S, Trickovic I, Weerawarana S (2003) Business process execution language for web services, version 1.1. Standards proposal by BEA Systems, International Business Machines Corporation, and Microsoft Corporation

  • Biermann AW, Feldman JA (1972b) A survey of results in grammatical inference. In: Watanabe S (eds) Frontiers of pattern recognition. Academic Press, pp 31–54

  • Biermann AW, Feldman JA (1972a) On the synthesis of finite-state machines from samples of their behavior. IEEE Trans Comput 21:592–597

    Article  MATH  MathSciNet  Google Scholar 

  • Cook JE, Du Z (2005) Discovering thread interactions in a concurrent system. J Syst Software 77(3):285–297

    Article  Google Scholar 

  • Cook JE, Wolf AL (1998) Discovering models of software processes from event-based data. ACM Trans Software Eng Method 7(3):215–249

    Article  Google Scholar 

  • Cook JE, Du Z, Liu C, Wolf AL (2004) Discovering models of behavior for concurrent workflows. Comput Indus 53(3):297–319

    Article  Google Scholar 

  • CPN Group, University of Aarhus, Denmark. CPN Tools Home Page. http://wiki.daimi.au.dk/cpntools/.

  • Datta A (1998) Automating the discovery of as-is business process models: probabilistic and algorithmic approaches. Inform Sys Res 9(3):275–301

    Google Scholar 

  • Desel J, Esparza J (1995) Free choice petri nets, vol 40 of Cambridge tracts in theoretical computer science. Cambridge University Press, Cambridge, UK

    Google Scholar 

  • Desel J, Reisig W, Rozenberg G (eds) (2004) Lectures on concurrency and petri nets, vol 3098 of Lecture notes in computer science. Springer-Verlag, Berlin

  • van Dongen BF, van der Aalst WMP (2005) A meta model for process mining data. In: Casto J, Teniente E (eds) Proceedings of the CAiSE’05 workshops (EMOI-INTEROP Workshop), vol 2. FEUP, Porto, Portugal, pp 309–320

  • van Dongen B, Alves de Medeiros AK, Verbeek HMW, Weijters AJMM, van der Aalst WMP (2005) The ProM framework: a new era in process mining tool support. In: Ciardo G, Darondeau P (eds) Application and theory of Petri nets 2005, vol 3536 of Lecture notes in computer science, Springer-Verlag, Berlin, pp 444–454

  • Dumas M, van der Aalst WMP, ter Hofstede AHM (2005) Process-aware information systems: bridging people and software through process technology. Wiley & Sons

  • Ehrenfeucht A, Rozenberg G (1989) Partial (set) 2-structures – part 1 and part 2. Acta Inform 27(4):315–368

    Article  MathSciNet  Google Scholar 

  • Greco G, Guzzo A, Pontieri L, Saccá D (2004) Mining expressive process models by clustering workflow traces. In: Proc of advances in kowledge discovery and data mining, 8th Pacific-Asia conference (PAKDD 2004), Sydney, Australia. Springer-Verlag, Berlin, pp 52–62

  • Grigori D, Casati F, Dayal U, Shan MC (2001) Improving business process quality through exception understanding, prediction, and prevention. In: Apers P, Atzeni P, Ceri S, Paraboschi S, Ramamohanarao K, Snodgrass R (eds) Proceedings of 27th international conference on very large data bases (VLDB’01), Roma, Italy. Morgan Kaufmann, pp 159–168

  • Grigori D, Casati F, Castellanos M, Dayal U, Sayal M, Shan MC (2004) Business process intelligence. Comput Indus 53(3):321–343

    Article  Google Scholar 

  • Harel D, Kugler H, Pnueli A (2005) Synthesis revisited: generating statechart models from scenario-based requirements. In: Formal methods in software and systems modeling, vol 3393 of Lecture notes in computer science. Springer-Verlag, Berlin, pp 309–324

  • Herbst J (2000) A machine learning approach to workflow management. In: Proceedings 11th european conference on machine learning, vol 1810 of Lecture notes in computer science. Springer-Verlag, Berlin, pp 183–194

  • IDS Scheer (2002) ARIS process performance manager (ARIS PPM): measure, analyze and optimize your business process performance (whitepaper). IDS Scheer, Saarbruecken, Gemany, http://www.ids-scheer.com

  • Jablonski S, Bussler C (1996) Workflow management: modeling concepts, architecture, and implementation. International Thomson Computer Press, London, UK

  • Keller G, Nüttgens M, Scheer AW (1992) Semantische Processmodellierung auf der Grundlage Ereignisgesteuerter Processketten (EPK). Veröffentlichungen des Instituts für Wirtschaftsinformatik, Heft 89 (in German), University of Saarland, Saarbrücken

  • Leymann F, Roller D (1999) Production workflow: concepts and techniques. Prentice-Hall PTR, Upper Saddle River, New Jersey, USA

  • Liang H, Dingel J, Diskin Z (2006) A comparative survey of scenario-based to state-based model synthesis approaches. In: Proceedings of the 2006 international workshop on scenarios and state machines: models, algorithms, and tools (SCESM06). ACM Press, New York, USA, pp 5–12

  • Alves de Medeiros AK, Guenther CW (2005) Process mining: using cpn tools to create test logs for mining algorithms. In: Jensen K (ed) Proceedings of the sixth workshop on the practical use of coloured Petri nets and CPN tools (CPN 2005), vol 576 of DAIMI, Aarhus, Denmark, October 2005. University of Aarhus, pp 177–190

  • de Medeiros AKA, van der Aalst WMP, Weijters AJMM (2003) Workflow mining: current status and future directions. In: Meersman R, Tari Z, Schmidt DC (eds). On the move to meaningful internet systems 2003: CoopIS, DOA, and ODBASE, vol 2888 of Lecture notes in computer science. Springer-Verlag, Berlin, pp 389–406

  • de Medeiros AKA, van Dongen BF, van der Aalst WMP, Weijters AJMM (2004) Process mining for ubiquitous mobile systems: an overview and a concrete algorithm. In: Baresi L, Dustdar S, Gall H, Matera M (eds) Ubiquitous mobile information and collaboration systems (UMICS 2004), vol 3272 of Lecture notes in computer science, Springer-Verlag, Berlin, pp 154–168

  • zur Mühlen M, Rosemann M (2000) Workflow-based process monitoring and controlling - technical and organizational issues. In: Sprague R (ed) Proceedings of the 33rd Hawaii international conference on system science (HICSS-33). IEEE Computer Society Press, Los Alamitos, California, pp 1–10

  • Pallas Athena (2004) Protos user manual. Pallas Athena BV, Plasmolen, The Netherlands

  • Parekh R, Honavar V (1996) An incremental interactive algorithm for regular grammar inference. In: International colloquium on grammatical inference: learning syntax from sentences (ICGI 1996), ol 1147 of Lecture notes in computer science. Springer-Verlag, Berlin, pp 238–249

  • Parekh R, Honavar VG (2001) Learning DFA from simple examples. Machine Learning 44(1-2): 9–35

    Article  MATH  Google Scholar 

  • Rozinat A, van der Aalst WMP (2005) Conformance testing: measuring the fit and appropriateness of event logs and process models. In: Castellanos M, Weijters T (eds) First international on business process intelligence (BPI’05), Nancy, France, September 2005. Springer-Verlag, Berlin, pp 1–12

  • Rozinat A, van der Aalst WMP (2006) Conformance testing: measuring the fit and appropriateness of event logs and process models. In: Bussler C et al (eds) BPM 2005 workshops, vol 3812 of Lecture notes in computer science. Springer-Verlag, Berlin, pp 163–176

  • Sarbanes P, Oxley G et al. Sarbanes-Oxley Act of 2002

  • Sayal M, Casati F, Dayal U, Shan MC (2002) Business process cockpit. In: Proceedings of 28th international conference on very large data bases (VLDB’02), Hong Kong, China. Morgan Kaufmann, pp 880–883

  • Scheer AW (2000) ARIS: business process modelling. Springer-Verlag, Berlin

    Google Scholar 

  • Scott J (1992) Social network analysis. Sage, Newbury Park CA

  • TIBCO (2005) TIBCO staffware process monitor (SPM). http://www.tibco.com

  • Wasserman S, Faust K (1994) Social network analysis: methods and applications. Cambridge University Press, Cambridge

    Google Scholar 

  • Weijters AJMM, van der Aalst WMP (2002) Workflow mining: discovering workflow models from event-based data. In: Dousson C, Höppner F, Quiniou R (eds) Proceedings of the ECAI workshop on knowledge discovery and spatial data, Lyon, France. IOS Press, pp 78–84

  • Weijters AJMM, van der Aalst WMP (2003) Rediscovering workflow models from event-based data using little thumb. Integr comput-Aided Eng 10(2):151–162

    Google Scholar 

  • Wen L, Wang J, van der Aalst WMP, Wang Z, Sun J (2004) A novel approach for process mining based on event types. BETA Working Paper Series, WP 118, Eindhoven University of Technology, Eindhoven

  • Wen L, Wang J, Sun J (2006) Detecting implicit dependencies between tasks from event logs. In: Zhou X, Lin X, Lu H et al. (eds) The 8th Asia-Pacific web conference (APWeb 2006), vol 3841 of Lecture notes in computer science. Springer-Verlag, Berlin, pp 591–603

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lijie Wen.

Additional information

Responsible editor: Eamonn Keogh.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wen, L., van der Aalst, W.M.P., Wang, J. et al. Mining process models with non-free-choice constructs. Data Min Knowl Disc 15, 145–180 (2007). https://doi.org/10.1007/s10618-007-0065-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10618-007-0065-y

Keywords

Navigation