Mining Reference Process Models from Large Instance Data

  • Jana-Rebecca RehseEmail author
  • Peter Fettke
Conference paper
Part of the Lecture Notes in Business Information Processing book series (LNBIP, volume 281)


Reference models provide generic blueprints of process models that are common in a certain industry. When designing a reference model, stakeholders have to cope with the so-called ‘dilemma of reference modeling’, viz., balancing generality against market specificity. In principle, the more details a reference model contains, the fewer situations it applies to. To overcome this dilemma, the contribution at hand presents a novel approach to mining a reference model hierarchy from large instance-level data such as execution logs. It combines an execution-semantic technique for reference model development with a hierarchical-agglomerative cluster analysis and ideas from Process Mining. The result is a reference model hierarchy, where the lower a model is located, the smaller its scope, and the higher its level of detail. The approach is implemented as proof-of-concept and applied in an extensive case study, using the data from the 2015 BPI Challenge.


Reference Model Mining Dilemma of reference modeling Reference model hierarchy Inductive reference model development Trace clustering 



The research described in this paper was partly supported by a grant from the German Research Foundation (DFG), project name: Konzeptionelle, methodische und technische Grundlagen zur induktiven Erstellung von Referenzmodellen (Reference Model Mining), support code GZ LO 752/5-1.


  1. 1.
    Fettke, P., Loos, P.: Perspectives on reference modeling. In: Fettke, P., Loos, P. (eds.) Reference Modeling for Business Systems Analysis, pp. 1–20. Idea Group Publishing, London (2007)CrossRefGoogle Scholar
  2. 2.
    Becker, J., Delfmann, P., Knackstedt, R.: Adaptive reference modeling: integrating configurative and generic adaptation techniques for information models. In: Becker, J., Delfmann, P. (eds.) Reference Modeling. Efficient Information Systems Design Through Reuse of Information Models, pp. 27–58. Physica-Verlag, Heidelberg (2007)Google Scholar
  3. 3.
    vom Brocke, J.: Design principles for reference modeling: reusing information models by means of aggregation, specialisation, instantiation, and analogy. In: Fettke, P., Loos, P. (eds.) Reference Modeling for Business Systems Analysis, pp. 47–75. Idea Group Publishing, London (2007)CrossRefGoogle Scholar
  4. 4.
    Hevner, A., March, S., Park, J., Ram, S.: Design science in information systems research. MIS Q. 28(1), 75–105 (2004)Google Scholar
  5. 5.
    Medeiros, A.K.A., Guzzo, A., Greco, G., Aalst, W.M.P., Weijters, A.J.M.M., Dongen, B.F., Saccà, D.: Process mining based on clustering: a quest for precision. In: Hofstede, A., Benatallah, B., Paik, H.-Y. (eds.) BPM 2007. LNCS, vol. 4928, pp. 17–29. Springer, Heidelberg (2008). doi: 10.1007/978-3-540-78238-4_4 CrossRefGoogle Scholar
  6. 6.
    Antunes, G., Bakhshandelh, M., Borbinha, J., Cardoso, J., Dadashnia, S., et al.: The process matching contest 2015. In Kolb, J., Leopold, H., Mendling, J. (eds.) Proceedings of the 6th International Workshop on Enterprise Modelling and Information Systems Architectures. International Workshop on Enterprise Modelling and Information Systems Architectures (EMISA-15), September 3–4, Innsbruck, Austria, Köllen Druck+Verlag GmbH, Bonn, September 2015Google Scholar
  7. 7.
    Everitt, B.S., Landau, S., Leese, M., Stahl, D.: Hierarchical clustering. In: Cluster Analysis, pp. 71–110. Wiley, New York (2011)Google Scholar
  8. 8.
    Thaler, T., Ternis, S.F., Fettke, P., Loos, P.: A comparative analysis of process instance cluster techniques. In: Proceedings of the 12th International Conference on Wirtschaftsinformatik. Internationale Tagung Wirtschaftsinformatik (WI 2015), March 3–5, Osnabrück, Germany, Universität Osnabrück, March 2015Google Scholar
  9. 9.
    Rehse, J.R., Fettke, P., Loos, P.: An execution-semantic approach to inductive reference models development. In: 24th European Conference for Information Systems (ECIS). European Conference on Information Systems (ECIS 2016), June 12–15, Istanbul, Turkey, Association for Information Systems (AIS) (2016)Google Scholar
  10. 10.
    Kurpjuweit, S., Winter, R.: Concern-oriented business architecture engineering. In: Proceedings of the 2009 ACM Symposium on Applied Computing, pp. 265–272. ACM (2009)Google Scholar
  11. 11.
    van Dongen, B.: BPI Challenge 2015 (2015).
  12. 12.
    Song, M., Günther, C.W., Aalst, W.M.P.: Trace clustering in process mining. In: Ardagna, D., Mecella, M., Yang, J. (eds.) BPM 2008. LNBIP, vol. 17, pp. 109–120. Springer, Heidelberg (2009). doi: 10.1007/978-3-642-00328-8_11 CrossRefGoogle Scholar
  13. 13.
    Greco, G., Guzzo, A., Pontieri, L., Sacca, D.: Discovering expressive process models by clustering log traces. IEEE Trans. Knowl. Data Eng. 18(8), 1010–1027 (2006)CrossRefGoogle Scholar
  14. 14.
    Bose, R.P.J.C., Verbeek, E.H.M.W., Aalst, W.M.P.: Discovering hierarchical process models using ProM. In: Nurcan, S. (ed.) CAiSE Forum 2011. LNBIP, vol. 107, pp. 33–48. Springer, Heidelberg (2012). doi: 10.1007/978-3-642-29749-6_3 CrossRefGoogle Scholar
  15. 15.
    Buijs, J.C.A.M., Dongen, B.F., Aalst, W.M.P.: Mining configurable process models from collections of event logs. In: Daniel, F., Wang, J., Weber, B. (eds.) BPM 2013. LNCS, vol. 8094, pp. 33–48. Springer, Heidelberg (2013). doi: 10.1007/978-3-642-40176-3_5 CrossRefGoogle Scholar
  16. 16.
    Ekanayake, C.C., Dumas, M., García-Bañuelos, L., Rosa, M.: Slice, mine and dice: complexity-aware automated discovery of business process models. In: Daniel, F., Wang, J., Weber, B. (eds.) BPM 2013. LNCS, vol. 8094, pp. 49–64. Springer, Heidelberg (2013). doi: 10.1007/978-3-642-40176-3_6 CrossRefGoogle Scholar
  17. 17.
    García-Bañuelos, L., Dumas, M., La Rosa, M., De Weerdt, J., Ekanayake, C.: Controlled automated discovery of collections of business process models. Inf. Syst. 46, 85–101 (2014)CrossRefGoogle Scholar
  18. 18.
    Luengo, D., Sepúlveda, M.: Applying clustering in process mining to find different versions of a business process that changes over time. In: Daniel, F., Barkaoui, K., Dustdar, S. (eds.) BPM 2011. LNBIP, vol. 99, pp. 153–158. Springer, Heidelberg (2012). doi: 10.1007/978-3-642-28108-2_15 CrossRefGoogle Scholar
  19. 19.
    Hompes, B., Buijs, J., Van der Aalst, W., Dixit, P., Buurman, H.: Detecting change in processes using comparative trace clustering. In: Proceedings of the 5th International Symposium on Data-driven Process Discovery and Analysis (SIMPDA 2015), Vienna, Austria, December 9–11, 2015, pp. 95–108 (2015)Google Scholar
  20. 20.
    Gottschalk, F., Aalst, W.M.P., Jansen-Vullers, M.H.: Mining reference process models and their configurations. In: Meersman, R., Tari, Z., Herrero, P. (eds.) OTM 2008. LNCS, vol. 5333, pp. 263–272. Springer, Heidelberg (2008). doi: 10.1007/978-3-540-88875-8_47 CrossRefGoogle Scholar
  21. 21.
    Li, C., Reichert, M., Wombacher, A.: Mining business process variants: challenges, scenarios, algorithms. Data Knowl. Eng. 70(5), 409–434 (2011)CrossRefGoogle Scholar
  22. 22.
    Parsons, J., Wand, Y.: Emancipating instances from the tyranny of classes in information modeling. ACM Trans. Database Syst. 25(2), 228–268 (2000)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  1. 1.Institute for Information Systems (IWi)German Research Center for Artificial Intelligence (DFKI GmbH) and Saarland UniversitySaarbrueckenGermany

Personalised recommendations