Skip to main content

Workflow Clustering Method Based on Process Similarity

  • Conference paper
Computational Science and Its Applications - ICCSA 2006 (ICCSA 2006)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3981))

Included in the following conference series:

Abstract

Process-centric information systems have been accumulating a mount of process models. Process designers continue to create new process models and they long for process analysis tools in various viewpoints. This paper proposes a novel approach of process analysis. Workflow clustering facilitates to analyze accumulated workflow process models and classify them into characteristic groups. The framework consists of two phases: domain classification and pattern analysis. Domain classification exploits an activity similarity measure, while pattern analysis does a transition similarity measure. Process models are represented as weighted complete dependency graphs, and then similarities among their graph vectors are estimated in consideration of relative frequency of each activity and transition. Finally, the models are clustered based on the similarities by a hierarchical clustering algorithm. We implemented the methodology and experimented sets of synthetic processes. Workflow clustering is adaptable to various process analyses, such as workflow recommendation, workflow mining, and process patterns analysis.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • van der Aalst, W.M.P., Dongen, B.F., Herbst, J., Maruster, L., Schimm, G., Weijters, A.J.M.: Workflow mining: A survey of issues and approaches. Data & knowledge engineering 47(2), 237–267 (2003)

    Article  Google Scholar 

  • van der Aalst, W.M.P., Weijters, A.J.M.M.: Process Mining: a Research Agenda. Computers in industry 53(3), 231–244 (2004)

    Article  Google Scholar 

  • Bae, J., Bae, H., Kang, S.-H., Kim, Y.: Automatic Control of Workflow Processes Using ECA Rules. IEEE Trans. Knowl. Data Eng. 16(8), 1010–1023 (2004)

    Article  Google Scholar 

  • Bunke, H., Shearer, K.: A Graph Distance Metric based on the Maximal Common Subgraph. Pattern Recognition Letters 19, 255–259 (1998)

    Article  MATH  Google Scholar 

  • Cardoso, J.: How to Measure the Control-flow Complexity of Web processes and Workflows. In: Fischer, L. (ed.) Workflow Handbook 2005, WfMC, Lighthouse Point, pp. 199–212 (2005)

    Google Scholar 

  • Ha, B., Bae, J., Park, Y.T., Kang, S.-H.: Development of process execution rules for workload balancing on agents. Data & Knowl. Eng. 56(1), 64–84 (2006)

    Article  Google Scholar 

  • Hammouda, K.M., Kamel, M.S.: Efficient Phrase-Based Document Indexing for Web Document Clustering. IEEE Trans. on Knowledge and Data Engineering 16(10), 1279–1296 (2004)

    Article  Google Scholar 

  • Hur, W., Bae, H., Kang, S.: Customizable Workflow Monitoring. Concurrent Engineering Research and Applications 11(4), 313–326 (2003)

    Article  Google Scholar 

  • Jung, J., Hur, W., Kang, S., Kim, H.: Business Process Choreography for B2B Collaboration. IEEE Internet Computing 8(1), 37–45 (2004)

    Article  Google Scholar 

  • Kim, Y., Kang, S., Kim, D., Bae, J., Ju, K.: WW-Flow: Web-Based Workflow Management with Runtime Encapsulation. IEEE Internet Computing 4(3), 55–64 (2000)

    Article  Google Scholar 

  • Lian, W., Cheung, W.W., Mamoulis, N., Yiu, S.: An Efficient and Scalable Algorithm for Clustering XML Documents by Structure. IEEE Transactions on Knowledge and Data Engineering 16(1), 82–96 (2004)

    Article  Google Scholar 

  • Malone, T.W., Crowston, K., Herman, G.A.: Organizing Business Knowledge: The MIT Process Handbook. The MIT Press, Cambridge (2003)

    Google Scholar 

  • Reijers, H.A., Vanderfeesten, I.T.P.: Cohesion and Coupling Metrics for Workflow Process Design. In: Desel, J., Pernici, B., Weske, M. (eds.) BPM 2004. LNCS, vol. 3080, pp. 290–305. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  • Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. Advanced Computer Science Series. McGraw-Hill, Auckland (1983)

    MATH  Google Scholar 

  • Simitsis, A., Vassiliadis, P., Sellis, T.: State-Space Optimization of ETL Workflows. IEEE Trans. on Knowledge and Data Engineering 17(10), 1404–1419 (2005)

    Article  Google Scholar 

  • Zamir, O., Etzioni, O.: Web Document Clustering: A Feasibility Demonstration. In: Proc. 21th Int. ACM SIGIR Conference, pp. 46–54 (1998)

    Google Scholar 

  • Zhang, K., Shasha, D.: Simple Fast Algorithms for the Editing Distance between Trees and Related Problems. SIAM Journal of Computing 18(6), 1245–1262 (1989)

    Article  MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Jung, JY., Bae, J. (2006). Workflow Clustering Method Based on Process Similarity. In: Gavrilova, M.L., et al. Computational Science and Its Applications - ICCSA 2006. ICCSA 2006. Lecture Notes in Computer Science, vol 3981. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11751588_40

Download citation

  • DOI: https://doi.org/10.1007/11751588_40

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-34072-0

  • Online ISBN: 978-3-540-34074-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics