Abstract
In order to conduct analyses on genome data, different calculation steps have to be done in a specific order, which constitutes a genome data processing pipeline. Still a lot of research is in process, in order to find faster and more reliable ways to do various analyses, so single steps or the whole sequence of the pipelines might be subject to change.Amodular and flexible way to configure pipelines could simplify their use and the sharing of pipelines between researchers. With a possibility to configure pipelines without altering source code, bioinformaticians and technicians would be relieved of the task to rewrite a pipeline every time a single algorithm changes. This contribution proposes to use common process modeling tools for the abstract representation of genome data processing pipelines. The benefits and drawbacks of different process model notations are examined with special focus on the possibilities to specify execution semantics. As a prototype, a system for the parsing and execution of genome data processing pipelines specified in business process model and notation, is introduced.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
van der Aalst WMP (1998) The Application of Petri Nets toWorkflow Management. The Journal of Circuits, Systems and Computers 8(1):21–66
Booch G, Rumbaugh J, Jacobson I (1998) The Unified Modeling Language User Guide, 1st edn. Addison-Wesley, Reading, MA
Dijkman RM, Gorp PV (2011) BPMN 2.0 Execution Semantics Formalized as Graph Rewrite Rules. In: Business Process Modeling Notation, Lecture Notes in Business Information Processing, vol 67, Springer, pp 16–30
Flicek P et al. (2012) Ensembl 2013. Nucleic Acids Research 41(D1):D48–D55
Goecks J, Nekrutenko A, Taylor J (2010) Galaxy: A Comprehensive Approach for Supporting Accessible, Reproducible, and Transparent Computational Research in the Life Sciences. Genome Biology 11(8)
Huser V et al. (2011) Implementation of Workflow Engine Technology to Deliver Basic Clinical Decision Support Functionality. BMC Medical Research Methodology 11(1):43
Ko RKL, Lee SSG, Lee EW (2009) Business Process Management (BPM) Standards: A Survey. Business Process Management Journal 15(5):744–791
Marshall C (1999) Enterprise Modeling with UML – Designing Successful Software Through Business Analysis. Addison-Wesley
Object Management Group (2011) Business Process Model and Notation (BPMN). http://www.omg.org/spec/BPMN/2.0/PDF/. Accessed Sep 23, 2013
Rumbaugh J, Jacobson I, Booch G (2004) Unified Modeling Language Reference Manual, The (2nd Edition). Pearson Higher Education
Russell N et al. (2006) On the Suitability of UML 2.0 Activity Diagrams for Business Process Modelling. In: Proceedings of the 3rd Asia-Pacific Conference on Conceptual Modelling, Australian Computer Society, Hobart, Australia, vol 53, pp 95–104
Salimifard K, Wright M (2001) Petri Net-based Modelling ofWorkflow Systems: An Overview. European Journal of Operational Research 134(3):664–676
WeskeM (2007) Business Process Management – Concepts, Languages, Architectures. Springer
Wheeler DL et al. (2008) Database Resources of the National Center for Biotechnology Information. Nucleic Acids Research 36:D13–D21
Wong PYH, Gibbons J (2008) A Process Semantics for BPMN. In: Liu S, Maibaum T, Araki K (eds) Formal Methods and Software Engineering, no. 5256 in Lecture Notes in Computer Science, Springer Berlin Heidelberg, pp 355–374
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Schäffer, M. (2014). Modeling Genome Data Processing Pipelines. In: Plattner, H., Schapranow, MP. (eds) High-Performance In-Memory Genome Data Analysis. In-Memory Data Management Research. Springer, Cham. https://doi.org/10.1007/978-3-319-03035-7_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-03035-7_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-03034-0
Online ISBN: 978-3-319-03035-7
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)