Skip to main content

A Linked Data Approach to Sharing Workflows and Workflow Results

  • Conference paper
Leveraging Applications of Formal Methods, Verification, and Validation (ISoLA 2010)

Abstract

A bioinformatics analysis pipeline is often highly elaborate, due to the inherent complexity of biological systems and the variety and size of datasets. A digital equivalent of the ‘Materials and Methods’ section in wet laboratory publications would be highly beneficial to bioinformatics, for evaluating evidence and examining data across related experiments, while introducing the potential to find associated resources and integrate them as data and services. We present initial steps towards preserving bioinformatics ‘materials and methods’ by exploiting the workflow paradigm for capturing the design of a data analysis pipeline, and RDF to link the workflow, its component services, run-time provenance, and a personalized biological interpretation of the results. An example shows the reproduction of the unique graph of an analysis procedure, its results, provenance, and personal interpretation of a text mining experiment. It links data from Taverna, myExperiment.org, BioCatalogue.org, and ConceptWiki.org. The approach is relatively ‘light-weight’ and unobtrusive to bioinformatics users.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Gentleman, R.C., Carey, V.J., Bates, D.M., Bolstad, B., Dettling, M., Dudoit, S., Ellis, B., Gautier, L., Ge, Y., Gentry, J., Hornik, K., Hothorn, T., Huber, W., Iacus, S., Irizarry, R., Leisch, F., Li, C., Maechler, M., Rossini, A.J., Sawitzki, G., Smith, C., Smyth, G., Tierney, L., Yang, J.Y.H., Zhang, J.: Bioconductor: open software development for computational biology and bioinformatics. Genome Biology 5, R80 (2004)

    Google Scholar 

  2. Wilkinson, M.D., Links, M.: BioMOBY: an open source biological web services proposal. Briefings in Bioinformatics 3, 331–341 (2002)

    Article  Google Scholar 

  3. Goble, C.A., Bhagat, J., Aleksejevs, S., Cruickshank, D., Michaelides, D., Newman, D., Borkum, M., Bechhofer, S., Roos, M., Li, P., De Roure, D.: myExperiment: a repository and social network for the sharing of bioinformatics workflows. Nucleic Acids Research (2010)

    Google Scholar 

  4. Bhagat, J., Tanoh, F., Nzuobontane, E., Laurent, T., Orlowski, J., Roos, M., Wolstencroft, K., Aleksejevs, S., Stevens, R., Pettifer, S., Lopez, R., Goble, C.A.: BioCatalogue: a universal catalogue of web services for the life sciences. Nucleic Acids Research (2010)

    Google Scholar 

  5. Mons, B., Ashburner, M., Chichester, C., van Mulligen, E., Weeber, M., den Dunnen, J., van Ommen, G.J., Musen, M., Cockerill, M., Hermjakob, H., Mons, A., Packer, A., Pacheco, R., Lewis, S., Berkeley, A., Melton, W., Barris, N., Wales, J., Meijssen, G., Moeller, E., Roes, P.J., Borner, K., Bairoch, A.: Calling on a million minds for community annotation in WikiProteins. Genome biology 9, R89 (2008)

    Google Scholar 

  6. Neumann, E., Miller, E., Wilbanks, J.: What the semantic web could do for the life sciences. Drug Discovery Today: BIOSILICO 2, 228–236 (2004)

    Article  Google Scholar 

  7. Marshall, M., Post, L., Roos, M., Breit, T.: Using Semantic Web Tools to Integrate Experimental Measurement Data on Our Own Terms. In: On the Move to Meaningful Internet Systems 2006: OTM 2006 Workshops, pp. 688–679 (2006), http://dx.doi.org/10.1007/11915034_92

  8. Bizer, C., Heath, T., Berners-Lee, T.: Linked Data. International Journal on Semantic Web and Information Systems 5 (2009)

    Google Scholar 

  9. Bizer, C., Heath, T., Idehen, K., Berners-Lee, T.: Linked data on the web (LDOW 2008). In: Proceeding of the 17th international conference on World Wide Web - WWW 2008, Beijing, China, p. 1265 (2008)

    Google Scholar 

  10. De Roure, D., Goble, C., Bhagat, J., Cruickshank, D., Goderis, A., Michaelides, D., Newman, D.: myExperiment: Defining the Social Virtual Research Environment (2008)

    Google Scholar 

  11. Newman, D., Bechhofer, S., Roure, D.C.D.: MyExperiment: An Ontology for e-Research. In: Proceedings of the Workshop on Semantic Web Applications in Scientific Discourse (SWASD 2009), Washington DC, USA (2009)

    Google Scholar 

  12. Missier, P., Sahoo, S., Zhao, J., Goble, C.A., Sheth, A.: Janus: from workflows to semantic provenance and linked open data. In: Proceedings of The Third International Provenance and Annotation Workshop, Troy, NY, U.S.A (2010)

    Google Scholar 

  13. Zhao, J., Miles, A., Klyne, G., Shotton, D.: Linked data and provenance in biological data webs. Briefings in Bioinformatics 10, 139–152 (2009)

    Article  Google Scholar 

  14. Sahoo, S., Sheth, A.: Provenir ontology: Towards a Framework for eScience Provenance Management. In: Microsoft eScience Workshop, Pittsburgh, PA, USA (2009)

    Google Scholar 

  15. Noy, N.F., Shah, N.H., Whetzel, P.L., Dai, B., Dorf, M., Griffith, N., Jonquet, C., Rubin, D.L., Storey, M., Chute, C.G., Musen, M.A.: BioPortal: ontologies and integrated data resources at the click of a mouse. Nucleic Acids Research 37, W170–W173 (2009)

    Google Scholar 

  16. Luciano, J.S., Stevens, R.D.: e-Science and biological pathway semantics. BMC Bioinformatics 8(suppl 3), S3 (2007)

    Google Scholar 

  17. Mons, B., Velterop, J.: Nano-Publication in the e-science era. In: Proceedings of the Workshop on Semantic Web Applications in Scientific Discourse (SWASD 2009), CEUR-WS, Washington DC, USA, p. 14 (2009)

    Google Scholar 

  18. Roos, M., Marshall, M.S., Gibson, A.P., Schuemie, M., Meij, E., Katrenko, S., van Hage, W.R., Krommydas, K., Adriaans, P.W.: Structuring and extracting knowledge for the support of hypothesis generation in molecular biology. BMC bioinformatics 10(suppl. 10), S9 (2009).

    Google Scholar 

  19. Clark, T., Kinoshita, J.: Alzforum and SWAN: the present and future of scientific web communities. Briefings in bioinformatics 8, 163–171 (2007)

    Article  Google Scholar 

  20. Smith, B., Ceusters, W., Klagges, B., Köhler, J., Kumar, A., Lomax, J., Mungall, C., Neuhaus, F., Rector, A.L., Rosse, C.: Relations in biomedical ontologies. Genome Biology 6, R46 (2005)

    Google Scholar 

  21. Bechhofer, S., De Roure, D., Gamble, M., Goble, C., Buchan, I.: Research Objects: Towards Exchange and Reuse of Digital Knowledge. In: The Future of the Web for Collaborative Science (FWCS 2010), Workshop at WWW 2010, Raleigh NC (2010), http://precedings.nature.com/documents/4626/version/1

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Roos, M. et al. (2010). A Linked Data Approach to Sharing Workflows and Workflow Results. In: Margaria, T., Steffen, B. (eds) Leveraging Applications of Formal Methods, Verification, and Validation. ISoLA 2010. Lecture Notes in Computer Science, vol 6415. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16558-0_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-16558-0_29

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-16557-3

  • Online ISBN: 978-3-642-16558-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics