Semantic Mediation to Improve Reproducibility for Biomolecular NMR Analysis
Two barriers to computational reproducibility are the ability to record the critical metadata required for rerunning a computation, as well as translating the semantics of the metadata so that alternate approaches can easily be configured for verifying computational reproducibility. We are addressing this problem in the context of biomolecular NMR computational analysis by developing a series of linked ontologies which define the semantics of the various software tools used by researchers for data transformation and analysis. Building from a core ontology representing the primary observational data of NMR, the linked data approach allows for the translation of metadata in order to configure alternate software approaches for given computational tasks. In this paper we illustrate the utility of this with a small sample of the core ontology as well as tool-specific semantics for two third-party software tools. This approach to semantic mediation will help support an automated approach to validating the reliability of computation in which the same processing workflow is implemented with different software tools. In addition, the detailed semantics of both the data and the processing functionalities will provide a method for software tool classification.
KeywordsOntology Computational reproducibility Provenance
This work was supported in part by the National Institute of General Medical Sciences of the National Institutes of Health under Award Number GM-111135.
- 3.Vitek, J., Kalibera, T.: Repeatability, reproducibility, and rigor in systems research. In: Proceedings of the Ninth ACM International Conference on Embedded software (EMSOFT 2011), pp. 33–38 (2011)Google Scholar
- 6.Ellis, H.J.C., Nowling, R.J., Vyas, J., Martyn, T.O., Gryk, M.R.: Iterative development of an application to support nuclear magnetic resonance data analysis of proteins. In: Proceedings of the International. Conference on Information Technology: New Generations, pp. 1014–1020 (2011)Google Scholar
- 9.Bowers, S., Ludäscher, B.: Towards a generic framework for semantic registration of scientific data. In: Semantic Web Technologies for Searching and Retrieving Scientific Data (SCISW) (2003)Google Scholar
- 10.McPhillips, T., Song, T., Kolisnik, T., Aulenbach, S., Belhajjame, K., Bocinsky, K., Cao, Y., Chirigati, F., Dey, S., Freire, J., Huntzinger, D., Jones, C., Koop, D., Missier, P., Schildhauer, M., Schwalm, C., Wei, Y., Cheney, J., Bieda, M., Ludäscher, B.: YesWorkflow: a user-oriented, language-independent tool for recovering workflow information from scripts. Int. J. Digit. Curation 10(1), 298–313 (2015)CrossRefGoogle Scholar
- 14.Rijgersberg, H., van Assem, M., Top, J.: Ontology of units of measure and related concepts. Semant. Web Interoper. Usabil. Appl. 4, 3–13 (2011)Google Scholar