Skip to main content

An Open Repository Model for Acquiring Knowledge About Scientific Experiments

  • Conference paper
  • First Online:
Knowledge Engineering and Knowledge Management (EKAW 2016)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10024))

Included in the following conference series:

Abstract

The availability of high-quality metadata is key to facilitating discovery in the large variety of scientific datasets that are increasingly becoming publicly available. However, despite the recent focus on metadata, the diversity of metadata representation formats and the poor support for semantic markup typically result in metadata that are of poor quality. There is a pressing need for a metadata representation format that provides strong interoperation capabilities together with robust semantic underpinnings. In this paper, we describe such a format, together with open-source Web-based tools that support the acquisition, search, and management of metadata. We outline an initial evaluation using metadata from a variety of biomedical repositories.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The @-prefixed notation follows JSON-LD; see Sect. 3.2.

  2. 2.

    A complete template model specification can be found at http://metadatacenter.org/cedar-template-model.

  3. 3.

    A JSON Schema validator can be found at http://www.jsonschemavalidator.net.

  4. 4.

    A useful online JSON-LD tool can be found at http://json-ld.org/playground.

  5. 5.

    The CEDAR Workbench is available at https://cedar.metadatacenter.net.

References

  1. Borgman, C.L.: The conundrum of sharing research data. J. Am. Soc. Inform. Sci. Technol. 63(6), 1059–1078 (2012)

    Article  Google Scholar 

  2. Tenenbaum, J.D., Sansone, S.-A., Haendel, M.A.: A sea of standards for omics data: sink or swim? JAMIA 21(2), 200–203 (2014)

    Google Scholar 

  3. Edgar, R., Domrachev, M., Lash, A.E.: Gene expression omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 30(1), 207–210 (2002)

    Article  Google Scholar 

  4. BioSample. http://www.ncbi.nlm.nih.gov/biosample. Accessed 15 Sept 2016

  5. Bhattacharya, S., et al.: ImmPort: disseminating data to the public for the future of immunology. Immunol. Res. 58(2–3), 234–239 (2014)

    Article  Google Scholar 

  6. Musen, M.A., et al.: The center for expanded data annotation and retrieval. J. Am. Med. Inform. Assoc. 22(6), 1148–1152 (2015)

    Google Scholar 

  7. BD2K. https://datascience.nih.gov/bd2k. Accessed 15 Sept 2016

  8. Sansone, S.-A., Rocca-Serra, P., Field, D., et al.: Toward interoperable bioscience data. Nat. Genet. 44(2), 121–126 (2012)

    Article  Google Scholar 

  9. Rocca-Serra, P., Brandizi, M., Maquire, E., et al.: ISA software suite: supporting standards-compliant experimental annotation and enabling curation at the community level. Bioinformatics 26(18), 2354–2356 (2010)

    Article  Google Scholar 

  10. Rayner, T.D., et al.: A simple spreadsheet-based, MIAME-supportive format for microarray data: MAGE-TAB. BMC Bioinform. 7(1), 489 (2006)

    Article  Google Scholar 

  11. Wilkinson, M.D., et al.: The FAIR guiding principles for scientific data management and stewardship. Sci. Data 3(1), 160018 (2016)

    Article  Google Scholar 

  12. Nosek, B.A., et al.: Promoting an open research culture. Science 6242(348), 1422–1425 (2015)

    Article  Google Scholar 

  13. JSON Schema. http://json-schema.org. Accessed 15 Sept 2016

  14. JSON-LD. http://json-ld.org. Accessed 15 Sept 2016

  15. Musen, M.A., Noy, N.F., Shah, N.H., et al.: The national center for biomedical ontology. JAMIA 19(2), 190–195 (2012)

    Google Scholar 

  16. Maecker, H., et al.: Standardizing immunophenotyping for the human immunology project. Nat. Rev. Immunol. 12(3), 191–200 (2012)

    Google Scholar 

  17. LINCS. http://www.lincsproject.org. Accessed 15 Sept 2016

  18. Panahiazar, M., et al.: Context aware recommendation engine for metadata submission. In: Workshop on Capturing Scientific Knowledge (2015)

    Google Scholar 

  19. Motik, B., Horrocks, I., Sattler, U.: Adding integrity constraints to OWL. In: OWLED, vol. 258 (2007)

    Google Scholar 

  20. SHACL. https://www.w3.org/TR/shacl/. Accessed 15 Sept 2016

  21. JSON-LD Use Cases. https://www.w3.org/2013/dwbp/wiki/RDF_AND_JSON-LD_UseCases. Accessed 15 Sept 2016

  22. CEDAR GitHub Organization. https://github.com/metadatacenter. Accessed 15 Sept 2016

Download references

Acknowledgments

CEDAR is supported by the National Institutes of Health through an NIH Big Data to Knowledge program under grant 1U54AI117925. NCBO is supported by the NIH Common Fund under grant U54HG004028. We appreciate the collaborations offered by the ImmPort, BioSharing, HIPC, and LINCS communities.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Martin J. O’Connor .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

O’Connor, M.J., Martínez-Romero, M., Egyedi, A.L., Willrett, D., Graybeal, J., Musen, M.A. (2016). An Open Repository Model for Acquiring Knowledge About Scientific Experiments. In: Blomqvist, E., Ciancarini, P., Poggi, F., Vitali, F. (eds) Knowledge Engineering and Knowledge Management. EKAW 2016. Lecture Notes in Computer Science(), vol 10024. Springer, Cham. https://doi.org/10.1007/978-3-319-49004-5_49

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-49004-5_49

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-49003-8

  • Online ISBN: 978-3-319-49004-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics