Preparing Data at the Source to Foster Interoperability across Rare Disease Resources
The ability to combine heterogeneous data distributed across the globe is critically important to boost research on rare diseases, but it presents a number of methodological, representational and automation challenges. In this scenario, biomedical ontologies are of critical importance for enabling computers to aid in information retrieval and analysis across data collections.
This chapter presents an approach to preparing rare disease data for integration through the application of a global standard for computer-readable data and knowledge. This includes the use of common data elements, ontological codes and computer-readable data. This approach was developed under a number of domain-relevant requirements, such as controlled access to data, independence of the original sources, and the desire to combining the data sources with other computational workflows and data platforms.
KeywordsOntologies FAIR approach Linkable data Data integration Standardization Semantic model
- 4.International Rare Disease Research Consortium (IRDiRC) Policies and Guidelines, Long version (2013). Available from: http://www.irdirc.org/wp-content/uploads/2013/06/IRDiRC_policies_24MayApr2013.pdf. Accessed Dec 2016
- 8.Lynch C, Parastatidis S, Jacobs N, Van de Sompel H, Lagoze C (2007) The OAI-ORE Effort: Progress, Challenges, Synergies. Proceedings of the 7th ACM/IEEE-CS joint conference on digital libraries 80-80Google Scholar
- 10.McMurry J, Blomberg N, Burdett T, Conte N, Dumontier M et al (2015) 10 Simple rules for design, provision, and reuse of identifiers for web-based life science data. Zenodo. Available from: https://doi.org/10.5281/zenodo.31765. Accessed Dec 2016
- 12.Miles A, Bechhofer S (2009) SKOS Simple Knowledge Organization System Reference. World Wide Web Consortium. Available from: http://www.w3.org/TR/skos-reference/. Accessed Dec 2016
- 13.Orphanet Standard Operating Procedures, Version 02.1 (2016) Available from: http://www.orpha.net/orphacom/special/eproc_SOPs_V2.pdf. Accessed Dec 2016
- 16.RD-Connect “Bring Your Own Data (BYOD)” Workshop to Link Rare Disease Registries (September 29–30, 2016) National centre for rare diseases, Istituto Superiore di Sanità, Rome. Available from: http://www.iss.it/binary/cnmr4/cont/RD_Connect_BYOD_2016_preliminary_programme_rev12.07.2016.pdf. Accessed Dec 2016
- 17.Roos M, Wilkinson MD, Kaliyaperumal R, Thompson M, Carta C et al (2016) Registries of domain-relevant semantic reference models help bootstrap interoperability in domains with fragmented data resources. Proceedings of the 9th International Semantic Web Applications and Tools for the Life Sciences (SWAT4LS) Conference. Available from: http://www.swat4ls.org/wp-content/uploads/2016/10/paper-16.pdf. Accessed Dec 2016
- 22.Weibel S, Kunze J, Lagoze C, Wolf M (1998) Dublin core metadata for resource discovery. Available from: http://www.rfc-editor.org/info/rfc2413. Accessed: Dec 2016
- 23.Whetzel PL, Noy NF, Shah NH, Alexander PR, Nyulas C et al (2011) BioPortal: enhanced functionality via new web services from the National Center for Biomedical Ontology to access and use ontologies in software applications. Nucleic Acids Res 39(Web Server issue):W541–W545CrossRefPubMedPubMedCentralGoogle Scholar
- 24.Wilkinson MD, Dumontier M, Aalbersberg IJ, Appleton G, Axton M et al (2016) The FAIR guiding principles for scientific data management and stewardship. Sci Data 3(March):1600018Google Scholar