Integration of Disease Entries Across OMIM, Orphanet, and a Proprietary Knowledge Base
Integration of disease databases benefits physicians searching for disease information. However, current algorithmic matching is not sufficiently powerful to automate the integration process. This paper reports our attempt to manually integrate disease entries spread across public disease databases, Online Mendelian Inheritance in Man and Orphanet, with a proprietary disease knowledge base. During the process, we identified that relations between synonyms require special handling, and a set of resolution rules are proposed. Situations encountered throughout the integration suggested that variations in the cross-references would facilitate future integration of distinct disease databases.
KeywordsDisease knowledge base Semantic integration Ontology alignment
Unable to display preview. Download preview PDF.
- 2.Bodenreider, O.: The unified medical language system (UMLS): integrating biomedical terminology 32, D267–D270 (2004)Google Scholar
- 5.Doan, A., Halevy, A.Y.: Semantic integration research in the database community: A brief survey. AI magazine 26(1), 83 (2005)Google Scholar
- 6.Euzenat, J., Shvaiko, P., et al.: Ontology matching, vol. 18. Springer (2007)Google Scholar
- 7.INSERM SC11: Orphanet. http://www.orpha.net/
- 8.John Hopkins University: OMIM: Online Mendelian Inheritance in Man. http://www.ncbi.nlm.nih.gov/omim
- 9.Merabti, T., Joubert, M., Lecroq, T., Rath, A., Darmoni, S.: Mapping biomedical terminologies using natural language processing tools and UMLS: mapping the Orphanet thesaurus to the MeSH. BioMedical Engineering and Research 31(4), 221–225 (2010)Google Scholar
- 11.National Center for Biotechnology Information, National Library of Medicine: PubMed. http://www.ncbi.nlm.nih.gov/pubmed/
- 12.National Library of Medicine: Medical Subject Headings. http://www.nlm.nih.gov/mesh/
- 14.Okumura, T., Tanaka, H., Omura, M., Ito, M., Nakagawa, S., Tateisi, Y.: Cost decisions in the development of disease knowledge base : A case study. In: 2014 International Workshop on Biomedical and Health Informatics, November 2014Google Scholar
- 15.Rance, B., Snyder, M., Lewis, J., Bodenreider, O.: Leveraging Terminological Resources for Mapping between Rare Disease Information Sources. Studies in health technology and informatics 192, 529 (2013)Google Scholar
- 16.Tokuda, Y., Aoki, M., Kandpal, S.B., Tierney, L.M.: Caught in the web: E-diagnosis. Clinical Care Conundrums: Challenging Diagnoses in Hospital Medicine, pp. 169–175 (2013)Google Scholar
- 17.World Health Organization: ICD-10: International statistical classification of diseases and related health problems. World Health Organization (2011)Google Scholar