Abstract
Semantic interoperability based on ontologies allows systems to combine their information and process them automatically. The ability to extract meaningful fragments from ontology is a key for the ontology re-use and the construction of a subset will help to structure clinical data entries. The aim of this work is to provide a method for extracting a set of concepts for a specific domain, in order to help to define data elements of an EHR. Method: an extraction algorithm was developed to extract, from the National Cancer Institute’s Thesaurus (NCIT) and for a specific disease (i.e. prostate neoplasm), all the concepts of interest into a sub-ontology. The extraction takes as input, one or several key concepts. We compared all the concepts extracted to the concepts encoded manually and contained into the multi-disciplinary meeting report form (MDMRF). Results: NCIT contained 83143 concepts from which we extracted two sub-ontologies: sub-ontology 1 contained 434 concepts (by using a single key concept) and sub-ontology 2 contained 480 concepts (by using 5 additional keywords). The coverage of sub-ontology 2 to the MDMRF concepts was 51%. The low rate of coverage is due to the lack of definition or misclassification of the NCIT concepts. By providing a subset of concepts focused on a particular domain, this extraction method helps to optimize the binding process of data elements and at maintaining and enriching a domain ontology.
Preview
Unable to display preview. Download preview PDF.
Références
HL7 Templates. Disponible sur: 〈http://www.hl7.org/Special/committees/template/index.cfm〉 (Consulté le 01.12.2010)
Kalra D, Beale T, Heard S. The openEHR Foundation. Studies in Health Technology and Informatics 2005; 115: 153–173
Data element — Wikipedia, the free encyclopedia. Disponible sur: 〈http://en.wikipedia.org/wiki/Data_element〉 (Consulté le 01.12.2010)
Rosenbloom ST, Miller RA, Johnson KB, Elkin PL, Brown SH. Interface Terminologies: Facilitating Direct Entry of Clinical Data into Electronic Health Record Systems. J American Medical Informatics Association 2006; 13: 277–288
Daniel C, Buemi A, Mazuel L, Ouagne D, Charlet J. Functional Requirements of Terminology Services for Coupling Interface Terminologies to Reference Terminologies. MIE 2009; 205–209
Pathak J, Jiang G, Dwarkanath SO, Buntrock JD, Chute CG, Chute C. LexValueSets: an approach for context-driven value sets extraction. AMIA Annu Symposium Proc 2008; 556–560
Noy N, Musen M. The PROMPT suite: Interactive tools for ontology mapping and merging. International Journal of Human-Computer Studies 2003; 6(59): 983–1024
Seidenberg J, Rector A. Web ontology segmentation: analysis, classification and use. Proceedings of the 15th international conference on World Wide Web. ACM 2006; 13–22
Stuckenschmidt H, Klein M. Structure-based partitioning of large class hierarchies. Proc. ISWC 2004; 289–303
Rector A, Napoli A, Stamou G, Stoilos G, Wolger H, Pan J, D’Aquin M, Spaccapietra S, Tzouvaras V. Report on modularization of ontologies. Technical report, Knowledge Web Deliverable D2.1.3.1, (2005)
Ocean terminology toolset: Oceaninformatics product description, July 2007. Page 16–18. Format PDF. Disponible sur: 〈http://www.oceaninformatics.com/media/docs/ProductDescription-e95c85c6-4197-4700-8c5c-5de85561f009.pdf〉 (Consulté le 01.12.2010)
Fung KW, McDonald C, Srinivasan S. The UMLS-CORE project: a study of the problem list terminologies used in large healthcare institutions. J American Medical Informatics Association 2010; 17: 675–680
Classification TNM — Wikipedia, the free encyclopedia. Disponible sur: 〈http://en.wikipedia.org/wiki/Classification_TNM〉 (Consulté le 01.12.2010)
Lindberg DA, Humphreys BL, McCray AT. The Unified Medical Language System. Methods Inf Med 1993; 32: 281–91
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag France
About this chapter
Cite this chapter
Bourdé, A. et al. (2011). Vers la définition automatique des éléments de données des fiches RCP en cancérologie à partir d’une ontologie. In: Staccini, P.M., Harmel, A., Darmoni, S.J., Gouider, R. (eds) Systèmes d’information pour l’amélioration de la qualité en santé. Informatique et Santé, vol 1. Springer, Paris. https://doi.org/10.1007/978-2-8178-0285-5_11
Download citation
DOI: https://doi.org/10.1007/978-2-8178-0285-5_11
Publisher Name: Springer, Paris
Print ISBN: 978-2-8178-0284-8
Online ISBN: 978-2-8178-0285-5