Biomedical Concepts Extraction Based on Possibilistic Network and Vector Space Model
This paper proposes a new approach for indexing biomedical documents based on the combination of a Possibilistic Network and a Vector Space Model. This later carries out partial matching between documents and biomedical vocabularies. The main contribution of the proposed approach is to combine the cosine similarity and the two measures of possibility and necessity to enhance the estimation of the similarity between a document and a given concept. The possibility estimates the extent to which a document is not similar to the concept. The necessity allows the confirmation that the document is similar to the concept. Experiments were carried out on the OSHUMED corpora and showed encouraging results.
KeywordsIndexing Biomedical documents Possibilistic network Vector space model Partial matching
Unable to display preview. Download preview PDF.
- 1.Singhal, A.: Modern information retrieval: A brief overview. IEEE Data Engineering Bulletin 24(4), 35–43 (2009)Google Scholar
- 2.Nelson, S.J., Johnson, W.D., Humphreys, B.L.: Relationships in Medical Subject Heading. In: Relationships in the Organization of Knowledge, pp. 171–184 (2001)Google Scholar
- 5.Chebil, W., Soualmia, L.F., Omri, M.N., Darmoni, S.J.: Indexing biomedical documents with a possibilistic network. Journal of the Association for Information Science and Technology (in press, 2015), doi: 10.1002/asi.23435Google Scholar
- 6.Dubois, D., Prade, H.: Possibility Theory. Plenum (1988)Google Scholar
- 9.Chebil, W., Soualmia, L.F., Dahamna, B., Darmoni, S.J.: Indexation automatique de do-cuments en santé: évaluation et analyse de sources d’erreurs. BioMedical Engineering and Research 33(5-6), 129–136 (2012)Google Scholar
- 10.Dinh, D., Tamine, L.: Towards a context sensitive approach to searching information based on domain specific knowledge sources. Web Semantics: Science, Services and Agents on the World Wide Web 12-13, 41–52 (2012)Google Scholar