Abstract
The main idea in this paper is to incorporate medical knowledge in the language modeling approach to information retrieval (IR). Our model makes use of the textual part of ImageCLEFmed corpus and of the medical knowledge as found in the Unified Medical Language System (UMLS) knowledge sources. The use of UMLS allows us to create a conceptual representation of each sentence in the corpus. We use these representations to create a graph model for each document. As in the standard language modeling approach, we evaluate the probability that a document graph model generates the query graph. Graphs are created from medical texts and queries, and are built for different languages, with different methods. After developing the graph model, we present our tests, which involve mixing different concepts sources (i.e. languages and methods) for the matching of the query and text graphs. Results show that using language model on concepts provides good results in IR. Multiplying the concept sources further improves the results. Lastly, using relations between concepts (provided by the graphs under consideration) improves results when only few conceptual sources are used to analyze the query.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Maisonnasse, L., Gaussier, E., Chevallet, J.-P.: Revisiting the dependence language model for information retrieval. In: Research and Development in Information Retrieval (2007)
Lacoste, C., Chevallet, J.-P., Lim, J.-H., Wei, X., Raccoceanu, D., Le, T.-H.D., Teodorescu, R., Vuillenemot, N.: Ipal knowledge-based medical image retrieval in imageclefmed 2006. In: Working Notes for the CLEF 2006 Workshop, 20-22 September, Alicante, Spain, (2006)
Zhou, W., Yu, C., Smalheiser, N., Torvik, V., Hong, J.: Knowledge-intensive conceptual retrieval and passage extraction of biomedical literature. In: Research and Development in Information Retrieval (2007)
Vintar, S., Buitelaar, P., Volk, M.: Relations in concept-based cross-language medical information retrieval. In: Proceedings of the ECML/PKDD Workshop on Adaptive Text Extraction and Mining (ATEM) (2003)
Ponte, J.M., Croft, W.B.: A language modeling approach to information retrieval. In: Research and Development in Information Retrieval (1998)
Srikanth, M., Srikanth, R.: Biterm language models for document retrieval. In: Research and Development in Information Retrieval (2002)
Song, F., Croft, W.B.: A general language model for information retrieval. In: CIKM 1999: Proceedings of the eighth international conference on Information and knowledge management, pp. 316–321. ACM Press, New York (1999)
Lee, C., Lee, G.G., Jang, M.G.: Dependency structure language model for information retrieval. ETRI journal (2006)
Gao, J., Nie, J.Y., Wu, G., Cao, G.: Dependence language model for information retrieval. In: Research and Development in Information Retrieval (2004)
Maisonnasse, L., Gaussier, E., Chevallet, J.P.: Multiplying concept sources for graph modeling. In: Working Notes for the CLEF 2007 Workshop, Budapest, Hungary, 19-21 September (2007)
Aronson, A.: Effective mapping of biomedical text to the UMLS metathesaurus: The MetaMap program. In: Proc AMIA 2001, pp. 17–21 (2001)
Müller, H., Deselaers, T., Kim, E., Kalpathy-Cramer, J., Deserno, T.M., Clough, P., Hersh, W.: Overview of the ImageCLEFmed 2007 medical retrieval and annotation tasks. In: Working Notes of the 2007 CLEF Workshop, Budapest, Hungary (2007)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Maisonnasse, L., Gaussier, E., Chevallet, J.P. (2008). Multiplying Concept Sources for Graph Modeling. In: Peters, C., et al. Advances in Multilingual and Multimodal Information Retrieval. CLEF 2007. Lecture Notes in Computer Science, vol 5152. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85760-0_73
Download citation
DOI: https://doi.org/10.1007/978-3-540-85760-0_73
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85759-4
Online ISBN: 978-3-540-85760-0
eBook Packages: Computer ScienceComputer Science (R0)