Abstract
We present the fusion of simple retrieval strategies with thesaural resources to perform document and query translation by text categorisation for cross–language retrieval in a collection of medical images with case notes. The collection includes documents in French, English and German. The fusion of visual and textual content is also treated. Unlike most automatic categorisation systems our approach can be applied with any controlled vocabulary and does not require training data. For the experiments we use Medical Subject Headings (MeSH), a terminology maintained by the National Library of Medicine existing in 12 languages. The idea is to annotate every text of the collection (documents and queries) with a set of MeSH terms using our automatic text categoriser. Our results confirm that such an approach is competitive. Simple linear approaches were used to combine text and visual features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Müller, H., Michoux, N., Bandon, D., Geissbuhler, A.: A review of content–based image retrieval systems in medicine – clinical benefits and future directions. Internation Journal of Medical Informatics 73, 1–23 (2004)
Müller, H., Deselaers, T., Lehmann, T.M., Clough, P., Eugene, K., Hersh, W.: Overview of the imageclefmed 2006 medical retrieval and medical annotation tasks. In: CLEF 2006 Proceedings. LNCS, Springer, Heidelberg (to appear, 2007)
Ruch, P.: Automatic Assignment of Biomedical Categories: Toward a Generic Approach. Bioinformatics 6, 658–664 (2006)
Rasolofo, Y., Savoy, J.: Term proximity scoring for keyword-based retrieval systems. In: European Conference on Information Retrieval, pp. 101–116 (2003)
Singhal, A., Buckley, C., Mitra, M.: Pivoted document length normalization. In: Proceedigns of the ACM SIGIR Conference, pp. 21–29. ACM Press, New York (1996)
Larkey, L., Croft, W.: Combining classifiers in text categorization. In: Proceedings of the ACM SIGIR Conference, pp. 289–297. ACM Press, New York (1996)
Aronson, A., Demner-Fushman, D., Humphrey, S., Lin, J., Liu, H., Ruch, P., Ruiz, M., Smith, L., Tanabe, L., Wilbur, J.: Fusion of knowledge-intensive and statistical approaches for retrieving and annotating textual genomics documents. In: Proceedings of TREC 2005 (2006)
Squire, D.M., Müller, W., Müller, H., Pun, T.: Content–based query of image databases: inspirations from text retrieval. Pattern Recognition Letters 21, 1193–1198 (2000)
Ruch, P.: Query translation by text categorization. In: Kaufmann, A.M. (ed.) Proceedings of COLING 2004, pp. 686–692 (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gobeill, J., Müller, H., Ruch, P. (2007). Translation by Text Categorisation: Medical Image Retrieval in ImageCLEFmed 2006. In: Peters, C., et al. Evaluation of Multilingual and Multi-modal Information Retrieval. CLEF 2006. Lecture Notes in Computer Science, vol 4730. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74999-8_88
Download citation
DOI: https://doi.org/10.1007/978-3-540-74999-8_88
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74998-1
Online ISBN: 978-3-540-74999-8
eBook Packages: Computer ScienceComputer Science (R0)