Language Resources and Evaluation

, Volume 44, Issue 1, pp 159–180

An efficient any language approach for the integration of phrases in document retrieval

Article

DOI: 10.1007/s10579-009-9102-3

Cite this article as:
Doucet, A. & Ahonen-Myka, H. Lang Resources & Evaluation (2010) 44: 159. doi:10.1007/s10579-009-9102-3

Abstract

In this paper, we address the problem of the exploitation of text phrases in a multilingual context. We propose a technique to benefit from multi-word units in adhoc document retrieval, whatever the language of the document collection. We present principles to optimize the performance improvement obtained through this approach. The work is validated through retrieval experiments conducted on Chinese, Japanese, Korean and English.

Keywords

Multiword expressionsDocument retrievalEndogenous resources

Copyright information

© Springer Science+Business Media B.V. 2009

Authors and Affiliations

  1. 1.Department of Computer ScienceUniversity of CaenCaenFrance
  2. 2.Department of Computer ScienceUniversity of HelsinkiHelsinkiFinland