Advertisement

The PUCRS NLP-Group Participation in CLEF2006: Information Retrieval Based on Linguistic Resources

  • Marco Gonzalez
  • Vera Lúcia Strube de Lima
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4730)

Abstract

This paper presents the 2006 participation of the PUCRS NLP-Group in the CLEF Monolingual Ad Hoc Task for Portuguese. We took part in this campaign using the TR+ Model, which is based on nominalization, binary lexical relations (BLR), Boolean queries, and the evidence concept. Our alternative strategy for lexical normalization, the nominalization, is to transform a word (adjective, verb, or adverb) into a semantically corresponding noun. BLRs identify relationships between nominalized terms and capture phrasal cohesion mechanisms, like those between subject and predicate, subject and object (direct or indirect), noun and adjective or verb and adverb. In our strategy, an index unit (a descriptor) may be a single term or a BLR, and we adopt the evidence concept: the descriptor weighting depends on the occurrence of phrasal cohesion mechanisms, besides depending on frequency of occurrence. We describe these features, which implement lexical normalization and term dependence in an information retrieval system based on linguistic resources.

Keywords

Search engine information retrieval evaluation lexical normalization term dependence 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Gamallo, P., Gonzalez, M., Agustini, A., Lopes, G., de Lima, V.L.S.: Mapping Syntactic Dependencies onto Semantic Relations. In: ECAI 2002, Workshop on Natural Language Processing and Machine Learning for Ontology Engineering, Lyon, France, pp. 15–22 (2002)Google Scholar
  2. 2.
    Crowther, J.: Oxford Advanced Learner’s Dictionary of Current English. Oxford University Press, New York (1995)Google Scholar
  3. 3.
    Ferreira, A.B.H.: Dicionário Aurélio Eletrônico – Século XXI. Nova Fronteira S.A., Rio de Janeiro (1999) Google Scholar
  4. 4.
    Gonzalez, M., de Lima, V.L.S., de Lima, J.V.: Binary Lexical Relations for Text Representation in Information Retrieval. In: Montoyo, A., Muńoz, R., Métais, E. (eds.) NLDB 2005. LNCS, vol. 3513, pp. 21–31. Springer, Heidelberg (2005)Google Scholar
  5. 5.
    Gonzalez, M.: Termos e Relacionamentos em Evidência na Recuperação de Informação. PhD thesis, Instituto de Informática, UFRGS (2005) Google Scholar
  6. 6.
    Gonzalez, M., de Lima, V.L.S., de Lima, J.V.: Lexical normalization and term relationship alternatives for a dependency structured indexing system. In: Gelbukh, A. (ed.) CICLing 2006. LNCS, vol. 3878, pp. 394–405. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  7. 7.
    Gonzalez, M., de Lima, V.L.S., de Lima, J.V.: Tools for Nominalization: an Alternative for Lexical Normalization. In: Vieira, R., Quaresma, P., Nunes, M.d.G.V., Mamede, N.J., Oliveira, C., Dias, M.C. (eds.) PROPOR 2006. LNCS (LNAI), vol. 3960, pp. 100–109. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  8. 8.
    Gonzalez, M., de Lima, V.L.S., de Lima, J.V.: The PUCRS-PLN Group participation at CLEF 2006. In: Nardi, A., Peters, C., Vicedo, J.L. (eds.) Working Notes for the CLEF 2006 Workshop (2006), Published Online www.clef-campaign.org
  9. 9.
    Krovetz, R.: Viewing morphology as an inference process. Artificial Intelligence N. 118, 227–294 (2000)Google Scholar
  10. 10.
    Mira Mateus, M.H., Brito, A.M., Duarte, I., Faria, I.H.: Gramática da Língua Portuguesa. Lisboa: Ed. Caminho (2003) Google Scholar
  11. 11.
    di Nunzio, G.M., Ferro, N., Mandl, T., Peters, C.: CLEF 2006: Ad Hoc Track Overview. In: Nardi, A., Peters, C., Vicedo, J. L. (eds.), Working Notes for the CLEF 2006 Workshop (2006), Published Online www.clef-campaign.org
  12. 12.
    Perini, M.A.: Para uma Nova Gramática do Português. São Paulo: Ed. Ática (2000) Google Scholar
  13. 13.
    Robertson, S.E., Walker, S.: Some Simple Effective Approximations to the 2-Poisson Model for Probabilistic Weighted Retrieval. In: Proceedings of 17th Annual International ACM SIGIR Conference on Research and Development in IR, pp. 232–241 (1994)Google Scholar
  14. 14.
  15. 15.
  16. 16.
  17. 17.
    Ziviani, N.: Text Operations. In: Baeza-Yates, R., Ribeiro-Neto, B. (eds.) Modern Information Retrieval, ACM Press, New York (1999)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Marco Gonzalez
    • 1
  • Vera Lúcia Strube de Lima
    • 1
  1. 1.Grupo PLN – Faculdade de Informática – PUCRS, Av. Ipiranga, 6681 – Prédio 16 - PPGCC, 90619-900 Porto AlegreBrazil

Personalised recommendations