The GeoTALP-IR System at GeoCLEF 2005: Experiments Using a QA-Based IR System, Linguistic Analysis, and a Geographical Thesaurus

  • Daniel Ferrés
  • Alicia Ageno
  • Horacio Rodríguez
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4022)


This paper describes GeoTALP-IR system, a Geographical Information Retrieval (GIR) system. The system is described and evaluated in the context of our participation in the CLEF 2005 GeoCLEF Monolingual English task.

The GIR system is based on Lucene and uses a modified version of the Passage Retrieval module of the TALP Question Answering (QA) system presented at CLEF 2004 and TREC 2004 QA evaluation tasks. We designed a Keyword Selection algorithm based on a Linguistic and Geographical Analysis of the topics. A Geographical Thesaurus (GT) has been built using a set of publicly available Geographical Gazetteers and a Geographical Ontology. Our experiments show that the use of a Geographical Thesaurus for Geographical Indexing and Retrieval has improved the performance of our GIR system.


Document Retrieval Geographical Analysis Query Type Name Entity Search Policy 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Brants, T.: TnT – a statistical part-of-speech tagger. In: Proceedings of the 6th Applied NLP Conference, ANLP-2000, Seattle, WA, United States (2000)Google Scholar
  2. 2.
    Collins, M.: Head-Driven Statistical Models for Natural Language Parsing. PhD thesis, University of Pennsylvania (1999)Google Scholar
  3. 3.
    Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT, Cambridge (1998)MATHGoogle Scholar
  4. 4.
    Ferrés, D., Kanaan, S., Ageno, A., González, E., Rodríguez, H., Surdeanu, M., Turmo, J.: The TALP-QA System for Spanish at CLEF 2004: Structural and Hierarchical Relaxing of Semantic Constraints. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds.) CLEF 2004. LNCS, vol. 3491, pp. 557–568. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  5. 5.
    Ferrés, D., Kanaan, S., González, E., Ageno, A., Rodríguez, H., Surdeanu, M., Turmo, J.: TALP-QA System at TREC 2004: Structural and Hierarchical Relaxation Over Semantic Constraints. In: Proceedings of the Text Retrieval Conference (TREC 2004) (2005)Google Scholar
  6. 6.
    Gey, F., Larson, R., Sanderson, M., Joho, H., Clough, P., Petras, V.: GeoCLEF: The CLEF 2005 Cross-Language Geographic Information Retrieval Track Overview. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, pp. 908–919. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  7. 7.
    Manov, D., Kiryakov, A., Popov, B., Bontcheva, K., Maynard, D., Cunningham, H.: Experiments with geographic knowledge for information extraction. In: Proceedings of HLT-NAACL Workshop of Analysis of Geographic References (2003)Google Scholar
  8. 8.
    Moldovan, D., Harabagiu, S., Pasca, M., Mihalcea, R., Goodrum, R., Gîrju, R., Rus, V.: LASSO: A tool for surfing the answer net. In: Proceedings of the Eighth Text Retrieval Conference (TREC-8) (1999)Google Scholar
  9. 9.
    Sang, E.F.T.K., De Meulder, F.: Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition. In: Daelemans, W., Osborne, M. (eds.) Proceedings of CoNLL 2003, Edmonton, Canada, pp. 142–147 (2003)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Daniel Ferrés
    • 1
  • Alicia Ageno
    • 1
  • Horacio Rodríguez
    • 1
  1. 1.TALP Research Center, Software DepartmentUniversitat Politècnica de CatalunyaBarcelonaSpain

Personalised recommendations