The XLDB Group at CLEF 2004

  • Nuno Cardoso
  • Mário J. Silva
  • Miguel Costa
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3491)


This paper describes the participation of the XLDB Group in the CLEF monolingual ad hoc task for Portuguese. We present tumba!, a Portuguese search engine and describe its architecture and the underlying assumptions. We discuss the way we used tumba! in CLEF, providing details on our runs and our experiments with ranking algorithms.


Query Term Query Expansion Ranking System Ranking Algorithm Relevant Judgement 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Arasu, A., Cho, J., Garcia-Molina, H., Paepcke, A., Raghavan, S.: Searching the Web. j-TOIT 1(1), 2–43 (2001), CrossRefGoogle Scholar
  2. 2.
    Braschler, M., Peters, C.: CLEF 2002 Methodology and Metrics, Advances in Cross-Language Information Retrieval: Results of the CLEF 2002 Evaluation Campaign. In: Peters, C., Braschler, M., Gonzalo, J. (eds.) CLEF 2002. LNCS, vol. 2785, pp. 394–404. Springer, Heidelberg (2003)Google Scholar
  3. 3.
    Costa, M., Silva, M.J.: Sidra: a Flexible Distributed Indexing and Ranking Architecture for Web Search. In: Proceedings of the VIII Conference on Software Engineering and Databases JISBD 2003, Alicante, Spain (November 2003)Google Scholar
  4. 4.
    Couto, F., Martins, B., Silva, M.J., Coutinho, P.: Classifying Biomedical Articles using Web Resources: application to KDD Cup 2002. DI/FCUL TR 03–24, Department of Informatics, University of Lisbon (July 2003)Google Scholar
  5. 5.
    Couto, F., Silva, M., Coutinho, P.: Finding Genomic Ontology Terms in Text using Information Content. In: Critical Assessment of Information Extraction systems in Biology (BioCreative), Granada, Spain (March 2004); BMC Bioinformatics Journal (accepted for publication)Google Scholar
  6. 6.
    Pólo XLDB da Linguateca,
  7. 7.
    Linguateca Distributed Resource Center for the Portuguese Language,
  8. 8.
    Tumba! Portuguese Web Search Engine,
  9. 9.
    Gomes, D., Campos, J.P., Silva, M.J.: Versus: a Web Repository. In: WDAS - Workshop on Distributed Data and Structures 2002, Paris, France (March 2002)Google Scholar
  10. 10.
    Gomes, D., Silva, M.J.: Tarântula - Sistema de Recolha de Documentos da Web. In: CRC 2001 - 4a Conferência de Redes de Computadores (November 2001) (in Portuguese)Google Scholar
  11. 11.
  12. 12.
    Peters, C., Braschler, M.: Cross-Language Evaluation Forum: Objectives, Results, Achievements. Information Retrieval 7(1/2), 7–31 (2004)CrossRefGoogle Scholar
  13. 13.
  14. 14.
    Santos, D., Rocha, P.: CHAVE: Topics and Questions on the Portuguese Participation in CLEF. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds.) CLEF 2004. LNCS, vol. 3491. Springer, Heidelberg (2005)Google Scholar
  15. 15.
    Silva, M.J.: The Case for a Portuguese Web Search Engine. In: Proceedings of the IADIS International Conference WWW/Internet 2003, ICWI 2003, Algarve, Portugal, November 5-8, pp. 411–418. IADIS (2003)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Nuno Cardoso
    • 1
  • Mário J. Silva
    • 1
  • Miguel Costa
    • 1
  1. 1.Grupo XLDB – Departamento de InformáticaFaculdade de Ciências da Universidade de Lisboa 

Personalised recommendations