MIRACLE at GeoCLEF Query Parsing 2007: Extraction and Classification of Geographical Information

  • Sara Lana-Serrano
  • Julio Villena-Román
  • José Carlos González-Cristóbal
  • José Miguel Goñi-Menoyo
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5152)

Abstract

This paper describes the participation of MIRACLE research consortium at the Query Parsing task of GeoCLEF 2007. Our system is composed of three main modules. The first one is the Named Geo-entity Identifier, whose objective is to perform the geo-entity identification and tagging, i.e., to extract the “where” component of the geographical query, if there is any. Then, the Query Analyzer parses this tagged query to identify the “what” and “geo-relation” components by means of a rule-based grammar. Finally, a two-level multiclassifier first decides whether the query is indeed a geographical query and, should it be positive, then determines the query type according to the type of information that the user is supposed to be looking for: map, yellow page or information.

Keywords

Linguistic Engineering classification geographical IR geographical entity recognition gazetteer Geonames tagging query classifier WordNet 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Goñi-Menoyo, J.M., González-Cristóbal, J.C., Villena-Román, J.: MIRACLE at Ad-Hoc CLEF 2005: Merging and Combining without Using a Single Approach. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  2. 2.
    Lana-Serrano, S., Goñi-Menoyo, J.M., González-Cristóbal, J.C.: MIRACLE at GeoCLEF 2005: First Experiments in Geographical IR. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, pp. 920–923. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  3. 3.
    Goñi-Menoyo, J.M., González-Cristóbal, J.C., Lana-Serrano, S., Martínez-González, A.: MIRACLE’s Ad-Hoc and Geographic IR approaches for CLEF 2006. In: Peters, C., et al. (eds.) CLEF 2005. LNCS, vol. 4022, Springer, Heidelberg (2006)Google Scholar
  4. 4.
    Lana-Serrano, S., Villena-Román, J., Goñi-Menoyo, J.M.: MIRACLE at GeoCLEF Query Parsing 2007: Extraction and Classification of Geographical Information. In: Nardi, A., Peters, C. (eds.) Working Notes of the Cross Language Evaluation Forum (CLEF) 2007 Workshop, Budapest, Hungary (2007)Google Scholar
  5. 5.
    Zhisheng, L., Chong, W., Xing, X., Wei-Ying, M.: Query Parsing Task for GeoCLEF2007 Report. In: Nardi, A., Peters, C. (eds.) Working Notes of the Cross Language Evaluation Forum (CLEF) 2007 Workshop, Budapest, Hungary (2007)Google Scholar
  6. 6.
    Geonames geographical database, http://www.geonames.org
  7. 7.
    U.S. National Geospatial Intelligence Agency, http://www.nga.mil
  8. 8.
    U.S. Geological Survey, http://www.usgs.gov
  9. 9.
    Global 30 Arc-Second Elevation Data Set, http://eros.usgs.gov/products/elevation/gtopo30.html
  10. 10.
    Charniak, E.: A Maximum-Entropy-Inspired Parser. In: Proceedings of North American Chapter of the Association for Computational Linguistics (NAACL), Seattle, USA (2000)Google Scholar
  11. 11.
    University of Neuchatel. Page of resources for CLEF, http://www.unine.ch/info/clef

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Sara Lana-Serrano
    • 1
    • 3
  • Julio Villena-Román
    • 2
    • 3
  • José Carlos González-Cristóbal
    • 1
    • 3
  • José Miguel Goñi-Menoyo
    • 1
  1. 1.Universidad Politécnica de Madrid 
  2. 2.Universidad Carlos III de Madrid 
  3. 3.DAEDALUS - Data, Decisions and Language, S.A. 

Personalised recommendations