XML Retrieval with a Natural Language Interface

  • Xavier Tannier
  • Shlomo Geva
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3772)


Effective information retrieval in XML documents requires the user to have good knowledge of document structure and of some formal query language. XML query languages like XPath and XQuery are too complex to be considered for use by end users. We present an approach to XML query processing that supports the specification of both textual and structural constraints in natural language. We implemented a system that supports the evaluation of both formal XPath-like queries and natural language XML queries. We present comparative test results that were performed with the INEX 2004 topics and XML collection. Our results quantify the trade-off in performance of natural language XML queries vs formal queries with favourable results.


Query Term Version Management Object Database Inverted List Child Element 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Smeaton, A.F.: Information Retrieval: Still Butting Heads with Natural Language Processing? In: Pazienza, M.T. (ed.) SCIE 1997. LNCS, vol. 1299, pp. 115–138. Springer, Heidelberg (1997)Google Scholar
  2. 2.
    Smeaton, A.F.: Using NLP or NLP Resources for Information Retrieval Tasks. [12], pp. 99–111Google Scholar
  3. 3.
    Arampatzis, A., van der Weide, T., Koster, C., van Bommel, P.: Linguistically-motivated Information Retrieval. In: Kent, A. (ed.) Encyclopedia of Library and Information Science, vol. 69, pp. 201–222. Marcel Dekker, Inc., New York (2000)Google Scholar
  4. 4.
    Sparck Jones, K.: What is the role of NLP in text retrieval? [12], pp. 1–24Google Scholar
  5. 5.
    Androutsopoulos, I., Ritchie, G.D., Thanisch, P.: Natural Language Interfaces to Databases – An Introduction. Journal of Natural Language Engineering 1, 29–81 (1995)CrossRefGoogle Scholar
  6. 6.
    Copestake, A., Jones, K.S.: Natural Language Interfaces to Databases. The Knowledge Engineering Review 5, 225–249 (1990)CrossRefGoogle Scholar
  7. 7.
    Perrault, C., Grosz, B.: Natural Language Interfaces. Exploring Articial Intelligence, 133–172 (1988)Google Scholar
  8. 8.
    Fuhr, N., Lalmas, M., Malik, S., Szlàvik, Z. (eds.): Advances in XML Information Retrieval. Third Workshop of the Initiative for the Evaluation of XML retrieval (INEX). LNCS, vol. 3493. Springer, Heidelberg (2005)Google Scholar
  9. 9.
    Trotman, A., Sigurbjörnsson, B.: Narrowed Extended XPath I (NEXI). [8]Google Scholar
  10. 10.
    Tannier, X., Girardot, J.J., Mathieu, M.: Analysing Natural Language Queries at INEX 2004, [8], pp. 395–409 (2004)Google Scholar
  11. 11.
    Geva, S.: GPX - Gardens Point XML Information Retrieval at INEX 2004. [8]Google Scholar
  12. 12.
    Strzalkowski, T. (ed.): Natural Language Information Retrieval. Kluwer Academic Publisher, Dordrecht (1999)MATHGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Xavier Tannier
    • 1
  • Shlomo Geva
    • 2
  1. 1.École Nationale Supérieure des Mines de Saint-EtienneSaint-EtienneFrance
  2. 2.Centre for Information Technology Innovation, Faculty of Information TechnologyQueensland University of TechnologyBrisbaneAustralia

Personalised recommendations