Advertisement

A Cross-Lingual German-English Framework for Open-Domain Question Answering

  • Bogdan Sacaleanu
  • Günter Neumann
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4730)

Abstract

The paper describes QUANTICO, a cross-language open domain question answering system for German and English. The main features of the system are: use of preemptive off-line document annotation with syntactic information like chunk structures, apposition constructions and abbreviation-extension pairs for the passage retrieval; use of online translation services, language models and alignment methods for the cross-language scenarios; use of redundancy as an indicator of good answer candidates; selection of the best answers based on distance metrics defined over graph representations. Based on the question type two different strategies of answer extraction are triggered: for factoid questions answers are extracted from best IR-matched passages and selected by their redundancy and distance to the question keywords; for definition questions answers are considered to be the most redundant normalized linguistic structures with explanatory role (i.e., appositions, abbreviation’s extensions). The results of evaluating the system’s performance by CLEF were as follows: for the best German-German run we achieved an overall accuracy (ACC) of 42.33% and a mean reciprocal rank (MRR) of 0.45; for the best English-German run 32.98% (ACC) and 0.35 (MRR); for the German-English run 17.89% (ACC) and 0.17 (MRR).

Keywords

Noun Phrase Question Type Mean Reciprocal Rank Passage Retrieval Definition Question 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Neumann, G., Piskorski, J.: A shallow text processing core engine. Computational Intelligence 18(3), 451–476 (2002)CrossRefGoogle Scholar
  2. 2.
    Neumann, G., Sacaleanu, B.: Experiments on Robust NL Question Interpretation and Multilayered Document Annotation for a Cross-Language Question/Answering System. In: Peters, C., Clough, P.D., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds.) CLEF 2004. LNCS, vol. 3491, pp. 411–422. Springer, Heidelberg (2005)Google Scholar
  3. 3.
    Neumann, G., Sacaleanu, B.: Experiments on Cross-Linguality and Question-Type Driven Strategy Selection for Open-Domain QA. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, pp. 429–438. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  4. 4.
    Sacaleanu, B., Neumann, G.: Cross-Cutting Aspects of Cross-Language Question Answering Systems. In: Proceedings of the EACL workshop on Multilingual Question Answering - MLQA 2006, Trento, Italy (2006)Google Scholar
  5. 5.
    Alias-i.LingPipe1.7 (2006), http://www.alias-i.com/lingpipe

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Bogdan Sacaleanu
    • 1
  • Günter Neumann
    • 1
  1. 1.LT-Lab, DFKI, SaarbrückenGermany

Personalised recommendations