Interlingual Information Extraction as a Solution for Multilingual QA Systems

  • Jesús Cardeñosa
  • Carolina Gallardo
  • Miguel A. de la Villa
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5822)

Abstract

Information extraction systems have been dealt with at length from the viewpoint of users posing definite questions whose expected answer is to be found in a document collection. This has been tackled by means of systems that analyse the user query and try to use the grammar features of each language to find a possible answer. This approach has failed to work for users of other languages or for documents in different languages, save for a few languages for which the query can be machine translated to the target language or all languages. Where there are more languages, however, this approach is impracticable for information extraction in a reasonable time. The massively multilingual approach (> 6 languages) necessarily involves the language-independent representation of the contents, that is, using an interlingua. This paper reports a promising early trial of a method that launches a query in any language against a language-independent representation of the document set using a general-purpose UNL interlingua and receives a precise response.

Keywords

multilingualism information extraction UNL Interlingua 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Hovy, E.H., Gerber, L., Hermjakob, U., Junk, M., Lin, C.-Y.: Question Answering in Webclopedia. In: Proc of the TREC-9 Conference. NIST, Gaithersburg (2000)Google Scholar
  2. 2.
    Plamondon, L., Kosseim, L.: QUANTUM: A Function-Based Question Answering System. In: Proceedings of the 15th Conference of the Canadian Society for Computational Studies of Intelligence (AI 2002), Calgary, Canada, pp. 281–292 (2002)Google Scholar
  3. 3.
    Voorhees, E.M.: Overview of the TREC 2001 Question Answering Track. In: Notebook Proceedings of The Tenth Text Retrieval Conference (TREC-X), Gaithersburg, Maryland, pp. 157–165 (2001)Google Scholar
  4. 4.
    Magnini, B., Romagnoli, S., Vallin, A., Herrera, J., Penas, A., Peinado, V., Verdejo, M.F., de Rijke, M.: The multiple language question answering track at CLEF 2003. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 471–486. Springer, Heidelberg (2004)Google Scholar
  5. 5.
    Sasaki, Y., Chen, H.-H., Hua Chen, K., Lin, C.-J.: Overview of the ntcir-5 cross-lingual question answering task (CLQA1). In: Proceedings of the Fifth NTCIR Workshop Meeting (December 2005)Google Scholar
  6. 6.
    Purwarianti, A., Tsuchiya, M., Nakagawa, S.: A machine learning approach for Indonesian question answering system. In: Proc. IASTED AIA 2007, Innsbruck, Austria, pp. 537–542, ( Feburary 2007)Google Scholar
  7. 7.
    Puscasu, G., Iftene, A., Pistol, I., Trandabat, D., Tufis, D., Ceausu, A., Stefanescu, D., Ion, R., Orasan, C., Dornescu, I., Moruz, A., Cristea, D.: Cross-lingual romanian to english question answering at CLEF 2006. In: Peters, C., Clough, P., Gey, F.C., Karlgren, J., Magnini, B., Oard, D.W., de Rijke, M., Stempfhuber, M. (eds.) CLEF 2006. LNCS, vol. 4730, pp. 385–394. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  8. 8.
    Dillinguer, M., Lommel, A.: Best practice Guide. Implementing machine translation. Localization Industry Standards Association (2004), http://www.lisa.org/products/bestPractice
  9. 9.
    Uchida, H.: The Universal Networking Language (UNL) Specifications, version 2005 (2006), http://www.undl.org/unlsys/unl/unl2005-e2006
  10. 10.
    Boguslavsky, I., Cardeñosa, J., Gallardo, C., Iraola, L.: The UNL initiative: An overview. In: Gelbukh, A. (ed.) CICLing 2005. LNCS, vol. 3406, pp. 377–387. Springer, Heidelberg (2005)Google Scholar
  11. 11.
    Bekios, J., Boguslavsky, I., Cardeñosa, J., Gallardo, C.: Automatic Construction of an Interlingual Dictionary for Multilingual Lexicography. In: Proceedings of the 2007 International Conference on Machine Learning; Models, Technologies & Applications, MLMTA 2007, pp. 215–220 (2007)Google Scholar
  12. 12.
    Boguslavsky, I., Cardeñosa, J., Gallardo, C.: A Novel Approach to Creating Disambiguated Multilingual Dictionarie. Applied Linguistics 30, 70–92 (2008)CrossRefGoogle Scholar
  13. 13.
    Cardeñosa, J., Gallardo, C., Iraola, L.: UNL as a text content representation language for information extraction. In: Larsen, H.L., Pasi, G., Ortiz-Arroyo, D., Andreasen, T., Christiansen, H. (eds.) FQAS 2006. LNCS (LNAI), vol. 4027, pp. 507–518. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  14. 14.

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Jesús Cardeñosa
    • 1
  • Carolina Gallardo
    • 1
  • Miguel A. de la Villa
    • 1
  1. 1.Facultad de InformaticaValidation & Business Applications Research Group, Universidad Politécnica de MadridSpain

Personalised recommendations