Language Independent Answer Prediction from the Web

  • Alejandro Figueroa
  • Günter Neumann
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4139)


This work presents a strategy that aims to extract and rank predicted answers from the web based on the eigenvalues of a specially designed matrix. This matrix models the strength of the syntactic relations between words by means of the frequency of their relative positions in sentences extracted from web snippets. We assess the rank of predicted answers by extracting answer candidates for three different kinds of questions. Due to the low dependence upon a particular language, we also apply our strategy to questions from four different languages: English, German, Spanish, and Portuguese.


Search Engine Query Term Query Expansion Question Answering Exact Answer 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Moldovan, D., Harabagui, S., Clark, C., Bowden, M., Lehmann, J., Williams, J.: Experiments and Analysis of LCC’s two QA Systems over TREC 2004. In: TREC 2004 (2004)Google Scholar
  2. 2.
    Radev, D.: Panel on web-based question answering. In: AAAI Spring Symposium on New Directions in Question Answering (2003)Google Scholar
  3. 3.
    Neumann, G., Xu, F.: Mining natural language answers from the web. Web Intelligence and Agent Systems, 2, 123–135 (2004)Google Scholar
  4. 4.
    Clarke, C.L.A., Cormack, G.V., Lynam, T.R.: Exploiting Redundancy in Question Answering. In: SIGIR, pp. 358–365 (2001)Google Scholar
  5. 5.
    Ramakrishnan, G., Paranjape, D., Chakrabarti, S., Bhattacharyya, P.: Is Question Answering an Acquired Skill? In: Proceedings of the 13th international conference on World Wide Web, WWW 2004 (2004)Google Scholar
  6. 6.
    Dumais, S.T., Banko, M., Brill, E., Lin, J.J., Ng, A.Y.: Web question answering: is more always better? In: SIGIR, pp. 291–298 (2002)Google Scholar
  7. 7.
    Ravichandran, D., Hovy, E.H.: Learning surface text patterns for a Question Answering System. In: ACL, pp. 41–47 (2002)Google Scholar
  8. 8.
    Belew, R.K.: Finding out About: A Cognitive Perspective on Search Engine Technology and the WWW. Cambridge University Press, Cambridge (2000)MATHGoogle Scholar
  9. 9.
    Robertson, S.: Understanding Inverse Document Frequency: On theoretical arguments for IDF. Journal of Documentation 60(5) (2004)Google Scholar
  10. 10.
    Lita, L., Carbonell, J.: Instance-Based Question Answering: A Data-Driven Approach. In: EMNLP 2004 (2004)Google Scholar
  11. 11.
    Sasaki, Y., Carbonell, J.: Question Answering As Question-Biased Term Extraction: a New Approach Toward Multilingual (QA). In: Proceedings of ACL (2005)Google Scholar
  12. 12.
    Ravichandran, D., Ittycheriah, A., Roukos, S.: Automatic Derivation of Surface Text Patterns for a Maximum Entropy Based Question Answering System. In: HLT-NAACL (2003)Google Scholar
  13. 13.
    Del-Castillo-Escobedo, A., Gómez, M., Villaseñor-Pineda, L.: QA on the Web: A Preliminary Study for Spanish Language. In: ENC 2004 (2004)Google Scholar
  14. 14.
    Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., Harshman, R.A.: Indexing by Latent Semantic Analysis. Journal of the American Society of Information Science 41(6), 391–407 (1990)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Alejandro Figueroa
    • 1
  • Günter Neumann
    • 1
  1. 1.DFKIDeutsches Forschungszentrum für Künstliche IntelligenzSaarbrückenGermany

Personalised recommendations