Abstract
The problem of acquiring valuable information from the large amounts available today in electronic media requires automated mechanisms more natural and efficient than those already existing. The trend in the evolution of information retrieval systems goes toward systems capable of answering specific questions formulated by the user in her/his language. The expected answers from such systems are short and accurate sentences, instead of large document lists. On the other hand, the state of the art of these systems is focused -mainly- in the resolution of factual questions, whose answers are named entities (dates, quantities, proper nouns, etc). This paper proposes a model to represent source documents that are then used by question answering systems. The model is based on a representation of a document as a set of named entities (NEs) and their local lexical context. These NEs are extracted and classified automatically by an off-line process. The entities are then taken as instance concepts in an upper ontology and stored as a set of DAML+OIL resources which could be used later by question answering engines. The paper presents a case of study with a news collection in Spanish and some preliminary results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Burger, J., et al.: Issues, Tasks and Program Structures to Roadmap Research in Question & Answering (Q&A). NIST (2001)
Carreras, X., Padró, L.: A Flexible Distributed Architecture for Natural Language Analyzers. In: Proceedings of the LREC 2002, Las Palmas de Gran Canaria, Spain (2002)
Cowie, J., et al.: Automatic Question Answering. In: Proceedings of the International Conference on Multimedia Information Retrieval, RIAO 2000 (2000)
Hirshman, L., Gaizauskas, R.: Natural Language Question Answering: The View from Here. Natural Language Engineering 7 (2001)
Magnini, B., Romagnoli, S., Vallin, A., Herrera, J., Peñas, A., Peinado, V., Verdejo, F., Rijke, M.: The Multiple Language Question Answering Track at CLEF 2003. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 471–486. Springer, Heidelberg (2004)
Mann, G.S.: Fine-Grained Proper Noun Ontologies for Question Answering. In: SemaNet 2002: Building and Using Semantic Networks (2002)
Niles, I., Pease, A.: Toward a Standard Upper Ontology. In: Proceedings of the 2nd International Conference on Formal Ontology in Information Systems, FOIS 2001 (2001)
Prager, J., Radev, D., Brown, E., Coden, A., Samn, V.: The Use of Predictive Annotation for Question Answering in TREC8. NIST (1999)
Ravichandran, D., Hovy, E.: Learning Surface Text Patterns for a Question Answering System. In: ACL Conference (2002)
Schölkopf, B., Smola, A.J.: Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. MIT Press, Cambridge (2001)
Solorio, T., López López, A.: Learning Named Entity Classifiers using Support Vector Machines. In: Gelbukh, A. (ed.) CICLing 2004. LNCS, vol. 2945, pp. 158–167. Springer, Heidelberg (2004) (to appear)
Stitson, M.O., Wetson, J.A.E., Gammerman, A., Vovk, V., Vapnik, V.: Theory of Support Vector Machines. Technical Report CSD-TR-96-17, Royal Holloway University of London, England (December 1996)
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1995)
Vicedo, J.L., Izquierdo, R., Llopis, F., Muñoz, R.: Question Answering in Spanish. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 541–548. Springer, Heidelberg (2004)
Vicedo, J.L., Rodríguez, H., Peñas, A., Massot, M.: Los sistemas de Búsqueda de Respuestas desde una perspectiva actual. Revista de la Sociedad Española para el Procesamiento del Lenguaje Natural, vol. (31) (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pérez-Coutiño, M., Solorio, T., Montes-y-Gómez, M., López-López, A., Villaseñor-Pineda, L. (2004). Toward a Document Model for Question Answering Systems. In: Favela, J., Menasalvas, E., Chávez, E. (eds) Advances in Web Intelligence. AWIC 2004. Lecture Notes in Computer Science(), vol 3034. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24681-7_17
Download citation
DOI: https://doi.org/10.1007/978-3-540-24681-7_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22009-1
Online ISBN: 978-3-540-24681-7
eBook Packages: Springer Book Archive