Abstract
Information extraction has almost always focused on extracting retrievable data from a text. Approaches that manage to extract elaborated information have seldom been devised. Through the use of interlingua-type language-independent contents representation, the semantic relations of the contents can be used to search a set of information concerning a particular entity. This way, the person asking a question to find out something about a city or a person, for example, would have to know no more than the name to be used to run a search. This approach is very promising as the person asking the question does not have to know what type of information he or she can request from a documentary source. Our work targets the goal of, given a user’s query, providing a complete report about such topic or event, composed of what we consider encyclopaedic knowledge. We describe the origins of this research and the followed procedure, as well as an illustrative case of this on-going research.
The production of this article has been sponsored by DAIL-Software S.L. www.dail-software.com
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Balog, K., Serdyukov, P., de Vries, A.P.: Overview of the TREC 2011 Entity Track. In: Proceedings of the 20th Text Retrieval Conference (TREC 2011), Gaithersburg, Maryland (2012)
Cardeñosa, J., Tovar, E.: Intelligent knowledge extraction from the Web. International Journal of Uncertainty, Fuzziness and Knowledge Based Systems 11(supp. 1), 117–134 (2003), doi:10.1142/S0218488503002302
Li, G., Deng, D., Feng, J.: Faerie: efficient filtering algorithms for approximate dictionary-based entity extraction. In: Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data (SIGMOD 2011), pp. 529–540. ACM, New York (2011), doi:10.1145/1989323.1989379
Dang, H.T., Lin, J., Kelly, D.: Overview of the TREC 2006 question answering track. In: Proceedings of the 15th Text Retrieval Conference, Gaithersburg, Maryland (2006)
Heie, M.H., Whittaker, E.W.D., Furui, S.: Question answering using statistical language modelling. Computer Speech & Language 26(3), 193–209 (2012), doi:10.1016/j.csl.2011.11.001
López, V., Fernández, M., Motta, E., Stieler, N.: PowerAqua: Supporting users in querying and exploring the Semantic Web. Semantic Web Journal 3(3), 249–265 (2011), doi:10.3233/SW-2011-0030
Sagara, T., Hagiwara, M.: Natural language neural network and its application to question-answering system. In: Proceedings of the 2012 International Joint Conference on Neural Networks (IJCNN), Brisbane, Australia, pp. 1–7 (2012), doi:10.1109/IJCNN.2012.6252553
Unger, C., Bühmann, L., Lehmann, J., Ngomo, A.C.N., Gerber, D., Cimiano, P.: Template-based question answering over RDF data. In: Proceedings of the21st International Conference on World Wide Web (WWW 2012), pp. 639–648. ACM Press, New York (2012), doi:10.1145/2187836.2187923
Uchida, H.: The Universal Networking Language (UNL) Specifications, version 2005 (edition 2006) (2006), http://www.undl.org/unlsys/unl/unl2005-e2006
Cardeñosa, J., Gallardo, C., de la Villa, M.A.: Interlingual information extraction as a solution for multilingual QA systems. In: Andreasen, T., Yager, R.R., Bulskov, H., Christiansen, H., Larsen, H.L. (eds.) FQAS 2009. LNCS, vol. 5822, pp. 500–511. Springer, Heidelberg (2009), doi:10.1007/978-3-642-04957-6_43
Bateman, J., Magnini, B., Fabris, G.: The generalized upper model Knowledge Base: Organization and use. In: Mars, N.J.I. (ed.) Towards Very Large Knowledge Bases: Knowledge Building and Knowledge Sharing, pp. 60–72. IOS Press, Amsterdam (1995)
Boguslavsky, I., Cardeñosa, J., Gallardo, C.: A novel approach to creating disambiguated multilingual dictionaries. Applied Linguistics 30(1), 70–92 (2008), doi:10.1093/applin/amn036
EOLSS, Encyclopedia of Life Support Systems (2002), http://www.eolss.net/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cardeñosa, J., de la Villa, M.Á., Gallardo, C. (2013). Linguistic Patterns for Encyclopaedic Information Extraction. In: Larsen, H.L., Martin-Bautista, M.J., Vila, M.A., Andreasen, T., Christiansen, H. (eds) Flexible Query Answering Systems. FQAS 2013. Lecture Notes in Computer Science(), vol 8132. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40769-7_57
Download citation
DOI: https://doi.org/10.1007/978-3-642-40769-7_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40768-0
Online ISBN: 978-3-642-40769-7
eBook Packages: Computer ScienceComputer Science (R0)