Linguistic Patterns for Encyclopaedic Information Extraction

Cardeñosa, Jesús; de la Villa, Miguel Ángel; Gallardo, Carolina

doi:10.1007/978-3-642-40769-7_57

Jesús Cardeñosa²⁴,
Miguel Ángel de la Villa²⁴ &
Carolina Gallardo²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8132))

Included in the following conference series:

International Conference on Flexible Query Answering Systems

1372 Accesses
4 Altmetric

Abstract

Information extraction has almost always focused on extracting retrievable data from a text. Approaches that manage to extract elaborated information have seldom been devised. Through the use of interlingua-type language-independent contents representation, the semantic relations of the contents can be used to search a set of information concerning a particular entity. This way, the person asking a question to find out something about a city or a person, for example, would have to know no more than the name to be used to run a search. This approach is very promising as the person asking the question does not have to know what type of information he or she can request from a documentary source. Our work targets the goal of, given a user’s query, providing a complete report about such topic or event, composed of what we consider encyclopaedic knowledge. We describe the origins of this research and the followed procedure, as well as an illustrative case of this on-going research.

The production of this article has been sponsored by DAIL-Software S.L. www.dail-software.com

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Balog, K., Serdyukov, P., de Vries, A.P.: Overview of the TREC 2011 Entity Track. In: Proceedings of the 20th Text Retrieval Conference (TREC 2011), Gaithersburg, Maryland (2012)
Google Scholar
Cardeñosa, J., Tovar, E.: Intelligent knowledge extraction from the Web. International Journal of Uncertainty, Fuzziness and Knowledge Based Systems 11(supp. 1), 117–134 (2003), doi:10.1142/S0218488503002302
Article MATH Google Scholar
Li, G., Deng, D., Feng, J.: Faerie: efficient filtering algorithms for approximate dictionary-based entity extraction. In: Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data (SIGMOD 2011), pp. 529–540. ACM, New York (2011), doi:10.1145/1989323.1989379
Chapter Google Scholar
Dang, H.T., Lin, J., Kelly, D.: Overview of the TREC 2006 question answering track. In: Proceedings of the 15th Text Retrieval Conference, Gaithersburg, Maryland (2006)
Google Scholar
Heie, M.H., Whittaker, E.W.D., Furui, S.: Question answering using statistical language modelling. Computer Speech & Language 26(3), 193–209 (2012), doi:10.1016/j.csl.2011.11.001
Article Google Scholar
López, V., Fernández, M., Motta, E., Stieler, N.: PowerAqua: Supporting users in querying and exploring the Semantic Web. Semantic Web Journal 3(3), 249–265 (2011), doi:10.3233/SW-2011-0030
Google Scholar
Sagara, T., Hagiwara, M.: Natural language neural network and its application to question-answering system. In: Proceedings of the 2012 International Joint Conference on Neural Networks (IJCNN), Brisbane, Australia, pp. 1–7 (2012), doi:10.1109/IJCNN.2012.6252553
Google Scholar
Unger, C., Bühmann, L., Lehmann, J., Ngomo, A.C.N., Gerber, D., Cimiano, P.: Template-based question answering over RDF data. In: Proceedings of the21st International Conference on World Wide Web (WWW 2012), pp. 639–648. ACM Press, New York (2012), doi:10.1145/2187836.2187923
Chapter Google Scholar
Uchida, H.: The Universal Networking Language (UNL) Specifications, version 2005 (edition 2006) (2006), http://www.undl.org/unlsys/unl/unl2005-e2006
Cardeñosa, J., Gallardo, C., de la Villa, M.A.: Interlingual information extraction as a solution for multilingual QA systems. In: Andreasen, T., Yager, R.R., Bulskov, H., Christiansen, H., Larsen, H.L. (eds.) FQAS 2009. LNCS, vol. 5822, pp. 500–511. Springer, Heidelberg (2009), doi:10.1007/978-3-642-04957-6_43
Chapter Google Scholar
Bateman, J., Magnini, B., Fabris, G.: The generalized upper model Knowledge Base: Organization and use. In: Mars, N.J.I. (ed.) Towards Very Large Knowledge Bases: Knowledge Building and Knowledge Sharing, pp. 60–72. IOS Press, Amsterdam (1995)
Google Scholar
Boguslavsky, I., Cardeñosa, J., Gallardo, C.: A novel approach to creating disambiguated multilingual dictionaries. Applied Linguistics 30(1), 70–92 (2008), doi:10.1093/applin/amn036
Google Scholar
EOLSS, Encyclopedia of Life Support Systems (2002), http://www.eolss.net/

Download references

Author information

Authors and Affiliations

Validation & Business Applications Research Group, Universidad Politécnica de Madrid, Spain
Jesús Cardeñosa, Miguel Ángel de la Villa & Carolina Gallardo

Authors

Jesús Cardeñosa
View author publications
You can also search for this author in PubMed Google Scholar
Miguel Ángel de la Villa
View author publications
You can also search for this author in PubMed Google Scholar
Carolina Gallardo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electronic Systems, Aalborg University, 6700, Esbjerg, Denmark
Henrik Legind Larsen
Department of Computer Science and Artificial Intelligence, University of Granada, 18071, Granada, Spain
Maria J. Martin-Bautista
Department of Computer Science and Arificial IntelIigence, University of Granada, 18071, Granada, Spain
María Amparo Vila
CBIT, Roskilde University, Universitetsvej 1, 4000, Roskilde, Denmark
Troels Andreasen
CBIT, Roskilde University, 4000, Roskilde, Denmark
Henning Christiansen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cardeñosa, J., de la Villa, M.Á., Gallardo, C. (2013). Linguistic Patterns for Encyclopaedic Information Extraction. In: Larsen, H.L., Martin-Bautista, M.J., Vila, M.A., Andreasen, T., Christiansen, H. (eds) Flexible Query Answering Systems. FQAS 2013. Lecture Notes in Computer Science(), vol 8132. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40769-7_57

Download citation

DOI: https://doi.org/10.1007/978-3-642-40769-7_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40768-0
Online ISBN: 978-3-642-40769-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics