Information Extraction from Text Based on Semantic Inferentialism

Pinheiro, Vladia; Pequeno, Tarcisio; Furtado, Vasco; Nogueira, Douglas

doi:10.1007/978-3-642-04957-6_29

Vladia Pinheiro²³,
Tarcisio Pequeno²⁴,
Vasco Furtado^24,25 &
…
Douglas Nogueira²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5822))

Included in the following conference series:

International Conference on Flexible Query Answering Systems

759 Accesses
7 Citations

Abstract

One of the growing needs of information extraction (IE) from text is that the IE system must be able to perform enriched inferences in order to discover and extract information. We argue that one reason for the current limitation of the approaches that use semantics for that is that they are based on ontologies that express the characteristics of things represented by names, and seek to draw inferences and to extract information based on such characteristics, disregarding the linguistic praxis (i.e. the uses of the natural language). In this paper, we describe a generic architecture for IE systems based on Semantic Inferentialism. We propose a model that seeks to express the inferential power of concepts and how these concepts, combined in sentence structures, contribute to the inferential power of sentences. We demonstrate the validity of the approach and evaluate it by deploying an application for extracting information about crime reported in on line newspapers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Grishman, R.: Information Extraction: Techniques and Challenges. In: SCIE 1997: International Summer School on Information Extraction, pp. 10–27. Springer, Heidelberg (1997)
Google Scholar
Vieira, R., Lima, V.L.S.: Lingüística Computacional: Princípios e Aplicações. Anais do XXI Congresso da SBC. I Jornada de Atualização em Inteligência Artificial 3, 47–86 (2001)
Google Scholar
Dummett, M.: Truth and Other Enigmas. Duckworth, London (1978)
Google Scholar
Brandom, R.B.: Articulating Reasons. In: An Introduction to Inferentialism. Harvard University Press, Cambridge (2000)
Google Scholar
Lieberman, H., Paternó, F., Klann, M., Wulf, V.: End-User Development: an Emergin Paradigm. End User Development. Cap.1 (2005)
Google Scholar
Pinheiro, V., Pequeno, T., Furtado, V., Assunção, T., Freitas, E.: SIM: Um Modelo Semântico-Inferencialista para Sistemas de Linguagem Natural. In: VI Workshop em Tecnologia da Informação e da Linguagem Humana (TIL 2008). WebMedia, Brasil (2008)
Google Scholar
Liu, H., Singh, P.: ConceptNet: A Practical Commonsense Reasoning Toolkit. BT Technology Journal 22(4) (2004)
Google Scholar
Gentzen, G.: Untersuchungen über das logische Schliessen. Mathematische Zeitschrift 39, 176–210, 405–431 (1935); Szabo, M.: Translated as Investigations into Logical Deduction,and printed. In: The Collected Papers of Gerhard Gentzen, pp. 68–131. North-Holland, Amsterdam (1969)
Article MathSciNet Google Scholar
Prawitz, D.: Natural Deduction: A Proof Theoretical Study. Almqvist & Wiksell, Stockholm (1965)
MATH Google Scholar
Bick, E.: The Parsing System “Palavras”. In: Automatic Grammatical Analysis of Portuguese in a Constraint Grammar Framework. Aarhus University Press (2000)
Google Scholar
Borges, K., Laender, A.H.F., Medeiros, C., Davis Jr, C.A.: Discovering geographic locations in web pages using urban addresses. In: Proceedings of the 4th ACM workshop on Geographical Information Retrieval (GIR 2007), Lisboa, Portugal, pp. 31–36 (2007)
Google Scholar
Cohen, K., Hunter, L.: Getting started in text mining. PLoS Compt Biology 4(1) (2008)
Google Scholar
Hobbs, J., Appelt, D., Bear, J., Israel, D., Kameyama, M., Stickel, M., Tyson, M.F.: Fastus: A cascaded finite-state transducer for extracting information from natural-language text. In: Roche, E., Schabes, Y. (eds.) Finite-State Devices for Natural Language Processing, pp. 383–406. MIT Press, Cambridge (1997)
Google Scholar
Glickman, O., Jones, R.: Examining machine learning for adaptable end-to-end information extraction systems. In: AAAI 1999 Workshop on Machine Learning for Information Extraction (1999)
Google Scholar
Borkar, V., Deshmukh, K., Sarawagi, S.: Automatic segmentation of text into structured records. In: Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data, California, pp. 175–186 (2001)
Google Scholar
Geng, J., Yang, J.: AUTOBIB: Automatic extraction and integration of bibliographic information on the web. In: 29th VLDB Conference, Berlin, Germany (2003)
Google Scholar
Fellbaum, C. (ed.): WordNet: An electronic lexical database. MIT Press, Cambridge (1998)
MATH Google Scholar
Baker, C.F., Fillmore, C.J., Lowe, J.B.: The Berkeley FrameNet Project. In: Proceedings of COLING-ACL (1998)
Google Scholar
Kaisser, M., Webber, B.: Question Answering based on Semantic Roles. In: ACL 2007 Workshop on Deep Linguistic Processing (2007)
Google Scholar
Saias, J., Quaresma, P.: A proposal for an ontology supported news reader and question-answer system. In: Proceedings of the 2nd Workshop on Ontologies and their Applications (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Ciências da Computação, Universidade Federal do Ceará, Campus do Pici-UFC, Fortaleza, Ceará, Brasil
Vladia Pinheiro
Mestrado em Informática Aplicada, Universidade de Fortaleza (UNIFOR), Av. Washington Soares, 1321, Fortaleza, Ceará, Brasil
Tarcisio Pequeno, Vasco Furtado & Douglas Nogueira
ETICE – Empresa de Tecnologia da Informação do Ceará, Av. Pontes Vieira 220, Fortaleza, Ceará, Brasil
Vasco Furtado

Authors

Vladia Pinheiro
View author publications
You can also search for this author in PubMed Google Scholar
Tarcisio Pequeno
View author publications
You can also search for this author in PubMed Google Scholar
Vasco Furtado
View author publications
You can also search for this author in PubMed Google Scholar
Douglas Nogueira
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Roskilde University, Universitetsvej 1, 4000, Roskilde, Denmark
Troels Andreasen & Henrik Bulskov &
Iona College, Machine Intelligence Institute, 10801, New Rochelle, NY, USA
Ronald R. Yager
Computer Science Dept., Research group PLIS: Programming, Roskilde University, Universitetsvej 1, 4000, Roskilde, Denmark
Henning Christiansen
Department of Computer Science and Engineering, Aalborg University Esbjerg, Niels Bohrs Vej 8, 6700, Esbjerg, Denmark
Henrik Legind Larsen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pinheiro, V., Pequeno, T., Furtado, V., Nogueira, D. (2009). Information Extraction from Text Based on Semantic Inferentialism. In: Andreasen, T., Yager, R.R., Bulskov, H., Christiansen, H., Larsen, H.L. (eds) Flexible Query Answering Systems. FQAS 2009. Lecture Notes in Computer Science(), vol 5822. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04957-6_29

Download citation

DOI: https://doi.org/10.1007/978-3-642-04957-6_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04956-9
Online ISBN: 978-3-642-04957-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics