Skip to main content

Deriving of Thematic Facts from Unstructured Texts and Background Knowledge

  • Conference paper
Knowledge Engineering and the Semantic Web (KESW 2014)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 468))

Included in the following conference series:

Abstract

When developing information-analytical systems (IAS) for various purposes it is often necessary to gather thematic facts which are of interest to experts in the field. The paper presents an approach that allows one to increase the completeness of fact extraction by using basic domain knowledge. The main idea of the approach is deriving new facts on the basis of facts explicitly stated in the text and basic knowledge contained in the corresponding ontologies. An architecture and algorithms of the system are discussed. The approach is illustrated by an example of extracting relevant facts using inference rules.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Feldman, R., Sanger, J.: The Text Mining Textbook: Advanced Approaches in Analyzing Unstructured Data. Cambridge Univ. Press (2007)

    Google Scholar 

  2. Wimalasuriya, D., Dou, D.: Ontology-based information extraction: An introduction and a survey of current approaches. J. of Inf. Science 36(3), 306–323 (2010)

    Article  Google Scholar 

  3. Anantharangachar, R., Ramani, S., Rajagopalan, S.: Ontology Guided Information Extraction from Unstructured Text. Int. J. of Web & Sem. Tech. 4(1), 19–36 (2013)

    Article  Google Scholar 

  4. Buitelaar, P., Cimiano, P., Frank, A., Hartung, M., Racioppa, S.: Ontology-based Information Extraction and Integration from Heterogeneous Data Sources. Int. J. of Human Computer Studies 66, 759–788 (2008)

    Article  Google Scholar 

  5. Petasis, G., Möller, R., Karkaletsis, V.: BOEMIE: Reasoning-based Information Extraction. In: Proceedings of the 1st Workshop on Natural Language Processing and Automated Reasoning, pp. 60–75 (2013)

    Google Scholar 

  6. Suchanek, F.M., Sozio, M., Weikum, G.: SOFIE: A self-organizing framework for information extraction. In: Proceedings of the 18th International Conference on World Wide Web, Madrid, Spain, pp. 631–640 (2009)

    Google Scholar 

  7. Apache Http Client, http://hc.apache.org

  8. Apache Tika, http://tika.apache.org/

  9. GATE: General Architecture for Text Engineering, https://gate.ac.uk/

  10. Apache Open NLP, https://opennlp.apache.org/

  11. Apache Lucene, http://lucene.apache.org

  12. Apache Jena Core, https://jena.apache.org/documentation/rdf/

  13. Apache Jena SDB, http://jena.apache.org/documentation/sdb/

  14. A PROMISING HIGH-SPEED HELICOPTER (PSV) V-37, http://bastion-karpenko.ru/v-37_psv/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Yelagina, N., Panteleyev, M. (2014). Deriving of Thematic Facts from Unstructured Texts and Background Knowledge. In: Klinov, P., Mouromtsev, D. (eds) Knowledge Engineering and the Semantic Web. KESW 2014. Communications in Computer and Information Science, vol 468. Springer, Cham. https://doi.org/10.1007/978-3-319-11716-4_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-11716-4_18

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-11715-7

  • Online ISBN: 978-3-319-11716-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics