Skip to main content

Relation Extraction from Documents for the Automatic Construction of Ontologies

  • Conference paper
  • 1401 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 184))

Abstract

The Semantic Web relies on domain ontologies that structure underlying data enabling comprehensive and transportable machine understanding. It takes so much time and efforts to construct domain ontologies because these ontologies have to be manually made by domain experts and knowledge engineers. To solve this problem, there have been some researches to semi-automatically construct ontologies. In this paper, we propose a hybrid method to extract relations from domain documents which combines a named relation approach and an unnamed relation approach. Our named relation approach is based on the Snowball system. We add the generalized pattern method to their methods. In our unnamed relation approach, we extract unnamed relations using association rules and clustering method. We also recommend candidate names of unnamed relations. We experiment and evaluate the proposed method by using Ziff documents set offered by TREC.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hearst, M.A.: Automatic Acquisition of Hyponyms from Large Text Corpora. In: Proceedings of the 14th International Conference on Computational Linguistics (1992)

    Google Scholar 

  2. Kim, H.-s., Choi, I., Kim, M.: Refining Term Weights of Documents Using Term Dependencies. In: Proceedings of the 26th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 552–553 (2004)

    Google Scholar 

  3. Lawrie, D., Croft, W.B., Rosenberg, A.: Finding Topic Words for Hierarchical Summarization. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 349–357 (2001)

    Google Scholar 

  4. Lawrie, D.J., Bruce Croft, W.: Generating Hierarchical Summaries for Web Searches. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 457–458 (2003)

    Google Scholar 

  5. Maedche, A., Staab, S.: Semi-Automatic Engineering of Ontologies from Text. In: Proceedings of the 12th International Conference on Sw Engineering and Knowledge Engineering, SEKE 2000 (2000)

    Google Scholar 

  6. Agichtein, E., Gravano, L.: Snowball: Extracting Relations from Large Plain-Text Collections. In: Proceedings of the ACM International Conference on Digital Libraries, DL 2000 (2000)

    Google Scholar 

  7. Frakes, W.B., Baeza-Yates, R.: Information Retrieval: Data Structures and Algorithms. Prentice-Hall, Englewood Cliffs (1992)

    Google Scholar 

  8. Byrd, R.J., Ravin, Y.: Identifying and extracting relations from text. In: Proceedings of the 4th International Conference on Applications of Natural Language to Information Systems (1999)

    Google Scholar 

  9. Cui, H., Kan, M.-Y., Chua, T.-S.: Unsupervised Learning of soft patterns for generating definition. In: Proceedings of 13th International World Wide Web Conference (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Choi, I., Rho, S., Jeong, YS., Kim, M. (2011). Relation Extraction from Documents for the Automatic Construction of Ontologies. In: Park, J.J., Yang, L.T., Lee, C. (eds) Future Information Technology. Communications in Computer and Information Science, vol 184. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22333-4_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-22333-4_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-22332-7

  • Online ISBN: 978-3-642-22333-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics