Abstract
The Semantic Web relies on domain ontologies that structure underlying data enabling comprehensive and transportable machine understanding. It takes so much time and efforts to construct domain ontologies because these ontologies have to be manually made by domain experts and knowledge engineers. To solve this problem, there have been some researches to semi-automatically construct ontologies. In this paper, we propose a hybrid method to extract relations from domain documents which combines a named relation approach and an unnamed relation approach. Our named relation approach is based on the Snowball system. We add the generalized pattern method to their methods. In our unnamed relation approach, we extract unnamed relations using association rules and clustering method. We also recommend candidate names of unnamed relations. We experiment and evaluate the proposed method by using Ziff documents set offered by TREC.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Hearst, M.A.: Automatic Acquisition of Hyponyms from Large Text Corpora. In: Proceedings of the 14th International Conference on Computational Linguistics (1992)
Kim, H.-s., Choi, I., Kim, M.: Refining Term Weights of Documents Using Term Dependencies. In: Proceedings of the 26th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 552–553 (2004)
Lawrie, D., Croft, W.B., Rosenberg, A.: Finding Topic Words for Hierarchical Summarization. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 349–357 (2001)
Lawrie, D.J., Bruce Croft, W.: Generating Hierarchical Summaries for Web Searches. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 457–458 (2003)
Maedche, A., Staab, S.: Semi-Automatic Engineering of Ontologies from Text. In: Proceedings of the 12th International Conference on Sw Engineering and Knowledge Engineering, SEKE 2000 (2000)
Agichtein, E., Gravano, L.: Snowball: Extracting Relations from Large Plain-Text Collections. In: Proceedings of the ACM International Conference on Digital Libraries, DL 2000 (2000)
Frakes, W.B., Baeza-Yates, R.: Information Retrieval: Data Structures and Algorithms. Prentice-Hall, Englewood Cliffs (1992)
Byrd, R.J., Ravin, Y.: Identifying and extracting relations from text. In: Proceedings of the 4th International Conference on Applications of Natural Language to Information Systems (1999)
Cui, H., Kan, M.-Y., Chua, T.-S.: Unsupervised Learning of soft patterns for generating definition. In: Proceedings of 13th International World Wide Web Conference (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Choi, I., Rho, S., Jeong, YS., Kim, M. (2011). Relation Extraction from Documents for the Automatic Construction of Ontologies. In: Park, J.J., Yang, L.T., Lee, C. (eds) Future Information Technology. Communications in Computer and Information Science, vol 184. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22333-4_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-22333-4_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22332-7
Online ISBN: 978-3-642-22333-4
eBook Packages: Computer ScienceComputer Science (R0)