Enhancing Relation Extraction by Eliciting Selectional Constraint Features from Wikipedia

Wang, Gang; Zhang, Huajie; Wang, Haofen; Yu, Yong

doi:10.1007/978-3-540-73351-5_29

Gang Wang¹,
Huajie Zhang¹,
Haofen Wang¹ &
…
Yong Yu¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4592))

Included in the following conference series:

International Conference on Application of Natural Language to Information Systems

1010 Accesses
4 Citations

Abstract

Selectional Con straints are usually checked for detecting semantic relations. Previous work usually defined the constraints manually based on hand crafted concept taxonomy, which is time-consuming and impractical for large scale relation extraction. Further, the determination of entity type (e.g. NER) based on the taxonomy cannot achieve sufficiently high accuracy. In this paper, we propose a novel approach to extracting relation instances using the features elicited from Wikipedia, a free online encyclopedia. The features are represented as selectional constraints and further employed to enhance the extrac tion of relations. We conduct case stud ies on the validation of the ex tracted instances for two common relations hasAr tist(album, artist) andhasDirector(film, director). Substantially high extraction precision (around 0.95) and validation accuracy (near 0.90) are obtained.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Girju, R., Badulescu, A., Moldovan, D.: Learning semantic constraints for the automatic discovery of part-whole relations. In: Proceedings of HLT-NAACL (2003)
Google Scholar
Sekine, S., Sudo, K., Nobata, C.: Extended Named Entity Hierarchy. In: Proceedings of the LREC-2002 Conference (2002)
Google Scholar
Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)
MATH Google Scholar
Stevenson, M., Greenwood, M.A.: A Semantic Approach to IE Pattern Induction. In: Proceedings of the 43^rd Annual Meeting of the ACL, pp. 379–386 (2005)
Google Scholar
Roth, D., Yih, W.: Probabilistic Reasoning for Entity & Relation Recognition. In: Proceedings of 19^th International Conference on Computational Linguistics (2002)
Google Scholar
Resnik, P.: Selectional constraints: an information-theoretic model and its computational realization. Cognition (1996)
Google Scholar
Karambelkar, S.: Acquisition of selectional constraints in natural language processing. Master thesis. University of Sheffield (2001)
Google Scholar
Schutz, A., Buitelaar, P.: RelExt: A Tool for Relation Extraction from Text in Ontology Extension. In: Proceedings of the 4^th International Semantic Web Conference (2005)
Google Scholar
Sekine, S.: On-Demand Information Extraction. In: Proceedings of COLING (2006)
Google Scholar
Boer, V., Someren, M., Wielinga, B.J.: Extracting Instances of Relations from Web Documents using Redundancy. In: Sure, Y., Domingue, J. (eds.) ESWC 2006. LNCS, vol. 4011, Springer, Heidelberg (2006)
Chapter Google Scholar
Stevenson, M.: An Unsupervised WordNet-based Algorithm for Relation Extraction. In: 4^th LREC Workshop Beyond Named Entity: Semantic Labeling for NLP Tasks (2004)
Google Scholar
Choi, Y., Cardie, C., Riloff, E., Patwardhan, S.: Identifying sources of opinions with CRFs and extraction patterns. In: Proceedings of HLT/EMNLP, pp. 355–362 (2005)
Google Scholar
Agichtein, E., Gravano, L.: Snowball: Extracting Relations from Large Plain-text Collections. In: Proceedings of Digital Libraries (2000)
Google Scholar
Culotta, A., Sorensen, J.: Dependency tree kernels for relation extraction. In: Proceedings of the 42^nd Annual Meeting of the ACL (2004)
Google Scholar
Geleijnse, G., Korst, J.: Automatic Ontology Population by Googling. In: Proceedings of the 17^th BNAIC, pp. 120–126 (2005)
Google Scholar
Giles, J.: Internet Encyclopaedias Go Head to Head. Nature 438, 900–901 (2005)
Article Google Scholar
Ruiz-Casado, M., Alfonseca, E., Castells, P.: Automatic extraction of semantic relationships for WordNet by means of pattern learning from Wikipedia. In: Montoyo, A., Muńoz, R., Métais, E. (eds.) NLDB 2005. LNCS, vol. 3513, Springer, Heidelberg (2005)
Google Scholar
Voss, J.: Collaborative Thesaurus Tagging the Wikipedia Way, available at http://arxiv.org/abs/cs/0604036
Evgeniy, G., Shaul, M.: Computing Semantic Relatedness using Wikipedia-Based Explicit Semantic Analysis. In: Proceedings of IJCAI 2007 (2007)
Google Scholar
Evgeniy, G., Shaul M.: Overcoming the Brittleness Bottleneck using Wikipedia: Enhancing Text Categorization with Encyclopedic Knowledge. In: Proceedings of AAAI 2006, pp. 1301–1306 (2006)
Google Scholar
Strube, M., Ponzetto, S.: WikiRelate! Computing Semantic Relatedness Using Wikipedia. In: Proceedings of AAAI 2006 (2006)
Google Scholar
Bunescu, R., Pasca, M.: Using Encyclopedic Knowledge for Named Entity Disambiguation. In: Proceedings of EACL 2006 (2006)
Google Scholar
Basu, S., Banerjee, A., Mooney, R.: Semi-Supervised Clustering by Seeding. In: Proceedings of ICML 2002 (2002)
Google Scholar
MUC: Voorhees, E.: Introduction to Information Extraction and Message Understanding Conferences, http://www.itl.nist.gov/iaui/894.02/related_projects/muc/
IREX: http://www.cs.nyu.edu/cs/project/proteus/irex
ACE: http://www.nist.gov/speech/tests/ace/
Denoyer, L.: The Wikipedia XML Corpus. SIGIR Forum (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Shanghai Jiaotong University, Shanghai, 200240, China
Gang Wang, Huajie Zhang, Haofen Wang & Yong Yu

Authors

Gang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Huajie Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Haofen Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yong Yu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Zoubida Kedad Nadira Lammari Elisabeth Métais Farid Meziane Yacine Rezgui

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, G., Zhang, H., Wang, H., Yu, Y. (2007). Enhancing Relation Extraction by Eliciting Selectional Constraint Features from Wikipedia. In: Kedad, Z., Lammari, N., Métais, E., Meziane, F., Rezgui, Y. (eds) Natural Language Processing and Information Systems. NLDB 2007. Lecture Notes in Computer Science, vol 4592. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73351-5_29

Download citation

DOI: https://doi.org/10.1007/978-3-540-73351-5_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73350-8
Online ISBN: 978-3-540-73351-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics