Relation classification via sequence features and bi-directional LSTMs

Ren, Yuanfang; Teng, Chong; Li, Fei; Chen, Bo; Ji, Donghong

doi:10.1007/s11859-017-1278-6

Relation classification via sequence features and bi-directional LSTMs

Computer Science
Published: 09 November 2017

Volume 22, pages 489–497, (2017)
Cite this article

Wuhan University Journal of Natural Sciences

Yuanfang Ren¹,
Chong Teng¹,
Fei Li¹,
Bo Chen¹ &
…
Donghong Ji¹

123 Accesses
7 Citations
Explore all metrics

Abstract

Structure features need complicated pre-processing, and are probably domain-dependent. To reduce time cost of pre-processing, we propose a novel neural network architecture which is a bi-directional long-short-term-memory recurrent-neural- network (Bi-LSTM-RNN) model based on low-cost sequence features such as words and part-of-speech (POS) tags, to classify the relation of two entities. First, this model performs bi-directional recurrent computation along the tokens of sentences. Then, the sequence is divided into five parts and standard pooling functions are applied over the token representations of each part. Finally, the token representations are concatenated and fed into a softmax layer for relation classification. We evaluate our model on two standard benchmark datasets in different domains, namely SemEval-2010 Task 8 and BioNLP-ST 2016 Task BB3. In SemEval- 2010 Task 8, the performance of our model matches those of the state-of-the-art models, achieving 83.0% in F₁. In BioNLP-ST 2016 Task BB3, our model obtains F₁ 51.3% which is comparable with that of the best system. Moreover, we find that the context between two target entities plays an important role in relation classification and it can be a replacement of the shortest dependency path.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Relation Classification via BiLSTM-CNN

An input information enhanced model for relation extraction

Article 29 August 2019

Attention-Based Combination of CNN and RNN for Relation Classification

References

Zhou J, Lü C, Ji D H, et al. Framework construction and application for global health information platform [J]. Wuhan University Journal of Natural Sciences, 2015, 20(2): 153–158.
Article Google Scholar
Ferrucci D A. Introduction to “this is watson” [J]. IBM Journal of Research and Development, 2012, 56(3.4):1-1.
Google Scholar
Li X, Zhang Y, Lu J, et al. A classification method forweb information extraction [J]. Wuhan University Journal of Natural Sciences, 2004, 9(5): 823–827.
Article Google Scholar
Doddington G R, Mitchell A, Przybocki M A, et al. The automatic content extraction (ace) program-tasks, data,and evaluation [C/OL] // Proc of the LREC. [2016-02-15]. https://www.ldc.upenn.edu/sites/www.ldc.upenn.edu/files/lrec2004-ace-program.pdf.
Google Scholar
Hendrickx I, Kim S N, Kozareva Z, et al. Semeval-2010 task 8: Multi-way classification of semantic relations between pairs of nominal [C]//Proc of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions. Stroudsburg: Association for Computational Linguistics, 2009: 94–99.
Chapter Google Scholar
Bossy R, Golik W, Ratkovic Z, et al. Bionlp shared task 2013–an overview of the bacteria biotope task [C/OL]// Proc of the BioNLP Shared Task 2013 Workshop. 2013: 161–169. [2016-02-15]. https://www.aclweb.org/anthology/ W/W13/ W13-20.pdf#page=173.
Google Scholar
Dele’ger L, Bossy R, Chaix E, et al. Overview of the bacteria biotope task at Bionlp shared task 2016[C]//ProcBioNLP Shared Task Workshop. Berlin: Association for Computational Linguistics, 2016: 12–22.
Chapter Google Scholar
Zhang M, Zhang J, Su J, et al. A composite kernel to extract relations between entities with both flat and structured features [C/OL] // Proc of the 44th Association for Computational Linguistics. 2006: 825–832. [2016-02-15]. http://aclarc.comp.nus.edu.sg/archives/acl-arc-090501d4/data/pdf/anthology-PDF/P/P06/P06-1104.pdf.
Google Scholar
Chan S, Roth D. Exploiting syntactico-semantic structures for relation extraction [C/OL]// Proc of the 49th Association for Computational Linguistics. 2011: 551–560. [2016-02-15]. https://pdfs.semanticscholar.org/5e46/fc68ede1108529f4db78bc7e1def69d70ba3.pdf.
Google Scholar
Shen F, Zhang J, Yuan X. Novel method of mining classification information for SVM training [J]. Wuhan University Journal of Natural Sciences, 2011, 16(6): 475–480.
Article Google Scholar
Li Q, Ji H. Incremental joint extraction of entity mentions and relations [C/OL]// Proc of the 52nd Association for Computational Linguistics. 2014: 402–412. [2016-02-15]. http://nlp.cs.rpi.edu/paper/jointmentionrelation.pdf.
Google Scholar
Kordjamshidi P, Roth D, Moens M. Structured learning for spatial information extraction from biomedical text: Bacteria biotopes[J]. BMC Bioinformatics, 2015, 16(1):129.
Article PubMed PubMed Central Google Scholar
Lü C, Chen B, Lü C Z, et al. A multiple feature approach to disorder normalization in clinical notes[J]. Wuhan University Journal of Natural Sciences, 2016, 21(4): 482–490.
Article Google Scholar
Plank B, Moschitti A. Embedding semantic similarity in tree kernels for domain adaptation of relation extraction [C/OL]// Proc of the 51st Association for Computational Linguistics. 2013:1498–1507. [2016-02-15]. http://disi.unitn. it/moschitti/since2013/2013_ACL_Plank_EmbeddingSeman-ticSimilarity.pdf.
Google Scholar
Zeng D, Liu K, Lai S, et al. Relation classification via convolutional deep neural network [C/OL]//Proc of 25th COLING. 2014:2335–2344. [2016-02-15]. http://www.nlpr.ia.ac.cn/cip/~liukang/liukangPageFile/camera_coling2014_final.pdf.
Google Scholar
Socher R, Huval B, Manning C, et al. Semantic compositionality through recursive matrix-vector spaces [C/OL]// Proc of the 2012 Joint Conference on EMNLP and COLING. 2012: 1201–1211. [2016-02-15]. http://ttic.uchicago.edu/~ haotang/speech/SocherHuvalManningNg_EMNLP2012.pdf.
Google Scholar
Xu Y, Mou L, Li G, et al. Classifying relations via long short term memory networks along shortest dependency paths [C/OL]// Proc of the EMNLP. [2016-02-15]. 2015:1785–1794. https://arxiv.org/pdf/1508.03720.pdf.
Google Scholar
Chen D, Manning C. A fast and accurate dependency parser using neural networks [C/OL]// Proc of the EMNLP. 2014: 740–750. [2016-02-15]. http://www.aclweb.org/anthology/D14-1082.
Google Scholar
Ebrahimi J, Dou D. Chain based RNN for relation classification [C/OL]// Proc of the NAACL. [2016-02-15]. 2015:1244–1249. https://www.cs.uoregon.edu/Reports/DRP-201412-Ebrahimi.pdf.
Google Scholar
Liu Y, Wei F, Li S, et al. A dependency-based neural network for relation classification [C/OL]//Proc of the 53rd ACL and the 7th IJCNLP. 2015: 285–290. [2016-02-15]. https://arxiv.org/pdf/1507.04646.pdf.
Google Scholar
Santos D, Xiang B, Zhou B. Classifying relations by ranking with convolutional neural networks [C/OL]// Proc the 53rd ACL and the 7th IJCNLP. [2016-02-15]. 2015: 626–634. https://arxiv.org/pdf/1504.06580.pdf.
Google Scholar
Xu K, Feng Y, Huang S, et al. Semantic relation classification via convolutional neural networks with simple negative sampling [C/OL]// Proc of the Conference on EMNLP. 2015: 536–540. [2016-02-15]. https://arxiv.org/pdf/1506.07650.pdf.
Google Scholar
Yu M, Gormley M, Dredze M. Factor-based compositional embedding models [C/OL]// Proc of the NIPS Work-shop on Learning Semantics. 2014: 95–101. [2016-02-15]. http://www.cs.cmu.edu/~mgormley/papers/yu+gormley+dredze.nip sw.2014.pdf.
Google Scholar
Hochreiter S. The vanishing gradient problem during learning recurrent neural nets and problem solutions [J]. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 1998, 6(2): 107–116.
Article Google Scholar
Duchi J, Hazan E, Singer Y. Adaptive sub-gradient methods for online learning and stochastic optimization [J]. Journal of Machine Learning Research, 2011 12(Jul): 2121–2159.
Google Scholar
Goller C, Kuchler A. Learning task-dependent distributed representations by backpropagation through structure [C]// Proc of IEEE International Conference on Neural Networks. Washington D C: IEEE Press, 1996: 347–352.
Chapter Google Scholar
Mikolov T, Sutskever I, Chen K, et al. Distributed representations of words and phrases and their compositionality[C/OL]// Proc NIPS. 2013: 3111–3119. [2016-02-15]. http://papers.nips.cc/paper/5021-distributed-representations-of-words-and-phrases-and-their-compositionality.pdf.
Google Scholar
Manning C, Surdeanu M, Bauer J, et al. The Stanford Corenlp natural language processing toolkit [C/OL]// Proc of the 52nd Association for Computational Linguistics. 2014: 55–60. [2016-02-15]. https://nlp.stanford.edu/pubs/Stanford CoreNlp2014.pdf.
Google Scholar
Miller G. Wordnet: A lexical database for English [J]. Communications of the ACM, 1995, 38: 39–41.
Article Google Scholar
Ciaramita M, Altun Y. Broad-coverage sense disambiguation and information extraction with a super-sense sequence tagger [C/OL]// Proc of the EMNLP. 2006: 594–602. [2016-02-15]. https://www.aclweb.org/anthology/W/W06/W06-16.pdf#page=616.
Chapter Google Scholar
Pyysalo S, Ginter F, Moen H, et al. Distributional semantics resources for biomedical text processing [C/OL]// Proc LBM. 2013:39–44. [2016-02-15]. http://bio.nlplab.org/pdf pyysalo13literature.pdf.
Google Scholar
Mou L, Peng H, Li G, et al. Discriminative neural sentence modeling by tree-based convolution [C/OL]// Proc of the EMNLP. 2015: 2315–2325. [2016-02-15]. https://arxiv.org/pdf/1504.01106.pdf.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer, Wuhan University, Wuhan, 430072, Hubei, China
Yuanfang Ren, Chong Teng, Fei Li, Bo Chen & Donghong Ji

Authors

Yuanfang Ren
View author publications
You can also search for this author in PubMed Google Scholar
Chong Teng
View author publications
You can also search for this author in PubMed Google Scholar
Fei Li
View author publications
You can also search for this author in PubMed Google Scholar
Bo Chen
View author publications
You can also search for this author in PubMed Google Scholar
Donghong Ji
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Donghong Ji.

Additional information

Foundation item: Supported by the China Postdoctoral Science Foundation (2014T70722) and the Humanities and Social Science Foundation of Ministry of Education of China (16YJCZH004)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ren, Y., Teng, C., Li, F. et al. Relation classification via sequence features and bi-directional LSTMs. Wuhan Univ. J. Nat. Sci. 22, 489–497 (2017). https://doi.org/10.1007/s11859-017-1278-6

Download citation

Received: 08 December 2016
Published: 09 November 2017
Issue Date: December 2017
DOI: https://doi.org/10.1007/s11859-017-1278-6

Key words

CLC number

TP 391

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Relation classification via sequence features and bi-directional LSTMs

Abstract

Access this article

Similar content being viewed by others

Relation Classification via BiLSTM-CNN

An input information enhanced model for relation extraction

Attention-Based Combination of CNN and RNN for Relation Classification

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Key words

CLC number

Navigation

Relation classification via sequence features and bi-directional LSTMs

Abstract

Access this article

Similar content being viewed by others

Relation Classification via BiLSTM-CNN

An input information enhanced model for relation extraction

Attention-Based Combination of CNN and RNN for Relation Classification

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key words

CLC number

Search

Navigation