Joint entity and relation extraction model based on directed-relation GAT oriented to Chinese patent texts

Zhao, Yushan; Li, Kuan-Ching; Wang, Tengke; Zhang, Shunxiang

doi:10.1007/s00500-024-09629-8

Joint entity and relation extraction model based on directed-relation GAT oriented to Chinese patent texts

Application of soft computing
Published: 08 February 2024

(2024)
Cite this article

Soft Computing Aims and scope Submit manuscript

Yushan Zhao ORCID: orcid.org/0000-0003-1147-4802¹,
Kuan-Ching Li¹,
Tengke Wang² &
…
Shunxiang Zhang²

838 Accesses
Explore all metrics

Abstract

The joint extraction task aims to construct an entity-relation triple comprising two entities and the relation between them. Existing joint models make it difficult to process too many overlapping relations in Chinese patent texts (CPT). This article introduces a joint entity and relation extraction model based on directed-relation graph attention network (DGAT) oriented to CPT to locate this problem. First, word-character tokens are obtained from CPT using BERT as the DGAT model input. Global tokens are expanded using the BiLSTM network to enhance contextual connection from the model input. Second, the DGAT model encodes the global tokens as a fully connected graph whose nodes represent the global tokens and edges denote the relations between global tokens. The edges with directed relation in the fully connected graph are assigned weights by the DGAT model, and other edges are pruned, resulting in a directed-relation-connected graph. Finally, the entity-relation triples are decoded using conditional random fields (CRF) from the directed relation-connected graph. Experimental results show that the proposed model was highly accurate based on the CPT dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Modeling Relational Data with Graph Convolutional Networks

Deep learning for named entity recognition: a survey

Article 28 March 2024

Event detection from text using path-aware graph convolutional network

Article 31 July 2021

Data availability

Data cannot be available for privacy reasons.

References

Bekoulis G, Deleu J, Demeester T, Develder C (2018) Joint entity recognition and relation extraction as a multi-head selection problem. Expert Syst Appl 114:34–45. https://doi.org/10.1016/j.eswa.2018.07.032
Article Google Scholar
Dai D, Xiao X, Lyu Y, Dou S, She Q, Wang H (2019) Joint extraction of entities and overlapping relations using position-attentive sequence labeling. In: Proceedings of the AAAI conference on artificial intelligence, vol 33, pp 6300–6308. https://doi.org/10.1609/aaai.v33i01.33016300
Eberts M, Ulges A (2020) Span-based joint entity and relation extraction with transformer pre-training. In: the 24th European conference on artificial intelligence 2020, pp 2006–2013. https://doi.org/10.3233/FAIA200321
Fu T-J, Li P-H, Ma W-Y (2019) Graphrel: modeling text as relational graphs for joint entity and relation extraction. In: Proceedings of the 57th annual meeting of the association for computational linguistics, pp 1409–1418. https://doi.org/10.18653/v1/P19-1136
Gao C, Zhang X, Liu H, Yun W, Jiang J (2022) A joint extraction model of entities and relations based on relation decomposition. Int J Mach Learn Cybern 13:1833–1845. https://doi.org/10.1007/s13042-021-01491-6
Article Google Scholar
Gardent C, Shimorina A, Narayan S, Perez-Beltrachini L (2017) The WEBNLG challenge: generating text from RDF data. In: Proceedings of the 10th international conference on natural language generation, pp 124–133. https://doi.org/10.18653/v1/W17-3518
Geng Z, Zhang Y, Han Y (2021) Joint entity and relation extraction model based on rich semantics. Neurocomputing 429:132–140. https://doi.org/10.1016/j.neucom.2020.12.037
Article Google Scholar
Grohe M (2021) The logic of graph neural networks. In: Proceedings of the 36th annual ACM/IEEE symposium on logic in computer science. https://doi.org/10.1109/LICS52264.2021.9470677
Guo Z, Zhang Y, Lu W (2020) Attention guided graph convolutional networks for relation extraction. In: Proceedings of the 57th annual meeting of the association for computational linguistics, pp 241–251. https://doi.org/10.18653/v1/P19-1024
Guo M, Lu C, Liu Z, Cheng M, Hu S (2023) Visual attention network. Comput Vis Media 9(4):733–752. https://doi.org/10.1007/s41095-023-0364-2
Article Google Scholar
Hu Z, Dong Y, Wang K, Chang K-W, Sun Y (2020) Gpt-gnn: Generative pre-training of graph neural networks. In: Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery and data mining, pp 1857–1867. https://doi.org/10.1145/3394486.3403237
Huang J, Shen H, Hou L, Cheng X (2021) SDGNN: learning node representation for signed directed networks. In: Proceedings of the AAAI conference on artificial intelligence, vol 35, pp 196–203. https://doi.org/10.1609/aaai.v35i1.16093
Ji S, Pan S, Cambria E, Marttinen P, Yu PS (2022) A survey on knowledge graphs: representation, acquisition, and applications. IEEE Trans Neural Netw Learn Syst 33(2):494–514. https://doi.org/10.1109/TNNLS.2021.3070843
Article MathSciNet PubMed Google Scholar
Lai T, Cheng L, Wang D, Ye H, Zhang W (2022) RMAN: relational multi-head attention neural network for joint extraction of entities and relations. Appl Intell 52(3):3132–3142. https://doi.org/10.1007/s10489-021-02600-2
Article Google Scholar
Li Q, Ji H (2014) Incremental joint extraction of entity mentions and relations. In: Proceedings of the 52nd annual meeting of the association for computational linguistics (vol 1: long papers), pp 402–412. https://doi.org/10.3115/v1/p14-1038
Li X, Luo X, Dong C, Yang D, Luan B, He Z (2021) TDEER: an efficient translating decoding schema for joint extraction of entities and relations. In: Proceedings of the 2021 conference on empirical methods in natural language processing, pp 8055–8064. https://doi.org/10.18653/v1/2021.emnlp-main.635
Li J, Sun A, Han J, Li C (2022) A survey on deep learning for named entity recognition. IEEE Trans Knowl Data Eng 34(1):50–70. https://doi.org/10.1109/TKDE.2020.2981314
Article ADS Google Scholar
Liu K (2020) A survey on neural relation extraction. Sci China Technol Sci 63(10):1971–1989. https://doi.org/10.1007/s11431-020-1673-6
Article ADS Google Scholar
Liu W, Fu X, Zhang Y, Xiao W (2021) Lexicon enhanced chinese sequence labeling using Bert adapter. In: Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing (vol 1: long papers), pp 5847–5858. https://doi.org/10.18653/v1/2021.acl-long.454
Liu P, Guo Y, Wang F, Li G (2022) Chinese named entity recognition: the state of the art. Neurocomputing 473:37–53. https://doi.org/10.1016/j.neucom.2021.10.101
Article Google Scholar
Miwa M, Bansal M (2016) End-to-end relation extraction using lstms on sequences and tree structures. In: Proceedings of the 54th annual meeting of the association for computational linguistics (vol 1: long papers), pp 1105–1116. https://doi.org/10.18653/v1/P16-1105
Miwa M, Sasaki Y (2014) Modeling joint entity and relation extraction with table representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp 1858–1869. https://doi.org/10.3115/v1/D14-1200
Nayak T, Ng HT (2020) Effective modeling of encoder–decoder architecture for joint entity and relation extraction. Proc AAAI Conf Artif Intell 34(05):8528–8535. https://doi.org/10.1609/aaai.v34i05.6374
Article Google Scholar
Nimmani P, Vodithala S, Polepally V (2021) Neural network based integrated model for information retrieval. In: 2021 5th International conference on intelligent computing and control systems, pp 1286–1289. https://doi.org/10.1109/ICICCS51141.2021.9432241
Niu Z, Zhong G, Yu H (2021) A review on the attention mechanism of deep learning. Neurocomputing 452:48–62. https://doi.org/10.1016/j.neucom.2021.03.091
Article Google Scholar
Qiao B, Zou Z, Huang Y, Fang K, Zhu X, Chen Y (2022) A joint model for entity and relation extraction based on BERT. Neural Comput Appl 34(5):3471–3481. https://doi.org/10.1007/s00521-021-05815-z
Article Google Scholar
Shang Y-M, Huang H, Mao X (2022) Onerel: Joint entity and relation extraction with one module in one step. In: Proceedings of the AAAI conference on artificial intelligence, vol 36, pp 11285–11293. https://doi.org/10.1609/aaai.v36i10.21379
Shen Y, Fu H, Du Z, Chen X, Burnaev E, Zorin D, Zhou K, Zheng Y (2022) GCN-denoiser: mesh denoising with graph convolutional networks. ACM Trans Graph. https://doi.org/10.1145/3480168
Article Google Scholar
Takanobu R, Zhang T, Liu J, Huang M (2019) A hierarchical framework for relation extraction with reinforcement learning. In: Proceedings of the AAAI conference on artificial intelligence, vol 33, pp 7072–7079. https://doi.org/10.1609/aaai.v33i01.33017072
Tian T, Liu Y, Yang X, Lyu Y, Zhang X, Fang B (2020) QSAN: a quantum-probability based signed attention network for explainable false information detection. In: Proceedings of the 29th ACM international conference on information and knowledge management, pp 1445–1454. https://doi.org/10.1145/3340531.3411890
Wan Q, Wei L, Chen X, Liu J (2021) A region-based hypergraph network for joint entity-relation extraction. Knowl Based Syst 228:107298. https://doi.org/10.1016/j.knosys.2021.107298
Article Google Scholar
Wang S, Zhang Y, Che W, Liu T (2018) Joint extraction of entities and relations based on a novel graph scheme. In: Proceedings of the twenty-seventh international joint conference on artificial intelligence, pp 4461–4467. https://doi.org/10.24963/ijcai.2018/620
Wang X, Huang T, Wang D, Yuan Y, Liu Z, He X, Chua T-S (2021) Learning intents behind interactions with knowledge graph for recommendation. In: Proceedings of the web conference 2021, pp 878–887. https://doi.org/10.1145/3442381.3450133
Wei Z, Su J, Wang Y, Tian Y, Chang Y (2020a) A novel cascade binary tagging framework for relational triple extraction. https://doi.org/10.18653/v1/2020.acl-main.136
Wei Z, Su J, Wang Y, Tian Y, Chang Y (2020b) A novel cascade binary tagging framework for relational triple extraction. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp 1476–1488. https://doi.org/10.18653/v1/2020.acl-main.136
Ye Y, Ji S (2021) Sparse graph attention networks. IEEE Trans Knowl Data Eng 35:905–916. https://doi.org/10.1109/TKDE.2021.3072345
Ye D, Lin Y, Li P, Sun M (2022) Packed levitated marker for entity and relation extraction. In: Proceedings of the 60th annual meeting of the association for computational linguistics (vol 1: long papers), pp 4904–4917. https://doi.org/10.18653/v1/2022.acl-long.337
Yuan Y, Zhou X, Pan S, Zhu Q, Song Z, Guo L (2020) A relation-specific attention network for joint entity and relation extraction. In: Proceedings of the twenty-ninth international joint conference on artificial intelligence, pp 4054–4060. https://doi.org/10.24963/ijcai.2020/561
Zeng X, Zeng D, He S, Liu K, Zhao J (2018) Extracting relational facts by an end-to-end neural model with copy mechanism. In: Proceedings of the 56th annual meeting of the association for computational linguistics (vol 1: long papers), pp 506–514. https://doi.org/10.18653/v1/P18-1047
Zeng X, He S, Zeng D, Liu K, Liu S, Zhao J (2019) Learning the extraction order of multiple relational facts in a sentence with reinforcement learning. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing, pp 367–377. https://doi.org/10.18653/v1/D19-1035
Zhang Y, Qi P, Manning CD (2018) Graph convolution over pruned dependency trees improves relation extraction. In: Proceedings of the 2018 conference on empirical methods in natural language processing, pp 2205–2215. https://doi.org/10.18653/v1/D18-1244
Zhang S, Hu Z, Zhu G, Jin M, Li K-C (2021) Sentiment classification model for Chinese micro-blog comments based on key sentences extraction. Soft Comput 25(1):463–476. https://doi.org/10.1007/s00500-020-05160-8
Article Google Scholar
Zhang S, Zhu H, Xu H, Zhu G, Li K-C (2022a) A named entity recognition method towards product reviews based on BiLSTM-attention-CRF. Int J Comput Sci Eng 25(5):479–489. https://doi.org/10.1504/IJCSE.2022.126251
Zhang S, Xu H, Zhu G, Chen X, Li K (2022b) A data processing method based on sequence labeling and syntactic analysis for extracting new sentiment words from product reviews. Soft Comput 26(2):853–866. https://doi.org/10.1007/s00500-021-06228-9
Zhang S, Wu H, Xu X, Zhu G, Hsieh M-Y (2022c) CL-ECPE: contrastive learning with adversarial samples for emotion-cause pair extraction. Connect Sci 34(1):1877–1894. https://doi.org/10.1080/09540091.2022.2082383
Zhang N, Deng S, Ye H, Zhang W, Chen H (2022d) Robust triple extraction with cascade bidirectional capsule network. Expert Syst Appl 187:115806. https://doi.org/10.1016/j.eswa.2021.115806
Zhao K, Xu H, Cheng Y, Li X, Gao K (2021) Representation iterative fusion based on heterogeneous graph neural network for joint entity and relation extraction. Knowl Based Syst 219:106888. https://doi.org/10.1016/j.knosys.2021.106888
Article Google Scholar
Zheng S, Hao Y, Lu D, Bao H, Xu J, Hao H, Xu B (2017a) Joint entity and relation extraction based on a hybrid neural network. Neurocomputing 257:59–66. https://doi.org/10.1016/j.neucom.2016.12.075
Zheng S, Wang F, Bao H, Hao Y, Zhou P, Xu B (2017b) Joint extraction of entities and relations based on a novel tagging scheme. In: Proceedings of the 55th annual meeting of the association for computational linguistics, vol 1: long papers, pp 1227–1236. https://doi.org/10.18653/v1/P17-1113
Zhong Z, Chen D (2021) A frustratingly easy approach for entity and relation extraction. In: Proceedings of the 2021 conference of the North American chapter of the association for computational linguistics: human language technologies, pp 50–61. https://doi.org/10.18653/v1/2021.naacl-main.5
Zhou W, Huang K, Ma T, Huang J (2021) Document-level relation extraction with adaptive thresholding and localized context pooling. In: Proceedings of the AAAI conference on artificial intelligence, vol 35, pp 14612–14620. https://doi.org/10.1609/aaai.v35i16.17717
Zhu G, Sun Z, Zhang S, Wei S, Li K (2022) Causality extraction model based on two-stage GCN. Soft Comput 26(24):13815–13828. https://doi.org/10.1007/s00500-022-07370-8

Download references

Funding

This work was supported by the Anhui University Postgraduate Scientific Research Project (Grant No. YJS20210368), the National Natural Science Foundation of China (Grant NO.62076006), and the National Natural Science Foundation of China (Grant No. 60973050).

Author information

Authors and Affiliations

School of Mathematics and Big Data, Anhui University of Science and Technology, Taifeng Street, Huainan, 232001, Anhui, China
Yushan Zhao & Kuan-Ching Li
School of Computer Science and Engineering, Anhui University of Science and Technology, Taifeng Street, Huainan, 232001, Anhui, China
Tengke Wang & Shunxiang Zhang

Authors

Yushan Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Kuan-Ching Li
View author publications
You can also search for this author in PubMed Google Scholar
Tengke Wang
View author publications
You can also search for this author in PubMed Google Scholar
Shunxiang Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Kuan-Ching Li or Shunxiang Zhang.

Ethics declarations

Conflict of interest

The author Yushan Zhao declares that she has no conflict of interest. The author Kuan-Ching Li declares that she has no conflict of interest. The author Tengke Wang declares that she has no conflict of interest. The author Shunxiang Zhang declares that she has no conflict of interest. Also this manuscript is approved by all the authors for publication. Yushan Zhao would like to declare on behalf of all the co-authors that the work described was original research that has not been published previously. All the authors listed have approved the manuscript that is enclosed.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Informed consent

No humans or any individual participants are involved in this study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Zhao, Y., Li, KC., Wang, T. et al. Joint entity and relation extraction model based on directed-relation GAT oriented to Chinese patent texts. Soft Comput (2024). https://doi.org/10.1007/s00500-024-09629-8

Download citation

Accepted: 30 December 2023
Published: 08 February 2024
DOI: https://doi.org/10.1007/s00500-024-09629-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Joint entity and relation extraction model based on directed-relation GAT oriented to Chinese patent texts

Abstract

Access this article

Similar content being viewed by others

Modeling Relational Data with Graph Convolutional Networks

Deep learning for named entity recognition: a survey

Event detection from text using path-aware graph convolutional network

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interest

Ethical approval

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Joint entity and relation extraction model based on directed-relation GAT oriented to Chinese patent texts

Abstract

Access this article

Similar content being viewed by others

Modeling Relational Data with Graph Convolutional Networks

Deep learning for named entity recognition: a survey

Event detection from text using path-aware graph convolutional network

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interest

Ethical approval

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation