Abstract
Creating a large-scale knowledge graph of electric power equipment faults will facilitate the development of automatic fault diagnosis and intelligent question answering (QA) in the electric power industry. However, most existing methods have lower accuracy in Chinese entity recognition, thus it is hard to build such a high-quality knowledge graph by extracting knowledge from Chinese technical literature. To solve the problem, a novel model called BERT–BiLSTM–CRF is proposed. It blends Bi-directional Encoder Representation from Transformers (BERT), Bi-directional Long Short-Term Memory (BiLSTM), and Conditional Random Field (CRF). The model firstly identifies and extracts electric power equipment entities from pre-processed Chinese technical literature. Then, the semantic relations between the entities are extracted based on the relation classification method based on dependency parsing. Finally, the extracted knowledge is stored in the Neo4j database in the form of the triplet and visualized in the form of a graph. Through the above steps, a Chinese knowledge graph of electric power equipment faults can be built. The novelty of the model just lies in its subtle blend: the BERT module can not only learn phrase-level information representation, but also learn rich semantic information features; the CRF module realizes the constraint on the label prediction value and reduces the irregular recognition rate, so the accuracy rate of entity recognition is improved. Taking the Chinese technological literature, which is about fault diagnosis of electric power equipment as the experimental object, the experimental results show that the model identifies and extracts Chinese entities more accurately than traditional methods. Thus, a comprehensive and accurate Chinese knowledge graph of electric power equipment faults could be constructed more easily.
Similar content being viewed by others
References
Qu Z, Chen S, Yang F et al (2014) An attribute reducing method for electric power big data preprocessing based on cloud computing technology. Autom Electr Power Syst 38(8):67–71
Li G, Jiao P, Wen F et al (2016) A partial order reduction based method for big data preprocessing in smart grid environment. Autom Electr Power Syst 40:98–106
Jindong C, Shengwen W (2019) Research on framework of industry-university-research information platform from perspective of consortium blockchain. J Mod Inf 39(8):143–151. https://doi.org/10.3969/j.issn.1008-0821.2019.08.018
Yan Y, Sheng G, Chen Y et al (2015) Cleaning method for big data of power transmission and transformation equipment state based on time sequence analysis. Autom Electr Power Syst 39(7):138–144
Steiner T, Verborgh R, Troncy R, Gabarro J, Van de Walle R (2012) Adding realtime coverage to the Google knowledge graph. In: 11th International Semantic Web Conference (ISWC). Citeseer, vol 914, pp 65–68
Zhao M, Wang H, Guo J et al (2019) Construction of an industrial knowledge graph for unstructured Chinese text learning. Appl Sci 9(13):2720
Liu Q, Li Y, Duan H et al (2016) Knowledge graph construction techniques. J Comput Res Dev 53(3):582–600
Banko M, Cafarella MJ, Soderland S et al (2007) Open information extraction from the web. In: IJCAI 2007, Proceedings of the 20th International Joint Conference on Artificial Intelligence, Hyderabad, India, January 6–12, 2007. Morgan Kaufmann Publishers Inc.
Bollacker KD, Evans C, Paritosh P et al (2008) Freebase: a collaboratively created graph database for structuring human knowledge. In: Sigmod Conference. ACM
Bizer C, Lehmann J, Kobilarov G et al (2009) DBpedia—a crystallization point for the Web of Data. J Web Semant 7(3):154–165
Yesha R, Gangopadhyay A, Siegel EL (2015) A graph-based method for analyzing electronic medical records, pp 1036–1041
Cheng B, Zhang Y, Cai D et al (2018) Construction of traditional Chinese medicine Knowledge Graph using Data Mining and Expert Knowledge. In: International conference on network infrastructure and digital content
Bean DM, Wu H, Iqbal E et al (2017) Knowledge graph prediction of unknown adverse drug reactions and validation in electronic health records. Sci Rep 7(1):16416
Wang W, Wang Z, Pan L et al (2016) Research on the construction of bilingual movie knowledge graph. Acta entiarum Naturalium Universitatis Pekinensis
Pappu SJ, Bhatt N, Pasumarthy R et al (2018) Identifying topology of low voltage distribution networks based on smart meter data. IEEE Trans Smart Grid 9(5):5113–5122
Chenghu L, Xin MO, Dayi XU et al (2015) Construction and application of regional dispatching power distribution automation system. Guangdong Electric Power
Yu C, Jian-Hui YE, Yong-Chao YU (2016) Research and implementation of intelligent search based on power grid ontology-based knowledge base. Power Energy 37(1):1–6
Joshi KP, Elluri L, Nagar A et al (2020) An integrated knowledge graph to automate cloud data compliance. IEEE Access 8:148541–148555
Wang H (2019) An error recognition method for power equipment defect records based on knowledge graph technology. Front Inf Technol Electron Eng 20(11):1564–1577
Tong R, Cheng-Lin S, Hao-Fen W et al (2016) Construction of traditional Chinese medicine knowledge graph and its application. J Med Inform 37(4):8–13
Mikolov T (2013) Distributed representations of words and phrases and their compositionality. Adv Neural Inf Process Syst 26:3111–3119
Guo H, Jiang J, Hu G et al (2005) Chinese named entity recognition based on multilevel linguistic features. In: Natural Language Processing-ijcnlp, First International Joint Conference, Hainan Island, China, March, Revised Selected Papers. DBLP
Somsap S, Seresangtakul P (2020) Isarn Dharma word segmentation using a statistical approach with named entity recognition. ACM Trans Asian Low-Resour Lang Inform Process 19(2):1–16
Gligic L, Kormilitzin A, Goldberg P et al (2020) Named entity recognition in electronic health records using transfer learning bootstrapped Neural Networks. Neural Netw 121:132–139
Sundermeyer M, Ney H, Schlüter R (2015) From feedforward to recurrent LSTM neural networks for language modeling. IEEE/ACM Trans Audio Speech Lang Process 23(3):517–529. https://doi.org/10.1109/TASLP.2015.2400218
Xiao J, Xiao S, Wang H et al (2017) Research on storage and query of massive RDF data. J Beijing Inf Sci Technol Univ 32(3):63–69
Acknowledgements
This work was supported in part by the Plan of Science and Technical Innovation Development of Jilin City (No. 201901041146), the Chinese Scholarship Council (No. 201908220190), National Key Research and Development Program (No. 2020YFB1707804), and the Science and Technical Research Project of Jilin Provincial Education Department (No. JJKH20190711KJ).
Author information
Authors and Affiliations
Corresponding authors
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Meng, F., Yang, S., Wang, J. et al. Creating Knowledge Graph of Electric Power Equipment Faults Based on BERT–BiLSTM–CRF Model. J. Electr. Eng. Technol. 17, 2507–2516 (2022). https://doi.org/10.1007/s42835-022-01032-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s42835-022-01032-3