A benchmark and comprehensive survey on knowledge graph entity alignment via representation learning

Zhang, Rui; Trisedya, Bayu Distiawan; Li, Miao; Jiang, Yong; Qi, Jianzhong

doi:10.1007/s00778-022-00747-z

A benchmark and comprehensive survey on knowledge graph entity alignment via representation learning

Special Issue Paper
Published: 24 May 2022

Volume 31, pages 1143–1168, (2022)
Cite this article

The VLDB Journal Aims and scope Submit manuscript

Rui Zhang ORCID: orcid.org/0000-0002-8132-6250¹,
Bayu Distiawan Trisedya²,
Miao Li²,
Yong Jiang¹ &
…
Jianzhong Qi²

1745 Accesses
21 Citations
Explore all metrics

Abstract

In the last few years, the interest in knowledge bases has grown exponentially in both the research community and the industry due to their essential role in AI applications. Entity alignment is an important task for enriching knowledge bases. This paper provides a comprehensive tutorial-type survey on representative entity alignment techniques that use the new approach of representation learning. We present a framework for capturing the key characteristics of these techniques, propose a benchmark addressing the limitation of existing benchmark datasets, and conduct extensive experiments using our benchmark. The framework gives a clear picture of how various techniques work. The experiments yield important results about the empirical performance of the techniques and how various factors affect the performance. One important observation not stressed by previous work is that techniques making good use of attribute triples and relation predicates as features stand out as winners. We are also the first to investigate the question of how to perform entity alignments on large-scale knowledge graphs such as the full Wikidata and Freebase (in Experiment 5).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Critical Assessment of State-of-the-Art in Entity Alignment

Iterative Representation Learning for Entity Alignment Leveraging Textual Information

Review of Deep Learning-Based Entity Alignment Methods

Notes

Our benchmark and all the code for our experiments are available at https://github.com/ruizhang-ai/EA_for_KG.
https://wiki.dbpedia.org/downloads-2016-10.

References

Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G.: Dbpedia: A nucleus for a web of open data. In: ISWC 2007 (2007)
Bhattacharya, I., Getoor, L.: Entity resolution in graphs. Mining Graph Data 13, 311–344 (2006)
Article Google Scholar
Bollacker, K.D., Evans, C., Paritosh, P., Sturge, T., Taylor, J.: Freebase: a collaboratively created graph database for structuring human knowledge. In: SIGMOD 2008 (2008)
Bordes, A., Usunier, N., Garcia-Duran, A., Weston, J., Yakhnenko, O.: Translating embeddings for modeling multi-relational data. In: NeurIPS 2013 (2013)
Cao, Y., Liu, Z., Li, C., Liu, Z., Li, J., Chua, T.S.: Multi-channel graph neural network for entity alignment. In: ACL 2019 (2019)
Chen, B., Zhang, J., Tang, X., Chen, H., Li, C.: Jarka: Modeling attribute interactions for cross-lingual knowledge alignment. In: PAKDD 2020 (2020)
Chen, M., Tian, Y., Yang, M., Zaniolo, C.: Multilingual knowledge graph embeddings for cross-lingual knowledge alignment. In: IJCAI 2017 (2017)
Chen, M., Tian, Y., Chang, K., Skiena, S., Zaniolo, C.: Co-training embeddings of knowledge graphs and entity descriptions for cross-lingual entity alignment. In: IJCAI 2018 (2018)
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)
MATH Google Scholar
Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: NAACL 2019 (2019)
Dong, X., Gabrilovich, E., Heitz, G., Horn, W., Lao, N., Murphy, K., Strohmann, T., Sun, S., Zhang, W.: Knowledge vault: a web-scale approach to probabilistic knowledge fusion. In: SIGKDD 2014 (2014)
Du, L., Kumar, A., Johnson, M., Ciaramita, M.: Using entity information from a knowledge base to improve relation extraction. In: ALTA 2015 (2015)
Färber, M.: The microsoft academic knowledge graph: a linked data source with 8 billion triples of scholarly data. In: ISWC 2019 (2019)
Fellegi, I.P., Sunter, A.B.: A theory for record linkage. JASA 64(328), 1183–1210 (1969)
Article Google Scholar
Francois, S., Francois, L.Y., Chuguang, Z.: Rdf-ai: an architecture for rdf datasets matching, fusion and interlink. In: IJCAI Workshop 2009 (2009)
Galárraga, L., Teflioudi, C., Hose, K., Suchanek, F.M.: Fast rule mining in ontological knowledge bases with AMIE+. VLDBJ 24(6), 707–730 (2015)
Article Google Scholar
Gilmer, J., Schoenholz, S.S., Riley, P.F., Vinyals, O., Dahl, G.E.: Neural message passing for quantum chemistry. In: ICML 2017 (2017)
Guo, L., Sun, Z., Hu, W.: Learning to exploit long-term relational dependencies in knowledge graphs. In: ICML 2019 (2019)
Hoffart, J., Suchanek, F.M., Berberich, K., Weikum, G.: YAGO2: A spatially and temporally enhanced knowledge base from wikipedia. Artif. Intell. 194, 28–61 (2013)
Article MathSciNet Google Scholar
Ji, G., He, S., Xu, L., Liu, K., Zhao, J.: Knowledge graph embedding via dynamic mapping matrix. In: ACL 2015 (2015)
Ji, S., Pan, S., Cambria, E., Marttinen, P., Yu, P.S.: A survey on knowledge graphs: Representation, acquisition and applications. CoRR arXiv:2002.00388 (2020)
Julius, V., Christian, B., Martin, G., Georgi, K.: Discovering and maintaining links on the web of data. In: ISWC 2009 (2009)
Kathuria, M., Nagpal, C., Duhan, N.: Journey of web search engines: milestones, challenges & innovations. IJITCS 12, 47–58 (2016)
Article Google Scholar
Kipf, T.N., Welling, M.: Semi-supervised classification with graph convolutional networks. In: ICLR 2017 (2017)
Kuhn, H.W.: The hungarian method for the assignment problem. In: 50 Years of Integer Programming 1958–2008 (2010)
Kulkarni, S., Singh, A., Ramakrishnan, G., Chakrabarti, S.: Collective annotation of wikipedia entities in web text. In: SIGKDD 2009 (2009)
Li, C., Cao, Y., Hou, L., Shi, J., Li, J., Chua, T.: Semi-supervised entity alignment via joint knowledge embedding model and cross-graph model. In: ENLP 2019 (2019)
Lin, X., Yang, H., Wu, J., Zhou, C., Wang, B.: Guiding cross-lingual entity alignment via adversarial knowledge embedding. In: ICDM 2019 (2019)
Lin, Y., Liu, Z., Luan, H., Sun, M., Rao, S., Liu, S.: Modeling relation paths for representation learning of knowledge bases. In: EMNLP 2015 (2015)
Lin, Y., Liu, Z., Sun, M., Liu, Y., Zhu, X.: Learning entity and relation embeddings for knowledge graph completion. In: AAAI 2015 (2015)
Liu, Z., Cao, Y., Pan, L., Li, J., Chua, T.: Exploring and evaluating attributes, values, and structures for entity alignment. In: EMNLP 2020 (2020)
Mao, X., Wang, W., Xu, H., Lan, M., Wu, Y.: MRAEA: an efficient and robust entity alignment approach for cross-lingual knowledge graph. In: WSDM 2020 (2020)
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: ICLR Workshop 2013 (2013)
Ngomo, A.C.N., Auer, S.: Limes: a time-efficient approach for large-scale link discovery on the web of data. In: IJCAI 2011 (2011)
Nie, H., Han, X., Sun, L., Wong, C.M., Chen, Q., Wu, S., Zhang, W.: Global structure and local semantics-preserved embeddings for entity alignment. In: IJCAI 2020 (2020)
Ott, M., Edunov, S., Baevski, A., Fan, A., Gross, S., Ng, N., Grangier, D., Auli, M.: fairseq: A fast, extensible toolkit for sequence modeling. In: NAACL-HLT 2019 (2019)
Pei, S., Yu, L., Hoehndorf, R., Zhang, X.: Semi-supervised entity alignment via knowledge graph embedding with awareness of degree difference. In: Web Conference 2019 (2019)
Pei, S., Yu, L., Zhang, X.: Improving cross-lingual entity alignment via optimal transport. In: IJCAI 2019 (2019)
Qin, K.K., Salim, F.D., Ren, Y., Shao, W., Heimann, M., Koutra, D.: G-crewe: Graph compression with embedding for network alignment. In: CIKM 2020 (2020)
Rahimi, A., Cohn, T., Baldwin, T.: Semi-supervised user geolocation via graph convolutional networks. In: ACL 2018 (2018)
Raimond, Y., Sutton, C., Sandler, M.B.: Automatic interlinking of music datasets on the semantic web. In: WWW Workshop 2008 (2008)
Roth, A.E.: Deferred acceptance algorithms: history, theory, practice, and open questions. Int. J. Game Theory 36(3–4), 537–569 (2008)
Article MathSciNet Google Scholar
Shi, X., Xiao, Y.: Modeling multi-mapping relations for precise cross-lingual entity alignment. In: EMNLP 2019 (2019)
Suchanek, F.M., Abiteboul, S., Senellart, P.: Paris: Probabilistic alignment of relations, instances, and schema. In: PVLDB 2011 (2011)
Sun, Z., Hu, W., Li, C.: Cross-lingual entity alignment via joint attribute-preserving embedding. In: ISWC 2017 (2017)
Sun, Z., Hu, W., Zhang, Q., Qu, Y.: Bootstrapping entity alignment with knowledge graph embedding. In: IJCAI 2018 (2018)
Sun, Z., Huang, J., Hu, W., Chen, M., Guo, L., Qu, Y.: Transedge: Translating relation-contextualized embeddings for knowledge graphs. In: ISWC 2019 (2019)
Sun, Z., Wang, C., Hu, W., Chen, M., Dai, J., Zhang, W., Qu, Y.: Knowledge graph alignment network with gated multi-hop neighborhood aggregation. In: AAAI 2020 (2020)
Sun, Z., Zhang, Q., Hu, W., Wang, C., Chen, M., Akrami, F., Li, C.: A benchmarking study of embedding-based entity alignment for knowledge graphs. In: VLDB 2020 (2020)
Tejada, S., Knoblock, C.A., Minton, S.: Learning object identification rules for information integration. Inf. Syst. 26(8), 607–633 (2001)
Article Google Scholar
Trisedya, B.D., Qi, J., Zhang, R.: Entity alignment between knowledge graphs using attribute embeddings. In: AAAI 2019 (2019)
Trisedya, B.D.,Weikum, G., Qi, J., Zhang, R.: Neural relation extraction for knowledge base enrichment. In: ACL 2019 (2019)
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., Polosukhin, I.: Attention is all you need. In: NIPS 2017 (2017)
Velickovic, P., Cucurull, G., Casanova, A., Romero, A., Liò, P., Bengio, Y.: Graph attention networks. In: ICLR 2018 (2018)
Verykios, V.S., Elmagarmid, A.K., Houstis, E.N.: Automating the approximate record-matching process. Inf. Sci. 126(1–4), 83–98 (2000)
Article Google Scholar
Vrandecic, D., Krötzsch, M.: Wikidata: a free collaborative knowledgebase. CACM 57(10), 78–85 (2014)
Article Google Scholar
Wang, Q., Mao, Z., Wang, B., Guo, L.: Knowledge graph embedding: a survey of approaches and applications. TKDE 29(12), 2724–2743 (2017)
Google Scholar
Wang, Z., Zhang, J., Feng, J., Chen, Z.: Knowledge graph embedding by translating on hyperplanes. In: AAAI 2014 (2014)
Wang, Z., Lv, Q., Lan, X., Zhang, Y.: Cross-lingual knowledge graph alignment via graph convolutional networks. In: EMNLP 2018 (2018)
Wang, Z., Yang, J., Ye, X.: Knowledge graph alignment with entity-pair embedding. In: EMNLP 2020 (2020)
Wu, Q., Shen, C., Wang, P., Dick, A., van den Hengel, A.: Image captioning and visual question answering based on attributes and external knowledge. TPAMI 40(06), 1367–1381 (2018)
Article Google Scholar
Wu, Y., Liu, X., Feng, Y., Wang, Z., Yan, R., Zhao, D.: Relation-aware entity alignment for heterogeneous knowledge graphs. In: IJCAI 2019 (2019)
Wu, Y., Liu, X., Feng, Y., Wang, Z., Zhao, D.: Jointly learning entity and relation representations for entity alignment. In: EMNLP 2019 (2019)
Wu, Y., Liu, X., Feng, Y., Wang, Z., Zhao, D.: Neighborhood matching network for entity alignment. In: ACL 2020 (2020)
Wu, Z., Pan, S., Chen, F., Long, G., Zhang, C., Yu, P.S.: A comprehensive survey on graph neural networks. TNNLS 32(1), 4–24 (2021)
MathSciNet Google Scholar
Xiao, H., Huang, M., Zhu, X.: From one point to a manifold: Knowledge graph embedding for precise link prediction. In: IJCAI 2016 (2016)
Xiao, H., Huang, M., Zhu, X.: Transg: a generative model for knowledge graph embedding. In: ACL 2016 (2016)
Xie, Q., Ma, X., Dai, Z., Hovy, E.H.: An interpretable knowledge transfer model for knowledge base completion. In: ACL 2017 (2017)
Xu, K., Wang, L., Yu, M., Feng, Y., Song, Y., Wang, Z., Yu, D.: Cross-lingual knowledge graph alignment via graph matching neural network. In: ACL 2019 (2019)
Xu, K., Song, L., Feng, Y., Song, Y., Yu, D.: Coordinated reasoning for cross-lingual knowledge graph alignment. In: AAAI 2020 (2020)
Xu, L., Zhou, Q., Gong, K., Liang, X., Tang, J., Lin, L.: End-to-end knowledge-routed relational dialogue system for automatic diagnosis. In: AAAI 2019 (2019)
Yang, H., Zou, Y., Shi, P., Lu, W., Lin, J., Sun, X.: Aligning cross-lingual entities with multi-aspect information. In: EMNLP 2019 (2019)
Yang, K., Liu, S., Zhao, J., Wang, Y., Xie, B.: COTSAE: co-training of structure and attribute embeddings for entity alignment. In: AAAI 2020 (2020)
Yang, S., Zhang, R., Erfani, S.M.: Graphdialog: Integrating graph knowledge into end-to-end task oriented dialogue systems. In: EMNLP 2020 (2020)
Yang, S., Zhang, R., Erfani, S.M., Lau J.H.: UniMF: A unified framework to incorporate multimodal knowledge bases into end-to-end task-oriented dialogue systems. In: IJCAI 2021 (2021)
Ye, R., Li, X., Fang, Y., Zang, H., Wang, M.: A vectorized relational graph convolutional network for multi-relational network alignment. In: IJCAI 2019 (2019)
Yuan, Y., Xiong, Z., Wang, Q.: ACM: adaptive cross-modal graph convolutional neural networks for RGB-D scene recognition. In: AAAI 2019 (2019)
Zeng, W., Zhao, X., Tang, J., Lin, X.: Collective entity alignment via adaptive features. In: ICDE 2020 (2020)
Zhang, F., Yuan, N.J., Lian, D., Xie, X., Ma, W.Y.: Collaborative knowledge base embedding for recommender systems. In: SIGKDD 2016 (2016)
Zhang, Q., Sun, Z., Hu, W., Chen, M., Guo, L., Qu, Y.: Multi-view knowledge graph embedding for entity alignment. In: IJCAI 2019 (2019)
Zhang, Z., Liu, H., Chen, J., Chen, X., Liu, B., Xiang, Y., Zheng, Y.: An industry evaluation of embedding-based entity alignment. In: COLING 2020 (2020)
Zhao, X., Zeng, W., Tang, J., Wang, W., Suchanek, F.M.: An experimental study of state-of-the-art entity alignment approaches. TKDE 2020, 1–1 (2020)
Google Scholar
Zhou, X., Zhu, Q., Liu, P., Guo, L.: Learning knowledge embeddings by combining limit-based scoring loss. In: CIKM 2017 (2017)
Zhu, H., Xie, R., Liu, Z., Sun, M.: Iterative entity alignment via joint knowledge embeddings. In: IJCAI 2017 (2017)
Zhu, Q., Zhou, X., Wu, J., Tan, J., Guo, L.: Neighborhood-aware attentional representation for multilingual knowledge graphs. In: IJCAI 2019 (2019)
Zhu, Q., Wei, H., Sisman, B., Zheng, D., Faloutsos, C., Dong, X.L., Han, J.: Collective multi-type entity alignment between knowledge graphs. In: Web Conference 2020 (2020)
Zhuang, C., Ma, Q.: Dual graph convolutional networks for graph-based semi-supervised classification. In: Web Conference 2018 (2018)

Download references

Author information

Authors and Affiliations

Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, China
Rui Zhang & Yong Jiang
The University of Melbourne, Parkville, Australia
Bayu Distiawan Trisedya, Miao Li & Jianzhong Qi

Authors

Rui Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Bayu Distiawan Trisedya
View author publications
You can also search for this author in PubMed Google Scholar
Miao Li
View author publications
You can also search for this author in PubMed Google Scholar
Yong Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Jianzhong Qi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rui Zhang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, R., Trisedya, B.D., Li, M. et al. A benchmark and comprehensive survey on knowledge graph entity alignment via representation learning. The VLDB Journal 31, 1143–1168 (2022). https://doi.org/10.1007/s00778-022-00747-z

Download citation

Received: 15 March 2021
Revised: 17 January 2022
Accepted: 26 March 2022
Published: 24 May 2022
Issue Date: September 2022
DOI: https://doi.org/10.1007/s00778-022-00747-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A benchmark and comprehensive survey on knowledge graph entity alignment via representation learning

Abstract

Access this article

Similar content being viewed by others

A Critical Assessment of State-of-the-Art in Entity Alignment

Iterative Representation Learning for Entity Alignment Leveraging Textual Information

Review of Deep Learning-Based Entity Alignment Methods

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A benchmark and comprehensive survey on knowledge graph entity alignment via representation learning

Abstract

Access this article

Similar content being viewed by others

A Critical Assessment of State-of-the-Art in Entity Alignment

Iterative Representation Learning for Entity Alignment Leveraging Textual Information

Review of Deep Learning-Based Entity Alignment Methods

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation