Skip to main content
Log in

Cross-platform product matching based on entity alignment of knowledge graph with raea model

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

Product matching aims to identify identical or similar products sold on different platforms. By building knowledge graphs (KGs), the product matching problem can be converted to the Entity Alignment (EA) task, which aims to discover the equivalent entities from diverse KGs. The existing EA methods inadequately utilize both attribute triples and relation triples simultaneously, especially the interactions between them. This paper introduces a two-stage pipeline consisting of rough filter and fine filter to match products from eBay and Amazon. For fine filtering, a new framework for Entity Alignment, R elation-aware and A ttribute-aware Graph Attention Networks for E ntity A lignment (RAEA), is employed. RAEA focuses on the interactions between attribute triples and relation triples, where the entity representation aggregates the alignment signals from attributes and relations with Attribute-aware Entity Encoder and Relation-aware Graph Attention Networks. The experimental results indicate that the RAEA model achieves significant improvements over 12 baselines on EA task in the cross-lingual dataset DBP15K (6.59% on average Hits@1) and delivers competitive results in the monolingual dataset DWY100K. The source code for experiments on DBP15K and DWY100K is available at github (https://github.com/Mockingjay-liu/RAEA-model-for-Entity-Alignment).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2

Similar content being viewed by others

Data Availability

All datasets are open-source and the sources are cited, except for the eBay product data due to the business interest.

Notes

  1. https://nijianmo.github.io/amazon/index.html

  2. https://kgma.github.io/

  3. https://github.com/PaddlePaddle/PaddleNLP/tree/develop/model_zoo/uie

  4. https://huggingface.co/sentence-transformers/paraphrase-multilingual-MPnet-base-v2

References

  1. Bordes, A., Usunier, N., Garcia-Duran, A., Weston, J., Yakhnenko, O.: Translating embeddings for modeling multi-relational data. In: Advances in Neural Information Processing Systems, pp. 26 (2013)

  2. Cao, Y., Liu, Z., Li, C., Liu, Z., Li, J.-Z., Chua, T.-S.: Multi-channel graph neural network for entity alignment. In: ACL (2019)

  3. Chen, L., Li, Z., Xu, T., Wu, H., Wang, Z., Yuan, N.J., Chen, E.: Multi-modal siamese network for entity alignment. In: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD ’22, pp. 118–126, New York,. Association for Computing Machinery (2022)

  4. Chen, M., Zhang, Y., Kou, X., Li, Y., Zhang, Y.: R-gat: relational graph attention network for multi-relational graphs. ArXiv2109, :05922 (2021)

  5. Chen, M., Tian, Y., Chang, K.-W., Skiena, S., Zaniolo, C.: Co-training embeddings of knowledge graphs and entity descriptions for cross-lingual entity alignment. In: IJCAI (2018)

  6. Chen, M., Tian, Y., Yang, M., Zaniolo, C.: Multilingual knowledge graph embeddings for cross-lingual knowledge alignment. In: IJCAI (2017)

  7. Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: Bert: pre-training of deep bidirectional transformers for language understanding. ArXiv1810, :04805 (2019)

  8. Dong, X.L., He, X., Kan, A., Li, X., Liang, Y., Ma, J., Xu, Y.E., Zhang, C., Zhao, T., Saldana, G.B., et al.: Autoknow: self-driving knowledge collection for products of thousands of types. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 2724–2734 (2020)

  9. Duchi, J., Hazan, E., Singer, Y.: Adaptive subgradient methods for online learning and stochastic optimization, Journal of machine learning research, 12(7) (2011)

  10. Fu, C., Wang, T., Nie, H., Han, X., Sun, L. 0001: Iscas_icip at mwpd-2020 task 1: Product matching based on deep entity matching frameworks. In: MWPD@ ISWC (2020)

  11. Gao, T., Yao, X., Chen, D.: Simcse: simple contrastive learning of sentence embeddings. In: EMNLP (2021)

  12. Gao, Y., Liu, X., Wu, J., Li, T., Wang, P., Chen, L.: Clusterea: scalable entity alignment with stochastic training and normalized mini-batch similarities. In: KDD (2022)

  13. Ge, C., Liu, X., Chen, L., Gao, Y., Zheng, B.: LargeEA. In: Proceedings of the VLDB Endowment, 15(2):237–245 (2021)

  14. Hao, Y., Zhang, Y., He, S., Liu, K., Zhao, J.: A joint embedding method for entity alignment of knowledge bases. In: CCKS (2016)

  15. Li, B., Zhou, H., He, J., Wang, M., Yang, Y., Li, L.: On the sentence embeddings from pre-trained language models. In: EMNLP (2020)

  16. Li, C., Cao, Y., Hou, L., Shi, J., Li, J., Chua, T.-S.: Semi-supervised entity alignment via joint knowledge embedding model and cross-graph model. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) (2019)

  17. Li, C., Cao, Y., Hou, L., Shi, J., Li, J., Chua, T.-S.: Semi-supervised entity alignment via joint knowledge embedding model and cross-graph model. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 2723–2732 (2019)

  18. Xinhang L., Zhang, Y., Xing, C.: Jointly learning knowledge embedding and neighborhood consensus with relational knowledge distillation for entity alignment. In: Proceedings of the ACM Web Conference 2022 arXiv (2022)

  19. Lin, Y., Liu, Z., Luan, H., Sun, M., Rao, S., Liu, S.: Modeling relation paths for representation learning of knowledge bases. In: EMNLP (2015)

  20. Lin, Z., Zhang, Z., Wang, M., Shi, Y., Wu, X., Zheng, Y.: Multi-modal contrastive representation learning for entity alignment (2022)

  21. Liu, B., Hua, W., Zuccon, G., Zhao, G., Zhang, X.: High-quality task division for large-scale entity alignment. In: Proceedings of the 31st ACM International Conference on Information and Knowledge Management ACM (2022)

  22. Liu, Z., Cao, Y., Pan, L., Li, J., Chua, T.-S.: Exploring and evaluating attributes, values, and structures for entity alignment. In: The Conference on Empirical Methods in Natural Language Processing (EMNLP), p 10 (2020)

  23. Mao, X., Wang, W., Yuanbin, W., Lan, M.: Boosting the speed of entity alignment 10 : Dual attention matching network with normalized hard sample mining. In: Proceedings of the Web Conference 2021, WWW ’21, Association for Computing Machinery, pp. 821–832 (2021)

  24. Mao, X., Wang, W., Yuanbin, W., Lan, M.: Lightea: A scalable, robust, and interpretable entity alignment framework via three-view label propagation arXiv (2022)

  25. Mao, X., Wang, W., Xu, H., Lan, M., Yuanbin, W.: Mraea: An efficient and robust entity alignment approach for cross-lingual knowledge graph : 420–428 (2020)

  26. Ni, J., Li, J., McAuley, J.: Justifying recommendations using distantly-labeled reviews and fine-grained aspects. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), pp 188–197 (2019)

  27. Peeters, R., Bizer, C.: Cross-language learning for product matching. WWW Companion, 2022a. https://doi.org/10.1145/3487553 pp 3524234 (2022)

  28. Peeters, R., Bizer, C., Glavaš, G.: Intermediate training of bert for product matching. small 745(722), 2–112 (2020)

    Google Scholar 

  29. Reimers, N., Gurevych, I.: Sentence-bert: sentence embeddings using siamese bert-networks. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, pp. 11 (2019)

  30. Song, K., Tan, X., Qin, T., Lu, J., Liu, T.-Y., Lin, H.: Mpnet: Masked and permuted pre-training for language understanding. In: Larochelle, H., Ranzato, M., Hadsell, R., Balcan, M.F. (eds.) Advances in Neural Information Processing Systems, 33, pp. 16857–16867. Curran Associates, Inc. (2020)

  31. Sun, Z., Hu, W., Li, C.: Cross-lingual entity alignment via joint attribute-preserving embedding. In: International Semantic Web Conference, pp. 628–644. Springer (2017)

  32. Sun, Z., Hu, W., Zhang, Q., Qu, Y.: Bootstrapping entity alignment with knowledge graph embedding. In: IJCAI, vol. 18, pp 4396–4402 (2018)

  33. Sun, Z., Huang, J., Hu, W., Chen, M., Guo, L., Qu, Y.: Transedge: Translating relation-contextualized embeddings for knowledge graphs. In: International Semantic Web Conference, pp 612–629. Springer (2019)

  34. Sun, Zequn, Wang, Chengming, Wei, H u, Chen, Muhao, Dai, Jian, Zhang, Wei, Yuzhong, Q u: Knowledge graph alignment network with gated multi-hop neighborhood aggregation. In: Proceedings of the AAAI Conference on Artificial Intelligence, 34, pp 222–229 (2020)

  35. Surisetty, A., Chaurasiya, D., Kumar, N., Singh, A., Dhama, G., Malhotra, A., Arora, A., Dey, V.: Reps: Relation, position and structure aware entity alignment. In: Companion Proceedings of the Web Conference 2022, WWW ’22, pp 1083–1091, New York,. Association for Computing Machinery (2022)

  36. Trisedya, Bayu Distiawan, Qi, Jianzhong, Zhang, Rui: Entity alignment between knowledge graphs using attribute embeddings. In: Proceedings of the AAAI Conference on Artificial Intelligence 33, pp. 297–304 (2019)

  37. Wang, Y., Wang, L., Li, Y., Di, H., Chen, Wei, Liu, T.-Y.: A theoretical analysis of ndcg ranking measures. In: Proceedings of the 26th annual conference on learning theory (COLT 2013), vol. 8, p 6 (2013)

  38. Wang, Z., Lv, Q., Lan, X., Zhang, Y.: Cross-lingual knowledge graph alignment via graph convolutional networks. In: Proceedings of the 2018 conference on empirical methods in natural language processing, pp 349–357 (2018)

  39. Yuting, W., Liu, X., Feng, Y., Wang, Z., Yan, R., Zhao, D.: Relation-aware entity alignment for heterogeneous knowledge graphs. 1908.08210. In: IJCAI (2019)

  40. Yuting, W., Liu, X., Feng, Y., Wang, Z., Zhao, D.: Neighborhood matching network for entity alignment. In: ACL, p 11 (2020)

  41. Xin, K., Sun, Z., Hua, W., Hu, W., Qu, J., Zhou, X.: Large-scale entity alignment via knowledge graph merging, partitioning and embedding. In: Proceedings of the 31st ACM International Conference on Information and Knowledge Management ACM (2022)

  42. Xu, C., Su, F., Lehmann, J.: Time-aware graph neural network for entity alignment between temporal knowledge graphs. In: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pp 8999–9010, Online and Punta Cana, Dominican Republic, Association for Computational Linguistics (2021)

  43. Xu, C., Su, F., Bo, X., Lehmann, J.: Time-aware Entity Alignment Using Temporal Relational Attention. In: Proceedings of the ACM Web Conference 2022, WWW ’22,788–797, New York, Association for Computing Machinery (2022)

  44. Xu, D., Chuanwei, R., Evren, K., Sushant, K., Kannan, A.: Product knowledge graph embedding for e-commerce. In: Proceedings of the 13th international conference on Web search and data mining, pp 672–680 (2020)

  45. Xu, K., Wang, L., Yu, M., Feng, Y., Song, Y., Wang, Z., Yu, D.: Cross-lingual knowledge graph alignment via graph matching neural network. In: The 57th Annual Meeting of the Association for Computational Linguistics (ACL) (2019)

  46. Yang, Z., Dai, Z., Yang, Y., Carbonell, J., R Salakhutdinov, R., V Le, Q.: Xlnet: Generalized autoregressive pretraining for language understanding. In: Wallach, H., Larochelle, H., Beygelzimer, A., d'Alché-Buc, F., Fox, E., Garnett, R. (eds.) Advances in Neural Information Processing Systems, vol. 32. Curran Associates, Inc (2019)

  47. Ye, R., Li, X., Fang, Y., Zang, H., Wang, M.: A vectorized relational graph convolutional network for multi-relational network alignment. In: IJCAI, pp. 4135–4141 (2019)

  48. Yu, D., Yiming, Y., Ruohong, Z., Wu, Y.: Knowledge embedding based graph convolutional network. In: Proceedings of the Web Conference vol. 2021, pp. 1619–1628 (2021)

  49. Zeng, W., Zhao, X., Li, X., Tang, J., Wang, W.: On entity alignment at scale. VLDB J. 31(5), 1009–1033 (2022)

    Article  Google Scholar 

  50. Zhang, Q., Sun, Z., Hu, W., Chen, M., Guo, L., Qu, Y.: Multi-view knowledge graph embedding for entity alignment. In: In the 28th International Joint Conference on Artificial Intelligence (IJCAI) (2019)

  51. Zhang, R., Trisedy, B.D., Li, M., Jiang, Y., Qi, J.: A comprehensive survey on knowledge graph entity alignment via representation learning. arXiv preprint arXiv:2103.15059 (2021)

  52. Zhong, Z., Zhang, M., Fan, J., Dou, C.: Semantics driven embedding learning for effective entity alignment. In: 2022 IEEE 38th International Conference on Data Engineering (ICDE), pp. 2127–2140 (2022)

  53. Zhu, H., Xie, R., Liu, Z., Sun, M.: Iterative entity alignment via knowledge embeddings. In: Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI) (2017)

  54. Zhu, R., Ma, M., Wang, P.: Raga: relation-aware graph attention networks for global entity alignment. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp 501–513. Springer (2021)

Download references

Funding

This study was supported partly by the National Key Research and Development Program of China (No. 2019YFE0198600), and partly by InnoHK initiative, the government of the HKSAR, and the Laboratory for AI-Powered Financial Technologies.

Author information

Authors and Affiliations

Authors

Contributions

QZ and HY initialized the project and managed the study. WL, JP, XZhang, XG, YY and XZhao collected the data and performed formal analysis. All authors analyzed the data. WL and QZ wrote the initial manuscript. All authors contributed to the editing of the paper.

Corresponding author

Correspondence to Qingpeng Zhang.

Ethics declarations

Ethics approval and consent to participate

No ethical issue involved.

Competing interests

No competing interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, W., Pan, J., Zhang, X. et al. Cross-platform product matching based on entity alignment of knowledge graph with raea model. World Wide Web 26, 2215–2235 (2023). https://doi.org/10.1007/s11280-022-01134-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-022-01134-y

Keywords

Navigation