Abstract
The real world data sets with multi-typed objects and multi-typed relations can be structured as heterogeneous information networks (HIN). Clustering is one of the most significant process in HIN since it provides useful insights of hidden patterns of objects and their complex relation structure. However, grouping multi-relational target objects without losing their rich semantics and unknown number of clusters is a challenging task. Hence, we use the meta-path concepts to compute the similarity matrix between each pair of objects by exploring the different relations to preserve their semantics. Subsequently, we employ the Affinity Propagation (AP) clustering approach that can automatically generate clusters and corresponding exemplars (cluster center) for each object based on the similarity matrix. The basic motivation of using AP algorithm is its effectiveness, scalability and the speed on detecting community/clustering of networked data and yet it has not been applied in HIN. However, the performance of AP algorithm depends on two parameters: i) preference p and ii) damping factor \(\lambda \) which causes the algorithm to be non-converged and produce unsatisfactory clustering results. Although some existing methods have been developed to handle this issue, it still faces two challenges: i) slow convergence ii) high computation for finding optimal clustering. In this paper, we presented an enhanced AP (EAP) clustering approach to overcome this issue by updating their parameter values based on different strategies, to improve the AP performance on an HIN data set. The experimental results show that the proposed method can accelerate the algorithm’s convergence to evaluate optimal clustering compared to the other methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Shi, C., Li, Y., Zhang, J., Sun, Y., Philip, S.Y.: A survey of heterogeneous information network analysis. IEEE Trans. Knowl. Data Eng. 29(1), 17–37 (2016)
Sun, Y., Han, J.: Mining heterogeneous information networks: a structural analysis approach. ACM SIGKDD Explor. Newsl. 14(2), 20–28 (2013)
Yi, D., Su, J., Liu, C., Chen, W.-H.: Data-driven situation awareness algorithm for vehicle lane change. In: 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), pp. 998–1003. IEEE (2016)
Sun, Y., Han, J.: Mining Heterogeneous Information Networks: Principles and Methodologies. Morgan & Claypool Publishers (2012)
Sun, Y., Han, J., Yan, X., Yu, P.S., Wu, T.: PathSim: meta path-based top-k similarity search in heterogeneous information networks. Proc. VLDB Endow. 4(11), 992–1003 (2011)
Zhang, J., Yu, P.S., Zhou, Z.-H.: Meta-path based multi-network collective link prediction. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1286–1295 (2014)
Frey, B.J., Dueck, D.: Clustering by passing messages between data points. Science 315(5814), 972–976 (2007)
Vlasblom, J., Wodak, S.J.: Markov clustering versus affinity propagation for the partitioning of protein interaction graphs. BMC Bioinform. 10(1), 1–14 (2009)
Wang, K., Zhang, J., Li, D., Zhang, X., Guo, T.: Adaptive affinity propagation clustering. arXiv preprint. arXiv:0805.1096 (2008)
Refianti, R., Mutiara, A.B., Juarna, A., Suhendra, A.: A preference model on adaptive affinity propagation. Int. J. Electr. Comput. Eng. (IJECE) 8(3), 1805–1813 (2018)
Meng, C., Cheng, R., Maniu, S., Senellart, P., Zhang, W.: Discovering meta-paths in large heterogeneous information networks. In: Proceedings of the 24th International Conference on World Wide Web, pp. 754–764 (2015)
Fu, G., Ding, Y., Seal, A., Chen, B., Sun, Y., Bolton, E.: Predicting drug target interactions using meta-path-based semantic network analysis. BMC Bioinform. 17(1), 1–10 (2016)
Sun, Y., Norick, B., Han, J., Yan, X., Yu, P.S., Yu, X.: PathSelClus: integrating meta-path selection with user-guided object clustering in heterogeneous information networks. ACM Trans. Knowl. Discov. Data (TKDD) 7(3), 1–23 (2013)
Luo, C., Pang, W., Wang, Z.: Semi-supervised clustering on heterogeneous information networks. In: Tseng, V.S., Ho, T.B., Zhou, Z.-H., Chen, A.L.P., Kao, H.-Y. (eds.) PAKDD 2014. LNCS (LNAI), vol. 8444, pp. 548–559. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-06605-9_45
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Rajan, D., Jiang, S., Yi, D., Pang, W., Coghill, G.M. (2024). Enhanced Affinity Propagation Clustering on Heterogeneous Information Network. In: Panoutsos, G., Mahfouf, M., Mihaylova, L.S. (eds) Advances in Computational Intelligence Systems. UKCI 2022. Advances in Intelligent Systems and Computing, vol 1454. Springer, Cham. https://doi.org/10.1007/978-3-031-55568-8_28
Download citation
DOI: https://doi.org/10.1007/978-3-031-55568-8_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-55567-1
Online ISBN: 978-3-031-55568-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)