CHIN: Classification with META-PATH in Heterogeneous Information Networks
Most real-word data can be modeled as heterogeneous information networks (HINs), which are composed of multiple types of nodes and links. Classification for objects in HINs is a fundamental problem with broad applications. However, traditional methods cannot involve in heterogeneous information networks. These approaches could not involve the relatedness between objects and various path semantics. In this paper, we proposed a novel framework called CHIN for classification. It utilizes the relevance measurement on objects to iteratively label objects in HINs. As different meta-path performs different accuracy for classification, the proposed framework incorporates the weights of meta-paths. As our experiments show, CHIN generates more accurate classes than the other classification algorithm, but also provides meaningful weights for meta-paths for classification task.
KeywordsClassification Meta-path Heterogeneous information networks
This work is supported by National Key R&D Program of China (No. 2017YFC08033007), the National Natural Science of Foundation of China (No. 91546111, 91646201) and Basic Research Funding of Beijing University of Technology (No. 040000546318516).
- 1.Sun, Y., Han, J., Yan, X., Yu, P.S., Wu, T.: PathSim: meta path-based top-k similarity search in heterogeneous information networks. Proc. VLDB Endow. 4(11), 992–1003 (2011)Google Scholar
- 2.Sun, Y., Yu, Y., Han, J.: Ranking-based clustering of heterogeneous information networks with star network schema. In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 797–806. ACM (2009)Google Scholar
- 4.Völkel, M., Krötzsch, M., Vrandecic, D., Haller, H., Studer, R.: Semantic wikipedia. In: Proceedings of the 15th International Conference on World Wide Web, pp. 585–594. ACM (2006)Google Scholar
- 9.Gupta, M., Kumar, P., Bhasker, B.: DPRel: a meta-path based relevance measure for mining heterogeneous networks. Inf. Syst. Front., 1–17 (2017)Google Scholar
- 10.Macskassy, S.A., Provost, F.: Classification in networked data: a toolkit and a univariate case study. J. Mach. Learn. Res. 8(May), 935–983 (2007)Google Scholar
- 11.Wan, C., Li, X., Kao, B., Yu, X., Gu, Q., Cheung, D., Han, J.: Classification with active learning and meta-paths in heterogeneous information networks. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pp. 443–452. ACM (2015)Google Scholar
- 12.Ji, M., Han, J., Danilevsky, M.: Ranking-based classification of heterogeneous information networks. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2011, pp. 1298–1306. ACM, New York (2011)Google Scholar
- 14.Zhou, D., Bousquet, O., Lal, T.N., Weston, J., Schölkopf, B.: Learning with local and global consistency. In: Advances in Neural Information Processing Systems, pp. 321–328 (2004)Google Scholar
- 15.Ji, M., Sun, Y., Danilevsky, M., Han, J., Gao, J.: Graph regularized transductive classification on heterogeneous information networks. In: Balcázar, J.L., Bonchi, F., Gionis, A., Sebag, M. (eds.) ECML PKDD 2010. LNCS (LNAI), vol. 6321, pp. 570–586. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15880-3_42CrossRefGoogle Scholar
- 16.Macskassy, S.A., Provost, F.: A simple relational classifier. Technical report, New York Univ NY STERN School of Business (2003)Google Scholar