Graph Regularized Transductive Classification on Heterogeneous Information Networks

Ji, Ming; Sun, Yizhou; Danilevsky, Marina; Han, Jiawei; Gao, Jing

doi:10.1007/978-3-642-15880-3_42

Ming Ji²³,
Yizhou Sun²³,
Marina Danilevsky²³,
Jiawei Han²³ &
…
Jing Gao²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6321))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

3516 Accesses
104 Citations

Abstract

A heterogeneous information network is a network composed of multiple types of objects and links. Recently, it has been recognized that strongly-typed heterogeneous information networks are prevalent in the real world. Sometimes, label information is available for some objects. Learning from such labeled and unlabeled data via transductive classification can lead to good knowledge extraction of the hidden network structure. However, although classification on homogeneous networks has been studied for decades, classification on heterogeneous networks has not been explored until recently.

In this paper, we consider the transductive classification problem on heterogeneous networked data which share a common topic. Only some objects in the given network are labeled, and we aim to predict labels for all types of the remaining objects. A novel graph-based regularization framework, GNetMine, is proposed to model the link structure in information networks with arbitrary network schema and arbitrary number of object/link types. Specifically, we explicitly respect the type differences by preserving consistency over each relation graph corresponding to each type of links separately. Efficient computational schemes are then introduced to solve the corresponding optimization problem. Experiments on the DBLP data set show that our algorithm significantly improves the classification accuracy over existing state-of-the-art methods.

Research was sponsored in part by the U.S. National Science Foundation under grant IIS-09-05215, and by the Army Research Laboratory under Cooperative Agreement Number W911NF-09-2-0053 (NS-CTA). The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the Army Research Laboratory or the U.S. Government. The U.S. Government is authorized to reproduce and distribute reprints for Government purposes notwithstanding any copyright notation here on.

Download to read the full chapter text

Chapter PDF

HetPathMine: A Novel Transductive Classification Algorithm on Heterogeneous Information Networks

Robust Classification of Information Networks by Consistent Graph Learning

Robust Semi-Supervised Learning on Multiple Networks with Noise

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Banerjee, A., Basu, S., Merugu, S.: Multi-way clustering on relation graphs. In: SDM 2007 (2007)
Google Scholar
Bekkerman, R., El-Yaniv, R., McCallum, A.: Multi-way distributional clustering via pairwise interactions. In: ICML 2005, pp. 41–48 (2005)
Google Scholar
Belkin, M., Niyogi, P., Sindhwani, V.: Manifold regularization: A geometric framework for learning from examples. J. Mach. Learn. Res. 7, 2399–2434 (2006)
MathSciNet Google Scholar
Chakrabarti, S., Dom, B., Indyk, P.: Enhanced hypertext categorization using hyperlinks. In: SIGMOD 1998, pp. 307–318. ACM, New York (1998)
Chapter Google Scholar
Chung, F.R.K.: Spectral Graph Theory. Regional Conference Series in Mathematics, vol. 92. AMS, Providence (1997)
MATH Google Scholar
Friedman, N., Getoor, L., Koller, D., Pfeffer, A.: Learning probabilistic relational models. In: IJCAI 1999 (1999)
Google Scholar
Gao, J., Liang, F., Fan, W., Sun, Y., Han, J.: Graph-based consensus maximization among multiple supervised and unsupervised models. In: Advances in Neural Information Processing Systems (NIPS), vol. 22, pp. 585–593 (2009)
Google Scholar
Long, B., Zhang, Z.M., Wu, X., Yu, P.S.: Spectral clustering for multi-type relational data. In: ICML 2006, pp. 585–592 (2006)
Google Scholar
Lu, Q., Getoor, L.: Link-based classification. In: ICML 2003 (2003)
Google Scholar
Macskassy, S.A., Provost, F.: A simple relational classifier. In: Proc. of MRDM-2003 at KDD-2003, pp. 64–76 (2003)
Google Scholar
Macskassy, S.A., Provost, F.: Classification in networked data: A toolkit and a univariate case study. J. Mach. Learn. Res. 8, 935–983 (2007)
Google Scholar
Neville, J., Jensen, D.: Relational dependency networks. J. Mach. Learn. Res. 8, 653–692 (2007)
Google Scholar
Neville, J., Jensen, D., Friedland, L., Hay, M.: Learning relational probability trees. In: KDD 2003, pp. 625–630 (2003)
Google Scholar
Neville, J., Jensen, D., Gallagher, B.: Simple estimators for relational bayesian classifiers. In: ICDM 2003, p. 609 (2003)
Google Scholar
Sen, P., Getoor, L.: Link-based classification. Technical Report CS-TR-4858, University of Maryland (February 2007)
Google Scholar
Sun, Y., Yu, Y., Han, J.: Ranking-based clustering of heterogeneous information networks with star network schema. In: KDD 2009, pp. 797–806 (2009)
Google Scholar
Taskar, B., Abbeel, P., Koller, D.: Discriminative probabilistic models for relational data. In: UAI, pp. 485–492 (2002)
Google Scholar
Taskar, B., Segal, E., Koller, D.: Probabilistic classification and clustering in relational data. In: IJCAI 2001, pp. 870–876 (2001)
Google Scholar
Yin, Z., Li, R., Mei, Q., Han, J.: Exploring social tagging graph for web object classification. In: KDD 2009, pp. 957–966 (2009)
Google Scholar
Zhang, T., Popescul, A., Dom, B.: Linear prediction models with graph regularization for web-page categorization. In: KDD 2006, pp. 821–826 (2006)
Google Scholar
Zhou, D., Bousquet, O., Lal, T.N., Weston, J., Schölkopf, B.: Learning with local and global consistency. In: NIPS 16 (2003)
Google Scholar
Zhu, X., Ghahramani, Z., Lafferty, J.D.: Semi-supervised learning using gaussian fields and harmonic functions. In: ICML 2003, pp. 912–919 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL
Ming Ji, Yizhou Sun, Marina Danilevsky, Jiawei Han & Jing Gao

Authors

Ming Ji
View author publications
You can also search for this author in PubMed Google Scholar
Yizhou Sun
View author publications
You can also search for this author in PubMed Google Scholar
Marina Danilevsky
View author publications
You can also search for this author in PubMed Google Scholar
Jiawei Han
View author publications
You can also search for this author in PubMed Google Scholar
Jing Gao
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Departamento de Matemáticas, Estadística y Computación, Universidad de Cantabria, Avenida de los Castros, s/n, 39071, Santander, Spain
José Luis Balcázar
Yahoo! Research Barcelona, Avinguda Diagonal 177, 08018, Barcelona, Spain
Francesco Bonchi
Yahoo! Research Barcelona, Avinguda Diagnonal 177, 08018, Barcelona, Spain
Aristides Gionis
TAO, CNRS-INRIA-LRI, Université Paris-Sud, 91405, Orsay, France
Michèle Sebag

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ji, M., Sun, Y., Danilevsky, M., Han, J., Gao, J. (2010). Graph Regularized Transductive Classification on Heterogeneous Information Networks. In: Balcázar, J.L., Bonchi, F., Gionis, A., Sebag, M. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2010. Lecture Notes in Computer Science(), vol 6321. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15880-3_42

Download citation

DOI: https://doi.org/10.1007/978-3-642-15880-3_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15879-7
Online ISBN: 978-3-642-15880-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Graph Regularized Transductive Classification on Heterogeneous Information Networks

Abstract

Chapter PDF

Similar content being viewed by others

HetPathMine: A Novel Transductive Classification Algorithm on Heterogeneous Information Networks

Robust Classification of Information Networks by Consistent Graph Learning

Robust Semi-Supervised Learning on Multiple Networks with Noise

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Graph Regularized Transductive Classification on Heterogeneous Information Networks

Abstract

Chapter PDF

Similar content being viewed by others

HetPathMine: A Novel Transductive Classification Algorithm on Heterogeneous Information Networks

Robust Classification of Information Networks by Consistent Graph Learning

Robust Semi-Supervised Learning on Multiple Networks with Noise

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation