On network-based kernel methods for protein-protein interactions with applications in protein functions prediction
Predicting protein functions is an important issue in the post-genomic era. This paper studies several network-based kernels including local linear embedding (LLE) kernel method, diffusion kernel and laplacian kernel to uncover the relationship between proteins functions and protein-protein interactions (PPI). The author first construct kernels based on PPI networks, then apply support vector machine (SVM) techniques to classify proteins into different functional groups. The 5-fold cross validation is then applied to the selected 359 GO terms to compare the performance of different kernels and guilt-by-association methods including neighbor counting methods and Chi-square methods. Finally, the authors conduct predictions of functions of some unknown genes and verify the preciseness of our prediction in part by the information of other data source.
Key wordsDiffusion kernel kernel method Laplacian kernel local linear embedding (LLE) kernel protein function prediction support vector machine.
Unable to display preview. Download preview PDF.
- R. Kondor and J. Lafferty, Diffusion kernels on graphs and other discrete input spaces, Proc Int Conf Machine Learning, 2002: 315–322.Google Scholar
- R. Lanckriet, M. Deng, M. Cristianini, et al., Kernel-based data fusion and its application to protein function prediction in yeast, Proceedings of the Pacific Symposium on Biocomputing, 2004, January 3–8, 300–311.Google Scholar
- W. Ching, L. Li, Y. Chan, and H. Mamitsika, A Study of network-based kernel methods on protein-protein interaction for protein functions prediction, The Third International Symposium on Optimization and Systems Biology (OSB 2009), Lecture Notes in Operations Research, Series 11, 2009, 11: 25–32.Google Scholar