Skip to main content

Heterogeneous PPI Network Representation Learning for Protein Complex Identification

  • Conference paper
  • First Online:
Bioinformatics Research and Applications (ISBRA 2022)

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 13760))

Included in the following conference series:

  • 661 Accesses

Abstract

Protein complexes are critical units for studying a cell system. How to accurately identify protein complexes has always been the focus of research. Most of the existing methods are based on the topological structure of the Protein-Protein Interaction (PPI) network and introduce some biological information to analyze the correlation between proteins to identify protein complex. However, these methods only comprise a homogenous network of biological information and protein nodes. Most of them ignore that different types of nodes have different importance for protein complex identification. Therefore, there is an urgent need for a method to integrate different types of biological information. This paper proposes a new protein complex identification method GHAE based on heterogeneous network representation learning. Firstly, GHAE combines Gene Ontology (GO) attribute information and PPI data to construct a heterogeneous PPI network. Secondly, based on the constructed network, we use the heterogeneous representation learning method to obtain the vector representation of protein nodes. Finally, we propose a complex identification method based on a heterogeneous network to identify protein complexes. Extensive experiments show that our method achieves state-of-the-art performance in most cases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 84.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Hanna, E.M., Zaki, N.: Dynamic protein-protein interaction networks and the detection of protein complexes: an overview. In: Proceedings of the International Conference on Bioinformatics and Computational Biology, p. 1 (2014)

    Google Scholar 

  2. Xu, Y., Zhou, J., Zhou, S., Guan, J.: CPredictor3.0: Detecting protein complexes from PPI networks with expression data and functional annotations. BMC Syst. Biol. 11(S7), 45–56 (2017)

    Article  Google Scholar 

  3. Adamcsek, B., Palla, G., Farkas, I.J., et al.: CFinder: locating cliques and overlapping modules in biological networks. Bioinformatics 22(8), 1021–1023 (2006)

    Article  CAS  PubMed  Google Scholar 

  4. Bader, G.D., Hogue, C.W.V.: An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinform. 4(1), 1–27 (2003)

    Article  Google Scholar 

  5. Liu, G., Wong, L., Chua, H.N.: Complex discovery from weighted PPI networks. Bioinformatics 25(15), 1891–1897 (2009)

    Article  CAS  PubMed  Google Scholar 

  6. Nepusz, T., Yu, H., Paccanaro, A.: Detecting overlapping protein complexes in protein-protein interaction networks. Nat. Methods 9(5), 471–472 (2012)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Asur, S., Ucar, D., Parthasarathy, S.: An ensemble framework for clustering protein–protein interaction networks. Bioinformatics 23(13), i29–i40 (2007)

    Article  CAS  PubMed  Google Scholar 

  8. Zaki, N., Efimov, D., Berengueres, J.: Protein complex detection using interaction reliability assessment and weighted clustering coefficient. BMC Bioinformatics 14(1), 1–9 (2013)

    Article  Google Scholar 

  9. Leung, H.C.M., Xiang, Q., Yiu, S.M., et al.: Predicting protein complexes from PPI data: a core-attachment approach. J. Comput. Biol. 16(2), 133–144 (2009)

    Article  CAS  PubMed  Google Scholar 

  10. Wu, M., Li, X., Kwoh, C.K., et al.: A core-attachment based method to detect protein complexes in PPI networks. BMC Bioinform. 10(1), 1–16 (2009)

    Article  Google Scholar 

  11. Chin, C.H., Chen, S.H., Ho, C.W., et al.: A hub-attachment based method to detect functional modules from confidence-scored protein interactions and expression profiles. BMC Bioinform. 11(1), 1–9 (2010)

    Google Scholar 

  12. King, A.D., Pržulj, N., Jurisica, I.: Protein complex prediction via cost-based clustering. Bioinformatics 20(17), 3013–3020 (2004)

    Article  CAS  PubMed  Google Scholar 

  13. Li, X.L., Foo, C.S., Ng, S.K.: Discovering protein complexes in dense reliable neighborhoods of protein interaction networks. Comput. Syst. Bioinform. 6, 157–168 (2007)

    Google Scholar 

  14. Lambrix, P., Habbouche, M., Perez, M.: Evaluation of ontology development tools for bioinformatics. Bioinformatics 19(12), 1564–1571 (2003)

    Article  CAS  PubMed  Google Scholar 

  15. Kipf, T.N., Welling, M.: Semi-supervised classification with graph convolutional networks. In: ICLR (2017)

    Google Scholar 

  16. Xu, B., et al.: Protein complexes identification based on go attributed network embedding. BMC Bioinform. 19 (2018). https://doi.org/10.1186/s12859-018-2555-x

  17. Perozzi, B., Al-Rfou, R., Deepwalk, S.S.: Online learning of social representations. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 701–710. ACM (2014)

    Google Scholar 

  18. Ou, M., Cui, P., Pei, J., et al.: Asymmetric transitivity preserving graph embedding. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1105–1114. San Francisco, USA (2016)

    Google Scholar 

  19. Tang, J., Qu, M., Wang, M., et al.: Line: large-scale information network embedding. In: Proceedings of the 24th International Conference on World Wide Web, pp. 1067–1077 (2015)

    Google Scholar 

  20. Heimann, M., Koutra, D.: On generalizing neural node embedding methods to multi-network problems. KDD MLG Workshop (2017)

    Google Scholar 

Download references

Acknowledgment

This work is supported by grant from the Natural Science Foundation of China (No. 62072070).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yijia Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhou, P., Zhang, Y., Chen, F., Pang, K., Lu, M. (2022). Heterogeneous PPI Network Representation Learning for Protein Complex Identification. In: Bansal, M.S., Cai, Z., Mangul, S. (eds) Bioinformatics Research and Applications. ISBRA 2022. Lecture Notes in Computer Science(), vol 13760. Springer, Cham. https://doi.org/10.1007/978-3-031-23198-8_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-23198-8_20

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-23197-1

  • Online ISBN: 978-3-031-23198-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics