Skip to main content

Ligand Affinity Prediction with Multi-pattern Kernels

  • Conference paper
  • First Online:
Discovery Science (DS 2016)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9956))

Included in the following conference series:

Abstract

We consider the problem of affinity prediction for protein ligands. For this purpose, small molecule candidates can easily become regression algorithm inputs if they are represented as vectors indexed by a set of physico-chemical properties or structural features of their molecular graphs. There are plenty of so-called molecular fingerprints, each with a characteristic composition or generation of features. This raises the question which fingerprint to choose for a given learning task? In addition, none of the standard fingerprints, however, systematically gathers all circular and tree patterns independent of size and the adjacency information of atoms. Since structural and neighborhood information are crucial for the binding capacity of small molecules, we combine the features of existing graph kernels in a novel way such that finally both aspects are covered and the fingerprint choice is included in the learning process. More precisely, we apply the Weisfeiler-Lehman labeling algorithm to encode neighborhood information in the vertex labels. Based on the relabeled graphs we calculate four types of structural features: Cyclic and tree patterns, shortest paths and the Weisfeiler-Lehman labels. We combine these different views using different multi-view regression algorithms. Our experiments demonstrate that affinity prediction profits from the application of multiple views, outperforming state-of-the-art single fingerprint approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    See [8] for a definition of such a canonical representation.

  2. 2.

    Binding database, https://www.bindingdb.org/bind/index.jsp.

  3. 3.

    openbabel.org.

  4. 4.

    Available at http://research.microsoft.com/en-us/um/people/manik/code/smo-mkl/download.html.

  5. 5.

    https://www.csie.ntu.edu.tw/~cjlin/libsvm/.

References

  1. Balfer, J., Bajorath, J.: Artifacts in support vector regression-based compound potency prediction revealed by statistical and activity landscape analysis. PLoS ONE 10 (2015)

    Google Scholar 

  2. Bender, A., Jenkins, J.L., Scheiber, J., Sukuru, S.C.K., Glick, M., Davies, J.W.: How similar are similarity searching methods? A principal component analysis of molecular descriptor space. J. Chem. Inf. Model. 49, 108–119 (2009)

    Article  Google Scholar 

  3. Borgwardt, K.M., Kriegel, H.-P.: Shortest-path kernels on graphs. In: Proceedings of ICDM, pp. 74–81 (2005)

    Google Scholar 

  4. Cherkasov, A., Muratov, E.N., Fourches, D., Varnek, A., Baskin, I., Cronin, M., et al.: QSAR modeling: where have you been? Where are you going to? J. Med. Chem. 57, 4977–5010 (2014)

    Article  Google Scholar 

  5. Christianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge University Press, New York (2000)

    Book  Google Scholar 

  6. Cortes, C., Mohri, M., Rostaminzadeh, A.: \({L}_2\) regularization for learning kernels. In: Proceedings of UAI, pp. 109–116 (2009)

    Google Scholar 

  7. Gaüzère, B., Brun, L., Villemin, D.: Treelet kernel incorporating cyclic, stereo and inter pattern information in Chemoinformatics. Pattern Recogn. 48, 356–367 (2014)

    Article  Google Scholar 

  8. Horváth, T., Gärtner, T., Wrobel, S.: Cyclic pattern kernels for predictive graph mining. In: Proceedings of KDD, pp. 158–167 (2004)

    Google Scholar 

  9. Liu, W., Meng, X., Xu, Q., Flower, D.R., Li, T.: Quantitative prediction of mouse class I MHC peptide binding affinity using support vector machine regression (SVR) models. BMC Bioinform. 7 (2006)

    Google Scholar 

  10. Myint, K.-Z., Wang, L., Tong, Q., Xie, X.-Q.: Molecular fingerprint-based artificial neural networks QSAR for ligand biological activity predictions. Mol. Pharm. 9, 2912–2923 (2012)

    Article  Google Scholar 

  11. Ning, X., Rangwala, H., Karypis, E.: Multi-assay-based structure-activity-relationship models: improving structure-activity-relationship models by incorporating activity information from related targets. J. Chem. Inf. Model. 49, 2444–2456 (2009)

    Article  Google Scholar 

  12. Ralaivola, L., Swamidass, S.J., Saigo, H., Baldi, P.: Graph kernels for chemical informatics. Neural Netw. 18, 1093–1110 (2005)

    Article  Google Scholar 

  13. Rogers, D., Hahn, M.: Extended connectivity fingerprints. J. Chem. Inf. Model. 50, 742–754 (2010)

    Article  Google Scholar 

  14. Shervashidze, N., Schweitzer, P., van Leeuwen, E.J., Mehlhorn, K., Borgwardt, K.M.: Weisfeiler-Lehman graph kernels. J. Mach. Learn. Res. 12, 2539–2561 (2011)

    MathSciNet  MATH  Google Scholar 

  15. Schölkopf, B., Herbrich, R., Smola, A.J.: A generalized representer theorem. In: Helmbold, D., Williamson, B. (eds.) COLT 2001. LNCS (LNAI), vol. 2111, pp. 416–426. Springer, Heidelberg (2001). doi:10.1007/3-540-44581-1_27

    Chapter  Google Scholar 

  16. Smola, A.J., Schölkopf, B.: A tutorial on support vector regression. Stat. Comput. 14, 199–222 (2004)

    Article  MathSciNet  Google Scholar 

  17. Sugaya, N.: Ligand efficiency-based support vector regression models for predicting bioactivities of ligands to drug target proteins. J. Chem. Inf. Model. 54, 2751–2763 (2014)

    Article  Google Scholar 

  18. Qiu, S., Lane, T.: Multiple kernel support vector regression for siRNA efficacy prediction. IEEE/ACM Trans. Comput. Biol. Bioinform. 4983, 367–378 (2008)

    MathSciNet  Google Scholar 

  19. Vishwanathan, S.V.N., Sun, Z., Theera-Ampornpunt, N., Varma, M.: Multiple kernel learning and the SMO algorithm. In: Proceedings of NIPS, pp. 2361–2369 (2010)

    Google Scholar 

Download references

Acknowledgements

We want to thank Dr. Martin Vogt from the Department of Life Science Informatics, B-IT, of the university of Bonn for preparing the protein dataset and making it available for us. Furthermore, we thank Dr. Martin Vogt and his colleagues for many valuable discussions on this topic. We would also like to thank Prof. Thomas Gärtner for guidance and advice.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Katrin Ullrich .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Ullrich, K., Mack, J., Welke, P. (2016). Ligand Affinity Prediction with Multi-pattern Kernels. In: Calders, T., Ceci, M., Malerba, D. (eds) Discovery Science. DS 2016. Lecture Notes in Computer Science(), vol 9956. Springer, Cham. https://doi.org/10.1007/978-3-319-46307-0_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-46307-0_30

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-46306-3

  • Online ISBN: 978-3-319-46307-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics