Skip to main content

Advertisement

Log in

Locally Linear Embedding for dimensionality reduction in QSAR

  • Published:
Journal of Computer-Aided Molecular Design Aims and scope Submit manuscript

Abstract

Current practice in Quantitative Structure Activity Relationship (QSAR) methods usually involves generating a great number of chemical descriptors and then cutting them back with variable selection techniques. Variable selection is an effective method to reduce the dimensionality but may discard some valuable information. This paper introduces Locally Linear Embedding (LLE), a local non-linear dimensionality reduction technique, that can statistically discover a low-dimensional representation of the chemical data. LLE is shown to create more stable representations than other non-linear dimensionality reduction algorithms, and to be capable of capturing non-linearity in chemical data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Hansch, C. and Leo, A., Exploring QSAR: Fundamentals and Applications in Chemistry and Biology. ACS Professional Reference Book, 1995.

  • L. Saul S. Roweis (2002) J. Mach. Learn. Res., 4 119

    Google Scholar 

  • I.T. Jolliffe (2002) Principal Component Analysis Springer New York

    Google Scholar 

  • J. Tenenbaum V. Silva Particlede J. Langford (2000) Science 290 IssueID5500 2319

    Google Scholar 

  • S. Roweis L. Saul (2000) Science 290 IssueID5500 2323

    Google Scholar 

  • B. Schölkopf A. Smola K.-R. Müller (1998) Neural Comput., 10 1299

    Google Scholar 

  • Bengio, Y., Paiement, J., Vincent, P., Delalleau, O., Le Roux, N. and Ouimet M., Out-of-Sample Extensions for LLE, Isomap, MDS, Eigenmaps, and Spectral Clustering. In Thrun, S., Saul, L. and Schölkopf, B. (Eds), Advances in Neural Information Processing Systems 16, 2004.

  • Williams, C.K.I. and Seeger, M., Using the Nyström method to speed up kernel machines. In Leen, T., Dietterich, T. and Tresp, V. (Eds), Advances in Neural Information Processing Systems 13. Cambridge, MA, 2001, pp. 682–688.

  • Shawe-Taylor, J., Cristianini, N. and Kandola., J., On the concentration of spectral properties. In Dietterich, T., Becker, S. and Ghahramani, Z. (Eds), Advances in Neural Information Processing Systems 14, 2002.

  • Shawe-Taylor, J. and Williams, C., The stability of kernel principal components analysis and its relation to the process eigenspectrum. In Becker, S., Thrun, S. and Obermayer, K. (Eds), Advances in Neural Information Processing Systems 15, 2003.

  • Zwald, L., Bousquet, O. and Blanchard, G., Statistical Properties of Kernel Principal Component Analysis. In Shawe-Taylor, J. and Singer, Y. (Eds), Learning Theory: Proceedings of 17th Annual Conference on Learning theory, COLT 2004, Banff, Canada, July 1–4. Vol. 3120 of Lecture Notes in Computer Science. Springer, Berlin, Germany, 2004, pp. 594–608.

  • T. Cox M. Cox (1994) Multidimensional Scaling Chapman & Hall London

    Google Scholar 

  • B. Schölkopf A. Smola K.-R. Müller (1996) Nonlinear Component Analysis as a Kernel Eigenvalue Problem Max Planck Institute for Biological Cybernetics Tübingen, Germany

    Google Scholar 

  • B. Schölkopf C.J.C. Burges A.J. Smola (1999) Advances in Kernel Methods–Support Vector Learning MIT Press Cambridge, MA

    Google Scholar 

  • D. Rumelhart G. Hinton R. Williams (1986) Nature 323 533

    Google Scholar 

  • I. Frank J. Friedman (1993) Technometrics 35 IssueID2 109

    Google Scholar 

  • P. Harrison G. Barlin L. Davies S. Ireland P. Matyus M. Wong (1996) Eur. J. Med. Chem. 31 651

    Google Scholar 

  • F. Burden M. Ford D. Whitley D. Winkler (2000) J. Chem. Inf. Comput. Sci. 40 1423

    Google Scholar 

  • B. Orlek F. Blaney F. Brown M. Clark M. Hadley J. Hatcher G. Riley H. Rosenberg H. Wadsworth P. Wyman (1991) J. Med. Chem. 34 2726

    Google Scholar 

  • Santavy, M. and Labute, P., SVL: The Scientific Vector Language, 1997. www.chemcomp.com/Journal_of_CCG/Features/svl.htm.

  • T. Halgren (1996) J. Comput. Chem. 17 490

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to P.-J. L’Heureux.

Rights and permissions

Reprints and permissions

About this article

Cite this article

L’Heureux, PJ., Carreau, J., Bengio, Y. et al. Locally Linear Embedding for dimensionality reduction in QSAR. J Comput Aided Mol Des 18, 475–482 (2004). https://doi.org/10.1007/s10822-004-5319-9

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10822-004-5319-9

Keywords

Navigation