Skip to main content

Summary

Many different measures of structural similarity have been suggested for matching chemical structures, each such measure focusing upon some particular type of molecular characteristic. The multi-faceted nature of biological activity suggests that an appropriate similarity measure should encompass many different types of characteristic, and this article discusses the use of data fusion methods to combine the results of searches based on multiple similarity measures. Experiments with several different types of dataset and activity suggest that data fusion provides a simple, but effective, approach to the combination of individual similarity measures. The best results were generally obtained with a fusion rule that sums the rank positions achieved by each molecule in searches using individual measures.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Downs, G.M. and Willett, P., Rev. Comput. Chem., 7 (1995) 1.

    Google Scholar 

  2. Dean, P.M. and Perkins, T.D.J., In Martin, Y.C. and Willett, P. (Eds.) Designing Bioactive Molecules: Three-Dimensional Techniques and Applications, American Chemical Society, Washington DC 1998, pp. 199–218.

    Google Scholar 

  3. Special issue devoted to molecular similarity, J. Chem. Inf. Comput. Sci., 32 (1992) 577–752.

    Google Scholar 

  4. Dean, P.M. (Ed.) Molecular Similarity in Drug Design, Chapman and Hall, Glasgow, 1975.

    Google Scholar 

  5. Willett, P., Barnard, J.M. and Downs, G.M., J. Chem. Inf. Comput. Sci., 38 (1998) 983.

    Article  CAS  Google Scholar 

  6. Willett, P. and Winterman, V., Quant. Struct.-Act. Relat., 5 (1986) 18.

    CAS  Google Scholar 

  7. Hall, D.L., Mathematical Techniques in Multisensor Data Fusion, Artech House, Northwood, MA, 1992.

    Google Scholar 

  8. Kokar, M. and Kim, K., Control Eng. Pract., 2 (1994) 803.

    Article  Google Scholar 

  9. Arabnia, H.R. and Zhu, D. (Eds.) Proceedings of the International Conference on Multisource-Multisensor Information Fusion, Fusion’98, CSREA Press, 1998.

    Google Scholar 

  10. Belkin, N.J., Kantor, P., Fox, E.A. and Shaw, J.B., Inf. Proc. Manag., 31 (1995) 431.

    Google Scholar 

  11. Savoy, J., Ndarugendamwo, M. and Vrajitoru, D., Proceedings of the Fourth Text Retrieval Conference, National Institute for Standards and Technology NIST Special Publication 500–236, Gaithersberg, MD, 1996, pp. 537–547.

    Google Scholar 

  12. Lee, J.H., Proceedings of the Twentieth Annual International Conference on Research and Development in Information Retrieval, Association for Computing Machinery, New York, NY, 1997, pp. 267–276.

    Google Scholar 

  13. Pfeifer, U., Poersch, T. and Fuhr, N., Inf. Proc. Manag., 32 (1996) 667.

    Google Scholar 

  14. Smeaton, A.F. and Crimmins, F., URL: http://www.inf.udec.cl/~campos/fusion/fusionpc/fusion-www6.html

  15. Clerc, T. and Erni, F., Topics Curr. Chem., 39 (1973) 91.

    CAS  Google Scholar 

  16. Masui, H. and Yoshida, M., J. Chem. Inf. Comput. Sci., 36(19%) 294.

    Google Scholar 

  17. Kearsley, S.K., Sallamack, S., Fluder, E.M., Andose, J.D., Mosely, R.T. and Sheridan, R.P., J. Chem. Inf. Comput. Sci., 36 (1996) 118.

    Article  CAS  Google Scholar 

  18. Sheridan, R.P., Miller, M.D., Underwood, D.J. and Kearsley, S.K., J. Chem. Inf. Comput. Sci., 36 (1996) 128.

    Article  CAS  Google Scholar 

  19. So, S.-S. and Karplus, M., J. Comput.-Aided Mol. Design, 13 (1999) 243.

    Article  CAS  Google Scholar 

  20. Ginn, C.M.R., Turner, D.B., Willett, P., Ferguson, A.M. and Heritage T.W., J. Chem. Inf. Comput. Sci., 37 (1997) 23.

    Article  CAS  Google Scholar 

  21. Ginn, C.M.R., The Application of Data Fusion to Similarity Searching of Chemical Databases. Ph.D. thesis, University of Sheffield, 1998.

    Google Scholar 

  22. Ranade, S.S., Prediction of Cellular Uptake of Foreign Chemicals Using Cluster Analysis, Ph.D. thesis, University of Sheffield, 1998.

    Google Scholar 

  23. Bath, P.A., Poirrette, A.R., Willett, P. and Allen, F.H., J. Chem. Inf. Comput. Sci., 34 (1994) 141.

    Article  CAS  Google Scholar 

  24. Siegel, S. and Castellan, N.J., Nonparmetric Statistics. McGraw-Hill, New York, NY, 1988.

    Google Scholar 

  25. Peperrell, C.A., Taylor, R. and Willett, P., Tetrahedron Comput. Methodol., 3 (1990) 575.

    Google Scholar 

  26. Drayton, S.K., Edwards, K., Jewell, N.E., Turner, D.B., Wild, D.J., Willett, P., Wright, P.M. and Simmons, K., Internet J. Chem., URL http://www.ijc.com/articles/1998v1/37/

  27. Kahn, S.D., Schleyer, P.v.R., Allinger, N.L., Clark, T., Gasteiger, J., Kollman, P.A., Schaefer III, H.F and Schreiner, P.R. (Eds.), Encyclopedia of Computational Chemistry, Vol. 1, John Wiley, Chichester, 1998, 417–425.

    Google Scholar 

  28. Stanton, D.T. and Jurs P.C., Anal. Chem., 62 (1990) 2323.

    Article  CAS  Google Scholar 

  29. Bradshaw, J., URL: http://w ww.daylight.com/meetings/mug97/Bradshaw/MUG97/tv_tversky.html

  30. Smeaton, A.F., Proceedings of the Twentieth BCS-IRSG Colloquium, Grenoble, France (in press).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Kluwer Academic Publishers

About this chapter

Cite this chapter

Ginn, C.M., Willett, P., Bradshaw, J. (2000). Combination of molecular similarity measures using data fusion. In: Klebe, G. (eds) Virtual Screening: An Alternative or Complement to High Throughput Screening?., vol 20. Springer, Dordrecht. https://doi.org/10.1007/0-306-46883-2_1

Download citation

  • DOI: https://doi.org/10.1007/0-306-46883-2_1

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-0-7923-6633-1

  • Online ISBN: 978-0-306-46883-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics