Assessing the Effect of 2D Fingerprint Filtering on ILP-Based Structure-Activity Relationships Toxicity Studies in Drug Design
The rational development of new drugs is a complex and expensive process. A myriad of factors affect the activity of putative candidate molecules in vivo and the propensity for causing adverse and toxic effects is recognised as the major hurdle behind the current “target-rich, lead-poor” scenario.
Structure-Activity Relationship studies, using relationalMachine Learning algorithms, proved already to be very useful in the complex process of rational drug design. However, a typical problem with those studies concerns the use of available repositories of previously studied molecules. It is quite often the case that those repositories are highly biased since they contain lots of molecules that are similar to each other. This results from the common practice where an expert chemist starts off with a lead molecule, presumed to have some potential, and then introduces small modifications to produce a set of similar molecules. Thus, the resulting sets have a kind of similarity bias.
In this paper we assess the advantages of filtering out similar molecules in order to improve the application of relational learners in Structure-Activity Relationship (SAR) problems to predict toxicity. Furthermore, we also assess the advantage of using a relational learner to construct comprehensible models that may be quite valuable to bring insights into the workings of toxicity.
KeywordsHydrogen Bond Donor Inductive Logic Programming Relational Learner Similar Molecule Toxic Molecule
Unable to display preview. Download preview PDF.
- 1.Plewczynski, D.: Tvscreen: Trend vector virtual screening of large commercial compounds collections. In: BIOTECHNO 2008, pp. 59–63 (2008)Google Scholar
- 2.Graham, J., Page, C., Kamal, A.: Accelerating the drug design process through parallel inductive logic programming data mining. In: CSB 2003, p. 400 (2003)Google Scholar
- 7.Tiwari, A., Knowles, J., Avineri, E., Dahal, K., Roy, R. (eds.): Advances in the Application of Machine Learning Techniques in Drug Discovery, Design and Development. Applications of Soft Compt.: Recent Trends. Advances in Soft Compt. Springer, Heidelberg (2006)Google Scholar
- 9.Neagu, D., Craciun, M., Stroia, S., Bumbaru, S.: Hybrid intelligent systems for predictive toxicology - a distributed approach. In: International Conference on Intelligent Systems Design and Applications, pp. 26–31 (2005)Google Scholar
- 14.Srinivasan, A.: The Aleph Manual (2003), http://web.comlab.ox.ac.uk/oucl/research/areas/machlearn/Aleph
- 15.Pereira, M., Costa, V.S., Camacho, R., Fonseca, N.A., Simoes, C., Brito, R.: Comparative study of classification algorithms using molecular descriptors in toxicological databases. In: Brasilian Symposium on Bioinformatics (2009)Google Scholar