Towards predictive resistance models for agrochemicals by combining chemical and protein similarity via proteochemometric modelling

van Westen, Gerard J. P.; Bender, Andreas; Overington, John P.

doi:10.1007/s12154-014-0112-2

Towards predictive resistance models for agrochemicals by combining chemical and protein similarity via proteochemometric modelling

Opinion Paper
Open access
Published: 15 May 2014

Volume 7, pages 119–123, (2014)
Cite this article

Download PDF

You have full access to this open access article

Journal of Chemical Biology

Towards predictive resistance models for agrochemicals by combining chemical and protein similarity via proteochemometric modelling

Download PDF

Gerard J. P. van Westen¹,
Andreas Bender² &
John P. Overington¹

2675 Accesses
2 Citations
12 Altmetric
1 Mention
Explore all metrics

Abstract

Resistance to pesticides is an increasing problem in agriculture. Despite practices such as phased use and cycling of ‘orthogonally resistant’ agents, resistance remains a major risk to national and global food security. To combat this problem, there is a need for both new approaches for pesticide design, as well as for novel chemical entities themselves. As summarized in this opinion article, a technique termed ‘proteochemometric modelling’ (PCM), from the field of chemoinformatics, could aid in the quantification and prediction of resistance that acts via point mutations in the target proteins of an agent. The technique combines information from both the chemical and biological domain to generate bioactivity models across large numbers of ligands as well as protein targets. PCM has previously been validated in prospective, experimental work in the medicinal chemistry area, and it draws on the growing amount of bioactivity information available in the public domain. Here, two potential applications of proteochemometric modelling to agrochemical data are described, based on previously published examples from the medicinal chemistry literature.

Quantitative estimation of pesticide-likeness for agrochemical discovery

Article Open access 12 September 2014

A large-scale crop protection bioassay data set

Article Open access 07 July 2015

Multi-scale QSAR Approach for Simultaneous Modeling of Ecotoxic Effects of Pesticides

Introduction

In agriculture, resistance to pesticides forms a complex and growing problem, which includes the development of resistance to insecticides [17], fungicides [16], as well as herbicides [7, 25, 6]. For each of these resistance types, a multitude of different resistance mechanisms are possible, all of which, however, lead to phenotypic resistance (i.e. the concentration of pesticide needed to kill the pests is higher for resistant variants compared to wild type). Commonly observed resistance mechanisms are similar to those observed in microbial and cancer mechanisms of resistance; examples include increased expression of efflux proteins, increased expression of metabolizing proteins, and point mutations in the protein targeted by the agrochemical agent [8, 10, 24]. Due to the spectrum of possible adaptations in the target organism, it is difficult to capture and model all potential forms of resistance for a certain compound in a model a priori, which is analogous to antibiotic and anti-cancer drug resistance. Out of these possibilities, the current opinion article will deal specifically with the impact of point mutations at the ligand-binding site and their effect on resistance. This is an area for which there is prior successful experience in the medicinal chemistry and drug design field, including prospective experimental validation of the models developed. Here, previous research of the authors as well as other related groups will be outlined, with the aim to transfer these methods also to the world of agrochemical research [22, 20].

Complementary ligand and target information

The binding affinity between a ligand (e.g. a small molecule or RNAi) [26, 1], and a target (usually a protein, but potentially a ribosomal target) [18], is governed by properties of both ligand and target (and the physiological environment, e.g. ionic strength and pH). Hence, it follows that changes to either one of the binding partners will, in most cases, translate into a difference (weaker or stronger binding) in affinity and subsequent efficacy. This mechanism is also exploited when a candidate molecule is optimized through iterative design of synthetic molecules to have a high affinity towards a desired protein target via quantitative structure-activity relationship (QSAR) studies, which is a standard technique during lead optimization in drug discovery. In the context of agrochemistry, extending this principle from single targets to multiple targets would be equivalent to optimizing a candidate molecule towards the desired pest, while avoiding unwanted effects, e.g. in humans (as well as other species). This procedure exploits the differences in the sequences and structures between the targets in these species (corresponding to a variation on the target side). In order to now model the activity of multiple ligands against multiple targets, ‘proteochemometric modelling’ (PCM) can be used, a computational technique that simultaneously uses properties of both ligand and target space. Hence, PCM can be used to make predictions for unknown ligands as well as unknown protein targets (Fig. 1; for reviews see [15, 23]). For clarity, these new sequences could be an orthologous sequence from a new pathogen or new sequences generated under selective pressure giving rise to resistant pathogens. The reader is referred to recent reviews and references therein for an overview of the more general concept of computational models of bioactivity and (QSAR) models (these include an overview of various applicable machine learning algorithms and descriptors) [23, 5]. Previously, PCM has been successfully applied to model the impact of protein mutations on the resistance of viruses to drugs. Examples include the human immunodeficiency virus (HIV), also including prospective experimental validation [14, 22, 20], as well as the dengue virus [19]. Given that the problem at hand is similar—in both cases, the impact of sequence mutations/differences of a target protein on ligand binding or efficacy needs to be understood—we anticipate that the concept of PCM will also be transferable to the field of agrochemicals, possibly with some domain-dependent modifications.

Resistance models

The first aim of PCM is to model the bioactivity of ligands against large number of related proteins, which allows the scientist to anticipate the bioactivity spectrum of a molecule in a multidimensional fashion. In the literature, it has been shown that multitarget models combining ligand and target properties (i.e. PCM models) generally perform better than single target models, which are trained on ligand-only properties (i.e. conventional QSAR models) [14, 19, 22, 23, 20]. Moreover, the multitarget nature of these models allows rationalization of resistance, as these models were able to deconvolute the individual contributions of amino acid substitutions (Fig. 2a [20]) or substructures within ligands that vary across a chemical series (Fig. 2b [22]). For instance in the case of HIV protease, it is known that residue 82 can mutate from the wild-type valine to a threonine, and that this simple, conservative, single-point mutation leads to (cross) resistance amongst all clinical inhibitors with the exception of Darunavir [12]. PCM models have been able to reproduce these results, as shown in Fig. 2a.

In the case of agrochemistry, the authors are of the opinion that the nature of the PCM technique would be equally suited to identify potential agrochemicals that have the most favourable resistance profile. Similarly, models could be used to deconvolute contributions of mutations to an increase or reduction of resistance displayed by individual mutants.

Multispecies models

Knowledge on the relationship between homologues targets in different species can be exploited using PCM, which is of relevance either in cases where off-target effects of a compound in a species needs to be avoided, but also importantly where the aim is to target multiple distinct species in a designed spectrum of activity. Previously, a proof-of-concept study [21] employed a single bioactivity model to simultaneously model both the human and rat orthologues of the adenosine receptors (G protein-coupled receptors) using data from the ChEMBL [9] database. The model was able to identify several novel ligands that were experimentally validated, and one of the ligands showed high affinity in the nanomolar range. Upon further inspection, it could be found that the selection of this ligand from a database was likely due to information from other species, underlining the value of integrating as much information from bioactivity space as reasonably possible. In Fig. 3a, a multidimensional scaling analysis (MDS) of the similarity between these eight proteins is shown. From the figure, it is apparent that orthologues (genes closely related in sequence and having the same function in different species) are more similar in the particular definition used than paralogues (genes which are similar in sequence but which have different functions in the same species). Constructing multispecies models on this type of data allows the rationalization of differences in activity between different species, as well as its application for compound design (such as in the above study). As is the case in the application to resistance, this model interpretation can be performed both from the ligand (chemical) point of view and from the target (protein or RNA) point of view, leading on the one hand to the elucidation of chemical features to guide compound optimization, and on the other hand to mutations driving resistance from the protein side.

These approaches are also transferable to the pesticide field. Analogous to the aforementioned adenosine receptors, gamma-aminobutyric acid A (GABA-A) ligand-gated ion channels can be aligned to capture and represent the (dis)similarities between species. These ion channels form the target for phenylpyrazole insecticides, and resistance has been demonstrated through point mutations [2, 4, 3]. An MDS of the similarity between mammalian and insect isoforms of these complexes is shown in Fig. 3b, illustrating that mammalian channels are more similar to each other than they are to their insect counterparts. Furthermore, the selected arthropods display a larger variation between species than do the selected mammals. As it was previously demonstrated that it is possible to model the protein similarity space shown in Fig. 3a (for the adenosine receptors), it stands to reason that it is feasible to model the space shown in Fig. 3b (representing the GABA-A ion channels). PCM models are agnostic of the particular target and application area they are used in—their applicability depends on the amount of data available both from the chemical and biological side (however, this requirement should not be neglected). Hence, the data visualized in Fig. 3b should allow for the construction of a predictive model that can predict activity (and toxicity) of candidate compounds on GABA-A in the species included in the analysis, in a manner similar to the adenosine receptors described above.

Finally, it should be noted that the ribosome has gained significant attention as a druggable target (specifically for antibiotics) [18]. It has been shown that bacteria can gain resistance via multiple diverse mutations in 23S ribosomal RNA [11, 13]. Yet a complementary mode of action leading to novel pesticides should provide a useful addition to established modes of action and increase the resistance threshold.

Conclusions

In summary, PCM is a versatile quantitative modelling technique for interactions between ligands and their biological targets. In the current opinion paper, we highlight application of the technique and precedence for success from related fields and sketch applications to the agrochemical context, comprising insecticides, fungicides, and herbicides. Based on the prospective experimental validation that has been performed for enzymes and receptors in previous studies, such work is likely to be successful. Firstly, there is the modelling and prediction of resistance towards pesticides by weeds, fungi, or insects. A second application is the prediction of activity of pesticides or other agricultural chemicals in organisms were this is undesired (‘off-targets’). Finally, the technique can be used in virtual screening in the identification of potential new agrochemicals, aimed at a higher resistance threshold (broader activity against multiple mutants), or potential agrochemicals anticipated to have less toxic effects on non-pest species. We anticipate a great future for studies in this area, as the cost of sequencing (which directly relates to the generation of protein-side descriptors) continues to drop and the amount of bioactivity data available increases, both of which increases our ability to generate predictive PCM models considerably.

References

Auer C, Frederick R (2009) Crop improvement using small RNAs: applications and predictive ecological risk assessments. Trends Biotechnol 27(11):644–51
Article CAS Google Scholar
Bloomquist JR (1996) Ion channels as targets for insecticides. Annu Rev Entomol 41(1):163–90
Article CAS Google Scholar
Buckingham SD, Biggin PC, Sattelle BM, Brown LA, Sattelle DB (2005) Insect GABA receptors: splicing, editing, and targeting by antiparasitics and insecticides. Mol Pharmacol 68(4):942–51
Article CAS Google Scholar
Caboni P, Sammelson RE, Casida JE (2003) Phenylpyrazole insecticide photochemistry, metabolism, and GABAergic action: ethiprole compared with fipronil. J Agric Food Chem 51(24):7055–61. doi:10.1021/jf030439l
Article CAS Google Scholar
Cherkasov A, Muratov EN, Fourches D, Varnek A, Baskin II, Cronin M, Dearden J, Gramatica P, Martin YC, Todeschini R, Consonni V, Kuz’min VE, Cramer R, Benigni R, Yang C, Rathman J, Terfloth L, Gasteiger J, Richard A, Tropsha A (2013) QSAR modeling: where have you been? Where are you going to? J Med Chem. doi:10.1021/jm4004285
Google Scholar
Devine MD, Shukla A (2000) Altered target sites as a mechanism of herbicide resistance. Crop Prot 19(8):881–9
Article CAS Google Scholar
Erickson JM, Rahire M, Rochaix J-D, Mets L (1985) Herbicide resistance and cross-resistance: changes at three distinct sites in the herbicide-binding protein. Science 228(4696):204–7
Article CAS Google Scholar
Ffrench-Constant RH, Rocheleau TA, Steichen JC, Chalmers AE (1993) A point mutation in a Drosophila GABA receptor confers insecticide resistance. Nature 363(6428):449–51
Article CAS Google Scholar
Gaulton A, Bellis LJ, Bento AP, Chambers J, Davies M, Hersey A, Light Y, McGlinchey S, Michalovich D, Al-Lazikani B, Overington JP (2012) ChEMBL: a large-scale bioactivity database for drug discovery. Nucleic Acids Res 40(D1):D1100–7. doi:10.1093/nar/gkr777
Article CAS Google Scholar
Hosie AM, Baylis HA, Buckingham SD, Sattelle DB (1995) Actions of the insecticide fipronil, on dieldrin-sensitive and- resistant GABA receptors of Drosophila melanogaster. Br J Pharmacol 115(6):909–12. doi:10.1111/j.1476-5381.1995.tb15896.x
Article CAS Google Scholar
Hummel H, Boück A (1987) 23S ribosomal RNA mutations in halobacteria conferring resistance to the anti-80S ribosome targeted antibiotic anisomycin. Nucleic Acids Res 15(6):2431–43
Article CAS Google Scholar
Johnson VA, Calvez V, Paredes R, Pillay D, Shafer RW, Wensing AM, Richman DD (2013) Update of the drug resistance mutations in HIV-1: March 2013. Top Antivir Med 21:6–14
Google Scholar
Kim JM, Kim JS, Kim N, Kim Y-J, Kim IY, Chee YJ, Lee C-H, Jung HC (2008) Gene mutations of 23S rRNA associated with clarithromycin resistance in Helicobacter pylori strains isolated from Korean patients. J Microbiol Biotechnol 18(9):1584–9
CAS Google Scholar
Lapins M, Eklund M, Spjuth O, Prusis P, Wikberg JES (2008) Proteochemometric modeling of HIV protease susceptibility. BMC Bioinforma 9:181. doi:10.1186/1471-2105-9-181
Article Google Scholar
Lapinsh M, Prusis P, Gutcaits a, Lundstedt T, Wikberg JE (2001) Development of proteo-chemometrics: a novel technology for the analysis of drug-receptor interactions. Biochim Biophys Acta 1525:180–90
Article CAS Google Scholar
Ma Z, Michailides TJ (2005) Advances in understanding molecular mechanisms of fungicide resistance and molecular detection of resistant genotypes in phytopathogenic fungi. Crop Prot 24(10):853–63
Article CAS Google Scholar
Oakeshott JG, Devonshire AL, Claudianos C, Sutherland TD, Horne I, Campbell PM, Ollis DL, Russell RJ (2005) Comparing the organophosphorus and carbamate insecticide resistance mutations in cholin- and carboxyl-esterases. Chem Biol Interact 157:269–75
Article Google Scholar
Poehlsgaard J, Douthwaite S (2005) The bacterial ribosome as a target for antibiotics. Nat Rev Microbiol 3(11):870–81
Article CAS Google Scholar
Prusis P, Lapins M, Yahorava S, Petrovska R, Niyomrattanakit P, Katzenmeier G, Wikberg JES (2008) Proteochemometrics analysis of substrate interactions with dengue virus NS3 proteases. Bioorg Med Chem 16:9369–77. doi:10.1016/j.bmc.2008.08.081
Article CAS Google Scholar
Van Westen GJP, Hendriks A, Wegner JK, IJzerman AP, Van Vlijmen HWT, Bender A (2013) Significantly improved HIV inhibitor efficacy prediction employing proteochemometric models generated from antivirogram data. PLoS Comput Biol 9(2):e1002899
Article Google Scholar
Van Westen GJP, Van den Hoven OO, Van der Pijl R, Mulder-Krieger T, de Vries H, Wegner JK, IJzerman AP, Van Vlijmen HWT, Bender A (2012) Identifying novel adenosine receptor ligands by simultaneous proteochemometric modeling of rat and human bioactivity data. J Med Chem 55(16):7010–20. doi:10.1021/jm3003069
Article Google Scholar
Van Westen GJP, Wegner JK, Geluykens P, Kwanten L, Vereycken I, Peeters A, IJzerman AP, Van Vlijmen HWT, Bender A (2011) Which compound to select in lead optimization? Prospectively validated proteochemometric models guide preclinical development. PLoS ONE 6:e27518. doi:10.1371/journal.pone.0027518
Article Google Scholar
Van Westen GJP, Wegner JK, IJzerman AP, Van Vlijmen HWT, Bender A (2011) Proteochemometric modeling as a tool for designing selective compounds and extrapolating to novel targets. Med Chem Commun 2:16–30
Article Google Scholar
Walsh C (2003) Antibiotics: actions, origins, resistance. Am Soc Microbiol (ASM)
Warwick SI (1991) Herbicide resistance in weedy plants: physiology and population biology. Annu Rev Ecol Evol Syst 22:95–114
Article Google Scholar
Whyard S, Singh AD, Wong S (2009) Ingested double-stranded RNAs can act as species-specific insecticides. Insect Biochem Mol Biol 39(11):824–32
Article CAS Google Scholar

Download references

Funding

GvW thanks EMBL (EIPOD) and Marie Curie Actions (COFUND) for funding. AB thanks Unilever and the European Research Commission (Starting Grant ERC-2013-StG 336159 MIXTURE) for funding. JPO thanks the Wellcome Trust for funding under the Strategic Award (WT086151/Z/08/Z).

Author information

Authors and Affiliations

European Molecular Biology Laboratory European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom
Gerard J. P. van Westen & John P. Overington
Unilever Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, United Kingdom
Andreas Bender

Authors

Gerard J. P. van Westen
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Bender
View author publications
You can also search for this author in PubMed Google Scholar
John P. Overington
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gerard J. P. van Westen.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Reprints and permissions

About this article

Cite this article

van Westen, G.J.P., Bender, A. & Overington, J.P. Towards predictive resistance models for agrochemicals by combining chemical and protein similarity via proteochemometric modelling. J Chem Biol 7, 119–123 (2014). https://doi.org/10.1007/s12154-014-0112-2

Download citation

Received: 11 February 2014
Accepted: 24 April 2014
Published: 15 May 2014
Issue Date: October 2014
DOI: https://doi.org/10.1007/s12154-014-0112-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Towards predictive resistance models for agrochemicals by combining chemical and protein similarity via proteochemometric modelling

Abstract

Similar content being viewed by others

Quantitative estimation of pesticide-likeness for agrochemical discovery

A large-scale crop protection bioassay data set

Multi-scale QSAR Approach for Simultaneous Modeling of Ecotoxic Effects of Pesticides

Introduction

Complementary ligand and target information

Resistance models

Multispecies models

Conclusions

References

Funding

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Towards predictive resistance models for agrochemicals by combining chemical and protein similarity via proteochemometric modelling

Abstract

Similar content being viewed by others

Quantitative estimation of pesticide-likeness for agrochemical discovery

A large-scale crop protection bioassay data set

Multi-scale QSAR Approach for Simultaneous Modeling of Ecotoxic Effects of Pesticides

Introduction

Complementary ligand and target information

Resistance models

Multispecies models

Conclusions

References

Funding

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation