Naïve Bayesian Models for Vero Cell Cytotoxicity

Perryman, Alexander L.; Patel, Jimmy S.; Russo, Riccardo; Singleton, Eric; Connell, Nancy; Ekins, Sean; Freundlich, Joel S.

doi:10.1007/s11095-018-2439-9

Naïve Bayesian Models for Vero Cell Cytotoxicity

Research Paper
Published: 29 June 2018

Volume 35, article number 170, (2018)
Cite this article

Pharmaceutical Research Aims and scope Submit manuscript

Alexander L. Perryman¹,
Jimmy S. Patel¹,
Riccardo Russo²,
Eric Singleton²,
Nancy Connell²,
Sean Ekins³ &
…
Joel S. Freundlich ORCID: orcid.org/0000-0002-3411-3455^1,2

818 Accesses
25 Citations
1 Altmetric
Explore all metrics

Abstract

Purpose

To advance translational research of potential therapeutic small molecules against infectious microbes, the compounds must display a relative lack of mammalian cell cytotoxicity. Vero cell cytotoxicity (CC₅₀) is a common initial assay for this metric. We explored the development of naïve Bayesian models that can enhance the probability of identifying non-cytotoxic compounds.

Methods

Vero cell cytotoxicity assays were identified in PubChem, reformatted, and curated to create a training set with 8741 unique small molecules. These data were used to develop Bayesian classifiers, which were assessed with internal cross-validation, external tests with a set of 193 compounds from our laboratory, and independent validation with an additional diverse set of 1609 unique compounds from PubChem.

Results

Evaluation with independent, external test and validation sets indicated that cytotoxicity Bayesian models constructed with the ECFP_6 descriptor were more accurate than those that used FCFP_6 fingerprints. The best cytotoxicity Bayesian model displayed predictive power in external evaluations, according to conventional and chance-corrected statistics, as well as enrichment factors.

Conclusions

The results from external tests demonstrate that our novel cytotoxicity Bayesian model displays sufficient predictive power to help guide translational research. To assist the chemical tool and drug discovery communities, our curated training set is being distributed as part of the Supplementary Material.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A machine learning platform to estimate anti-SARS-CoV-2 activities

Article 03 May 2021

ProfhEX: AI-based platform for small molecules liability profiling

Article Open access 09 June 2023

A Method for the In Vitro Cytotoxicity Assessment of Anti-cancer Compounds and Materials Using High Content Screening Analysis

Abbreviations

ADME/Tox:: Absorption, metabolism, distribution, excretion and toxicity
AID:: Assay Identification number on PubChem BioAssay
ECFP_6:: Extended class fingerprints of maximum diameter 6
FCFP_6:: Molecular function class fingerprints of maximum diameter 6
NPV:: Negative predictive value (filtering rate)
PPV:: Positive predictive value (hit rate)
QSAR:: Quantitative Structure-Activity Relationships
ROC:: Receiver-operator characteristic
SAR:: Structure-Activity Relationship
SMILES:: Simplified molecular-input line-entry system
Vero CC₅₀ :: Vero cell (African green monkey kidney cell) 50% cytotoxicity value

References

Kola I, Landis J. Can the pharmaceutical industry reduce attrition rates? Nat Rev Drug Discov. 2004;3(8):711–5.
Article PubMed CAS Google Scholar
Schoonen WG, Westerink WM, Horbach GJ. High-throughput screening for analysis of in vitro toxicity. EXS. 2009;99:401–52.
PubMed CAS Google Scholar
Segall MD, Barber C. Addressing toxicity risk when designing and selecting compounds in early drug discovery. Drug Discov Today. 2014;19(5):688–93.
Article PubMed CAS Google Scholar
Chekmarev DS, Kholodovych V, Balakin KV, Ivanenkov Y, Ekins S, Welsh WJ. Shape signatures: new descriptors for predicting cardiotoxicity in silico. Chem Res Toxicol. 2008;21(6):1304–14.
Article PubMed PubMed Central CAS Google Scholar
Polak S, Wisniowska B, Fijorek K, Glinka A, Polak M, Mendyk A. The open-access dataset for insilico cardiotoxicity prediction system. Bioinformation. 2011;6(6):244–5.
Article PubMed PubMed Central Google Scholar
Ekins S, Williams AJ, Xu JJ. A predictive ligand-based Bayesian model for human drug-induced liver injury. Drug Metab Dispos. 2010;38(12):2302–8.
Article PubMed CAS Google Scholar
Greene N, Fisk L, Naven RT, Note RR, Patel ML, Pelletier DJ. Developing structure-activity relationships for the prediction of hepatotoxicity. Chem Res Toxicol. 2010;23(7):1215–22.
Article PubMed CAS Google Scholar
Rodgers AD, Zhu H, Fourches D, Rusyn I, Tropsha A. Modeling liver-related adverse effects of drugs using knearest neighbor quantitative structure-activity relationship method. Chem Res Toxicol. 2010;23(4):724–32.
Article PubMed PubMed Central CAS Google Scholar
Liew CY, Lim YC, Yap CW. Mixed learning algorithms and features ensemble in hepatotoxicity prediction. J Comput Aided Mol Des. 2011;25(9):855–71.
Article PubMed CAS Google Scholar
Ekins S. Progress in computational toxicology. J Pharmacol Toxicol Methods. 2014;69(2):115–40.
Article PubMed CAS Google Scholar
Zhang H, Chen QY, Xiang ML, Ma CY, Huang Q, Yang SY. In silico prediction of mitochondrial toxicity by using GA-CG-SVM approach. Toxicol in Vitro. 2009;23(1):134–40.
Article PubMed CAS Google Scholar
Lin Z, Will Y. Evaluation of drugs with specific organ toxicities in organ-specific cell lines. Toxicol Sci. 2012;126(1):114–27.
Article PubMed CAS Google Scholar
Lakshminarayana SB, Huat TB, Ho PC, Manjunatha UH, Dartois V, Dick T, et al. Comprehensive physicochemical, pharmacokinetic and activity profiling of anti-TB agents. J Antimicrob Chemother. 2015;70(3):857–67.
Article PubMed CAS Google Scholar
Riss TL, Moravec RA. Use of multiple assay endpoints to investigate the effects of incubation time, dose of toxin, and plating density in cell-based cytotoxicity assays. Assay Drug Dev Technol. 2004;2(1):51–62.
Article PubMed CAS Google Scholar
Manjunatha UH, Smith PW. Perspective: challenges and opportunities in TB drug discovery from phenotypic screening. Bioorg Med Chem. 2015;23(16):5087–97.
Article PubMed CAS Google Scholar
Franzblau SG, DeGroote MA, Cho SH, Andries K, Nuermberger E, Orme IM, et al. Comprehensive analysis of methods used for the evaluation of compounds against Mycobacterium tuberculosis. Tuberculosis (Edinb). 2012;92(6):453–88.
Article CAS Google Scholar
Kim H, Yoon SC, Lee TY, Jeong D. Discriminative cytotoxicity assessment based on various cellular damages. Toxicol Lett. 2009;184(1):13–7.
Article PubMed CAS Google Scholar
Schrey AK, Nickel-Seeber J, Drwal MN, Zwicker P, Schultze N, Haertel B, et al. Computational prediction of immune cell cytotoxicity. Food Chem Toxicol. 2017;107(Pt A):150–66.
Article PubMed CAS Google Scholar
Moon H, Cong M. Predictive models of cytotoxicity as mediated by exposure to chemicals or drugs. SAR QSAR Environ Res. 2016;27(6):455–68.
Article PubMed CAS Google Scholar
Adhikari N, Halder AK, Saha A, Das Saha K, Jha T. Structural findings of phenylindoles as cytotoxic antimitotic agents in human breast cancer cell lines through multiple validated QSAR studies. Toxicol in Vitro. 2015;29(7):1392–404.
Article PubMed CAS Google Scholar
Ekins S, Freundlich JS, Hobrath JV, Lucile White E, Reynolds RC. Combining computational methods for hit to lead optimization in Mycobacterium tuberculosis drug discovery. Pharm Res. 2014;31(2):414–35.
Article PubMed CAS Google Scholar
Stouch TR, Kenyon JR, Johnson SR, Chen XQ, Doweyko A, Li Y. In silico ADME/Tox: why models fail. J Comput Aided Mol Des. 2003;17(2–4):83–92.
Article PubMed CAS Google Scholar
Johnson SR. The trouble with QSAR (or how I learned to stop worrying and embrace fallacy). J Chem Inf Model. 2008;48(1):25–6.
Article PubMed CAS Google Scholar
Ekins S, Reynolds RC, Kim H, Koo M-S, Ekonomidis M, Talaue M, et al. Bayesian models leveraging bioactivity and cytotoxicity information for drug discovery. Chem Biol. 2013;20:370–8.
Article PubMed PubMed Central CAS Google Scholar
Ekins S, Perryman AL, Clark AM, Reynolds RC, Freundlich JS. Machine learning model analysis and data visualization with small molecules tested in a mouse model of Mycobacterium tuberculosis infection (2014-2015). J Chem Inf Model. 2016;56(7):1332–43.
Article PubMed PubMed Central CAS Google Scholar
Perryman AL, Stratton TP, Ekins S, Freundlich JS. Predicting mouse liver microsomal stability with "pruned" machine learning models and public data. Pharm Res. 2016;33(2):433–49.
Article PubMed CAS Google Scholar
Wang Y, Xiao J, Suzek TO, Zhang J, Wang J, Zhou Z, et al. PubChem's BioAssay database. Nucleic Acids Res. 2012;40(Database issue):D400–12.
Article PubMed CAS Google Scholar
Smith CJ, Hansch C, Morton MJ. QSAR treatment of multiple toxicities: the mutagenicity and cytotoxicity of quinolines. Mutat Res. 1997;379(2):167–75.
Article PubMed CAS Google Scholar
Skibo EB, Xing C, Dorr RT. Aziridinyl quinone antitumor agents based on indoles and cyclopent[b]indoles: structure-activity relationships for cytotoxicity and antitumor activity. J Med Chem. 2001;44(22):3545–62.
Article PubMed CAS Google Scholar
Weinstein JN, Myers TG, O'Connor PM, Friend SH, Fornace AJ Jr, Kohn KW, et al. An information-intensive approach to the molecular pharmacology of cancer. Science. 1997;275(5298):343–9.
Article PubMed CAS Google Scholar
Swamidass SJ, Chen J, Bruand J, Phung P, Ralaivola L, Baldi P. Kernels for small molecules and the prediction of mutagenicity, toxicity and anti-cancer activity. Bioinformatics. 2005;21(Suppl 1):i359–68.
Article PubMed CAS Google Scholar
Lee AC, Shedden K, Rosania GR, Crippen GM. Data mining the NCI60 to predict generalized cytotoxicity. J Chem Inf Model. 2008;48(7):1379–88.
Article PubMed PubMed Central CAS Google Scholar
Molnar L, Keseru GM, Papp A, Lorincz Z, Ambrus G, Darvas F. A neural network based classification scheme for cytotoxicity predictions:validation on 30,000 compounds. Bioorg Med Chem Lett. 2006;16(4):1037–9.
Article PubMed CAS Google Scholar
Guha R, Schurer SC. Utilizing high throughput screening data for predictive toxicology models: protocols and application to MLSCN assays. J Comput Aided Mol Des. 2008;22(6–7):367–84.
Article PubMed CAS Google Scholar
Boik JC, Newman RA. Structure-activity models of oral clearance, cytotoxicity, and LD50: a screen for promising anticancer compounds. BMC Pharmacol. 2008;8:12.
Article PubMed PubMed Central CAS Google Scholar
Huang R, Southall N, Xia M, Cho MH, Jadhav A, Nguyen DT, et al. Weighted feature significance: a simple, interpretable model of compound toxicity based on the statistical enrichment of structural features. Toxicol Sci. 2009;112(2):385–93.
Article PubMed PubMed Central CAS Google Scholar
Langdon SR, Mulgrew J, Paolini GV, van Hoorn WP. Predicting cytotoxicity from heterogeneous data sources with Bayesian learning. J Cheminform. 2010;2(1):11.
Article PubMed PubMed Central CAS Google Scholar
Chang CY, Hsu MT, Esposito EX, Tseng YJ. Oversampling to overcome overfitting: exploring the relationship between data set composition, molecular descriptors, and predictive modeling methods. J Chem Inf Model. 2013;53(4):958–71.
Article PubMed CAS Google Scholar
Mervin LH, Cao Q, Barrett IP, Firth MA, Murray D, McWilliams L, et al. Understanding cytotoxicity and Cytostaticity in a high-throughput screening collection. ACS Chem Biol. 2016;11(11):3007–23.
Article PubMed CAS Google Scholar
Stratton TP, Perryman AL, Vilcheze C, Russo R, Li SG, Patel JS, et al. Addressing the metabolic stability of Antituberculars through machine learning. ACS Med Chem Lett. 2017;8(10):1099–104.
Article PubMed CAS Google Scholar
Hu Y, Unwalla R, Denny RA, Bikker J, Di L, Humblet C. Development of QSAR models for microsomal stability: identification of good and bad structural features for rat, human and mouse microsomal stability. J Comput Aided Mol Des. 2010;24(1):23–35.
Article PubMed CAS Google Scholar

Download references

Author information

Authors and Affiliations

Department of Pharmacology, Physiology and Neuroscience, and Medicine, Rutgers University-New Jersey Medical School, Medical Sciences Building, I-503, 185 South Orange Ave, Newark, NJ, 07103, USA
Alexander L. Perryman, Jimmy S. Patel & Joel S. Freundlich
Division of Infectious Diseases, Department of Medicine, and the Ruy V. Lourenço Center for the Study of Emerging and Re-emerging Pathogens, Rutgers University–New Jersey Medical School, Medical Sciences Building, I-503, 185 South Orange Ave, Newark, NJ, 07103, USA
Riccardo Russo, Eric Singleton, Nancy Connell & Joel S. Freundlich
Collaborations Pharmaceuticals, Inc., Main Campus Drive Lab 3510, Raleigh, North Carolina,, 27606, USA
Sean Ekins

Authors

Alexander L. Perryman
View author publications
You can also search for this author in PubMed Google Scholar
Jimmy S. Patel
View author publications
You can also search for this author in PubMed Google Scholar
Riccardo Russo
View author publications
You can also search for this author in PubMed Google Scholar
Eric Singleton
View author publications
You can also search for this author in PubMed Google Scholar
Nancy Connell
View author publications
You can also search for this author in PubMed Google Scholar
Sean Ekins
View author publications
You can also search for this author in PubMed Google Scholar
Joel S. Freundlich
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Joel S. Freundlich.

Ethics declarations

Conflicts of Interest

S.E. is the Founder and CEO of Collaborations Pharmaceuticals Inc.

Electronic supplementary material

ESM 1

(DOCX 1.29 mb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Perryman, A.L., Patel, J.S., Russo, R. et al. Naïve Bayesian Models for Vero Cell Cytotoxicity. Pharm Res 35, 170 (2018). https://doi.org/10.1007/s11095-018-2439-9

Download citation

Received: 26 January 2018
Accepted: 05 June 2018
Published: 29 June 2018
DOI: https://doi.org/10.1007/s11095-018-2439-9

Key Words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Naïve Bayesian Models for Vero Cell Cytotoxicity