Development of artificial neural network models to predict the PAMPA effective permeability of new, orally administered drugs active against the coronavirus SARS-CoV-2

Gousiadou, Chrysoula; Doganis, Philip; Sarimveis, Haralambos

doi:10.1007/s13721-023-00410-9

Development of artificial neural network models to predict the PAMPA effective permeability of new, orally administered drugs active against the coronavirus SARS-CoV-2

Original Article
Open access
Published: 06 February 2023

Volume 12, article number 16, (2023)
Cite this article

Download PDF

You have full access to this open access article

Network Modeling Analysis in Health Informatics and Bioinformatics Aims and scope Submit manuscript

Development of artificial neural network models to predict the PAMPA effective permeability of new, orally administered drugs active against the coronavirus SARS-CoV-2

Download PDF

Chrysoula Gousiadou ORCID: orcid.org/0000-0002-7093-3055¹,
Philip Doganis¹ &
Haralambos Sarimveis¹

2095 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

Responding to the pandemic caused by SARS-CoV-2, the scientific community intensified efforts to provide drugs effective against the virus. To strengthen these efforts, the “COVID Moonshot” project has been accepting public suggestions for computationally triaged, synthesized, and tested molecules. The project aimed to identify molecules of low molecular weight with activity against the virus, for oral treatment. The ability of a drug to cross the intestinal cell membranes and enter circulation decisively influences its bioavailability, and hence the need to optimize permeability in the early stages of drug discovery. In our present work, as a contribution to the ongoing scientific efforts, we employed artificial neural network algorithms to develop QSAR tools for modelling the PAMPA effective permeability (passive diffusion) of orally administered drugs. We identified a set of 61 features most relevant in explaining drug cell permeability and used them to develop a stacked regression ensemble model, subsequently used to predict the permeability of molecules included in datasets made available through the COVID Moonshot project. Our model was shown to be robust and may provide a promising framework for predicting the potential permeability of molecules not yet synthesized, thus guiding the process of drug design.

Machine Learning Exploration of the Relationship Between Drugs and the Blood–Brain Barrier: Guiding Molecular Modification

Article 11 April 2024

Neural Network Ensemble Based QSAR Model for the BBB Challenge: A Review

A machine learning platform to estimate anti-SARS-CoV-2 activities

Article 03 May 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

From the onset of the pandemic caused by the virus SARS-CoV-2, the scientific community intensified efforts to provide drugs effective against the disease COVID-19 (Ferreira and Andricopulo 2020). In this context, the COVID Moonshot project (Delft et al. 2021) was launched as a worldwide collaboration between scientists aiming to identify pre-clinical candidate molecules potent against the virus, for oral use. To date, almost 21,000 structurally diverse molecules have been submitted to the project’s website (PostEra 2022). Whilst the bioactivity of the molecules designed and submitted to the project is currently under investigation and while sub-micromolar IC₅₀ has been reported for a number of them (PostEra 2022), important factors such as permeability, selectivity, pharmacokinetics, pharmacodynamics and toxicity remain to be optimized to improve their drug-like profile (Erlanson 2020). Permeability across the biological membranes decisively influences the degree of a drug’s absorption and bioavailability (Homayun et al. 2019) and is therefore routinely evaluated through high-throughput screening (HTS) in the early stages of drug design using either cell-based or cell-free permeation systems (Masungi et al. 2008; Balani et al. 2005; Berben et al. 2018).

A cell-free, low-cost and easy to handle in vitro method for rapid in vivo permeability predictions—the PAMPA assay (Parallel Artificial Membrane Permeability Assay)—was introduced in 1998 by Kansy et al. (1998). PAMPA predicts in vivo permeability only via passive diffusion and is currently a preferred HTS method in the pharmaceutical industry (Schmidt and Lynch 2022). PAMPA is successful in establishing structure–activity/structure–property relationships (SARs/SPRs) and hit-to-lead optimization due to the lack of active transport systems or metabolizing enzymes (Fortuna et al. 2007). An analytical description of the method is provided in Sect. 2 section of the manuscript.

Apart from the costly, time and effort-consuming experimental studies, in silico approaches like quantitative structure–activity relationship (QSAR) models are reliably used as HTS methods for hit-to-lead optimization (Chi et al. 2019; Sun et al. 2017; Oja and Maran 2015a, b, c, 2016a, b, 2018; Roy et al. 2021; Diukendjieva et al. 2019) due to the short computational time required to screen large-sized datasets and the high accuracy of the models in making correct predictions. Nevertheless, despite its advantages, the QSAR approach has a limited role in the drug design process, being mainly used in the early stages for the exclusion of molecules with a low permeability profile (Dahlgren and Lennernäs 2019). Further contribution to the drug development process is rather restricted, primarily due to the choice of the descriptors involved in the analyses that fail to provide clear insight into which structural features influence permeability. Indeed, alternate models have been reported that link different molecular descriptors to the permeation process. For the most part, the Lipinski-like physicochemical properties of molecules (e.g. lipophilicity, hydrogen bond donors and acceptors, and molecular mass) (Lipinski 2000; Lipinski et al. 2001) and charge-related surface area descriptors were successfully related to permeability (Chi et al. 2019; Sun et al. 2017; Oja and Maran 2015a, b, c, 2016a, b, 2018; Roy et al. 2021; Diukendjieva et al. 2019), but no sufficient physical explanation could be derived from the models’ predictions. A list of QSAR models recently reported in literature for predicting PAMPA permeability, along with information on the statistical methods, the descriptors and data used is presented in Table 1.

Table 1 QSAR models for predicting PAMPA permeability

Full size table

To bring new insight into the above-mentioned problem, we used the dataset of 190 molecules curated by Chi et al. (2019) to model the PAMPA permeability of molecules. We developed a QSAR approach using a set of 61 selected descriptors that, apart from effectively mapping chemical space, allow for structural interpretation of the molecules. As will be analytically discussed in Sect. 4, the set contains Lipinski-like physicochemical features (Lipinski 2000; Lipinski et al. 2001) as well as BCUT structural descriptors (Stanton 1999) from which chemical structures can be uniquely deduced (Masek et al. 2008) and visualized (Guha and Willighagen 2012), giving clear insight into the influence of different groups on the permeation ability of molecules and vice versa, i.e. how permeability will be affected by structural changes. Drug discovery decisions can be made, as for example a targeted modification of a chemical structure or the selection of a chemical series with promising permeation profile for further refinement.

Using the selected descriptors, we subsequently employed artificial neural network (ANN) algorithms (Günther et al. 2010) to create a highly accurate “stacked regression” ensemble QSAR model (Wolpert 1992; Breiman 1996) to predict the permeation ability of molecules. Well-trained ANN models show increased accuracy and precision and are routinely used to solve complex problems (Irshad et al. 2020; Kaur et al. 2020; Sarker et al. 2020; Alloqmani et al. 2021; Jaber Alsolami et al. 2021). Our ensemble combined two neural network base models generated using a model development dataset split explicitly into a training and a test set. The models were fitted on the training set—using 20-fold cross-validation with three repeats—and validated on the test set to have an early estimation of their predictive performance on new data. The optimization of hidden layers (number of layers and neurons) of the models was based on the values of R-square and root mean-square error (RMSE) (Alexander et al. 2015; Kvålseth 1985) metrics indicating the predictive ability of the models on both datasets. The predictions of the models on the training set were subsequently combined using an ANN algorithm to create a stacked ensemble. The ability of the model to generalize well, i.e. to make accurate predictions on unseen data was further evaluated using independent external validation datasets (Irshad et al. 2020; Ho et al. 2020). Details on model generation, characteristics and performance metrics are provided in Sect. 3 of the manuscript. Our AΝΝ ensemble outperformed the HSVR model reported by Chi et al. (2019), with the Pearson correlation between observed and predicted logPe values for the 190 molecules being 0.97 and 0.93, respectively.

On the whole, the contribution of our approach can be summarized as follows:

we identified a set of 61 theoretically calculated descriptors with high relevance in explaining the permeability of molecules, from which chemical structures can be uniquely deduced;
we created an ensemble ANN regression model that can predict with high accuracy the PAMPA permeability of compounds of interest in a very short time (5 min to train the models using a system with CPU @ 2.69 GHz and RAM 12 GB).

The ANN ensemble was further used to predict the PAMPA permeability of molecules contributed by medicinal chemists to the COVID Moonshot project and downloaded from the PostEra site (PostEra 2022) (Supporting Information1, sheets S1.6 & S1.7 respectively). Our goal in doing so was to join and strengthen the ongoing efforts towards the development from scratch of new, orally administered, target-specific drugs with optimized absorption profile for COVID-19 treatment.

Briefly, this manuscript has been structured as follows: in Sect. 2, the experimental details of the study are analytically described and a diagram summarizing the workflow is provided; in Sect. 3, the results of the analysis and the creation of the QSAR models are presented; in Sect. 4, the contribution of the present study in predicting permeation ability of molecules is extensively discussed; and Sect. 6 contains information on the working environment and the availability of data and code.

2 Experimental

Special consideration was given to the consistency, quality and completeness of the data; hence, a homogenous, publicly available dataset (Chi et al. 2019) (Supporting Information1, sheet S1.1) with recorded PAMPA permeability for 190 molecules measured under the same experimental protocol was used to build the models. This is important, since permeability measurements heavily depend on the applied experimental protocol (Chi et al. 2019; Dahlgren and Lennernäs 2019; Avdeef et al. 2004). Also, as different types of measurements result in different permeability coefficients (Chi et al. 2019; Dahlgren and Lennernäs 2019), we note that in the present work we have modelled the effective permeability coefficient (logPe), analytically described below.

2.1 Description of the PAMPA method

The PAMPA method measures the permeability via passive diffusion, based on an artificial non-cell lipid membrane without pores, active transport systems or metabolizing enzymes (Fortuna et al. 2007). Used therefore as a HTS method, PAMPA is very successful in establishing the structure–activity relationships (SARs) and hit-to-lead optimization (Fortuna et al. 2007). The PAMPA system is a ‘sandwich’ consisting of two 96-well plates and includes three compartments. Substances move from a donor compartment, through a lipid-infused artificial membrane into an acceptor compartment (Kansy et al. 1998; Chi et al. 2019). The donor, membrane and acceptor compartments emulate the gastrointestinal tract, the intestinal epithelium and the blood circulation, respectively. To date, PAMPA models have been developed that exhibit a high degree of correlation with permeation across a variety of barriers, including the gastrointestinal tract (Avdeef et al. 2004), Caco-2 cultures (Bermejo et al. 2004; Avdeef et al. 2005), blood–brain barrier (Dagenais et al. 2009) and skin (Sinkó et al. 2012). The simplicity and stability of the PAMPA system allow for variability in the experimental settings, e.g. changing the pH values in the donor compartment offers the possibility to measure permeability under different physiological conditions in the intestinal pathway (Berben et al. 2018; Kansy et al. 1998). PAMPA measurements are shown to compare well with human intestinal absorption, except for some problematic cases concerning compounds with limited solubility or specific drug classes and compounds absorbed by active transport (Fortuna et al. 2007).

2.2 Permeability measurements and experimental setup

The influence of pH on the absorption through the intestine of drug-like molecules has been previously reported (Oja and Maran 2015a, b, c, 2016a, b, 2018). Indeed, the intestinal environment may present a variation in terms of pH values that possibly affects the absorption properties of substances (Oja and Maran 2018; Avdeef 2001). In keeping with this, the PAMPA assay has been used to measure the pH-dependent permeability profiles of various compounds (Oja and Maran 2015a, b, c, 2016a, b, 2018).

The present QSAR study is based on the effective membrane permeability (Chi et al. 2019; Dahlgren and Lennernäs 2019) measurements initially performed on a series of acidic, basic, neutral and amphoteric compounds at pH 7.4 by Oja and Maran (2015a, b, c, 2016a, b, 2018) and subsequently curated by Chi et al. (2019) in a dataset of 190 selected molecules (Supporting Information S1.1).

The effective membrane permeability coefficient (logPe) was calculated according to the equation (Oja and Maran 2016a):

$$log \left(Pe\left(\frac{cm}{s}\right)\right)=log \left( -\frac{2.303. {V}_{D}}{A.(t-{\tau }_{ss).{\varepsilon }_{a}}} . \left(\frac{1}{1+{r}_{v}}\right). {log}_{10}\left[1-\left(\frac{1+{r}_{v}^{-1}}{1-{R}_{M}} \right) . \frac{{C}_{A }\left(t\right)}{{C}_{D }\left(0\right)} \right]\right),$$

where V_D is the volume of the solution in the donor side, A is the membrane area, t is the time point of the experiment, t_ss is the lag time, e_α is the apparent membrane porosity, r_v is the ratio of volumes of the donor and acceptor sides (r_v = V_D/V_A), C_D(0) is the initial compound concentration in the donor side, C_A(t) is the concentration in the acceptor side at time tand R_M is the membrane retention ratio:

$$R_{M}=1-( \frac{{C}_{D }\left(t\right)}{{C}_{D}\left(0\right)}- \frac{{V}_{A }{C}_{A}(t)}{{V}_{D}{C}_{D}(0)}).$$

As the cutoff value for the membrane permeability depends on the experimental system, we note here that, for the specific experimental setup and for pH 7.4, a rough approximation may be employed (Chi et al. 2019) according to which logPe values ≥ -6.2 correspond to compounds with higher permeability, whereas logPe values < -6.2 would indicate lower permeability in general.

2.3 Partitioning of the data for model development and validation: calculation of molecular descriptors and model performance statistics

2.3.1 Train, test and external validation datasets (supporting information 1, sheet S1.1)

A visualization of the data split for the logPe modelling is presented in Fig. 1. The individual subsets were saved as CSV files for reading into the R modelling workflows and these CSV files are provided in the code archive available on Zenodo (Gousiadou 2021), along with a README file explaining their contents and guidance on how to reproduce results via running the available code files.

2.3.1.1 Model development data: train and test subsets

For model development and initial evaluation, a dataset of 174 molecules randomly selected out of the set of 190 compounds was further randomly split into explicit train (80%, 141) and test (20%, 33) subsets. The train set was used to fine-tune the algorithm parameters and fit the models, while the test set provided an early estimate of their predictive performance (Table 2).

Table 2 Modelling the effective membrane permeability (logPe) of compounds (190)

Full size table

2.3.1.2 External validation set

For the external validation of the final models, we used the following sets of data (Supporting Information 1, S1.1): a. 16 molecules initially partitioned from the dataset of 190 compounds, which were set aside to create an independent external validation set; b. a set containing the chemical structures of two anti-COVID-19 drugs, namely Paxlovid (Owen et al. 2021) and Remdesivir (Jang et al. 2021) with reported permeability (not PAMPA). The SMILES strings of the drugs were downloaded from the PubChem database (Sayers et al. 2022).

2.3.2 Calculation of molecular descriptors

A single 3D conformation was created from SMILES for each structure using the publicly available Bioclipse software (Spjuth et al. 2007, 2009). An SDF file containing the 3D coordinates of the molecules was imported in R, and the rcdk package (Guha 2007) was used to automatically calculate a number of descriptor variables. The CDK descriptors (Java Library for Chemoinformatics) are divided broadly into three main groups, that is, atomic, bond and molecular and belong to the specific categories “topological”, “geometrical”, “hybrid”, “constitutional”, and “electronic”. The calculation resulted in 286 descriptors for each molecule. Non-informative descriptors were removed, that is, all variables with zero variance (zero values for all molecules). This process reduced the number of descriptors to 232.

2.3.3 Model performance statistics

For the comparison and evaluation of the predictive performance of models, we primarily employed the Pearson’s correlation coefficient, the coefficient of determination (R², Eqs. 1 and 2) and the "root mean-square error" (RMSE, Eq. 3) metrics (Alexander et al. 2015; Kvålseth 1985). The best models were those with the smaller RMSE and greater R² values. Whilst different R² (“Rsquared”) and related statistics may be reported in the literature (Alexander et al. 2015; Kvålseth 1985; Roy et al. 2009), here we employed Eqs. (1), (2) and (3) recommended as generally suited for QSAR studies (Alexander et al. 2015; Kvålseth 1985). Assuming that the difference between the mean experimental and predicted values is zero, “R-squared” can be interpreted as the proportion of the variability in the response captured by each model (Alexander et al. 2015; Kvålseth 1985). However, under certain circumstances, e.g. due to the average prediction being significantly shifted from the average experimental value or due to outliers, R² (Eq. 1) can be negative.

We note that, where statistics are reported with the subscript “cv” (R²_CV, ^‡R²_CV, RMSE_CV), this means that the model built on a cross-validation training subset was applied to the corresponding validation fold, with the performance statistic being averaged across all folds and repeats of cross-validation. (Supporting Information1, sheet S1.4). The coefficients of determination reported as R² and R²_CV were calculated using Eq. (1), whilst the coefficients of determination ^‡R² and ^‡R²_CV were calculated using Eq. (2). For the coefficients of determination depicted as R² and ^‡R² (without the “cv” indication), the corresponding calculations were made by applying the models to data other than those used to train them, i.e. the test and external validation sets. Where correlation statistics are referred to as “resubstitution” estimates, this means that the model trained on the training set was applied to that training set (Hawkins 2004). These are not estimates of predictive performance, but may provide insight into the degree of overfitting when compared to the corresponding statistics on truly independent data.

$$\mathrm{R}2 =1-\frac{\sum {\left(y-\hat{y}\right)}^{2}}{\sum {\left(y-\bar{y}\right)}^{2}},$$

(1)

$$\ddagger \mathrm{R}2 = {\left(\frac{cov\left(y,\hat{y}\right)}{\sqrt{var\left(y\right). var\left(\hat{y}\right)}}\right)}^{2},$$

(2)

$$\mathrm{RMSE }=\sqrt{\frac{\sum_{i=1}^{N}{\left({y}_{i}-{\hat{y}}_{i}\right)}^{2}}{N}},$$

(3)

where y and ŷ are the observed and predicted values, respectively, and $\bar{y}$ is the mean of the observed values.

2.3.4 Workflow for model development and validation

The workflow followed in this study is summarized in Fig. 2. The different stages of the analysis are clearly shown, i.e. data separation, data pre-processing and feature selection as well as the development and validation of the models.

3 Results

3.1 Data pre-processing and feature selection

An initial exploratory analysis of the dataset (190 molecules, 232 descriptors) revealed a high correlation (> 0.80) between the 127 descriptors. As it is always desirable to have a reduced set of uncorrelated, nonredundant, and informative descriptors that allow for interpretable prediction models, we reduced data dimensionality using feature elimination methods. Feature selection was performed using the training set of 141 molecules with 232 descriptors and the corresponding logPe values. The method selected for the feature elimination was based on a wrapper approach (John et al. 1994). Wrapper methods are search algorithms that treat the predictors as inputs and utilize model performance as the criterion to be optimized (Ambroise and McLachlan 2002). Using the caret package (version 6.0–84) in R, we performed a simple backwards selection of descriptors (Recursive Feature Elimination, RFE) with random forest (randomForest package—version 4.6–14) (Svetnik et al. 2003). Random forest has a built-in feature selection (Svetnik et al. 2004) as well as variable importance estimation utilized for the RFE approach (Kuhn 2019; Kotu and Deshpande 2019). We used the version of the algorithm that incorporates resampling (rfe) (Kuhn 2019) and applied an outer resampling method of 20-fold cross-validation with three repeats to reduce the risk of overfitting of the model to the descriptors and to get performance estimates that incorporate the variation due to feature selection. By employing the resampling method, we improved the generalization performance of the model and obtained a more probabilistic assessment of descriptor importance than a ranking based on a single fixed data set. The best performance was based on the Root Mean-Square Error (RMSE_CV) (Alexander et al. 2015) and corresponded to a subset of 61 descriptor variables—ranked according to their significance in predicting the logPe values (Fig. 3), (Supporting Information1, sheet S1.5)—which we further used to build our models.

3.2 QSAR models created using the selected 61 descriptors

The data in the train set were subsequently pre-processed, i.e. normalized in the range 0–1. The same pre-process parameters were applied for normalization of the test (33 molecules) and external validation (16 molecules) datasets. Next, we used the training data and employed a sophisticated ensemble modelling approach known as “stacked regression” (Breiman 1996). Ensemble approaches combine the predictions of multiple learning algorithms for obtaining improved predictive performance, which could not otherwise be obtained from any of the constituent learners alone. Although an ensemble may have multiple base models within the model, it acts and performs as a single model (Kotu and Deshpande 2019). The advantage of such a “metalearner” is that the generalization error of the prediction is minimized by deducing the biases of the base models with respect to a provided learning set. This deduction proceeds by generalizing in a second space—whose inputs are the predictions of the base learners on a given dataset and whose output is the actual outcome—and trying to make predictions on new, unseen data (Wolpert 1992).

For the present regression analysis, we used the resilient backpropagation method (Günther et al. 2010; Riedmiller and Braun 1993) to generate feedforward neural network models (Hornik et al. 1989) to be combined in an ensemble. This method is considered one of the fastest for regression analyses and does not require predefining of the overall learning rate. We employed the resilient backpropagation algorithm with weight backtracking (rpart +) (Svetnik et al. 2003), available in the neuralnet package (version 1.44.2) (Günther et al. 2010), and trained multi-layer perceptrons (MLPs) that predicted permeability by calculating the following function:

$$ \begin{aligned} o\left(x\right)& =f\left({w}_{0}+\sum_{j=1}^{j}{w}_{j}.f\left({w}_{oj}+\sum_{i=1}^{n}{w}_{ij}{x}_{i}\right)\right)\\&=f\left({w}_{o }+\sum_{j=1}^{j}{w}_{j}.f({w}_{oj}+{{\varvec{w}}}_{j}^{ T}{\varvec{x}})\right),\end{aligned} $$

where w₀ denotes the intercept of the output neuron, w_oj the intercept of the jth hidden neuron, w_j the synaptic weight that corresponds to the synapse starting at the jth, hidden neuron and leading to the output neuron, w = (w_1j,…, w_nj) the vector of all synaptic weights corresponding to the synapses leading to the jth hidden neuron, and x the vector of all covariates (x₁,…, x_n). More specifically, a number of neurons are organized in consequent layers connected by synapses, and the output of every neuron in one layer is the input to a neuron in the next layer. All the covariates (descriptors) are arranged in separate neurons to form the input layer, while the output layer consists of the response variable. The intermediate layers are referred to as hidden layers. A weight indicating the effect of the corresponding neuron is attached to each one of the synapses (Günther et al. 2010). These weights are the parameters of the backpropagation ANN models and during the training process are modified by the algorithm to minimize the error function that measures the difference between the observed (o) and predicted (y) output values:

$$E=\frac{1}{2}\sum_{l=1}^{L}\sum_{h=1}^{H}{{(o}_{ith}-{y}_{ith})}^{2},$$

where l = 1,…,L is the index for the observations and h = 1,…,H is the index for the output nodes.

The algorithm rpart + uses only the sign instead of the magnitude of the partial derivatives to update the weights. Based on this method and using the previously selected 61 descriptors, we trained a series of neural network learners on the train set of 141 molecules to compare their performance. As the selection of the number of hidden units in an ANN is not an exact science, trial and error played a significant role in this process. The algorithm was applied on the training data using different number of hidden layers and neurons and employing a resampling method of 20-fold cross-validation with three repeats. The final choice of hidden units (Table 2) was based on a compromise between the quality of the model (learning well from the data, avoiding overfitting, etc.) and complexity/computational speed. However, it was observed that the use of a relatively small number of covariates (61 descriptors) significantly reduced the complexity of the models, despite increasing the hidden layers and neurons in a number of them. Further fine-tuning regarding the hidden layers was performed based on the resulting root mean-square error and R-squared values (RMSE_CV and ^‡R²_CV)—calculated according to Eqs. (3) and (2), respectively (described in “Model Performance Statistics” section) and presented as the average across all folds and repeats of cross-validation. The models were subsequently used to predict the logPe values of the 33 molecules in the test set, which provided a less biased evaluation of the models’ effectiveness in predicting unseen data. Based on the results from both datasets, two ANN based learners with optimized parameters were finally selected to be combined in a stacked ensemble (Table 2B).

The architecture and complexity of the Ensemble NN model is analytically presented in Table 3 and Fig. 4.

Table 3 Result matrix for the EnsembleNN

Full size table

An overall evaluation of the variable importance performed by the models NN1 and NN2 while being generated, along with a description of the 20 variables selected as most informative by this process, is provided in Supporting Information1, sheet S1.5. In addition, Fig. 5 presents a correlation chart of the top 6 out of the 20 most important descriptors, along with the modelled end point Observed Log Pe. The distributions of the variables, their correlation to each other and to the output as well as their individual contribution in explaining the variability of the output are depicted.

A visual comparison of the modelling results—based on the evaluation metrics ‡R2_CV, RMSE_CV and MAE_cv (Willmott and Matsuura 2005)—for the predictive performance of the models NN1 and NN2 obtained via cross-validation on the training set (141 molecules) with optimized hyperparameters is depicted in Fig. 5. In addition, in Table 4 and Fig. 6, a pairwise comparison of the cross-validated results for the selected models NN1 and NN2 is shown. As can be seen, the correlation between the two models is very low (0.40), which means that each model has captured different aspects of the data and the information they provide has limited redundancy. They are therefore well suited to be combined in an ensemble.

Table 4 Pairwise comparison of the cross-validation results for the selected and optimized models NN1 and NN2 (Table 1A). The metric used is root mean-squared error (RMSE). The models were not strongly correlated (0.40), indicating that they were informative in different ways and suitable to be combined in an ensemble

Full size table

We subsequently trained an ANN stacked ensemble (Table 2B)—employing again the resilient backpropagation algorithm and applying 20-fold cross-validation with three repeats—using as input variables the predictions of the base models on the train set and as output (target) variable the corresponding experimental values of logPe. The whole process resulted in the creation of the ensemble model EnsembleNN with boosted predictive performance (Fig. 7). In Fig. 8, an illustration of the performance of the base models NN1 and NN2 as well as the EnsembleNN—obtained via cross-validation on the training set (141 molecules)—is provided by evaluation curves that assess the performance of the models and compare the results with the random pick (baseline) (Mount and Zumel 2020).

3.3 Domain of applicability (DOA) of the ensemble NN model

For estimating the applicability domain (AD) of the ensemble, we employed the standard deviation (SD) method, extensively used in ensemble modelling (Cao et al. 2017; Tetko et al. 2008; Sushko et al. 2010). SD measures model reliability by incorporating information about the model itself and is based on the assumption that if for a given compound the predictions of the models in the ensemble differ significantly, then the ensemble prediction for this compound is likely to be unreliable. For a set of predictions concerning a compound j given by a set of k trained models in the ensemble, SD is calculated using the following equation:

$$SD\left(j\right)=\sqrt{\frac{{\sum }_{i=1}^{k}{{(y}_{i}-\bar{y})}^{2}}{k-1}},$$

(4)

where y_i is the prediction of the ith model for compound j and ȳ is the mean prediction y_j(i) (i = 1..k).

The EnsembleNN has an AD threshold of approximately three times the maximum SD value of the train data (3*0.23) (Cao et al. 2017), within which the bulk of the ensemble predictions are shown to be reliable (Fig. 9). For new samples with sd values larger than the threshold, the logPe predictions are likely to be inaccurate.

Subsequently, we evaluated the ability of the EnsembleNN to make accurate predictions on the hitherto unseen data of the two external validation sets (normalized with the same parameters used for data pre-processing in the train set). These predictions were completely unbiased, since the external validation sets had not in any way participated previously in the development or selection of the base models (Supporting Information 1, S1.1). On the first external validation set of the 16 molecules the EnsembleNN showed enhanced performance, making predictions with 89% correlation to the observed logPe values (Table 2D, Fig. 10). The second external validation set consisted of two anti-COVID-19 drugs, namely Paxlovid and Remdesivir for which the permeation ability (not measured with PAMPA) is known (Hung et al. 2022; Schäfer et al. 2022). The two drugs have chemical structures very different from those included in the dataset of 190 molecules: Paxlovid is a new, orally administered, target-specific antiviral drug that has excellent permeation ability and is currently state-of-the-art treatment against COVID-19. Remdesivir, a nucleoside analogue, has been used in the early stages of the pandemic after drug repurposing and due to its low permeability was administered only intravenously. The model made correct predictions within the applicability domain for both molecules: high permeability was predicted for Paxlovid and poor permeability for Remdesivir (predicted logPe – 5.29, sd = 0.44 and – 6.72, sd = 0.18, respectively).

Backpropagation depends heavily on the training data (Schäfer et al. 2022). To assert the robustness of the EnsembleNN, we have tested model generalization performance as noise rises in the training data. Since in our analysis the chemical structures are represented by theoretically calculated descriptors, noise would only be expected in the output variable, i.e. the experimentally measured logPe. We therefore added noise to the LogPe values in the training data and repeated the analysis. The results are presented in Supporting Information 1, S1.8.

Whilst the ANN models can make highly accurate predictions, they provide little explanatory insight into the relative influence of the independent variables in the prediction process (Olden et al. 2004). On these grounds, a more straightforward means of obtaining general insights into the influence of individual descriptors on the modelled logPe variable has been employed here. We used the rpart (Therneau and Atkinson 2018) algorithm to create a single decision tree on the entire dataset of 190 molecules, using the selected set of 61 descriptors. The decision path (Fig. 11) shows the features—along with their threshold values—associated with every decision. A description of the features is provided in Supporting Information1, S1.5.

3.4 Linear model created using only structural descriptors

To explore the percentage of the LogPe variation that a model built on descriptors other than the Lipinski-like properties could explain, we used a subset of 6 descriptors (out of the 61) carrying structural information to build a linear model (Faraway 2005). The set included four structural descriptors from the BCUT chemical space as well as the geometrical FNSA.3 and the MDEO.11 descriptors (Supporting Information 1, sheet S1.5) (Guha 2007). Following the same protocol as previously described (same data partitioning to train, test and external validation datasets, normalization, etc.) we created a linear regression model, and the results are presented in Table 5.

Table 5 Modelling the effective membrane permeability (logPe) of compounds (190)

Full size table

As can be seen, approximately 41% of the LogPe variation is captured by the six descriptors, with the Pearson correlation between the experimental and the predicted LogPe values being 64% for the train and external validation sets and 73% for the test set.

3.5 Implementation of the EnsembleNN model

Following development and validation, we used the ensemble model EnsembleNN to predict the logPe at pH 7.4 of 4520 molecules contributed by medicinal chemists to the COVID Moonshot Project and downloaded from the PostEra site (Kansy et al. 1998) on May 1st, 2020. Our engagement with this data emerged as an activity within the European Union’s Horizon 2020 project NanoCommons Translational Access (TA) (NanoCommons Translational Access (TA) xxxx) and was initiated by Tim Dudgeon from the software company Informatics Matters Ltd. (Informatics Matters Ltd xxxx), who created a repository project board on GitHub (Dudgeon xxxx) dealing with the ADME (Absorption, Distribution, Metabolism and Excretion) analysis of the molecules included in the above-mentioned dataset, for which activity data were not available. As a follow-up, on February 2nd, 2021, we also downloaded from the PostEra site 1561 molecules for which biological activity is available and made predictions on their logPe values.

The data were provided as SMILES strings of the molecules, from which a single 3D conformation was created for each structure with the publicly available Bioclipse software (Spjuth et al. 2007, 2009) and the previously selected 61 descriptors were calculated using the rcdk package in R. Pre-processing of the data (range from 0–1) was performed with the same parameters used for the development and external validation datasets. Predictions on the permeability of the molecules were performed with the ensemble model EnsembleNN, and the results together with data on the molecules´ activity (when available) are presented in Supporting Information S1, sheets S1.6 and S1.7.

Both Rapidfire and fluorescence assays were used to measure the bioactivity of the molecules. The reported IC₅₀ values (uM) represent the average over multiple dose–response runs of the molecules in each assay (https:, , github.com, postera-ai, COVID_moonshot_submissions. xxxx; Lu et al. 2016).

The structures of selected molecules (47 out of 1561) exhibiting inhibitory activity against M^pro with IC₅₀ < 100 uM in both assays, along with their predicted logPe, are depicted in Supporting Information 2.

4 Discussion

The fact that a drug’s transport via passive diffusion is strongly connected to certain physicochemical properties has been previously shown by Lipinski (Lipinski 2000; Lipinski et al. 2001). Lipinski and later Veber (Veber et al. 2002) suggested “rules of thumb” to be followed by medicinal chemists, concerning the accepted margins of property values (nHBDon < 5, nHBAcc < 10, MW < 500, XlogP < 5, TPSA < 140 A²) that ensure oral bioavailability. Indeed, across the two methods (ANN and single decision tree) used to model logPe, the topological descriptor tpsaEfficiency—representing the polar surface area of a molecule expressed as a ratio to molecular size—ranked first on the list of features evaluated as most relevant (Supporting Information1, S1.5). In the decision tree, tpsaEfficiency is depicted as the root node, as well as the second and third node (Fig. 11). The list of high ranking descriptors invariably included—although in different order depending on the method selected—features related to lipophilicity (octanol/water partition coefficients XlogP and AlogP) and the number of hydrogen bond donors in a molecule (nHBDon). A clear path in the decision tree indicated by the nodes 1, 2, 5, 10, 11 and 22 suggests that for acidic compounds (nAcid > = 0.44)—shown to have “lower permeability”—charge is influential, whereas for the non-acidic molecules (nAcid < = 0.44) lipophilicity and the number of hydrogen bond donors are decisive. The results from both modelling methods suggesting that non-polar, lipophilic and uncharged molecules are more likely to penetrate the highly hydrophobic intestinal cell membranes chime with previous reports (Chi et al. 2019; Oja and Maran 2015a, b, c, 2016a, b, 2018). However, given the pH variation in the intestinal environment (Avdeef 2001), measuring membrane permeability at only neutral pH (7.4) may eliminate compounds with good absorption characteristics at other pH values (Oja and Maran 2015a, 2015b, 2016a). Acidic compounds like valsartan or salicylic acid (IDs 186 and 144, Supporting Information 1, S1.1) show good permeability in acidic pH (5) only (logPe values – 5.99 and –5.53, respectively) (Oja and Maran 2015b).

4.1 Importance of descriptors

Unlike previous attempts mostly depending on the combination of few “Lipinski-like” properties to explain the permeability of molecules, the present work aims to highlight the importance of additionally using meaningful structural information in modelling LogPe. Indeed, in Sect. 3.3 of the manuscript, we have shown that a set of six structural descriptors could capture approximately 41% of the variation in our data. Most interestingly, the BCUT descriptors are previously reported to allow for reversible decoding (Masek et al. 2008). While the combination of Lipinski-like descriptors may carry sufficient information to deduce substructural features, the high precision coordinates in BCUTs reveal a high level of detail from which a unique chemical structure or closely related analogues can be derived (Masek et al. 2008). The BCUT metrics introduced by Pearlman (Pearlman and Smith 2022) are whole-molecule descriptors that combine two or more measures of atom-based properties into a single value and are significant in measuring molecular diversity (Stanton 1999). Being extensions of previously developed parameters (Burden 1989), they further expand the number and types of atomic features that can be considered and provide a greater variety of proximity measures and weighting (Stanton 1999). Currently, three weighting schemes are employed: atomic weight (BCUTw), partial charge (BCUTc) and polarizability (BCUTp). In Fig. 12 the negative relationship between BCUTc1h and LogPe ( permeability) as well as between BCUTc1 and XLogP (lipophilicity) is presented.

Each dot on both sides of the line represents an observation, i.e. a molecule with an observed logPe and a calculated BCUTc1h value. In each scatterplot, the dots are sized according to a third variable, i.e. the structural descriptor BCUTw1h. It can be observed that more than one structural combinations could lead to the same LogPe and XlogP values.

The geometric descriptor FNSA.3, which combines the surface area and partial charge information (charge weighted partial negative surface area/total molecular surface area) was estimated by the ANN models as highly significant, which is in accord with previous reports (14–19). In Fig. 5, where the correlation of the top 6 out of 61 most important descriptors along with the modelled end point logPe is presented, we can see that a positive linear association is observed between the descriptor FNSA.3 and logPe. This is better shown in Fig. 13, where a general illustration of the relationship between the two variables is provided. Each dot on both sides of the line represents an observation, i.e. a molecule with an observed logPe and a calculated FNSA.3 value. The overall pattern of the graph suggests that higher FNSA.3 values are generally associated with increased permeability (approximately logPe ≥ -6.2). In each scatterplot, the dots are sized according to a third variable, i.e. the descriptors nHBDon, XlogP and TopoPSA (topological polar surface area), respectively, to explore their influence on the observed permeability. It can be clearly seen that an increase of FNSA.3 combined with low nHBDon and TopoPSA values and high XlogP (> 0, < 6) result in increased permeability.

In Table 6, structures of the ten best performing molecules (out of the 1561, PostEra) in terms of their activity against M^pro (r_avg_IC₅₀ < 1uM) are presented, along with their logPe as predicted by the model EnsembleNN. The “Lipinski-like” properties of the synthesized compounds are compliant with the rule of five and high permeability is predicted for the majority of them. Electron-withdrawing sulphonyl functional groups in molecules 1145 and 1245 appear to have negative effect on their membrane permeability. Although the presence of sulphonyl groups increases the number of hydrogen bond acceptors and enhance the binding affinity of drugs with target proteins through hydrogen bond interactions, they also increase polarity and affect solubility and acid–base properties (Fei et al. 2016), features already shown to be highly relevant for the membrane permeability of molecules.

Table 6 Structures of the ten best performing molecules (out of the 1561) in terms of their activity against the main protease M^pro (r_avg_IC50 < 1uM) are presented, along with their logPe as predicted by the model EnsembleNN

Full size table

5 Conclusions

In the present work, we employed a “stacked regression” ensemble approach to model the effective membrane permeability coefficient LogPe of 190 compounds, measured by the PAMPA assay at pH 7.4. On the whole, built on the set of 61 selected descriptors, our ensemble ANN model provides a method for the quantification of drug-likeness, particularly useful for drugs that move beyond the traditional rule of five space to access challenging targets (Alex et al. 2011). Unlike Lipinski’s rule, used for filtering large datasets of molecules and achieving no other discrimination of compounds beyond a qualitative pass or fail, our model allows for the optimization of relevant physicochemical properties of new molecules of interest—even before they are synthesized—through careful multi-criteria evaluation and control.

6 Data, software and code availability

Publicly available permeability data (Chi et al. 2019) containing the SMILES strings of 190 structurally diverse drug or drug-like molecules with recorded effective permeability (logPe) values were used for creating the QSAR models in the present work. The data—carefully curated by Chi et al. (Chi et al. 2019) and based on previous reports by Oja et Maran (2015a, b, c, 2016a, b, 2018)—were generated with the same experimental protocol and were therefore highly homogenous. A single 3D conformation was created for each structure using the publicly available Bioclipse software (Spjuth et al. 2007, 2009). Data analysis and QSAR modelling were performed using the publicly available R Statistical Programming Language (version 3.5.1, 64bit and 4.0.3, 64BIT) (R Core Team 2018). R is both a language and an environment for statistical computing and graphics, providing a wide variety of statistical (linear and nonlinear modelling, classical statistical tests, time-series analysis, classification, clustering) and graphical techniques and is highly extensible. R is designed around a true computer language allowing users to add additional functionality by defining new functions. Extended functionalities are added to R by installing a number of packages, including Machine Learning algorithms implemented as third party libraries. The following R packages were used for the analysis: rcdk (Guha 2007), randomForest (Liaw and Wiener 2002), caret (Kuhn 2019, 2008), rpart (Therneau and Atkinson 2018), rpart.plot (Milborrow 2019), caretEnsemble (Deane-Mayer and Knowles 2016), tidyverse (Wickham et al. 2019a), mlbench (Leisch and Dimitriadou 2010), corrplot (Wei and Simko 2017), neuralnet (Günther et al. 2010), and dplyr (Wickham et al. 2019b), magrittr (Bache and Wickham 2014), WVPlots (Mount and Zumel 2020). The R code, a file detailing the versions of all R packages and individual subsets saved as CSV files for reading into the R modelling workflows have been made available on Zenodo (Gousiadou 2021), along with a README file explaining their contents and guidance on how to reproduce results via running the available code files.

The model has been implemented as a web service in the Jaqpot 5 modelling platform (Sarimveis 2019; ) and is available at the following URL: https://app.jaqpot.org/model/NDmy6udCm9UjmEMq0Zaq under the NanoCommons organization. In the “overview” tab, details about the model are presented. For accessing the model, the interested user should first register in Jaqpot 5 and then become a member of the NanoCommons organization by sending an e-mail to: E-mail: hsarimv@central.ntua.gr.

7 Supporting information

Supplementary material for this work is included in the Supporting Information 1, as different sheets of an Excel workbook (also including the SMILES strings of all the compounds) and in the Supporting Information 2 where the structures of 47 most bioactive molecules (out of 1561 downloaded from the PostEra site with recorded bioactivity against M^pro) along with their recorded IC₅₀ and predicted LogPe values are depicted.

Data availability

The data that support the findings of this study are openly available in Zenodo Online Repository https://zenodo.org/record/5504324#.Y5sPp3bMJaQ.

References

https://github.com/postera-ai/COVID_moonshot_submissions
Alex A, Millan DS, Perez M et al (2011) Intramolecular hydrogen bonding to improve membrane permeability and absorption in beyond rule of five chemical space. Med Chem Commun 2:669–674. https://doi.org/10.1039/C1MD00093D
Article Google Scholar
Alexander DLJ, Tropsha A, Winkler DA (2015) Beware of R2: simple, unambiguous assessment of the prediction accuracy of QSAR and QSPR models. J Chem Inf Model 55:1316–1322. https://doi.org/10.1021/acs.jcim.5b00206
Article Google Scholar
Alloqmani, A., B., Y., Irshad, A., Alsolami, F. Deep learning based anomaly detection in images: Insights, challenges and recommendations. International Journal of Advanced Computer Science and Applications 2021, 12. https://doi.org/10.14569/IJACSA.2021.0120428
Ambroise C, McLachlan GJ (2002) Selection Bias in Gene Extraction on the Basis of Microarray Gene-Expression Data. Proc Natl Acad Sci USA 99:6562–6566. https://doi.org/10.1073/pnas.102102699
Article MATH Google Scholar
An G (1996) The effects of adding noise during backpropagation training on a generalization performance. Neural Comput 8:643–674. https://doi.org/10.1162/neco.1996.8.3.643
Article Google Scholar
Avdeef A (2001) Physicochemical profiling (solubility, permeability and charge state). Curr Top Med Chem 1:277–351. https://doi.org/10.2174/1568026013395100
Article Google Scholar
Avdeef A, Artursson P, Neuhoff S, Lazorova L, Gråsjö J, Tavelin S (2005) Caco-2 permeability of weakly basic drugs predicted with the Double-Sink PAMPA pKa(flux) method. Pharm Sci 24:333–349. https://doi.org/10.1016/j.ejps.2004.11.011
Article Google Scholar
Avdeef A, Nielsen PE, Tsinman O (2004) PAMPA—a drug absorption in vitro model 11. Matching the in vivo unstirred water layer thickness by individual-well stirring in microtitre plates. Pharm Sci 22:365–374. https://doi.org/10.1016/j.ejps.2004.04.009
Article Google Scholar
Bache, S. M.; Wickham, H. 2014. “magrittr: A forward-pipe operator for R.” R package version 1.5. https://CRAN.R-project.org/package=magrittr
Balani SK, Miwa GT, Gan L-S, Wu J-T, Lee FW (2005) Strategy of utilizing in vitro and in vivo ADME tools for lead optimization and drug candidate selection. Curr Top Med Chem 5:1033–1038. https://doi.org/10.2174/156802605774297038
Article Google Scholar
Berben P, Bauer-Brandl A, Brandl M, Faller B, Flaten GE, Jacobsen A-C, Brouwers J, Augustijns P (2018) Drug permeability profiling using cell-free permeation tools: overview and applications. Eur J Pharm Sci 119:219–233. https://doi.org/10.1016/j.ejps.2018.04.016
Article Google Scholar
Bermejo M, Avdeef A, Ruiz A, Nalda R, Ruell JA, Tsinman O, González I, Fernández C, Sánchez G, Garrigues TM, Merino V (2004) PAMPA—a drug absorption in vitro model 7. Comparing rat in situ, Caco-2, and PAMPA permeability of fluoroquinolones. Pharm Sci 21:429–441. https://doi.org/10.1016/j.ejps.2003.10.009
Article Google Scholar
Breiman L (1996) Stacked regressions. Mach Learn 24:49–64. https://doi.org/10.1007/BF00117832.2
Article MATH Google Scholar
Burden, F.R. Molecular Identification Number for Substructure Searches. J. Chem. Inf. Comput, Sci. 1989, 29, 225–227. doi: https://doi.org/10.1021/ci00063a011.
Cao, D.-S., Deng, Z.-K., Zhu, M.-F., Yao, Z.-J., Dong, J., Zhao, R.-G. Ensemble partial least squares regression for descriptor selection, outlier detection, applicability domain assessment, and ensemble modeling in QSAR/QSPR modeling. Journal of Chemometrics 2017, 31, e2922. https://doi.org/10.1002/cem.2922
Chi C-T, Lee M-H, Weng C-F, Leong MK (2019) In silico prediction of PAMPA effective permeability using a two-QSAR approach. Int J Mol Sci 20:3170–3194. https://doi.org/10.3390/ijms20133170
Article Google Scholar
Dagenais C, Avdeef A, Tsinman O, Dudley A, Beliveau R (2009) P-glycoprotein deficient mouse in situ blood–brain barrier permeability and its prediction using an in combo PAMPA model. Eur J Pharm Sci 38:121–137. https://doi.org/10.1016/j.ejps.2009.06.009
Article Google Scholar
Dahlgren D, Lennernäs H (2019) Intestinal permeability and drug absorption: predictive experimental. Comput in Vivo Approach Pharm 11:411–429. https://doi.org/10.3390/pharmaceutics11080411
Article Google Scholar
Deane-Mayer, Z. A.; Knowles, J. E.. 2016. “caretEnsemble: Ensembles of Caret Models.” R package version 2.0.0. https://CRAN.R-project.org/package=caretEnsemble.
von Delft F, Calmiano M, Chodera J et al (2021) A white-knuckle ride of open COVID drug discovery. Nature 594:330–332. https://doi.org/10.1038/d41586-021-01571-1
Article Google Scholar
Diukendjieva A, Alov P, Tsakovska I et al (2019) In vitro and in silico studies of the membrane permeability of natural flavonoids from Silybum marianum (L.) Gaertn. And their derivatives. Phytomedicine 53:79–85. https://doi.org/10.1016/j.phymed.2018.09.001
Article Google Scholar
Dudgeon, T. https://github.com/tdudgeon/jupyter_mpro/blob/master/ADMET-moonshot.ipynb (last accessed 24/02/2021).
Erlanson DA (2020) Many small steps towards a COVID-19 drug. Nat Commun 11:5048. https://doi.org/10.1038/s41467-020-18710-3
Article Google Scholar
Faraway, J. Linear Models with R. Chapman & Hall/CRC, 2005, Boca Raton.
Fei Z, Jiang W, Xiao D et al (2016) Application of Sulfonyl in Drug Design. Chinese Journal of Organic Chemistry 36:490. https://doi.org/10.6023/cjoc201510006
Article Google Scholar
Ferreira LG, Andricopulo AD (2020) COVID-19: small-molecule clinical trials landscape. Curr Top Med Chem 2020:1577–1580. https://doi.org/10.2174/156802662018200703154334
Article Google Scholar
Fortuna A, Alves G, Falcão A (2007) The importance of permeability screening in drug discovery process: PAMPA, Caco-2 and rat everted gut assays. Curr Top Pharmacol 11:63–86
Google Scholar
Gousiadou, C., 2021. “Development of Neural Network Models to Predict the PAMPA Effective Permeability of New, Orally Administered Drugs Active Against the Coronavirus SARS-CoV-2.". Zenodo Online Repository https://zenodo.org/record/5504324#.Y5sPp3bMJaQ
Guha R (2007) Chemical informatics functionality in R. J Stat Softw 18:1–16. https://doi.org/10.18637/jss.v018.i05
Article Google Scholar
Guha R, Willighagen E (2012) A survey of quantitative descriptions of molecular structure. Curr Top Med Chem 12:1946–1956. https://doi.org/10.2174/156802612804910278
Article Google Scholar
Günther, F., Fritsch, S. neuralnet: Training of Neural Networks. The R journal, 2010, 2, ISSN 2073–4859. https://doi.org/10.32614/RJ-2010-006
Hawkins DM (2004) The Problem of Overfitting. J Chem Inf Model 44:1–12. https://doi.org/10.1021/ci0342472
Article Google Scholar
Ho SY, Phua K, Wong L, Bin Goh WW (2020) Extensions of the external validation for checking learned model interpretability and generalizability. Patterns 1:100129. https://doi.org/10.1016/j.patter.2020.100129
Article Google Scholar
Homayun B, Lin X, Choi H-J (2019) Challenges and recent progress in oral drug delivery systems for biopharmaceuticals. Pharmaceutics 11:129. https://doi.org/10.3390/pharmaceutics11030129
Article Google Scholar
Hornik K, Stinchcombe M, White H (1989) Multilayer feedforward networks are universal approximators. Neural Netw 2:359–366. https://doi.org/10.1016/0893-6080(89)90020-8
Article MATH Google Scholar
Hung Y-P, Lee J-C, Chiu C-W et al (2022) Oral nirmatrelvir/ritonavir therapy for covid-19: the dawn in the dark? Antibiotics 11:220. https://doi.org/10.3390/antibiotics11020220
Article Google Scholar
Informatics Matters Ltd. https://www.informaticsmatters.com/ (last accessed 24/02/2021).
Irshad K, Khan AI, Irfan SA et al (2020) Utilizing artificial neural network for prediction of occupants thermal comfort: a case study of a test room fitted with a thermoelectric air-conditioning system. IEEE Access 8:99709–99728. https://doi.org/10.1109/ACCESS.2020.2985036
Article Google Scholar
Jaber Alsolami F, Saad Al-Malaise AL, Ghamdi A, Irshad Khan AB et al (2021) Impact assessment of COVID-19 pandemic through machine learning models. Computers, Materials & Continua 68, 2895–2912. https://doi.org/10.32604/cmc.2021.017469
Jang WD, Jeon S, Kim S, Lee SY (2021) Drugs repurposed for COVID-19 by virtual screening of 6,218 drugs and cell-based assay. Proc Natl Acad Sci 118:e2024302118. https://doi.org/10.1073/pnas.2024302118
Jaqpot https://infrastructure.nanocommons.eu/services/5/jaqpot-5-computational-platform-for-in-silico-modelling/
John, G. H.; Kohavi, R.; Pfleger, K. “Irrelevant Features and the Subset Selection Problem.” In Machine Learning Proceedings 1994, 121–129. Burlington, MA: Morgan Kauffman, doi:https://doi.org/10.1016/B978-1-55860-335-6.50023-4.
Kansy M, Senner F, Gubernator K (1998) Physicochemical high throughput screening: parallel artificial membrane permeation assay in the description of passive absorption processes. J Med Chem 41:1007–1010. https://doi.org/10.1021/jm970530e
Article Google Scholar
Kaur J, Khan AI, Abushark YB et al (2020) Security risk assessment of healthcare web application through adaptive neuro-fuzzy inference system: a design perspective</p>. Risk Manag Healthcare Policy 13:355–371. https://doi.org/10.2147/RMHP.S233706
Article Google Scholar
Kotu, V.; Deshpande, B. Chapter 2 - Data Science Process. Data Science (2nd Edition) 2019, edited by Kotu, V., Deshpande, 19–37. Morgan Kaufmann, ISBN 9780128147610, https://doi.org/10.1016/B978-0-12-814761-0.00002-2.
Kuhn, M. 2019. “caret: Classification and Regression Training R package version 6.0–84.” http://topepo.github.io/caret/index.html.
Kuhn, M. “Building Predictive Models in R Using the Caret Package.” Journal of Statistical Software 2008 28: 1–26. https://doi.org/10.18637/jss.v028.i05
Kvålseth OT (1985) Cautionary note about R2. Am Stat 39(4):279–285. https://doi.org/10.1080/00031305.1985.10479448
Article Google Scholar
Leisch, F.; Dimitriadou, E. 2010. “mlbench: Machine Learning Benchmark Problems.” R package version 2.1–1. http://rdrr.io/cran/mlbench.
Liaw A, Wiener M (2002) Classification and Regression by randomForest. R News 2:18–22
Google Scholar
Lipinski CA (2000) Drug-like properties and the causes of poor solubility and poor permeability. J Pharmacol Toxicol Methods 44:235–249. https://doi.org/10.1016/s1056-8719(00)00107-6
Article Google Scholar
Lipinski CA, Lombardo F, Dominy BW, Feeney PJ (2001) Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev 46:3–26. https://doi.org/10.1016/s0169-409x(00)00129-0
Article Google Scholar
Lu H, Kopcho L, Ghosh K et al (2016) Development of a RapidFire mass spectrometry assay and a fluorescence assay for the discovery of kynurenine aminotransferase II inhibitors to treat central nervous system disorders. Anal Biochem 15:56–65. https://doi.org/10.1016/j.ab.2016.02.003
Article Google Scholar
Masek BB, Shen L, Smith KM, Pearlman RS (2008) Sharing chemical information without sharing chemical structure. J Chem Inf Model 48:256–261. https://doi.org/10.1021/ci600383v
Article Google Scholar
Masungi C, Mensch J, Van Dijck A et al (2008) Parallel artificial membrane permeability assay (Pampa) combined with a 10-day multiscreen Caco-2 cell culture as a tool for assessing new drug candidates. Pharmazie 63:194–199
Google Scholar
Milborrow, S. 2019. “rpart.plot: Plot ’rpart’ Models: An Enhanced Version of ’plot.rpart’.” R package version 3.0.8. https://CRAN.R-project.org/package=rpart.plot.
Mount, J.; Zumel, N. WVPlots: Common Plots for Analysis. 2020 R package version 1.3.1. https://CRAN.R-project.org/package=WVPlots.
NanoCommons Translational Access (TA). https://www.nanocommons.eu/ta-access/ (last accessed 24/02/2021.
Oja M, Maran U (2015a) The permeability of an artificial membrane for wide range of pH in human gastrointestinal tract: experimental measurements and quantitative structure-activity relationship. Mol Inf 34:493–506. https://doi.org/10.1002/minf.201400147
Article Google Scholar
Oja M, Maran U (2015b) Quantitative structure–permeability relationships at various pH values for acidic and basic drugs and drug-like compounds. SAR QSAR Environ Res 26:701–719. https://doi.org/10.1080/1062936X.2015.1085896
Article Google Scholar
Oja M, Maran U (2016a) Quantitative structure–permeability relationships at various pH values for neutral and amphoteric drugs and druglike compounds. SAR QSAR Environ Res 27:813–832. https://doi.org/10.1080/1062936X.2016.1238408
Article Google Scholar
Oja M, Maran U (2018) pH-permeability profiles for drug substances: Experimental detection, comparison with human intestinal absorption and modelling. Eur J Pharm Sci 123:429–440. https://doi.org/10.1016/j.ejps.2018.07.014
Article Google Scholar
Oja M, Maran U (2015c) Data for: Quantitative structure-permeability relationships at various pH values for acidic and basic drugs and drug-like compounds. QsarDB repository, QDB.166. 2015c. http://dx.doi.org/https://doi.org/10.15152/QDB.166.
Oja M, Maran U (2016b) Data for: Quantitative structure-permeability relationships at various pH values for neutral and amphoteric drugs and drug-like compounds. QsarDB repository, QDB.184. 2016b. http://dx.doi.org/https://doi.org/10.15152/QDB.184.
Olden JD, Joy MK, Death RG (2004) An accurate comparison of methods for quantifying variable importance in artificial neural networks using simulated data. Ecol Model 178:389–397. https://doi.org/10.1016/j.ecolmodel.2004.03.013
Article Google Scholar
Owen DR, Allerton CMN, Anderson AS et al (2021) An oral SARS-CoV-2 Mpro inhibitor clinical candidate for the treatment of COVID-19. Science 374:1586–1593. https://doi.org/10.1126/science.abl4784
Article Google Scholar
Pearlman, R.S., Smith, K.M. In 3D-QSAR and Drug Design: Recent AdVances; Kubinyi, H., Martin, Y., Folkers, G., Eds.; Kluwer Academic: Dordrecht, Netherlands, 1997; pp 339–353.
PostEra (2022) COVID Moonshot: an international effort to discover a COVID antiviral. https://covid.postera.ai/covid (Accessed 19/07/2022)
R Core Team. 2018. A Language and Environment for Statistical Computing; R Foundation for Statistical Computing. Vienna: Austria, http://www.R-project.org.
Riedmiller, M., Braun, H. A direct adaptive method for faster backpropagation learning: The RPROP algorithm. IEEE International Conference on Neural Networks 1993, 586–591. https://doi.org/10.1109/ICNN.1993.298623
Roy D, Dutta D, Wishart DS, Kovalenko A (2021) Predicting PAMPA permeability using the 3D-RISM-KH theory: Are we there yet? J Comput Aided Mol Des 35:261–269. https://doi.org/10.1007/s10822-020-00364-4
Article Google Scholar
Roy, P. P., S. Paul, I. Mitra, and K. Roy.. “On Two Novel Parameters for Validation of Predictive QSAR Models. ”Molecules (Basel, Switzerland) 2009, 14, 5, 1660–1701, doi:https://doi.org/10.3390/molecules14051660.
Sarimveis, H. (2019), "Jaqpot - An open-source web platform for creating, using, testing and sharing predictive models in nano-informatics," https://ncihub.org/resources/2268.
Sarker IH, Abushark YB, Alsolami F, Khan AI (2020) Intrudtree: a machine learning based cyber security intrusion detection model. Symmetry 12:754. https://doi.org/10.3390/sym12050754
Article Google Scholar
Sayers EW, Bolton EE, Brister JR et al (2022) Database resources of the national center for biotechnology information. Nucleic Acids Res 50:D20–D26. https://doi.org/10.1093/nar/gkab1112
Article Google Scholar
Schmidt D, Lynch J (2022) Evaluation of the reproducibility of Parallel Artificial Membrane Permeation Assays (PAMPA) https://www.sigmaaldrich.com/DK/en/technical-documents/technical-article/research-and-disease-areas/pharmacology-and-drug-discovery-research/evaluation-of-the-reproducibility-of-pampa
Schäfer, A., Martinez, D. R., Won, J. J., et al. Therapeutic treatment with an oral prodrug of the remdesivir parental nucleoside is protective against SARS-CoV-2 pathogenesis in mice. Science Translational Medicine 2022, 14, eabm3410. https://doi.org/10.1126/scitranslmed.abm3410
Sinkó B, Garrigues TM, Balogh GT, Nagy ZK, Tsinman O, Avdeef A, Takács-Novák K (2012) Skin-PAMPA: a new method for fast prediction of skin penetration. Eur J Pharm Sci 45:698–707. https://doi.org/10.1016/j.ejps.2012.01.011
Article Google Scholar
Spjuth O, Alvarsson J, Berg A, Eklund M, Kuhn S, Mäsak C, Torrance G, Wagener J, Willighagen EL, Steinbeck C, Wikberg JES (2009) Bioclipse 2: a scriptable integration platform for the life sciences. BMC Bioinf 10:397–402
Article Google Scholar
Spjuth O, Helmus T, Willighagen EL, Kuhn S, Eklund M, Wagener J, Murray-Rust P, Steinbeck C, Wikberg JES (2007) Bioclipse: an open source workbench for chemo- and bioinformatics. BMC Bioinf 8:59–69. https://doi.org/10.1186/1471-2105-8-59
Article Google Scholar
Stanton DT (1999) Evaluation and use of BCUT descriptors in QSAR and QSPR studies. J Chem Inf Comput Sci 39:11–20. https://doi.org/10.1021/ci980102x
Article Google Scholar
Sun H, Nguyen K, Kerns E et al (2017) Highly predictive and interpretable models for PAMPA permeability. Bioorg Med Chem 25:1266–1276. https://doi.org/10.1016/j.bmc.2016.12.049
Article Google Scholar
Sushko I, Novotarskyi S, Körner R et al (2010) Applicability domain for in silico models to achieve accuracy of experimental measurements. J Chemom 24:202–208. https://doi.org/10.1002/cem.1296
Article Google Scholar
Svetnik, V.; Liaw, A.; Tong, C.; Culberson, J. C.; Sheridan, R. P.; Feuston, B. P. “Random Forest: A Classification and Regression Tool for Compound Classification and QSAR modeling.” J Chem Inf Model. 2003, 43. https://doi.org/10.1021/ci034160g
Svetnik, V.; Liaw, A.; Tong, C.; Wang, T. “Application of Breiman’s Random Forest to Modeling Structure-Activity Relationships of Pharmaceutical Molecules.” In Multiple Classifier Systems. MCS 2004. Lecture Notes in Computer Science, edited by F. Roli, J. Kittler, and T. Windeatt, Vol. 3077, 334–343. Springer, Berlin, Heidelberg.doi:https://doi.org/10.1007/978-3-540-25966-4_33.
Tetko IV, Sushko I, Pandey AK et al (2008) Critical assessment of QSAR models of environmental toxicity against Tetrahymena pyriformis: Focusing on applicability domain and overfitting by variable selection. J Chem Inf Model 48:1733–1746. https://doi.org/10.1021/ci800151m
Article Google Scholar
Therneau, T.; Atkinson, B. 2018. “rpart: Recursive Partitioning and Regression Trees.” R package version 4.1–13. https://CRAN.R-project.org/package=rpart.
Veber DF, Johnson SR, Cheng H-Y et al (2002) Molecular properties that influence the oral bioavailability of drug candidates. J Med Chem 45:2615–2623. https://doi.org/10.1021/jm020017n
Article Google Scholar
Wei, T.; Simko, V. 2017. “R Package "Corrplot": Visualization of a Correlation Matrix (Version 0.84).” Available from https://github.com/taiyun/corrplot.
Wickham, H.; Averick, M.; Bryan, J.; Chang, W.; McGowan, L.; François, R.; Grolemund, G. et al. “Welcome to the Tidyverse.” Journal of Open Source Software 2019a, 4 , 1686–1692. https://doi.org/10.21105/joss.01686
Wickham, H.; François, R.; Henry, L.; Müller, K. 2019b. “dplyr: A Grammar of Data Manipulation.” R package version 0.8.3. https://CRAN.R-project.org/package=dplyr.
Willmott C, Matsuura K (2005) Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance. Clim Res 30:79–82. https://doi.org/10.3354/cr030079
Article Google Scholar
Wolpert DH (1992) Stacked generalization. Neural Netw 5:241–259. https://doi.org/10.1016/S0893-6080(05)80023-1
Article Google Scholar

Download references

Acknowledgements

This work received funding from the European Union's Horizon 2020 research and innovation programme via NanoCommons Project under grant agreement No 731032.

Funding

Open access funding provided by HEAL-Link Greece.

Author information

Authors and Affiliations

School of Chemical Engineering, National Technical University of Athens, Heroon Polytechneiou 9, 15780, Zografou, Athens, Greece
Chrysoula Gousiadou, Philip Doganis & Haralambos Sarimveis

Authors

Chrysoula Gousiadou
View author publications
You can also search for this author in PubMed Google Scholar
Philip Doganis
View author publications
You can also search for this author in PubMed Google Scholar
Haralambos Sarimveis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chrysoula Gousiadou.

Ethics declarations

Conflict of interest

The authors have no competing interests to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (XLSX 1056 kb)

Supplementary file2 (DOCX 274 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gousiadou, C., Doganis, P. & Sarimveis, H. Development of artificial neural network models to predict the PAMPA effective permeability of new, orally administered drugs active against the coronavirus SARS-CoV-2. Netw Model Anal Health Inform Bioinforma 12, 16 (2023). https://doi.org/10.1007/s13721-023-00410-9

Download citation

Received: 11 January 2022
Revised: 05 January 2023
Accepted: 06 January 2023
Published: 06 February 2023
DOI: https://doi.org/10.1007/s13721-023-00410-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Development of artificial neural network models to predict the PAMPA effective permeability of new, orally administered drugs active against the coronavirus SARS-CoV-2

Abstract

Similar content being viewed by others

Machine Learning Exploration of the Relationship Between Drugs and the Blood–Brain Barrier: Guiding Molecular Modification

Neural Network Ensemble Based QSAR Model for the BBB Challenge: A Review

A machine learning platform to estimate anti-SARS-CoV-2 activities

1 Introduction

2 Experimental

2.1 Description of the PAMPA method

2.2 Permeability measurements and experimental setup

2.3 Partitioning of the data for model development and validation: calculation of molecular descriptors and model performance statistics

2.3.1 Train, test and external validation datasets (supporting information 1, sheet S1.1)

2.3.1.1 Model development data: train and test subsets

2.3.1.2 External validation set

2.3.2 Calculation of molecular descriptors

2.3.3 Model performance statistics

2.3.4 Workflow for model development and validation

3 Results

3.1 Data pre-processing and feature selection

3.2 QSAR models created using the selected 61 descriptors

3.3 Domain of applicability (DOA) of the ensemble NN model

3.4 Linear model created using only structural descriptors

3.5 Implementation of the EnsembleNN model

4 Discussion

4.1 Importance of descriptors

5 Conclusions

6 Data, software and code availability

7 Supporting information

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (XLSX 1056 kb)

Supplementary file2 (DOCX 274 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation