Background

Among numerous infections influencing present-day humanities, dementia is quite possibly the most severe health problem. The most well-known type of dementia is AD which is liable for a large percentage of the circumstances. In line with these realities, 40 years to come, a hundred million AD patients may spring up [1]. A decrease in memory is one out of the numerous characteristics of AD which result in neuronal atrophy [2]. The existing pharmacological administration of communication and emotional signs of dementia (BPSD) offers restricted viability, related to genuine adverse effects bringing about an enhanced danger of death [3].

In the cure of AD, quite a lot of scientific trials have made known that ACHEIs are favourable treatments [4, 5]. Tacrine, galantamine, rivastigmine and donepezil are some ACHEIs presently used, but their clinical usefulness is inadequate since the inadequate number of available drugs, their low effectiveness and numerous unwanted side effects such as hepatotoxicity [6]; therefore, AD remains incurable [7,8,9], to increase their activity and lessen adverse side effects [10], and there is need to synthesis and study new compounds as ACHEIs is required [11].

Computational techniques are on a fundamental level similar to high-throughput separating both target and ligand structure, and data are required. The extending highlights of 3D ligand-based pharmacophore screening and pharmacokinetic inspect, improve the lead atoms choice with ligand viability [12, 13].

The present study aimed to identify and validate that N-aryl derivatives bind to the energetic site of ChEIs [14]. Theoretical methods have been effectively used to recognize the necessary structural structures for their selective inhibitory activity. The quantitative structure–activity relationships (QSAR) approaches have the capability of diminishing considerably the time and exertion needed for the disclosure of the different treatments [15]. Molecular descriptors that address the variety of the underlying properties of the particles are significant progress in building the QSAR models [15]. Subsequently, the utilization of QSAR in the improvement of a hypothetical model to calculate the biological activities of a set of compounds is a vital strategy used in the QSAR methodology and is found in the literature [16].

Also, the following approaches were used to determine the efficacy of the ligand inhibitor molecules: design of molecules, 3D pharmacophore, molecular docking, and ADMET studies are essential approaches to determining ligand inhibitors.

Methods

Data set, generation of the molecular structures and descriptor calculation

The QSAR studies contain 75 N-aryl derivatives with the anti-Alzheimer activity that were selected from the literature [17,18,19,20]. For this study, ChemDraw Ultra version 12.0, Spartan’14 version 1.1.4 and PaDEL-Descriptor Software, version 2.18 were used, anti-Alzheimer activity values of the designated molecules were described in IC50 (µmol); it was transformed to the inverse of IC50 and then to negative Log of IC50 [21]. 3D of the structures was obtained from the drawn 2D molecular structures by ChemDraw software; optimization and DFT calculations were made on the molecules by the Spartan’14 software and B3LYP/6-31G** basis set which yielded several molecular properties [22].

Data pre-treatment, splitting and descriptors change

All descriptors segments with perpetual and with a difference not exactly 0.001 were removed from the descriptor pool. Correspondence scrutiny was done on any descriptor of any pair with a relationship more prominent than 0.8, which was disposed of. The Modified K-mediod1.2 [23] was utilized to share the data. Descriptors in the model with great positive or negative assessment information of the training set were changed by standardization [24] utilizing the equation under

$${P}^{n} = \frac{P-{P}_{\mathrm{max}}}{{P}_{\mathrm{max}}-{P}_{\mathrm{min}}}$$
(1)

where Pn is the standardized molecular properties, Pmax is the maximum value of the property section, and Pmin is the least value in the column.

Model molecular properties collection

An algorithm in Material Studio, version 7.0 best describes the difference in activity of the studied molecules that were employed to choose a grouping of descriptors [25]. The algorithm has some merits; one of them is the production of several models at a time [26].

Model development, selection and QSAR model validation

The Molegro Data Modeler programming was used to build the model and export into the Molegro worksheet, the descriptors and activities of the compounds. The selection of models built was done dependent on the conditions of R2, Q2, and R2pred [27, 28].

Mean effect and variance inflation factor (VIF)

The mean effect was restrained towards activities of the produced model and the impact of the calculated descriptors.

$${\mathrm{Mean}}\, {\mathrm{Effect}} = \frac{{\beta }_{p}{\sum }_{i}^{n}{D}_{p}}{{\sum }_{p}^{q}\left({\beta }_{p}{\sum }_{i}^{r}{D}_{p}\right)}$$
(2)

where βp modifies the molecular properties p's coefficient, Dp modifies each estimation of framework property in the preparation set, and q adjusts with the count of model properties present and r represents the count of particles utilized as preparing set [27, 28]. Variance inflation factor (VIF) of multi-co-linearity between molecular properties blends selected by the algorithm was assessed. In Eq. (3), R_ip^2 is the relationship coefficient of the different relapse between the property I and the excess j molecular properties in the model [28].

$${\mathrm{VIF}}_{i}= \frac{1}{1-{R}_{ij}^{2}}$$
(3)

Models applicability domain (ADS′)

The estimation with a given consistency which a model make defines the AD′ of the response and chemical space [29, 30]. Those compounds that fall within the AD′ can be considered dependable if only the model can be positioned to use for selecting compounds, which means AD′ is defined and predicted [31]. Arrangement of a hat matrix H maps the vector of experimental values to the vector of fitted values defined by the leverage method which was used for AD′ in this work to build models [32]. The caution leverage (l*) signifies the boundary of typical values for irregularities of X and it is stated as l* \(= \frac{3\left(m+1\right)}{X}\) where X stands for the sum of training compounds, and m the summation of model descriptors existent. li value > l* and standardized residual values >  ± 3 within standard deviance units were seen as abnormalities of the compounds [32]. Two compounds appeared outside the warning value l* = 0.3 signifying outliers after a close observation of the AD′ (Fig. 2).

Molecular docking studies

Predicted protein–ligand relations on new investigation techniques that mix disparity change with a cavity expectation strategy were carried out using the PyRx Virtual Screen tool software [33]. Protein information bank contains Human AChEI (PDB: 4EY7) with a high-resolution 2.35 Å crystal structure. Unwanted and heteroatom particles were removed from the protein contained in the PDB record, while hydrogen was added to the protein segment and saved, the PDB was opened with Material Studio programming. The PyRx Virtual Screen tool depression location calculation and the docking was accomplished to predict the limiting method of the ligand and the unbiased protein of the scoring capacity and the saved PDB was then brought into the PyRx Virtual Screen tool interphase where the limiting pocket was guided and characterized [34].

Results

Data arrangement and QSAR models


QSAR model

$$\begin{aligned} {\mathbf{pIC50}} & = - 0.000120539*{\mathbf{ATS4i}} - {1}.{975262189}*{\mathbf{MATS2e}} \\ & \quad + {1}.0{2527}0{698}*{\mathbf{SpMax7}}\_{\mathbf{Bhs}} + 0.00{24}0{1}0{8}0*{\mathbf{EHOMO}} \\ & \quad + 0.00{784}0{871}*{\mathbf{MW}}-{1}.00{289}. \, N = {6}0R^{2} \\ & \quad = 0.{936}R_{{{\text{Adj}}}}^{2} = 0.907724 Q_{{{\text{cv}}}}^{2} = 0.88\,{\text{LOF }} \\ & \quad = 0.0{154}R_{{{\text{ext}}}}^{2} = 0.{881}, \;N_{{{\text{ext}}}} = 15. \\ \end{aligned}$$
(4)

The importance of the parameters selected was by virtue as the top value of R2 = 0.936, R2Adj = 0.907, Q2cv = 0.88 and R2ext = 0.881.

Descriptors elucidation

Minimum standard agreement for a reliable and influential model, the internal as well as the external justification considerations. A rise in physicochemical factors of descriptors SpMax7_Bhs, EHOMO (Homo) and MW (Molecular Weight) will increase inhibitory activities of N-aryl derivatives against Alzheimer disease since their figures were positive. Also, negative coefficients descriptors; ATS4i and MATS2e suggests that inhibitory activities of N-aryl derivatives will rise in contrast to the enzyme with decreasing standards of the descriptors (Additional file 1).

In silico design of anti-Alzheimer compound

Also, the seven designed compounds;1-(4-hydroxy-3-nitrophenyl)pyrrolidine-2,5dione (PIC50 = 1.3968),1(hydroxyphenyl)pyrrolidine-2,5-dione (PIC50 = 2.4182),1-(-4-hydroxy-3,5dinitrophenyl)pyrrlidine-2,4-dione (PIC50 = 3.124),1-(4-hydroxy-2,3,5trinitrophenyl)pyrrolidine-2,5-dione (pIC50 = 2.7643),1-(4-hydroxy-2 nitrophenyl)pyrrolidine-2,5-dione (pIC50 = 1.6064), 1-(2,4-dihydroxy-6-nitrophenyl)pyrrolidine-2,5-dione (pIC50 = 1.9319) and 1-(2,4,6-trihydroxy-3,5-dinitrophenyl)pyrrolidine-2,5-dione (pIC50 = 2.7478) were established to have better activities relative to the existing drug (Donepezil) (pIC50 = 2.7643), with compound 1-(-4-hydroxy-3,5-dinitrophenyl)pyrrlidine-2,4-dione (pIC50 = 3.124) having the largely better activity.

Molecular docking investigation and virtual screening

Investigations were carried out to regulate and calculate binding affinity towards protein target and visualize/elucidate molecular interactions between the receptor and ligand to identify types of amino acids accountable for the chemical connections at the active site of the inhibitors. Computed binding energy scores resulting from the docked complexes range from − 7.4 to − 10.1 kcal/mol with RMSD values for the majority of the complexes (over 80%) estimated to be less than 2.0 Å (Table 1). This suggests that the ligands were effectively docked to the dynamic site of the receptor and the performance of the Docking Algorithms used for this study was very reliable because a docking complex with an RMSD value less than 2 Å is considered a successful and correct docking prediction [33, 34]. The results of docked complexes with their binding energy were reported in Table 1.

Table 1 Molecular docking result of the designed compounds using Lamarckian genetic algorithm

ADMET property predictions and physicochemical parameters evaluation of some selected compounds

As the collaboration of an inhibitor with a compound cannot ensure its appropriateness as a medication, to further reinforce the after-effects of three-dimension-QSAR and docking studies, ADMET properties were considered on the designed molecules revealed in Table 2. The capacity to reach a target in bioactive structure was surveyed utilizing the SwissADME [35] and pkCSM [36]. Critically, the advancements carried out in these stages can envisage by a reasonable level of certainty, the false-positive outcomes ordinarily saw in biological tests of small molecules [37]. Tables 3 and 4 reveal physicochemical parameters and pharmacokinetics properties of designed compounds obtained values; Table 3 shows to be within the suitable ranges for oral bioavailability as drug candidates and obeyed Lipinski’s rule of five [38]. This implies that these compounds possessed a good physiochemical attribute generally considered in clinical trials that accounts for the good transport properties of a drug candidate [39].

Table 2 Molecular structures of design N-aryl derivatives and their theoretical activities
Table 3 Physicochemical properties of some designed compounds
Table 4 ADMET/Pharmacokinetics properties of selected ligands with highest binding affinity

Discussion

The dataset Table 5 was fragmented into 60 training set and 15 test set by clustering method. “Dt” was used in Table 5 to identify test set molecules. Several models were generated, the best was picked based on good statistical information as Eq. (4). The model 4-parametric equation containing five descriptors and an error term. Under the equation are the statistical parameters to ascertain the robustness and predictive ability of the equation. After careful validation and examination, the chosen model close by its approval boundary is introduced in Eq. (4).

Table 5 Comparison of experimental predicted and residual of the data set

Hence, the molecule in the AD of the model can be used to make extrapolation for identified molecule without activity. Figure 1 shows the experimental activity plotted against predicted activity for training and tests sets of the model were evenly distributed on both side of the line, shown in Fig. 2, is the plot of experimental activities against the predicted activities of the test sets. The external model validation showed an exact relationship among the experimental and predicted pIC50 of the test set; also, the analytical power of the model is in the great linearity of the plot.

Fig. 1
figure 1

Experimental activity plotted against predicted activity for training and tests sets of the model

Fig. 2
figure 2

Experimental versus predicted test set value’s graph

The model William’s plot in Fig. 3 depicted their AD for the built model. A square area is bounded by 0 < l* < 0.3 and − 2.5 < SDR < 2.5, where l* is the model warning leverage in the AD of the model as described in the research.

Fig. 3
figure 3

The model William’s plot

Better action guides the application for the advancement of particles in interpreting descriptors confined in a QSAR model. Along these lines, descriptors appeared in the model of this research, their normal regression number and incidence are in Table 6. Table 7 shows the reflection on the importance of the molecular properties as well as the correlation amongst them.

Table 6 Molecular descriptor and their regression coefficient
Table 7 Used molecular properties correlation matrix with VIF and mean effect

Several hypothetic novel derivatives were designed using the template (Fig. 4) which was gotten from the molecule with number D35rm in the dataset (Table 5). Noble AD’s leverage value, superb standardized residual and relatively high activity are some of the factors considered in chosen the template. The facts acquired from the molecular properties confined in the model, guided the design of the derivatives. The template was modified via the adding and elimination of a selection of substituents such as –NH2, –OH, COOH and –NO2 sets. All experimental compounds (Table 5) have lesser activities compared to the seven designed compounds (Table 2).

Fig. 4
figure 4

2D structure of Ligand template

Comparison of the designed ligands and the co-crystallized ligand. Compound one (1) Van der Waals bond has THR, ASP, PHE, LEU, PHE residues which is similar to the interaction in the co-crystallized ligand, conventional Hydrogen Bond has ARG, SER, and TYR residues which is similar to the interaction in the co-crystallized ligand. Compound two (2) Van der Waals bond has LEU, ASN, SER, ARG, THR, GLN, and ILE residues which is similar to the interaction in the co-crystallized ligand, conventional Hydrogen Bond has SER, TYR, and ARG has residues which is similar to the interaction in the co-crystallized ligand, also, π–π stacked has TRP residues which is similar to the interaction in the co-crystallized ligand. Compound five (5) Van der Waals bond has VAL, LEU, THR, and GLN residues which is similar to the interaction in the co-crystallized ligand, conventional Hydrogen Bond has SER, and ARG residues which is similar to the interaction in the co-crystallized ligand and π–π Stacked has TRP residue which is similar to the interaction in the co-crystallized ligand. Compound Seven (7) Van der Waals bond has THR, PHE, and ASP residues which is similar to the interaction in the co-crystallized ligand, and conventional Hydrogen Bond has ARG, and TYR residues which is similar to the interaction in the co-crystallized ligand. Compound Eight (8) Van der Waals bond has THR, LEU, PHE, and GLN residues which is similar to the interaction in the co-crystallized ligand, conventional Hydrogen Bond has TYR, ARG, and SER residues which is similar to the interaction in the co-crystallized ligand. Compound sixteen (16) Van der Waals bond has LEU, GLN, and VAL residues which is similar to the interaction in the co-crystallized ligand, conventional Hydrogen Bond has ARG, and PHE residues which is similar to the interaction in the co-crystallized ligand. Compound Eighteen (18) Van der Waals bond has LEU, VAL, and GLN residues which is similar to the interaction in the co-crystallized ligand, conventional Hydrogen Bond has TYR, and GLN residues which is similar to the interaction in the co-crystallized ligand. All the designated compounds showed decent binding affinity and better molecular interaction at the dynamic site of the target evidenced from their binding affinity scores and the RMSD values. The binding affinity scores of the selected compounds (− 7.4 to − 10.1 kcal/mol) were found to be better compared to an FDA approved drug (Donepezil) with the binding affinity of − 6.5 kcal/mol. Also, Fig. 5 shows the structure of human AChEI (PDB ID: 4EY7), visualized through PyRx, 2D Interactions and Hydrogen bond interaction, respectively.

Fig. 5
figure 5

A Structure of human ACE (PDB ID: 4EY7), visualized through PyRx, B 2D interactions, C hydrogen bond interaction

The bioavailability radar plots showed the oral accessibility of our recommended bioactive compounds (Fig. 6), parameters of the studied molecule, make available a graphical snapshot of the drug-likeness. Lastly, Fig. 7 shows the 3D interactions of the designed compounds and the protein target.

Fig. 6
figure 6

Radar plots

Fig. 7
figure 7

3D interactions of designed ligands and protein target

Conclusions

In this work, a QSAR and virtual screening were acquired by structure-based (docking), seven ligands acquired from a blended virtual convention. As a result, we can infer that:

  • The seven chose intensifies meet these standards of no possible poisonousness, great pharmacotherapeutic profile anticipated qualities determined which is better than the reference compounds.

  • An assortment of screening techniques utilized, particularly the blended methodology, served for self-approval of the outcomes, just as to investigate, all the more effectively, the underlying variety of substance compounds, in search of a first lead.

  • The quick possibilities are purchasing these mixtures and leading broad natural movement tests (and selectivity), whose outcomes will manage the advancement of a lead is hence gotten, just as more point by point investigation of the outcomes approved.