Molecular simulations
The molecular simulation studies were performed using the Discovery Studio v 3.1. (DS, Accelrys, Inc., San Diego, USA). Three different approaches, Bayesian modeling, pharmacophore modeling, and density functional theory (DFT), were applied to the discovery of chemical features important for the potency of SIRT1 activators.
BM generation
In chemoinformatics, the Bayesian network (an emerging new technique in drug discovery) acts as the best alternative tool for similarity-based virtual screening approaches. To generate and validate the BMs, a set of 360 SIRT1 activators were collected from various literatures and patents (Bemis et al., 2009; Mai et al., 2009; Vu et al., 2009) with a reported biological activity value (EC50). The collected SIRT1 activators were sketched with the help of ACD ChemSketch v12 and transformed to 3D using DS. Based on the EC50 values the collected 360 activators were divided into 184 highly active activators (EC50 ≤5 μM) and 176 least active activators (EC50 >5 μM). The force field of Chemistry at HARvard Macromolecular Mechanics, CHARMM (Brooks et al., 1983), a flexible and comprehensive empirical energy function that is a summation of many individual energy terms, was applied to all SIRT1 activators. The quality of the BM was directly proportional to the preeminent of training set compounds, descriptors/fingerprints, and statistical method (Prathipati et al., 2008).
Preparation of training set based on diversity
The molecular diversity in the training set plays a major role in finding an important fragment in the generation of BM. Hence, the Diverse Molecules/Library Analysis was applied to select the training and test sets compounds based on the subset of structure diversity with respect to functional class fingerprints of maximum diameter 4, FCFP_4 (Prathipati et al., 2008). The curated activators (360 activators) were divided into training set (20 highly active and 20 least active) and test set (164 highly active and 156 least active) based on a maximum dissimilarity approach. DS provides three types of conformational analysis: FAST, BEST, and CAESAR quality analysis (Sakkiah et al., 2011b). BEST conformation analysis ensures the best coverage of the conformational space, hence it was used to generate the conformations for the training and test set activators. A maximum number of 255 diverse conformations were generated for each compound in training and test sets using Monte-Carlo-like algorithm with an energy range of 20 kcal mol−1 together with the poling algorithm (Smellie et al., 1995).
Selection of molecular descriptors
Descriptors are classified based on the molecular representation such as 1D (molecular formula, volume, and LogP), 2D (molecular connectivity/topology), 3D (molecular geometry/stereochemistry/pharmacophore), and 4D/5D (conformational ensembles). Here, we focused on the 2D descriptors including ALogP, molecular properties counts and element counts, surface and volume using 2D estimation and Estate Keys to generate the BMs. The main aim in calculating the descriptors is to check how well these descriptors are correlated with the reported biological activity of compounds.
Fingerprints selection
Laplacian-modified Bayesian analysis combined with fingerprints is especially useful for high-throughput data analysis because it is fast, easily automated and scales linearly with the number of samples (Rogers et al., 2005). Hence, we selected 12 different fingerprints including (i) atom type extended connectivity fingerprints counts (ECFC), (ii) atom type extended connectivity fingerprints (ECFP), (iii) atom type connectivity fingerprints counts (EPFC), (iv) atom type daylight path-based fingerprints (EPFP), (v) functional class daylight path-based fingerprint counts (FPFC), (vi) functional class daylight path-based fingerprints (FPFP), (vii) functional class extended connectivity fingerprints counts (FCFC), (viii) functional class extended connectivity fingerprints (FCFP), (ix) ALogP types extended connectivity fingerprint counts (LCFC), (x) ALogP extended connectivity fingerprint (LCFP), (xi) ALogP types daylight path-based fingerprint counts (LPFC), and (xii) ALogP types daylight path-based fingerprints (LPLP), with a diameters of 4, 6, 8, and 10 to design BMs. Extended connectivity fingerprints generate higher-order features with each feature representing the presence of a structural unit. Atom environment counts generate higher-order features (Bender et al., 2004) and hashed atom environment counts use a hashing algorithm to create an integer representation of the atom environment counts. The only difference in the generation of a functional class or atom type is the assignment of the initial atom code for each heavy, non-hydrogen atom of the molecule. The initial code assigned to an atom type is based on the number of connections to the atom, element type, charge, atom mass, and valence. For the functional class, initial atom code is based on quick estimate of the functional role the atom plays, this role indicates that the atom must be a combination of hydrogen-bond acceptor (HBA), hydrogen-bond donor (HBD), positively ionized or positively ionizable (PI), aromatic, and halogen. Hence, these different fingerprints were chosen to find which one was more appropriate to differentiate the highly active from least active SIRT1 activators.
Laplace modified naïve Bayesian classifier
In recent studies, Bayesian inference network was introduced as a promising similarity search approach. In this work, Laplacian-corrected Bayesian classifier algorithm (Prathipati et al., 2008) was used to generate BMs for SIRT1 activators. Multiple reference structures were used or more weights assigned to some fragments in molecular structure. This implementation of Bayesian statistics used information from both highly active and least active SIRT1 activators in the training set and removed features from the model which were deemed to be less important. The following steps were taken for BM generation: (i) features of the sample were generated for each compound, (ii) the weight was calculated for each feature using a Laplacian-adjusted probability estimate, (iii) weights were summed to provide a probability estimate which is a relative predictor of the likelihood of that sample being from the good subset, and (iv) Laplacian-corrected estimator was used to adjust the uncorrected probability estimate of a feature to account for different sampling frequencies of different features. The value of “1” was assigned to highly active activators and “0” for least active activators. The BM was built based on Bayes’ theorem:
$$P(h/d) = P(d/h)P(h)/P(d),$$
where h denotes the model. D is the observed data, P(h) indicates the prior belief (probability of pharmacophore model h before observing any data), P(d) is the data evidence (marginal probability of the data), P(d/h) is the likelihood (probability of data d if pharmacophore model h is true), and P(h/d) is the posterior probability (probability of pharmacophore model h being true given the observed data d; Sakkiah et al., 2013b). The modeling process creates a predictive model from the training set that can then be applied to score samples in the test set, and the score can be used to prioritize samples for screening.
Qualitative pharmacophore model generation
Pharmacophore modeling is one the most potent techniques used to identify the critical chemical features from known inhibitors/activators of a particular target. The ligand-based approach is one of the most powerful tools in the rational drug design process. Qualitative hypothesis (Hypo) using the Hip-Hop algorithm was utilized to generate a Hypo for SIRT1 activators.
Training set preparation
Initially 45 good SIRT1 activators were collected from the literature. The diversity of the training set was directly proportional to the quality of the generated Hypo. Hence Cluster Ligands protocol was used to select 6 highly diverse SIRT1 activators from the 45 activators. Cluster Ligands protocol recognized the common pattern or chemical features present in the SIRT1 activators and assigned a set of molecules into subsets or clusters based on the root mean square difference in the Tanimoto distance for fingerprints, such that molecules with similar properties clustered. The cluster analysis was performed by a relocation method based on maximal dissimilarity portioning. Initially, it randomly chose the data set as the first cluster center and based on the distance record from the first center, it selected the next cluster center. The process was repeated until a sufficient number of cluster centers were achieved. Six clusters were obtained through this process and finally six SIRT2 activators were selected as a training set (one SIRT1 activator from each cluster).
Pharmacophore model generation
DS provides a dictionary of chemical features important in drug-enzyme/receptor interactions such as HBA, HBD, hydrophobic (Hy), Hy aliphatic (HAli), Hy aromatic (HAro), ring aromatic (RA), and PI and negative ionizable (NI) chemical groups. Sakkiah et al. (2009) reported that the HBA, HBD, RA, PI, and Hy chemical features were important for a molecule to enhance SIRT1 activity. Hence these five chemical features were selected to generate a qualitative Hypo models.
The qualitative pharmacophore generation for SIRT1 activators was performed in three-steps (Sakkiah et al., 2011a; Ferreira da Silva et al., 2004): (i) generation of the conformation for each molecule in the training set, (ii) each conformer is examined for the presence of certain chemical features, and (iii) a 3D configuration of chemical features common to the input molecules is determined. In the Hypo generation methodology, the highest weight value of “2” was assigned for all compounds which ensures that all the chemical features present in the compound will be considered in building Hypo space and “0” for the principal and maximum omitting features columns (Arooj et al., 2013). The ranking score for each individual Hypo was calculated based on a ranking formula and the default definition of the “FIT” of a molecule to the Hypo, in order to determine the probability that a selected Hypo mapped with the training set molecule by a chance correlation. The top 10 common feature Hypos were generated with the best ranking scores. The quality of the Hypo was predicted by calculating the “fit-value” and this value was defined as the weight (f) × [1 − SSE(f)], where f is the mapping features, SSE(f) is the sum over location constraints c on f of [D(c)/T(c)] 2, D is the displacement of the feature from the center of the location constraint, and T (tolerance) is the radius of the location constraint sphere for the feature.
Pharmacophore model validation
The test set was prepared to identify the best pharmacophore model, as well as to check how accurately it was able to differentiate the highly active from moderately/least active SIRT1 activators. The test set contained 130 activators collected from the literature and classified into three sets based on their activity values: highly active (EC50 <5 μM), moderately active (5 ≤ EC50 ≤ 50 μM), and least active (EC50 >50 μM). All the activators were energy minimized by applying CHARMM force field and 255 conformations were generated for each compound based on the energy values. To select a best Hypo, Ligand Pharmacophore mapping module was used to screen the test set to find which Hypo was able to pick a reliable number of highly active SIRT1 activators.
Preparation of drug-like database
Chemists typically prioritize screening hits on the basis of “drug-like properties,” synthetic accessibility, intellectual property potential, and potency. Hence, a drug-like database was generated by removing the non-drug-like compounds with a high probability of having undesirable molecular features that would hamper their development. Initially, a simple Rule of 5 was applied to filter Maybridge (60,000), Chembridge (50,000), NCI (~200,000), and ChemDiv (~0.7 million) databases on the basis of their likelihood to be orally availability. Rule of 5 (Lipinski, 2000) states that the molecular weight should be less than or equal to 500 kDa, LogP less than or equal to 5, HBA (O and N) less than or equal to 10, and HBD (OH and NH) less than or equal to 5. Furthermore, absorption, distribution, metabolism, excretion, and toxicity, ADMET (Sakkiah et al., 2010, 2011b) was applied to select the compounds having less toxicity, should not cross the blood–brain barrier (BBB), good solubility, and absorption. The ADMET functionality estimates the values of BBB penetration, solubility, cytochrome P450 (CYP450) 2D6 inhibition, hepatotoxicity, human intestinal adsorption (HIA), plasma protein binding (PPB) and access a broad range of ligand toxicity measures. This approach is based on the assumption that compounds resembling known drugs are more likely to possess desirable biological properties such as low toxicity, high oral absorption and permeability, resistance to metabolic degradation, and the absence of rapid excretion. Finally, Prepare ligand module was used to eliminate the duplicate structures, as well as to generate the possible tautomer’s and isomers.
Virtual screening using BM and pharmacophore models
Initially, the best BM was used as an input for virtual screening to select compounds from the drug-like database. BM screening retrieved compounds from the drug-like database with similar molecular fragments present in highly active SIRT1 activators. The screened compounds which had the similar molecular fragment were further validated by mapping to best qualitative pharmacophore model to identify whether these compounds had the important chemical features found in highly active SIRT1 activators.
Density functional theory
There is growing evidence that DFT provides an accurate description of the electronic and structural properties of small molecules by computing the electronic structure of matter. The energy of SIRT1 activators was calculated using Calculate Energy module by combining the quantum mechanics (QM) and molecular mechanics (MM) force field (Sherwood et al., 2003). DFT calculated the QM–MM single point energies and geometry optimization minimizations using Dmol3 as the quantum server with CHARMM force field (Momany and Rone, 1992). This protocol simulated the systems by dividing the input into two regions, central and outer regions, which were treated by QM and MM methods. It also calculated the electronic orbital properties for a molecule including the highest occupied atomic orbital (HOMO) and lowest occupied atomic orbital (LUMO). Bayesian training set molecules were optimized by applying the QM–MM at Becke exchange plus Lee–Yang–Parr correlation (BLYP) hybrid DFT. The optimized molecules were used to calculate the HOMO and LUMO energy values.