Combining spatial and chemical information for clustering pharmacophores

Zhou, Lingxiao; Griffith, Renate; Gaeta, Bruno

doi:10.1186/1471-2105-15-S16-S5

Combining spatial and chemical information for clustering pharmacophores

Research
Open access
Published: 08 December 2014

Volume 15, article number S5, (2014)
Cite this article

Download PDF

You have full access to this open access article

BMC Bioinformatics Aims and scope Submit manuscript

Combining spatial and chemical information for clustering pharmacophores

Download PDF

Lingxiao Zhou¹,
Renate Griffith² &
Bruno Gaeta¹

Abstract

Background

A pharmacophore model consists of a group of chemical features arranged in three-dimensional space that can be used to represent the biological activities of the described molecules. Clustering of molecular interactions of ligands on the basis of their pharmacophore similarity provides an approach for investigating how diverse ligands can bind to a specific receptor site or different receptor sites with similar or dissimilar binding affinities. However, efficient clustering of pharmacophore models in three-dimensional space is currently a challenge.

Results

We have developed a pharmacophore-assisted Iterative Closest Point (ICP) method that is able to group pharmacophores in a manner relevant to their biochemical properties, such as binding specificity etc. The implementation of the method takes pharmacophore files as input and produces distance matrices. The method integrates both alignment-dependent and alignment-independent concepts.

Conclusions

We apply our three-dimensional pharmacophore clustering method to two sets of experimental data, including 31 globulin-binding steroids and 4 groups of selected antibody-antigen complexes. Results are translated from distance matrices to Newick format and visualised using dendrograms. For the steroid dataset, the resulting classification of ligands shows good correspondence with existing classifications. For the antigen-antibody datasets, the classification of antigens reflects both antigen type and binding antibody. Overall the method runs quickly and accurately for classifying the data based on their binding affinities or antigens.

Clustering of atoms relative to vector space in the Z-matrix coordinate system and ‘graphical fingerprint’ analysis of 3D pharmacophore structure

Article Open access 28 January 2024

A visual approach for analysis and inference of molecular activity spaces

Article Open access 22 October 2019

Generative Topographic Mapping Approach to Chemical Space Analysis

Background

Pharmacophore methods are widely used in drug discovery research projects [1]. As defined in the International Union of Pure and Applied Chemistry (IUPAC) glossary of terms [2], a pharmacophore describes chemical features and their spatial arrangement in active molecules and targets involved in specific biochemical interactions. Several software tools provide solutions for pharmacophore modelling and generation, including Accelrys Discovery Studio [3], LigandScout [4], ZINCPharmer [5].

Pairwise comparison of pharmacophores requires defining a similarity metric. Generally, there are two categories of similarity measurements: alignment-dependent methods and alignment-independent methods [6]. Alignment-independent methods usually target binary fingerprint descriptors, such as 3-point pharmacophore fingerprints [7] or 4-point pharmacophore fingerprints [8]. They calculate similarities with measurements such as the Tanimoto similarity (also called Jaccard Index as it was originally introduced by Paul Jaccard [9]). Alignment-dependent methods [6], in most of the cases, are based on shape or shape plus pharmacophore similarity measurements. Superimposition or overlays are widely used in this category of methods. However, chemical information is typically not involved in the shape-based methods. The OpenEye [10] colour-Tanimoto is an exception. It sums overlaps using customised pharmacophore features. However, this requires painstaking manual definition of the target features.

For grouping pharmacophores at a quantitative level, it is important to find an optimal partition method. Cluster analysis or clustering aims to separate data into groups or clusters. Clustering methods group data based on their pairwise distances. In other words, similar objects are grouped together more closely than dissimilar objects. There are some fundamental steps involved in a clustering activity including data extraction, similarity measurement, clustering and validation [11]. In cheminformatics applications, hierarchical clustering is one of the most popular approaches. These clustering methods group data based on their distances. The group average method (GA) and Ward's method [12] are two examples of hierarchical methods. Partition evaluation is a significant step to judge a clustering method. If the clustering method is applied to a benchmark dataset of known classification, then validation methods such as the Rand index and the adjusted Rand index [13] for supervised learning can be applied for comparing the results of the clustering method with the benchmark classification. Otherwise, unsupervised learning evaluation algorithms such as the Davies-Boulding index [14] can be used.

We present here a pharmacophore-aided Iterative Closest Point (ICP) clustering method for grouping pharmacophores using both their structural and chemical information. In this paper, Discovery Studio Modelling Environment http://accelrys.com, release 3.5 or 4.0, is used to generate the pharmacophores. There are six features defined in Discovery studio from which to construct pharmacophore models. They are Hydrogen bond acceptor, Hydrogen bond donor, Hydrophobic, Positive ionisable (from Catalyst's definition, a "Group that is, or can be, positively charged at physiological pH,") [3] , Negative ionisable (from Catalyst's definition, a "Group that is, or can be, negatively charged at physiological pH,") [3] and Ring aromatic (from Catalyst's definition, a "Five- or six-membered aromatic ring (vector)") [3]. A computer vision method, Iterative Closest Point (ICP) [15], is employed to calculate pharmacophore structural distances and a greedy alignment method is applied to measure the chemical distance. These two distance measures are then combined prior to hierarchical clustering. The method is evaluated relative to existing methods using two sets of experimental data. The results demonstrate that the proposed method is not only of benefit for classification of pharmacophores, but also has the potential to facilitate research in the field of antibody-antigen interactions.

Methods

Data preparation and pharmacophore generation

Two experimental data sets were used in testing. The first set of 31 globulin binding steroids (Figure 1) was introduced by Carmer et al [15]. In recent years, this dataset has been studied using a range of clustering methods and descriptors [16–19]. We compare our proposed method to a previous study [16] that used four-point pharmacophores as molecular descriptors.

Antibody-antigen binding is known to be highly specific [20]. Pharmacophores, by definition [2], can describe features involved in the interaction between compounds and target. Therefore, our second evaluation involves classifying pharmacophores generated from antibody-antigen complexes. The complexes were obtained from the Protein Data Bank [21] and information about the antibodies and antigens was gathered from an online self maintaining database SACS [22]. After applying the selection criteria (human sourced antibody-antigen complexes), 207 entries were selected and aligned by Clustal Omega [23]. To simplify the evaluation, 41 complexes were selected, corresponding to 3 differently named antibodies (17B, 2F5 and Anti-HIV V3 FAB 2557) and 2 types of antigens (GP120 and GP41) (See Additional file 1). However, Discovery Studio does not accept compounds over 1000 atoms or protein as ligands. Therefore, for each of the large (over 1000 atoms) protein antigens, the compound had to be cut into several parts and be saved in molecule format (SD file format). The cutting was based on the potential contact surface on the antigens. The potential contact surfaces were determined by finding the neighbouring (distance equal or less than 2.5 Å) amino acids of the antibody chains.

Discovery Studio Modelling Environment, release 4.0, generates the pharmacophores as *.chm files. Several protocols were employed for generating the pharmacophores. The autopharmacophore generation protocol selects pharmacophores using a Genetic Function Approximation (GFA) model [24]. This protocol aims to generate pharmacophore models from a single input molecule. Thirty one pharmacophores were generated using this protocol. The pharmacophore details for the globulin binding steroids have been recorded (See Additional file 2). For protein-ligand interactions, the GFA model as coded in the receptor-ligand pharmacophore generation protocol was used to produce structure-based pharmacophore models. Antibody-small molecules and antibody-protein parts were processed using this protocol. The details of the 41 antibody-antigen complex pharmacophores are listed in the table in Additional file 3. In this table, partial pharmacophores for large protein antigens were combined. The combination details are explained in the next section.

Parsing pharmacophore files

The pharmacophore files produced by Discovery Studio include information such as name, coordinates, vector and tolerance etc. of the pharmacophore features. Based on our method, a set of Perl scripts were written to perform a series of steps to phase the pharmacophore files. Structural and chemical information was extracted from pharmacophore files. To simplify the calculation, some vector features, such as hydrogen bond donors and hydrogen bond acceptors were represented as one point. The coordinates of this point were provided by the centroid of the vector. Some statistics for each pharmacophore model were calculated and recorded, including the name of the features for each model, feature counters for each feature and so on. In the final stage of the phasing, the centroids of all pharmacophore models were normalized to (0,0,0), and new coordinates were calculated.

ICP based structural distance calculation

The clustering was implemented in Matlab using the Iterative Closest Point (ICP) algorithm. ICP [15, 25] is a method for optimizing the sum of squared distances between two sets of points. It is widely used in the fields of computer vision and robot navigation. The following is a summary of the ICP algorithm we implemented. It calculates the 3D structural information of two pharmacophores p and q to generate a rotation matrix R and a translation matrix T.

For k = 1 to k_max

1.
Do selection and matching Build k-d tree[26] and find closest neighbor pairs with KNN search
2.
If matches to edge vertices or worst matches detected Do rejection point pairs
3.
Weight matched points Weighting with compatibility of normal:
$W = n_{p} * n_{q}$
(1)
4.
Minimize the error metric Calculate R with singular value decomposition (SVD)[27]:
$R = V * U^{T}$
(2)

Calculate T:
$T = \bar{\bar{q}} - R * \bar{p}$
(3)
5.
Assign and apply transformation

End for

Figure 2. demonstrates this implementation by applying the ICP algorithm to our antibody-antigen dataset. Blue points represent the template set, the green and red points represent the second set, with the green points representing the initial pharmacophore locations and the red points representing them after application of the transformation.

The structural distance of the two pharmacophores was calculated using the Root-mean-square deviation (RMSD). RMSD values were normalized by dividing by the maximum distance. In the end, a N*N structural distance matrix was produced based on the number of pharmacophore models (N).

Greedy alignment-based chemical distance calculation

The second significant part of the method is to compute a chemical distance matrix. A greedy alignment method was introduced to compare the chemical differences between pharmacophore models. This alignment approach was coded in Matlab like the ICP algorithm. In this method, a pharmacophore scoring matrix, as used in the Pharmacophore Alignment Search Tool (PhAST) [28], played an important role. The procedure of the greedy alignment is as follows. Let us consider two pharmacophore lists {p_i} (pharmacophores 1) and {q_j} (pharmacophores 2). n is the number of features in {p_i} and m is the number of features in {q_j}.

1.
Find common features from both groups and remove them
2.
Find the "best-unmatched" (feature pair with lowest dissimilarity score) features
1. a.
  Remove them
2. b.
  Increase the penalty score
3.
Calculate gaps (|n-m|)
1. a.
  Increase the penalty score

The chemical distance matrix was calculated for each possible pair of pharmacophores. The matrix was then normalized by the maximum value of the gap penalty (by dividing each value in the matrix by the gap penalty * max(n, m)). A gap penalty score of 14 per position was used in the calculation, as in the PhAST method [28].

Combined distance matrix

In the final step of the method, the structural distance matrix and the chemical distance matrix were integrated to form a mixed distance matrix. The combined matrix includes a geometric term S and a chemical term C:

D = λ * S + (1 - λ) * C

(4)

In equation (4), λ can be adjusted to change the weights of 3D and chemical data. The workflow for the complete procedure can be found in Figure 3.

Results

Globulin-binding steroids

After applying our clustering method, a 31*31 distance matrix was generated. The tree (Figure 4) was created using T-REX [29] from the combined matrix and using the neighbour joining method. This tree was compared with trees produced from the same dataset by two other methods [16]. One of the trees (Figure 5) was generated with the group average method [30], and the other one (Figure 6) was derived using Ward's method [12].

For further comparison, a table of binding affinity information for the 31 molecules from the literature [31] is provided as a gold standard to evaluate all three methods (Table 1). The 31 molecules were divided into two groups based on this binding affinity data: group 1 (CGB<-6.2) and group 2 (CGB>-6.2), to provide a reference clustering (Table 2) . For the clusterings produced by Rodriguez and that produced by our method, the 31 compounds were also labeled based on the clustering results (Table 2). Both clusterings were then compared to the reference CBG clustering using the Rand Index and adjusted Rand Index methods [13]. The evaluation results are shown in Table 3. All methods performed equally well in recreating the benchmark clustering.

Table 1 Binding affinities of the 31 globulin binding steroids [31].

Full size table

Table 2 Group labeling for 31 globulin binding steroids.

Full size table

Table 3 Evaluation of different clustering methods for 31 globulin binding steroids.

Full size table

Antibody-antigen complex dataset

In this section, the ICP-based pharmacophore-aided method was applied to classify 4 groups of pharmacophores. The clustering method generated a 41*41 distance matrix. T-REX translated the distance matrix into a dendrogram (Figure 7).

To evaluate the result, we categorised the 41 complexes into two groups based on their antigens, as a benchmark clustering. Results from the new method were clustered into 4 groups (Figure 7 and Additional file 1). There were two large clusters G1 (antigen GP41), G2 (antigen GP120). Complexes 3D0L and 3D0V were misclassified, so we labelled them as G3 (3D0L) and G4 (3D0V). These two classifications were compared using the Rand Index and Adjusted Rand Index. The results (Table 4) demonstrate an excellent agreement between the two classifications.

Table 4 Evaluation of 3D plus chemical clustering method for antibody-antigen complexes.

Full size table

Discussion

In the dataset of 31 steroid compounds, some pairs had been reported that should be grouped together closely ([16]). They were (21, 26), (7, 30) and (19, 29), that differ only by a methyl group. Molecules 5 and 16 only differ by the stereochemistry of one centre on the A ring. Comparison of Figures 4, 5 and 6 demonstrate that all three different methods have successfully grouped those reported pairs. The special structures of the two compounds 4 and 31 led to a misclassification (they were classified into group with pKa < -6.2) in all three methods. Molecules 21 and 26 were incorrectly clustered as an exceptional cluster by our new method. With the exception of those molecules (21 and 26), the group average method, Ward's method and our method all produced trees with the same two superclasses. Rodriguez's methods and the new method have the same Rand Index value and a very close adjusted Rand Index. Additionally, all Rand Index and adjusted Rand Index scores are above the threshold for a 'good' clustering (0.5 for Rand Index, 0 for adjusted Rand Index).

Considering the application of the proposed method to 41 antibody-antigen complexes, the pharmacophores were generally classified into two large super-clusters based on their antigens. One supercluster included all complexes with GP41 or a GP41 analog as antigen. The second supercluster had all the complexes with GP120 or one of its fragments as antigen. The classification did not only identify the antigens, within each supercluster, pharmacophores also formed clusters corresponding to their binding antibody (e.g. G1 with 17B as antibody and G4 with ANTI-HIV-1 V3 FAB 2557). Additionally, the Rand Index and adjusted Rand Index both were very high, which means the ICP aided method performed well in clustering. In addition some interesting structural and chemical features highlighted by other researchers could be identified in the results. In complex 1U8H, the Glu662 substitution has been reported to involve a water network rearrangement and thus this complex is structurally different from the other 1U8* complexes [32]. This can be seen by the unexpected position of 1U8H in a clustering based solely on 3D distance calculated using ICP (Figure 8). In the same paper, 1U8L was reported to have chemical differences to the other 1U8* complexes. This can be seen on a dendrogram based solely on chemical distances (Figure 9). However, when 3D and chemical distances were combined, 1U8L and 1U8H were correctly clustered with other complexes with similar antigens (Figure 10).

Conclusions

A method combining a structural distance based on ICP and a "chemical" distance has been developed and has been demonstrated to successfully partition pharmacophores based on the types of antigens in a set of antibody/antigen complexes or binding affinity in a set of steroids. In addition, the method is very fast. The 41 pharmacophore comparison only took around 30 seconds on a desktop computer (Apple iMac, 2.7 GHz Intel Core i5, 8GB Memory). However, the method requires the number of pharmacophores being compared to be similar and was less accurate when the following ratio was larger than 2.: Max(Number_of_Pharmacophores)/Min(Number_of_Pharmacophores)

References

Leach AR, Gillet VJ, Lewis RA, Taylor R: Three-Dimensional Pharmacophore Methods in Drug Discovery. J Med Chem. 2010, 53 (2): 539-558. 10.1021/jm900817u.
Article CAS PubMed Google Scholar
Wermuth G, Ganellin CR, Lindberg P, Mitscher LA: Glossary of terms used in medicinal chemistry (IUPAC Recommendations 1998). Pure Appl Chem. 1998, 70 (5): 1129-1143.
Article CAS Google Scholar
Sutter J, Li JB, Maynard AJ, Goupil A, Luu T, Nadassy K: New Features that Improve the Pharmacophore Tools from Accelrys. Curr Comput-Aid Drug. 2011, 7 (3): 173-180. 10.2174/157340911796504305.
Article CAS Google Scholar
Wolber G, Dornhofer AA, Langer T: Efficient overlay of small organic molecules using 3D pharmacophores. Journal of computer-aided molecular design. 2006, 20 (12): 773-788.
Article CAS PubMed Google Scholar
Koes DR, Camacho CJ: ZINCPharmer: pharmacophore search of the ZINC database. Nucleic Acids Res. 2012, 40 (W1): W409-W414. 10.1093/nar/gks378.
Article PubMed Central CAS PubMed Google Scholar
MacCuish JD, MacCuish NE: Chemoinformatics applications of cluster analysis. Wires Comput Mol Sci. 2014, 4 (1): 34-48. 10.1002/wcms.1152.
Article CAS Google Scholar
Good AC, Kuntz ID: Investigating the Extension of Pairwise Distance Pharmacophore Measures to Triplet-Based Descriptors. Journal of computer-aided molecular design. 1995, 9 (4): 373-379. 10.1007/BF00125178.
Article CAS PubMed Google Scholar
Mason JS, Cheney DL: Library design and virtual screening using multiple 4-point pharmacophore fingerprints. Pac Symp Biocomput. 2000, 576-587.
Google Scholar
Jaccard P: The distribution of the flora in the alpine zone. New Phytologist. 1912, 11 (2): 37-50. 10.1111/j.1469-8137.1912.tb05611.x.
Article Google Scholar
Hawkins PCD, Skillman AG, Nicholls A: Comparison of shape-matching and docking as virtual screening tools. J Med Chem. 2007, 50 (1): 74-82. 10.1021/jm0603365.
Article CAS PubMed Google Scholar
Jain AK, Murty MN, Flynn PJ: Data clustering: A review. Acm Comput Surv. 1999, 31 (3): 264-323. 10.1145/331499.331504.
Article Google Scholar
Ward JH: Hierarchical Grouping to Optimize an Objective Function. J Am Stat Assoc. 1963, 58 (301): 236-&. 10.1080/01621459.1963.10500845.
Article Google Scholar
Hubert L, Arabie P: Comparing Partitions. J Classif. 1985, 2 (2-3): 193-218.
Article Google Scholar
Davies DL, Bouldin DW: Cluster Separation Measure. Ieee T Pattern Anal. 1979, 1 (2): 224-227.
Article CAS Google Scholar
Besl PJ, Mckay ND: A Method for Registration of 3-D Shapes. Ieee T Pattern Anal. 1992, 14 (2): 239-256. 10.1109/34.121791.
Article Google Scholar
Rodriguez A, Tomas MS, Perez JJ, Rubio-Martinez J: Assessment of the performance of cluster analysis grouping using pharmacophores as molecular descriptors. J Mol Struc-Theochem. 2005, 727 (1-3): 81-87. 10.1016/j.theochem.2005.02.030.
Article CAS Google Scholar
Cramer RD, Patterson DE, Bunce JD: Comparative Molecular-Field Analysis (Comfa) .1. Effect of Shape on Binding of Steroids to Carrier Proteins. J Am Chem Soc. 1988, 110 (18): 5959-5967. 10.1021/ja00226a005.
Article CAS PubMed Google Scholar
Wagener M, Sadowski J, Gasteiger J: Autocorrelation of Molecular-Surface Properties for Modeling Corticosteroid-Binding Globulin and Cytosolic Ah Receptor Activity by Neural Networks. J Am Chem Soc. 1995, 117 (29): 7769-7775. 10.1021/ja00134a023.
Article CAS Google Scholar
Bultinck P, Carbo-Dorca R: Molecular quantum similarity matrix based clustering of molecules using dendrograms. J Chem Inf Comp Sci. 2003, 43 (1): 170-177. 10.1021/ci025602b.
Article CAS Google Scholar
Ramos-Vara JA: Technical aspects of immunohistochemistry. Vet Pathol. 2005, 42 (4): 405-426. 10.1354/vp.42-4-405.
Article CAS PubMed Google Scholar
Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE: The Protein Data Bank. Nucleic Acids Res. 2000, 28 (1): 235-242. 10.1093/nar/28.1.235.
Article PubMed Central CAS PubMed Google Scholar
Allcorn LC, Martin ACR: SACS - Self-maintaining database of antibody crystal structure information. Bioinformatics. 2002, 18 (1): 175-181. 10.1093/bioinformatics/18.1.175.
Article CAS PubMed Google Scholar
Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li WZ, Lopez R, McWilliam H, Remmert M, Soding J: Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol. 2011, 7:
Google Scholar
Rogers D, Hopfinger AJ: Application of Genetic Function Approximation to Quantitative Structure-Activity-Relationships and Quantitative Structure-Property Relationships. J Chem Inf Comp Sci. 1994, 34 (4): 854-866. 10.1021/ci00020a020.
Article CAS Google Scholar
Chen Y, Medioni G: Object Modeling by Registration of Multiple Range Images. 1991 Ieee International Conference on Robotics and Automation. 1991, 1-3: 2724-2729.
Article Google Scholar
Dandamudi SP, Sorenson PG: An Empirical Performance Comparison of Some Variations of the K-D Tree and Bd Tree. Int J Comput Inf Sci. 1985, 14 (3): 135-159. 10.1007/BF00991003.
Article Google Scholar
Alter O, Brown PO, Botstein D: Singular value decomposition for genome-wide expression data processing and modeling. Proc Natl Acad Sci USA. 2000, 97 (18): 10101-10106. 10.1073/pnas.97.18.10101.
Article PubMed Central CAS PubMed Google Scholar
Hahnke V, Hofmann B, Grgat T, Proschak E, Steinhilber D, Schneider G: PhAST: pharmacophore alignment search tool. Journal of computational chemistry. 2009, 30 (5): 761-771. 10.1002/jcc.21095.
Article PubMed Google Scholar
Alix B, Boubacar DA, Vladimir M: T-REX: a web server for inferring, validating and visualizing phylogenetic trees and networks. Nucleic Acids Res. 2012, 40 (W1): W573-W579. 10.1093/nar/gks485.
Article CAS Google Scholar
Lohse-Bossenz H, Kunina-Habenicht O, Kunter M: Estimating within-group agreement in small groups: A proposed adjustment for the average deviation index. Eur J Work Organ Psy. 2014, 23 (3): 456-468. 10.1080/1359432X.2012.748189.
Article Google Scholar
Robert D, Amat L, Carbo-Dorca R: Three-dimensional quantitative structure-activity relationships from tuned molecular quantum similarity measures: Prediction of the corticosteroid-binding globulin binding affinity for a steroid family. J Chem Inf Comp Sci. 1999, 39 (2): 333-344. 10.1021/ci980410v.
Article CAS Google Scholar
Bryson S, Julien JP, Hynes RC, Pai EF: Crystallographic definition of the epitope promiscuity of the broadly neutralizing anti-humanimmunodeficiency virus type 1 antibody 2F5: vaccine design implications. J Virol. 2009, 83 (22): 11862-11875. 10.1128/JVI.01604-09.
Article PubMed Central CAS PubMed Google Scholar
Corper AL, Sohi MK, Bonagura VR, Steinitz M, Jefferis R, Feinstein A, Beale D, Taussig MJ, Sutton BJ: Structure of human IgM rheumatoid factor Fab bound to its autoantigen IgG Fc reveals a novel topology of antibody-antigen interaction. Nat Struct Biol. 1997, 4 (5): 374-381. 10.1038/nsb0597-374.
Article CAS PubMed Google Scholar
Ekiert DC, Bhabha G, Elsliger MA, Friesen RHE, Jongeneelen M, Throsby M, Goudsmit J, Wilson IA: Antibody Recognition of a Highly Conserved Influenza Virus Epitope. Science. 2009, 324 (5924): 246-251. 10.1126/science.1171491.
Article PubMed Central CAS PubMed Google Scholar

Download references

Acknowledgements

The publication costs for this article were funded from a grant from the School of Computer Science and Engineering, UNSW Australia.

This article has been published as part of BMC Bioinformatics Volume 15 Supplement 16, 2014: Thirteenth International Conference on Bioinformatics (InCoB2014): Bioinformatics. The full contents of the supplement are available online at http://www.biomedcentral.com/bmcbioinformatics/supplements/15/S16.

Author information

Authors and Affiliations

School of Computer Science and Engineering, UNSW Australia, Sydney, NSW, Australia
Lingxiao Zhou & Bruno Gaeta
School of Medical Sciences/Pharmacology, UNSW Australia, Sydney, NSW, Australia
Renate Griffith

Authors

Lingxiao Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Renate Griffith
View author publications
You can also search for this author in PubMed Google Scholar
Bruno Gaeta
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lingxiao Zhou.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

All authors read and approved the final manuscript.

Electronic supplementary material

12859_2014_6738_MOESM1_ESM.xls

Additional file 1: Antibody-antigen complexes. This table summarises antibody-antigen complexes used in this study with their cluster number as assigned by the ICP-based method. (XLS 43 KB)

Additional file 2: Number of pharmacophore features in the 31 globulin binding steroids used in this study (XLS 40 KB)

Additional file 3: Number of pharmacophore features in the 41 antibody-antigen complexes used in this study (XLS 32 KB)

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Zhou, L., Griffith, R. & Gaeta, B. Combining spatial and chemical information for clustering pharmacophores. BMC Bioinformatics 15 (Suppl 16), S5 (2014). https://doi.org/10.1186/1471-2105-15-S16-S5

Download citation

Published: 08 December 2014
DOI: https://doi.org/10.1186/1471-2105-15-S16-S5

Combining spatial and chemical information for clustering pharmacophores