Abstract
Background
Osteopontin (Eta, secreted sialoprotein 1, opn) is secreted from different cell types including cancer cells. Three splice variant forms namely osteopontin-a, osteopontin-b and osteopontin-c have been identified. The main astonishing feature is that osteopontin-c is found to be elevated in almost all types of cancer cells. This was the vital point to consider it for sequence analysis and structure predictions which provide ample chances for prognostic, therapeutic and preventive cancer research.
Methods
Osteopontin-c gene sequence was determined from Breast Cancer sample and was translated to protein sequence. It was then analyzed using various software and web tools for binding pockets, docking and druggability analysis. Due to the lack of homological templates, tertiary structure was predicted using ab-initio method server – I-TASSER and was evaluated after refinement using web tools. Refined structure was compared with known bone sialoprotein electron microscopic structure and docked with CD44 for binding analysis and binding pockets were identified for drug designing.
Results
Signal sequence of about sixteen amino acid residues was identified using signal sequence prediction servers. Due to the absence of known structures of similar proteins, three dimensional structure of osteopontin-c was predicted using I-TASSER server. The predicted structure was refined with the help of SUMMA server and was validated using SAVES server. Molecular dynamic analysis was carried out using GROMACS software. The final model was built and was used for docking with CD44. Druggable pockets were identified using pocket energies.
Conclusions
The tertiary structure of osteopontin-c was predicted successfully using the ab-initio method and the predictions showed that osteopontin-c is of fibrous nature comparable to firbronectin. Docking studies showed the significant similarities of QSAET motif in the interaction of CD44 and osteopontins between the normal and splice variant forms of osteopontins and binding pockets analyses revealed several pockets which paved the way to the identification of a druggable pocket.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Background
Cancer results from alterations that disrupt the appropriate controls and balances that direct normal cellular growth and development. These changes resulting in altered gene products or altered gene expression can occur in two classes of genes that interact with each other: genes that inhibit tumor suppressor genes and genes that facilitate cell growth and development [1]. Malignant tumors are characterized by dysregulated growth control, the overcoming of replicative senescence and the formation of metastases. Several growth factors and cytokines play pivotal roles in the regulation of proliferation, survival, adhesion and migration of neoplastic cells [2]. Decades of scrutiny into the molecular basis of cancer have largely focused on what causes oncogenic transformation and the incipient emergence of tumors [3]. The invasion of tumor cells is a complex, multistage process. To facilitate the cell motility, invading cells need to change the cell-cell adhesion properties, rearrange the extracellular matrix environment, suppress anoikis and recognize their cytoskeletons [4].
A biomarker is any substance, which when detected in biological samples or tissue, is associated with an increased risk of a disease. Serum biomarkers are produced by body organs or tumors and when detected in high amounts in the blood, can be suggestive of tumor activity. These markers are nonspecific for cancer and can be produced by normal organs as well. Most biomarkers are used infrequently for screening purposes. They are more often used to evaluate treatment effects or to assess the potential for metastatic disease in patients with established disease. Osteopontin (OPN) was identified as one such biomarker [5]. Osteopontin is a secreted glycoprotein that plays important roles in a wide range of biological processes, including tissue remodeling, inflammation, angiogenesis, tumor development and immunity to infectious disease [6]. Osteopontin also increases expression of HIF-1α through phosphatidyl inositol 3′–kinase/Acutely transforming retrovirus AKT8 in rodent T cell lymphoma (PI3-K/Akt) pathway [7].
The OPN is a 32.5-kDa multifunctional protein with multiple phosphorylation and glycosylation sites and contains an arginine-glycine-aspartic acid-binding (RGD) domain as well as two heparin-binding sites, one thrombin cleavage site (RSK [arginine-serine-lysine]) and a calcium-binding site. The protein functions as both a cell attachment protein and a cytokine that has a signaling function through the action of two cell adhesion molecules: αvβ3-integrin and CD44 [8]. It is also a tumor-associated protein, which mediates tumor transformation and malignant progression. OPN has been proposed to promote tumor progression through several mechanisms, including increased cell survival, migration, invasion, neovascularization, and modulation of immune function. The RGD domain of OPN functionally mediates cell adhesion, migration and invasion through integrin engagement. Interaction between the RGD domain of OPN and integrin receptors leads to Nuclear Factor-KappaB (NF-kB) and Focal adhesion kinase (FAK) actvation mainly through decreased apoptosis. These data indicate that the predominant mechanism, by which OPN promotes tumor growth and metastasis through the RGD domain, is enhancement of survival in the tumor microenvironment [9]. When OPN is cleaved at the RSK site by thrombin, it is separated into two approximately equivalent sized pieces, including N-terminal and C-terminal fragments. Thrombin is activated by tissue factor (TF) which is overexpressed on the surface of cancer cells. Both N-terminal and C-terminal fragments increases adhesion and migration of cancer cells through interaction with integrins and cyclophilin C respectively [10]. Enhanced OPN expression has been detected at the tumor site as well as in plasma and serum of patients with various types of cancers [11].
The existence in humans, of two osteopontin splice variants with deletions of exon 4 referred to as osteopontin-c or exon 5 called osteopontin-b and the normal osteopontin referred as osteopontin-a has been described by Young et al. [12]. Alternative splicing occurs in a region in a molecule that is upstream of the central integrin binding domain and the C-terminal CD44 binding domain. Interestingly, osteopontin-b expressed by transfection is unstable and the protein is degraded in the proteosome. In addition, osteopontin-b RNA is present at consistently low levels of expression in breast tissue specimens [13]. Osteopontin-a was found to be expressed in both normal and cancer cells to a lesser extent whereas osteopontin-c transcripts were never detected in the normal tissue samples but were present only in tumor cells [14]. The splice variant osteopontin-c, which does not contain the sequence encoded in exon-4, lacks an important domain for calcium induced aggregation and transglutamination. Lack of this domain forms the soluble form of the protein [15]. Among the three splice variants of osteopontin expressed in breast cancer, the shortest form, osteopontin-c, supports anchorage-independence more effectively than the full length form, osteopontin-a. Splice variant form, osteopontin-c, is brought about through the gain of function by the cancer cells, reflected in the activation of unique signal transduction pathways. Osteopontin-c coordinately induces oxidoreductase genes that are associated with the mitochondrial energy metabolism and with the hexose mono phosphate shunt [14].
Taken together, this growing list of studies suggests that osteopontin blood levels have a potential as a prognostic or diagnostic marker in prostate, breast, head and neck and other cancers. It should be noted, however, that osteopontin is unlikely to be a blood marker that is specific to cancer because osteopontin levels are also elevated in other conditions including sepsis, kidney disease and cardiovascular disease. But, the identification of the splice variant form of osteopontin-c solved this problem [14]. In order to study further about the role and function of osteopontin-c, the three dimensional structure might be useful, which is yet to be determined through x-ray crystallographic or NMR techniques. In this context, in silico structure prediction of osteopontin-c was carried out along with sequence analysis and docking studies.
Methods
Web based tools
The web based tools were used for the purpose of translation, similarity studies, tertiary structure prediction, model refinement, model evaluation, binding pockets prediction and docking. The list of websites along with web site addresses is shown in Table 1.
Sequence source
Osteopontin-c gene sequence was determined from breast cancer sample and it was translated to protein sequence using ExPASy Translate tool (http://web.expasy.org/translate/) [16].
Sequence analysis
Signal sequence of osteopontin-c was predicted using signalP [17] and PrediSi servers [18, 19]. Tertiary structure prediction was carried out using I-TASSER tool [20]. Critical Assessment of Techniques for Protein Structure Prediction (CASP) is a community-wide experiment for testing the state-of-the-art of protein structure predictions which takes place every two years since 1994. The I-TASSER server (as “Zhang-Server”) participated in the Server Section of 7th (2006), 8th (2008), and 9th CASPs (2010), and was ranked as the No 1 server in CASP7, CASP8 and CASP9. Thus, this server selected for tertiary structure prediction. The c-score is a confidence score for estimating the quality of predicted models by I-TASSER. It is calculated based on the significance of threading template alignments and the convergence parameters of the structure assembly simulations. The c-score is typically in the range of (-5, 2), where a c-score of higher value signifies a model with a high confidence and vice-versa [20]. The quality of the predicted structure was examined using an online metaserver SAVES, which uses Procheck [21], WhatCheck [22], Verify3D [23], ERRAT [24] and PROVE [25] servers. The predicted structure was refined using SUMMA server [26]. Molecular dynamic analysis was carried out using GROMACS (GROningen MAchine for Chemical Simulations) software [27, 28]. Structure visualization was carried out using Accelrys’ Discovery Studio Visualizer 1.7. Tertiary structure of osteopontin-a also predicted by I-TASSER server and subjected to other treatments as mentioned for osteopontin-c.
Determination of conserved regions (domains)
In order to determine conserved regions (domains) in osteopontin-c of human, it was aligned with rabbit, cattle, chicken, house mouse, Norway rat and water buffalo osteopontin sequences using clustalW [29] at http://www.genome.jp/tools/clustalw/. RSK and RGD domain comparison was achieved by using Discovery Studio Visualizer (Accelrys Discovery Studio Visualizer, version 1.7, 2007; Accelrys Software Inc., San Diego). Tertiary structures of thrombin cleaved fragments were also predicted by I-TASSER server. The C-terminal fragment of osteopontin-c was used for hypothetical polymer formation using ICM Molsoft tool. Six subunits were utilized for the formation of polymer formation using import option of the ICM Molsoft tool.
Docking
The predicted tertiary structures of osteopontin-a and osteopontin-c were docked with CD44 using Cluspro. ClusPro is the first fully integrated server that includes both docking and discrimination steps for predicting the structure of protein–protein complexes. The server can be used to discriminate a set of potential complex structures from several docking algorithms, or it can generate its own structures using DOT or ZDOCK [30].
Binding pockets predictions
PocketFinder and Q-Site finder were utilized for binding pocket predictions [31]. PocketFinder is based on the Ligsite algorithm written by Hendlich et al.[32] which was used to predict small molecule binding sites in proteins. Q-Site finder uses the interaction energy between the protein and a simple van der Waals probe to locate energetically favourable binding sites.
Druggable pocket predictions
DoGSiteScorer is an automated pocket detection and analysis tool which can be used for protein druggability assessment. Based on the three dimensional coordinates of a protein, its potential active sites on the protein surface are calculated with DoGSiteScorer. DoGSiteScorer is a grid-based function prediction method which uses a difference of Gaussian filter to detect potential pockets on the protein surface and splits them into subpockets. Subsequently, global properties, describing the size, shape and chemical features of the predicted pockets are calculated. Per default, a simple score is provided for each pocket, based on a linear combination on the three descriptors describing volume, hydrophobicity and enclosure. For the discrimination of the druggability, a subset of meaningful descriptors is used in a support vector maschine (libsvm). The druggability model was trained and tested on the druggable cavity directory dataset consisting of 1069 structures and yielded prediction accuracies of 88%. For each queried input structure, a druggability score between zero and one is returned. The higher the score the more druggable the pocket is estimated to be [33–35].
Results and discussion
Sequence analysis
Osteopontin-c gene sequence was determined from breast cancer sample and deposited to Genbank with ID JF412667. With the ExPASy Translate tool, a peptide sequence was deduced, consisting of 287 amino acid residues. This sequence was 100% identical to the protein sequence in GenPept database (NP_001035149.1). Both the signal prediction tools namely SignalP and PrediSi indicated the presence of a potential signal peptide in osteopontin-c protein. Signal sequence prediction servers predicted an N-terminal cleavage site between 16th and 17th amino acid residues of osteopontin-c sequence. After predicting the signal sequence, first 16 amino acid residues were identified as signal peptide and were removed from osteopontin-c sequence. The remaining protein sequence was utilized for tertiary structure prediction because during protein folding under in-vivo condition, the signal sequence is removed.
X-ray crystallography, NMR and cryo-electron microscopic studies were used in wet-lab for three dimensional structure predictions of proteins [36]. NMR studies carried out to predict three dimensional structure of only RGD tripeptide sequence of osteopontin [37]. Prediction of protein structure from amino-acid sequences has been one of the most challenging problems in computational structural biology for many years. Historically, protein structure prediction was classified into three categories: (i) comparative modeling, (ii) threading, and (iii) ab-initio folding. The first two approaches build protein models by aligning query sequences onto solved template structures. When close templates are identified, high-resolution models could be built by the template-based methods. If templates are absent from the Protein Data Bank (PDB) library, the models need to be built from scratch, i.e. ab-initio folding. This is the most difficult category of protein structure prediction. Such difficult task was attained using I-TASSER tool [20]. Tertiary structure of osteopontin-c was predicted using I-TASSER Server [38]. Predicted tertiary structure is shown in Figure 1. I-TASSER result provided five models with different c-scores, cluster density and number of decoys as shown in Table 2.
Predicted model-2 contains the highest c-score and also showed high similarity with electron microscopic structure of bone sialoprotein (BSP), which belongs to same Small integrin-binding ligand N-linked glycoproteins (SIBLINGs) protein family. Both the Predicted model-2 and bone sialoprotein were found to have thread with globular domain structure. Electron crystallography is a form of microscopy that uses a beam of electrons to construct images of small solids such as proteins. This process is used to determine and predict the structure and arrangement of a protein from secondary structure crystals such as alpha helices or beta sheets based on electron scattering. By electron crystallography method BSP structure determined. The BSP is a monomer possessing a globular structure with a diameter of 10 ± 1 nm that is linked to a thread-like structure of 25 ± 6 nm length. The globule is likely to correspond to the C-terminal part and the threadlike structure to N-terminal part of the protein [39].
Small integrin-binding ligand N-linked glycoproteins (SIBLINGs), a family of five integrin binding glycophosphoproteins comprising osteopontin (OPN), bone sialoprotein (BSP), dentin matrix protein 1 (DMP1), dentin sialophosphoprotein (DSPP) and matrix extracellular phosphoglycoprotein (MEPE), are an emerging group of molecular tools that cancer cells use to facilitate their expansion. SIBLINGs are soluble, secreted proteins that can act as modulators of cell adhesion as well as autocrine and paracrine factors by their interaction with cell surface receptors such as integrins. BSP and OPN are two members of the SIBLING family of genetically related proteins that are clustered on human chromosome 4. These two proteins have several common binding partners like CD44, integrins, matrix metalloproteinases (MMP), and complement factor H (CFH). Because of that, they had common interaction domains like RGD and in turn structure [40, 41].
Predicted tertiary structure of osteopontin-c had three domains, namely N-terminal domain, central domain and C-terminal domain. RGD and RSK motifs and two helical regions and three turns were present in central domain. N-terminal end domain consists of four antiparallel sheets, two helical regions and five turns. C-terminal end domain consists of one sheet, one helical region and one turn. Earlier hypothetical structure for osteopontin was predicted by Ganss. It was an open extended and flexible structure. Model-2 of I-TASSER result supported the proposal of Ganss [42].
Model-2 was refined using SUMMA Server. The predicted structure was refined by fixing side chains, fixing problematic loops, removal of amino acid clashes (bumps) and energy minimization. Potential functions used in structure prediction and refinements are typically grouped into two general classes: traditional “physical” molecular mechanics potentials and statistically derived 'knowledge-based” potentials [26]. The refinements did not yield any drastic change in the initial predicted structure augmenting the correctness of the predicted structure, which was confirmed by superimposition studies.
Refined structure and predicted structure were superimposed using Accelry’s Discovery Studio Visualizer 1.7 software. RMSD value for the superimposition was found to be 0.92 A°. The superimposed structure is shown in Figure 2. It was found that the structure of osteopontin-c was similar in structure to fibronectin which was determined by Amit sharma et al., [43]. Fibronectin (FN) is a large glycoprotein found on cell surfaces, in the connective tissue matrix and in extracellular fluids. It participates in cell adhesion, spreading, migration, extracellular matrix formation, hemostasis and thrombosis. FN binds to fibrin, collagens, gelatin, DNA, integrins, heparin and proteoglycans. Due to the common interacting partners both have similar structures [43].
SAVES validation server results showed that refined structure was slightly improved from the predicted structures. Comparisons of validation server results are listed in the Table 3. Lesser values in ProCheck, WhatCheck and PROVE servers result of refined structure showed that the structure quality was improved. Verify 3D values were found to be insignificant. Higher value in ERRAT result also supported the structure refinement process. Molecular dynamics (MD) is a computer simulation of physical movements of atoms and molecules [44]. Molecular dynamic simulation with explicit waters for 10 nanoseconds was assessed by GROMACS software [45]. The root-mean-square deviation (RMSD) variations for the molecular dynamic simulation are given in the Figure 3. It clearly showed that osteopontin-c had average RMSD deviation of 2.79 A°. Anyway, the refined structure was found to be an improved one. But one cannot reject predicted structure of osteopontin-c because the structure does not showed drastic variation in tertiary structure during molecular dynamic simulations. RMSD values below 5–6 A° are generally considered being characteristic for a stable protein in molecular dynamics simulations [46].
Determination of conserved regions (domains)
The amino acid sequences of osteopontin were derived from human [47], mouse [48], rat [49], rabbit [50], Water buffalo [51] and cattle [52]. Multiple sequence alignment was carried out using ClustalW program at Eurobean Bioinformatic Institute. Based on sequence alignments, the amino acid sequence was divided into nine parts which were represented in Figure 4. Of these nine parts, only five parts have known functions i.e. PolyD – Binds with calcium, RGD – Integrin binding site, GLRS – Thrombin cleavage site, Eighth part – Calcium Binding site, and Ninth part- Heparin Binding site. Of these conserved regions, Poly D (4th), GRGDS (6th) and GLRS (7th) regions are well known for their functions [53]. Human and Rabbit sequences were found to have 64% similarity whereas human and chicken were found to have only 21% similarity score in multiple sequence alignment. Multiple sequence alignment is shown in Figure 5. Phylogenetic relationships between these organisms are shown in Figure 6. Distinct differences were found to be present between human and chicken which could reflect functional and developmental differences between chicken and mammalian osteopontins [54].
Predicted osteopontin-a and osteopontin-c contain both RGD and RSK motifs which can be seen in Figure 7. Difference in the role of osteopontin-c from osteopontin-a in cancer can be clearly seen from Figure 7 that is the exposure of RSK domain. Due to the exposure of RSK domain, thrombin can easily cleave osteopontin-c into two fragments N-terminal and C-terminal fragments. The fragments were then subjected to tertiary structure prediction. Predicted structures of N-terminal and C-terminal fragments are shown in Figure 8. Predicted structure showed totally different structure when compared with the intact osteopontin-a and osteopontin-c. This might be the essential reason for the role of osteopontin-c in cancer progression and metastasis. C-terminal fragments’ polymer might form channel like structures which is shown in Figure 8. Presence of RGD and RSK motifs in osteopontin were reported by several studies [55, 56]. Thrombin cleaves between R and S residues of RSK sequence [55, 56]. This cleavage occurs within six amino acid residues of the GRGDS sequence, raising the interesting possibility that thrombin-cleavage further activates osteopontin by allowing greater accessibility of the GRGDS domain to cell surface receptors [57–60]. Xuan et al.[61] have shown that thrombin cleavage of osteopontin abolishes its cell binding function. The N-terminal GRGDS-containing osteopontin thrombin-cleavage fragment is highly active in promoting tumor cell migration [62]. The N-terminal fragment contains two integrin binding sites. These integrin binding domains include a SVVYGLR domain and a well-characterized RGD domain [63, 64]. Furthermore, a recent study by Mi et al. [65] demonstrated that the thrombin-cleaved COOH-terminal fragment of osteopontin can activate downstream signaling and influence breast cancer cell migration and invasion in-vitro. The COOH-terminal fragment of osteopontin binds with another marker of metastatic function (cyclophilin C or rotamase) to the CD147 cell surface glycoprotein (also known as extracellular matrix metalloproteinase inducer or EMMPRIN), to activate Akt1/2 and matrix metalloproteinase-2 [65, 66].
Previous studies in bovine osteopontin clearly demonstrated that gln-x-gln sequence is required for transglutaminase activity of cross linking osteopontin through gamma-glutamyl-epsilon-lysino peptidyl bond formation [67]. Multiple sequence alignment in Figure 9 clearly showed that the transglutaminase acting site is not present in osteopontin-c. The probable site is present between 46th and 52nd residues (PDPSQKQ) in osteopontin-a. The absence of this region in osteopontin-c clearly showed that transglutaminase will not act on it. This might be the essential reason for the pathological role of osteopontin-c and it may not be crosslinked with extracellular matrix and thus it results in cell migration. This fact was also supported by the lack of exon-4 expression in osteopontin-c by Weber [15].
Docking
Interestingly, OPN binding to CD44 results in the propagation of cytosolic signals that enhance integrin activation and thus migration in cancer cells. Binding of OPN with CD44 actively promote local proteolysis through binding with MMP3 and also activate complement factor H (CF-H) which protects cancer cell from complement mediated lysis. Both integrins and CD44 have well-established roles in tumor progression. Therefore, interfering with these receptor–ligand interactions by controlling receptor cell surface expression, blocking receptor–ligand binding or suppressing associated signal transduction are promising ways to block both tumor development and metastatic dissemination [40]. Thus, CD44 is selected as a potential receptor for docking studies. Docking analysis was carried out using CD44 as receptor and osteopontin types osteopontin-a and osteopontin-c as ligands. Docking results were analyzed manually and interaction sites were analyzed using Discovery Studio Visualizer 1.7 neighbour analysis tool. The results of ClusPro docking studies between CD44 and osteopontin-a and osteopontin-c showed for the first time that both of them interact in a similar site. Interaction of osteopontin to CD44 was proved by many studies [39, 68] but none of the previous studies determined the interaction sites. Docked structures of CD44 and osteopontins and the interaction domains of CD44 and osteopontins were shown in Figure 10. It was identified that aspargine (233rd residue), serine (234th residue) and threonine (237th residue) residues were essential for interactions in osteopontin isoform sequences. These residues might form the QSAET motif essential for the interaction of CD44 and osteopontins which is reported first time in the present study. Of these three residues, serine and threonine were found to be highly conserved in various species as was evident from the multiple sequence alingment. As the outcome, it was found out that serine and threonine residues are essential for interaction with CD44. This result supports the result of serine (234th residue) residue being glycosylated as a post translational modification in human osteopontin [69].
Binding pockets prediction
Binding pockets prediction is an essential step towards drug designing and docking studies [70]. Predicted binding pockets are shown in Figure 11. Pocket Finder detected ten pockets in both osteopontin-a and osteopontin-c. Eighth pocket of osteopontin-a was found to be present in the close region of RSK motif whereas no pocket contains RGD motif. First pocket of osteopontin-c was found to contain both RSK and RGD motifs. This result again confirms the role of osteopontin-c in cancer biology with respect to RSK and RGD motifs. Q-site finder predicted ten binding pockets from both osteopontin-a and osteopontin-c. Osteopontin-a has two pockets, namely, sixth and seventh, which contain RSK and RGD motifs in its outer layer, respecively. Osteopontin-c was found to have second and seventh pockets with RGD and RSK motifs, respectively. From all these results, the main novelty identified was that osteopontin-c contains binding pockets with highest rank with RGD and RSK motif, whereas osteopontin-a does not. Osteopontin promotes tumor growth and metastasis through the RGD domain [9].
Druggable pocket identification
Pockets are ranked according to interaction energy, and it is assumed that these relate to locations where a putative ligand could bind and optimise its van der Waals interaction energy. The total energy of the pocket defines its ability to bind a small molecule and therefore its druggability [71]. Druggable pockets differ from binding pockets because druggable pocket was predicted using the druggable cavity directory dataset of drugs. A druggable pocket in osteopontin-c was predicted by DoGSiteScorer [34] and manual inspection of the pocket with their energies with the help of Swiss PDB Viewer 3.7 tool. It was found that the Q-site finder predicted eighth pocket with APSD (170th to 173rd residues) and TSQLD (184th to 188th residues) motifs can be used as a drug target in osteopontin-c due to the presence of pocket with maximum energy and druggable score. Amino acid residues of this predicted pocket was found to be conserved which was proved by multiple sequence analysis [Figure 5]. Elevated expression of osteopontin-c has been found in many cancers and the level of its expression is associated with the metastatic potential of cancer. Thus targeting osteopontin using the druggable pocket would be a logical approach for cancer management [72].
Conclusion
The key finding of the present study is the discovery, for the first time, of the binding site of CD44 in osteopontin-c which has aspargine (233rd residue), serine (234th residue) and threonine (237th residue) residues. During the course of the study, a novel druggable pocket with APSD and TSQLD motifs was also found, which will be useful for future computer aided drug designing studies. Another important finding is that the RSK sequence is exposed to thrombin in osteopontin-c splice variant only as evidenced by the predicted tertiary structure, which explains the fact that only osteopontin-c is involved in metastasis. Due to the action of thrombin, osteopontin-c is fragmented into N-terminal and C-terminal fragments easily. Hypothetical proposal for the formation of channel like conformation by osteopontin-c was achieved with help of C-terminal fragment and favors cancer cell migration and metastasis. Absence of “PDPSQKQ” sequence in osteopontin-c avoids full length protein polymerization by transglutaminase-2 and favors metastasis. On the other hand, full length osteopontin-a is polymerized by transglutaminase-2 and thus, osteopontin-a cannot help metastasis because polymer favors cell adhesion. Obviously, experimental elucidation might be useful for further validation of real time tertiary structure of osteopontin-c. Until then, the present predicted structure might be used for computational drug design for osteopontin-c with respect to prevention of cancer.
References
Berns EM, Foekens JA, van Putten WL, van Staveren IL, Portengen H, De Koning WC, Klijn JG: Prognostic factors in human primary breast cancer: comparison of c-myc and HER2/neu amplification. J Steroid Biochem Mol Biol 1992, 43(1–3):13–19.
Rangaswami H, Bulbule A, Kundu GC: Osteopontin: role in cell signaling and cancer progression. TRENDS Cell Biol 2006, 16(2):79–87. 10.1016/j.tcb.2005.12.005
Gupta GP, Massague J: Cancer metastatsis: building a framework. Cell 2006, 127: 679–695. 10.1016/j.cell.2006.11.001
Woodhouse EC, Chuaqui RF, Liotta LA: General mechanisms of metastasis. Cancer 1997, 80: 1529–1537. 10.1002/(SICI)1097-0142(19971015)80:8+<1529::AID-CNCR2>3.0.CO;2-F
Rodrigues LR, Teixeira JA, Schmitt FL, Paulsson M, Lindmark-Mansson H: The role of osteopontin in tumor progression and metastasis in breast cancer. Cancer Epidemiol Biomarkers Prev 2007, 16(6):1087–1097. 10.1158/1055-9965.EPI-06-1008
Wai PY, Kuo PC: Osteopontin: regulation in tumor metastasis. Cancer Metastasis Rev 2008, 27: 103–118. 10.1007/s10555-007-9104-9
Song G, Cai QF, Mao YB, Ming YL, Bao SD, Ouyang GL: Osteopontin promotes ovarian cancer progression and cell survival and increases HIF-1alpha expression through the PI3-K/Akt pathway. Cancer Sci 2008, 99(10):1901–1907.
Song G, Ouyang G, Mao Y, Ming Y, Bao S, Hu T: Osteopontin promotes gastric cancer metastasis by augmenting cell survival and invasion through Akt-mediated HIF-1alpha up-regulation and MMP9 activation. J Cell Mol Med 2009, 13: 1706–1718. 10.1111/j.1582-4934.2008.00540.x
Courter D, Cao H, Kwok S, Kong C, Banh A, Kuo P, Bouley DM, Vice C, Brustugun OT, Denko NC, Koong AC, Giaccia A, Le Q-T: The RGD domain of human osteopontin promotes tumor growth and metastasis through activation of survival pathways. PLoS One 2010, 5: e9633. 10.1371/journal.pone.0009633
Beausoleil MS, Schulze EB, Goodale D, Postenka CO, Allan AL: Deletion of the thrombin cleavage domain of osteopontin mediates breast cancer cell adhesion, proteolytic activity, tumorgenicity, and metastasis. BMC Cancer 2011, 11: 25. 10.1186/1471-2407-11-25
Chakraborty G, Jain S, Behera R, Ahmed M, Sharma P, Kumar V, Kundu GC: The multifaceted roles of osteopontin in cell signaling, tumor progression and angiogenesis. Curr Mol Me 2006, 6(8):819–830. 10.2174/156652406779010803
Young MF, Kerr JM, Termine JD, Wewer UM, Wang MG, McBride OW, Fisher LW: cDNA cloning, mRNA distribution and heterogeneity, chromosomal location, and RFLP analysis of human osteopontin (OPN). Genomics 1990, 7: 491–502. 10.1016/0888-7543(90)90191-V
Mirza M, Shaughnessy E, Hurley JK, Vanpatten KA, Pestano GA, He B, Weber GF: Osteopontin-c is a selective marker of breast cancer. Int J Cancer 2008, 122(4):889–897. 10.1002/ijc.23204
He B, Mirza M, Weber GF: An osteopontin splice variant induces anchorage independence in human breast cancer cells. Oncogene 2006, 25(15):2192–2202. 10.1038/sj.onc.1209248
Weber GF: Molecular mechanisms of metastasis. Cancer Lett 2008, 270: 181–190. 10.1016/j.canlet.2008.04.030
Gasteiger E, Gattiker A, Hoogland C, Ivanyi I, Appel RD, Bairoch A: ExPASy: the proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res 2003, 31(13):3784–3788. 10.1093/nar/gkg563
Bendtsen JD, Nielsen H, von Heijne G, Brunak S: Improved prediction of signal peptides: signalP 3.0. J Mol Biol 2004, 340: 783–795. 10.1016/j.jmb.2004.05.028
Nielsen H, Engelbrecht J, Brunak S, von Heijne G: Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites. Protein Eng 1997, 10: 1–6. 10.1093/protein/10.1.1
Vijayaraj N, Elasri MO: Structure and function predictions of Msa protein in staphylococcus aureus . BMC Bioinforma 2007, 8(suppl 7):S5. 10.1186/1471-2105-8-S7-S5
Wu S, Skolnick J, Zhang Y: Ab-initio modeling of small proteins by iterative TASSER simulations. BMC Biol 2007, 5: 17. 10.1186/1741-7007-5-17
Laskoswki RA, MacArthur MW, Moss DS, Thorton JM: PROCHECK: a program to check the stereo-chemical quality of protein structures. J Appl Cryst 1993, 26: 283–291. 10.1107/S0021889892009944
Hooft RWW, Vriend G, Sander C, Abola EE: Errors in protein structures. Nature 1996, 381: 272–272.
Luthy R, Bowie JU, Eisenberg D: Assessment of protein models with three-dimensional profiles. Nature 1992, 356: 83–85. 10.1038/356083a0
Colovos C, Yeates TO: Verification of protein structures: patterns of nonbonded atomic interactions. Protein Sci 1993, 2: 1511–1519. 10.1002/pro.5560020916
Pontius J, Richelle J, Wodak SJ: Deviations from standard atomic volumes as a quality measure for protein crystal structures. J Mol Biol 1996, 264: 121–136. 10.1006/jmbi.1996.0628
Summa CM, Levitt M: Near native structure refinement using in vacuo energy minimization. Proc Natl Acad Sci USA 2007, 104(9):3177–3182. 10.1073/pnas.0611593104
Van Der Spoel D, Lindahl E, Hess B, Groenhof G, Mark AE, Berendsen HJC: GROMACS: fast, flexible, and free. J Comput Chem 2005, 26: 1701–1718. 10.1002/jcc.20291
Hess B, Kutzner C, van der Spoel D, Lindahl E: GROMACS 4: algorithms for highly efficient, load-balanced, and scalable molecular simulation. J Chem Theory Comput 2008, 4: 435–447. 10.1021/ct700301q
Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, Thompson JD, Gibson TJ, Higgins DG: ClustalW and ClustalX version 2. Bioinformatics 2007, 23(21):2947–2948. 10.1093/bioinformatics/btm404
Comeau SR, Gatchell DW, Vajda S, Camacho CJ: ClusPro: an automated docking and discrimination method for the prediction of protein complexes. Bioinformatics 2004, 20(1):45–50. 10.1093/bioinformatics/btg371
Alasdair TR, Jackson L, Jackson RM: Q-SiteFinder: an energy-based method for the prediction of protein–ligand binding sites. Bioinformatics 2005, 21(9):1908–1916. 10.1093/bioinformatics/bti315
Hendlich M, Rippmann F, Barnickel G: LIGSITE: automatic and efficient detection of potential small molecule-binding sites in proteins. J Mol Graph Model 1997, 15(6):359–363. 10.1016/S1093-3263(98)00002-3
Volkamer A, Griewel A, Grombacher T, Rarey M: Analyzing the topology of active sites: on the prediction of pockets and subpockets. J Chem Inf Model 2010, 50(11):2041–2052. 10.1021/ci100241y
Volkamer A, Kuhn D, Grombacher T, Rippmann F, Rarey M: Combining global and local measures for structure-based druggability predictions. J Chem Inf Model 2012, 52(2):360–372. 10.1021/ci200454v
Schmidtke P, Barril X: Understanding and predicting druggability. A high-throughput method for detection of drug binding sites. J Med Chem 2010, 53(15):5858–5867. 10.1021/jm100574m
Columbus L, Peti W, Etezady-Esfarjani T, Herrmann T, Wuthrich K: NMR structure determination of the conserved hypothetical protein TM1816 from Thermotoga maritima. Proteins 2005, 60(3):552–557. 10.1002/prot.20465
Pettersson E, Luning B, Mickos H, Heinegard D: Synthesis, NMR and function of an O-phosphorylated peptide, comprising the RGD-adhesion sequence of osteopontin. Acta Chem Scand 1991, 45(6):604–608.
Zhang Y: I-TASSER server for protein 3D structure prediction. BMC Bioinforma 2008, 9: 40. 10.1186/1471-2105-9-40
Wuttke M, Muller S, Nitsche DP, Paulsson M, Hanisch FG, Maurer P: Structural characterization of human recombinant and bone-derived bone sialoprotein. J Biol Chem 2001, 276(39):36839–36848. 10.1074/jbc.M105689200
Bellahcene A, Castronovo V, Ogbureke KU, Fisher LW, Fedarko NS: Small integrin-binding ligand N-linked glycoproteins (SIBLINGs): multifunctional proteins in cancer. Nat Rev Cancer 2008, 8(3):212–226. 10.1038/nrc2345
Fisher LW, Torchia DA, Fohr B, Young MF, Fedarko NS: Flexible structures of SIBLING proteins, bone sialoprotein, and osteopontin. Biochem Biophys Res Commun 2001, 280(2):460–465. 10.1006/bbrc.2000.4146
Ganss B, Kim RH, Sodek J: Bone sialoprotein. Crit Rev Oral Biol Med 1999, 10: 79–98. 10.1177/10454411990100010401
Sharma A, Askari JA, Humphries MJ, Jones EY, Stuart DI: Crystal structure of a heparin- and integrin-binding segment of human fibronectin. EMBO J 1999, 18(6):1468–1479. 10.1093/emboj/18.6.1468
Zhong S, Chen X, Zhu X, Dziegielewska B, Bachman KE, Ellenberger T, Ballin JD, Wilson GM, Tomkinson AE, MacKerell AD: Identification and validation of human DNA ligase inhibitors using computer-aided drug design. J Med Chem 2008, 51(15):4553–4562. 10.1021/jm8001668
Dibrov A, Myal Y, Leygue E: Computational modelling of protein interactions: energy minimization for the refinement and scoring of association decoys. Acta Biotheor 2009, 57(4):419–428. 10.1007/s10441-009-9085-x
Gyimesi G, Ramachandran S, Kota P, Dokholyan NV, Sarkadi B, Hegedus T: ATP hydrolysis at one of the two sites in ABC transporters initiates transport related conformational transitions. Biochimica et Biophysica Acta (BBA) – Biomembranes 2011, 1808(12):2954–2964. 10.1016/j.bbamem.2011.07.038
Dai J, Peng L, Fan K, Wang H, Wei R, Ji G, Cai J, Lu B, Li B, Zhang D, Kang Y, Tan M, Qian W, Guo Y: Osteopontin induces angiogenesis through activation of PI3K/AKT and ERK1/2 in endothelial cells. Oncogene 2009, 28(38):3412–3422. 10.1038/onc.2009.189
Grassinger J, Haylock DN, Storan MJ, Haines GO, Williams B, Whitty GA, Vinson AR, Be CL, Li S, Sorensen ES, Tam PP, Denhardt DT, Sheppard D, Choong PF, Nilsson SK: Thrombin-cleaved osteopontin regulates hemopoietic stem and progenitor cell functions through interactions with alpha9beta1 and alpha4beta1 integrins. Blood 2009, 114(1):49–59. 10.1182/blood-2009-01-197988
Hirata A, Sugahara T, Nakamura H: Localization of runx2, osterix, and osteopontin in tooth root formation in rat molars. J Histochem Cytochem 2009, 57(4):397–403.
Nasu K, Ishida T, Setoguchi M, Higuchi Y, Akizuki S, Yamamoto S: Expression of wild-type and mutated rabbit osteopontin in Escherichia coli, and their effects on adhesion and migration of P388D1 cells. Biochem J 1995, 307(PT1):257–265.
Tantia MS, Mishra B, Bharani Kumar ST, Mishra BP, Kataria RS, Mukesh M, Vijh RK: Characterization of osteopontin gene of Bubalus bubalis . Animal 2008, 2: 987–990.
Yamniuk AP, Burling H, Vogel HJ: Thermodynamic characterization of the interactions between the immunoregulatory proteins osteopontin and lactoferrin. Mol Immunol 2009, 46(11–12):2395–2402.
Sodek J, Ganss B, Mckee MD: Osteopontin. Crit Rev Oral Biol Med 2000, 11(3):279–303. 10.1177/10454411000110030101
Thayer JM, Schoenwolf GC: Early expression of osteopontin in the chick is restricted to rhombomeres 5 and 6 and to a subpopulation of neural crest cells that arise from these segments. Anat Rec 1998, 250: 199–209. 10.1002/(SICI)1097-0185(199802)250:2<199::AID-AR10>3.0.CO;2-J
Bautista DS, Jian-wu XS, Charulata HS, Chambers FA, Harrisson JF: Inhibition of Arg-Gly-Asp (RGD)-mediated cell adhesion to osteopontin by a monoclonal antibody against osteopontin. J Biol Chem 1994, 269(37):23280–23285.
Schulze EB, Hedley BD, Goodale D, Postenka CO, Al-Katib W, Tuck AB, Chambers AF, Allan AL: The thrombin inhibitor Argatroban reduces breast cancer malignancy and metastasis via osteopontin-dependent and osteopontin-independent mechanisms. Breast Cancer Res Treat 2008, 112: 243–254. 10.1007/s10549-007-9865-4
Senger DR, Perruzzi CA, Papadopoulos-Sergiou A, Van De Water L: Adhesive properties of osteopontin: regulation by a naturally occurring thrombin-cleavage in close proximity to the GRGDS cell-binding domain. Mol Biol Cell 1994, 5: 565–574. 10.1091/mbc.5.5.565
Senger DR, Perruzzi CA, Papadopoulos A: Elevated expression of secreted phosphoprotein I (osteopontin, 2ar) as a consequence of neoplastic transformation. Anticancer Res 1989, 9(5):1291–1300.
Senger DR, Perruzzi CA, Papadopoulos A, Tenen DG: Purification of a human milk protein closely similar to tumor-secreted phosphoproteins and osteopontin. Biochim Biophys Acta 1989, 996(1–2):43–48.
Denhardt DT, Noda M, O’Regan AW, Pavlin D, Berman JS: Osteopontin as a means to cope with environmental insults: regulation of inflammation, tissue remodeling, and cell survival. J Clin Invest 2001, 107: 1055–1061. 10.1172/JCI12980
Xuan JW, Hota C, Chambers AF: Recombinant GSThuman osteopontin fusion protein is functional in RGDdependent cell adhesion. J Cell Biochern 1994, 64: 247–255.
Senger RD, Perruzzi AC: Cell migration promoted by a potent GRGDS-containing thrombin-cleavage fragment of osteopontin. Biochim Biophys Acta 1996, 1314: 13–24. 10.1016/S0167-4889(96)00067-5
Yokosaki Y, Matsuura N, Sasaki T, Murakami I, Schneider H, Higashiyama S, Saitoht Y, Yamakido M, Taooka Y, Sheppard D: The integrin alpha9beta1 binds to a novel recognition sequence (SVVYGLR) in the thrombin-cleaved amino-terminal fragment of osteopontin. J Biol Chem 1999, 274: 36328–36334. 10.1074/jbc.274.51.36328
Shigeyuki K, Yasuyuki Y, Mmasahiro M, Tatsuya S, Yuko H, Hiroe T, Rashid MM, Morimoto J, Inobe M, Shijubo N, Chambers AF, Uede T: Mapping of functional epitopes of osteopontin by monoclonal antibodies raised against defined internal sequences. J Cell Biochem 2002, 84: 420–432. 10.1002/jcb.10039
Mi Z, Oliver T, Guo H, Gao C, Kuo PC: Thrombin-cleaved COOH-terminal osteopontin peptide binds with cyclophilin C to CD147 in murine breast cancer cells. Cancer Res 2007, 67(9):4088–4097. 10.1158/0008-5472.CAN-06-4066
Yurchenko V, Constant S, Bukrinsky M: Dealing with the family: CD147 interactions with cyclophilins. Immunology 2006, 117: 301–309. 10.1111/j.1365-2567.2005.02316.x
Sorensen ES, Rasmussen LK, Moller L, Jensen PH, Hojrup P, Petersen TE: Localization of transglutaminase-reactive glutamine residues in bovine osteopontin. Biochem J 1994, 304(Pt 1):13–16.
Smith LL, Greenfield BW, Aruffo A, Giachelli CM: CD44 Is Not an adhesive receptor for osteopontin. J Cell Biochem 1999, 73: 20–30. 10.1002/(SICI)1097-4644(19990401)73:1<20::AID-JCB3>3.0.CO;2-3
Brian C, Nielsen MS, Haselmann KF, Petersen TE, Sorensen ES: Post-translationally modified residues of native human osteopontin are located in clusters: identification of 36 phosphorylation and five O-glycosylation sites and their biological implications. Biochem J 2005, 390: 285–292. 10.1042/BJ20050341
Cherezov V, Liu W, Derrick JP, Luan B, Aksimentiev A, Katritch V, Caffrey M: In meso crystal structure and docking simulations suggest an alternative proteoglycan binding site in the OpcA outer membrane adhesin. Proteins 2008, 71: 24–34. 10.1002/prot.21841
Fuller JC, Burgoyne NJ, Jackson RM: Predicting druggable binding sites at the protein–protein interface. Drug Discov Today 2009, 14(3–4):155–161.
Ahmed M, Behera R, Chakraborty G, Jain S, Kumar V, Sharma P, Bulbule A, Kale S, Kumar S, Mishra R, Raja R, Saraswati S, Kaur R, Soundararajan G, Kumar D, Thorat D, Sanyal M, Ramdasi A, Ghosh P, Kundu GC: Osteopontin: a potentially important therapeutic target in cancer. Expert Opin Ther Targets 2011, 15: 1113–1126. 10.1517/14728222.2011.594438
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
Authors SS and SND, carried out sequence analysis and structure prediction studies, participated in docking and molecular dynamic analysis and drafted the manuscript. Both authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Sivakumar, S., Niranjali Devaraj, S. Tertiary structure prediction and identification of druggable pocket in the cancer biomarker – Osteopontin-c. J Diabetes Metab Disord 13, 13 (2014). https://doi.org/10.1186/2251-6581-13-13
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/2251-6581-13-13