Structure-based functional annotation of hypothetical proteins from Candida dubliniensis: a quest for potential drug targets

Kumar, Kundan; Prakash, Amresh; Anjum, Farah; Islam, Asimul; Ahmad, Faizan; Hassan, Md. Imtaiyaz

doi:10.1007/s13205-014-0256-3

Structure-based functional annotation of hypothetical proteins from Candida dubliniensis: a quest for potential drug targets

Original Article
Open access
Published: 17 October 2014

Volume 5, pages 561–576, (2015)
Cite this article

Download PDF

You have full access to this open access article

3 Biotech Aims and scope Submit manuscript

Structure-based functional annotation of hypothetical proteins from Candida dubliniensis: a quest for potential drug targets

Download PDF

Kundan Kumar¹,
Amresh Prakash¹,
Farah Anjum²,
Asimul Islam¹,
Faizan Ahmad¹ &
…
Md. Imtaiyaz Hassan¹

3015 Accesses
18 Citations
Explore all metrics

Abstract

Candida dubliniensis is an emerging pathogenic yeast in humans and infections are usually restricted to mucosal parts of the body. However, its presence in specimens of immunocompromised individuals, especially in HIV-positive patients, is of major medical concern. There is a large fraction of genomes of C. dubliniensis in the database which are uncharacterized for their biochemical, biophysical, and/or cellular functions, and are identified as hypothetical proteins (HPs). Function annotation of Candida genome is, therefore, essentially required to facilitate the understanding of mechanisms of pathogenesis and biochemical pathways important for selecting novel therapeutic target. Here, we carried out an extensive analysis to explain the functional properties of genome, using available protein structure and function analysis tools. We successfully modeled the structures of eight HPs for which a template with moderate sequence similarity was available in the protein data bank. All modeled structures were analyzed and we found that these proteins may act as transporter, kinase, transferase, ketosteroid, isomerase, hydrolase, oxidoreductase, and binding targets for DNA and RNA. Since these unique HPs of Candida showed no homologs in humans, these proteins are expected to be a potential target for future antifungal therapy.

Functional annotation of Candida albicans hypothetical proteins: a bioinformatics approach

Article 23 February 2024

Deepika Tripathi, Arushi Kapoor, … Meenakshi Sharma

Novel and unique domains in aminoacyl-tRNA synthetases from human fungal pathogens Aspergillus niger, Candida albicans and Cryptococcus neoformans

Article Open access 05 December 2014

Manish Datt & Amit Sharma

Draft genome of a commonly misdiagnosed multidrug resistant pathogen Candida auris

Article Open access 07 September 2015

Sharanya Chatterjee, Shuba Varshini Alampalli, … Utpal S. Tatu

Introduction

In recent years, Candida dubliniensis has emerged as a major cause of candidal infection in humans, particularly in HIV-infected patients and other immunocompromised persons (Sullivan and Coleman 1998). Normally, this species of Candida is harmless, but it may become infectious under certain circumstances and is thus also called an opportunistic pathogen (Butler et al. 2009). In fact, C. dubliniensis is substantially less pathogenic than its closely related species C. albicans (Karkowska-Kuleta et al. 2009). However, a longer survival time and adaptability in the host induce virulence (Stokes et al. 2007). Furthermore, the genotypic and phenotypic closeness with C. albicans, misguides the medical practitioners, in routine laboratory diagnosis, and its treatment becomes tougher with their tendency to develop resistance against antifungal agents (Moran et al. 1997). In general, C. dubliniensis infection is restricted to oral mucosa, vagina and lungs. However, it may cause fatal systemic infection (Sullivan et al. 2005). Therefore, this specie is under investigation during fungal infection, especially in the condition of low immunity patient (Achkar and Fries 2010).

The complete genome of C. dubliniensis has been sequenced recently, and it consists of eight chromosomal contigs with 262288 reads of total size 14.6 Mb (Jackson et al. 2009). Extensive analysis of 5,860 open reading frames leads to identification of 1,273 hypothetical proteins (HPs), whose functions have not been determined so far (Galperin and Koonin 2004). Function determination of putative uncharacterized HPs for their possible biological activity has emerged as an important focus for computational biology (Kumar et al. 2014; Loewenstein et al. 2009; Shahbaaz et al. 2013). The primary approach is to assign a function to new genes based on sequence homology (Desler et al. 2009). Various HPs are found to be well conserved among organisms and are involved in important biochemical processes (Dutta et al. 2013). Further, the characterization of protein function in context of their sequence similarity is a primitive approach and may lead to ambiguous function annotation (Chiusano et al. 2008). In various cases, the evolution retains a conserved folding pattern despite having very poor sequence similarity (Chiusano et al. 2008). Therefore, the structural analysis of protein is essential to decipher their biochemical functions that could not be illustrated from sequence data alone (Ebihara et al. 2006; Hassan et al. 2008, 2013). Hence, the three-dimensional structure determination of HPs is an imperative task to illustrate the biological function at the molecular level (Shahbaaz et al. 2014; Sinha et al. 2014).

Recently, we annotated functions of several HPs from Candida dubliniensis (Kumar et al. 2014). To extend this study further, we modeled the structure of those proteins for which we got sufficient sequence similarity and coverage to further improve the function assignment. We have already been involved in structure-based drug design (Hassan et al. 2007a, b; Prakash et al. 2013; Thakur and Hassan 2011; Thakur et al. 2013) and hence we are looking for novel therapeutic targets (Shahbaaz et al. 2013). To validate a potential drug target, protein–ligand docking studies have been proven as one of the appropriate tools (Singh et al. 2014; Tasleem et al. 2014; Thakur et al. 2014; Totrov and Abagyan 2008). Hence, we docked few drug molecules with HPs as well. A precise annotation of HPs from C. dubliniensis may lead to the identification of new functions, and novel pharmacological targets for drug design, discovery and screen to cure the candidal infections.

Materials and methods

Sequence retrieval and analysis

Candida dubliniensis genome encodes 5,860 proteins, wherein functions are uncharacterized for 1,273 proteins and are termed as HPs (O’Connor et al. 2010). Sequences of HPs from C. dubliniensis were retrieved from UniProt database (http://www.uniprot.org/), and BLAST (Altschul et al. 1990) and PSI-BLAST (Altschul et al. 1997) searches were carried out to identify similar sequences with known structures and functions. We extensively analyzed functions using conserved domain database (CDD) (Marchler-Bauer et al. 2011), InterProScan (Quevillon et al. 2005) and superfamily databases (Marchler-Bauer et al. 2011). InterProScan (Quevillon et al. 2005) combines different protein signature recognition methods from the InterPro consortium for motif discovery. In a protein, the motifs are signatures of protein families and can be preferably used to define the protein function, particularly in enzyme where motifs are associated with the catalytic function (Bork and Koonin 1996). We also used SMART (Letunic et al. 2012), ScanProsite (de Castro et al. 2006; Gattiker et al. 2002), CATH (Orengo et al. 1997) and PANTHER (Thomas et al. 2003) to establish the evolutionary relationships and infer the functions of HPs. All bioinformatics tools and databases used for sequence and structure analysis are listed in the Table 1. FungalRV was used to identify adhesins character to describe the virulence factor for C. dubliniensis HPs. (Chaudhuri et al. 2011). This tool is based on the support vector machine (SVM) method and trained by a large number of compositional properties that are used to classify human pathogenic fungal adhesins and adhesins-like protein.

Table 1 List of bioinformatics tools and databases used for sequence and structure-based function annotation

Full size table

Structure prediction

Three-dimensional structures of HPs are determined using MODELLER (Eswar et al. 2006), SWISS-MODEL (Kiefer et al. 2009) and protein homology recognition engine (Phyre) (Kelley and Sternberg 2009). MODELLER uses the spatial restraints to predict structure, and restraints of the quarry sequence are generated using alignment to related structure. SWISS-MODEL predicts the structure of HP by assembly of rigid bodies, in which a model from a small number of rigid bodies is obtained from the aligned protein structures (Schwede et al. 2003). The Phyre server uses a library of known protein structures taken from the Structural Classification of Proteins (SCOP) database and the PDB (Kelley and Sternberg 2009). The top ten scoring alignments were used to construct the three-dimensional structure of each HP. All predicted models from three sources were subjected to energy minimization with CHARMM-22, using Discovery Studio 3.5 (Hirashima and Huang 2008). Side chains of the predicted model were refined by using side chain refinement protocol of the MODELLER of Discovery studio 3.5.

Structure validation

The stereochemical quality of the modeled structure for HPs was validated at structural analysis and with verification server (Laskowski et al. 1993) using PROCHECK (Laskowski et al. 1996). PROCHECK validates the stereochemical quality of a protein structure by analyzing the overall structure and residue-by-residue geometry of proteins. ProQ is a neural network-based predictor server used to define the correctness and quality of predicted structures (Wallner and Elofsson 2003). The model showed higher LG score and MaxSub score that were selected for further analysis. A validation report of each model is listed in Table 2.

Table 2 Validation report of all predicted models

Full size table

Structure analysis

Structure-based functional annotations of proteins are considered to be a more reliable approach than sequence-based function assignment, as the structures of homologous proteins are more conserved in evolution than sequences (Hassan and Ahmad 2011). We used various tools for the analysis of predicted structure of HPs. Results of Pocket-Finder (Laurie and Jackson 2005), information from literature for the selected templates and docking analyses were used to define the catalytic sites. The ProFunc server was used to identify the structural motifs associated with the biological functions (Laskowski et al. 2005). Furthermore, DALI server was used for the function prediction (Holm and Rosenstrom 2010). Docking simulation study was carried out with AutoDock 4.2 (Morris et al. 2009) using standard protocol. PASS method was used to define the active site of HPs, and the center of mass was calculated for the predicted catalytic cavity (Brady and Stouten 2000). The docking simulations end with multiple runs, and cluster analysis of ligands was performed with their corresponding docked energy. Docking solutions with ligand atom root mean square deviation (RMSD) within 2.0 Å of each other were clustered together and ranked by the lowest-energy representative (Maiorov and Crippen 1994). The lowest-energy solution of the lowest ligand all-atom RMSD cluster was accepted as the calculated binding energy. The top-posed docking conformations were obtained and post-docking energy minimization carried out with Discovery Studio 3.5. The resultant structure files were analyzed using PyMOL visualization programs (Lill and Danielson 2011).

Result and discussion

Functional annotation of HPs is essential for better understanding of biological processes at systems level and predicting the behavior of biological system for designing a predictive disease model (Mazandu and Mulder 2012). For this, various novel methods such as neural network model, support vector machine and hidden Markov model-based tools have been developed recently (Rashid et al. 2007). These methods are efficient, intelligent and can complement the classical homology search, to detect functional constraints on genome evolution. However, the characterization of protein function in the context of their structure is more intuitive, reliable and generally more applicable to extract the biochemical or enzymatic function (Haas et al. 2011). Here, we annotated the functions of HPs of C. dubliniensis based on sequence and structure analysis (Table 3). We found that these HPs possess transport, kinase, transferase, ketosteroid, isomerase, hydrolase, oxidoreductase activity and DNA- and RNA-binding properties. A detailed analysis of each structure is presented here separately.

Table 3 Function of HPs of C. dubliniensis

Full size table

HP B9WFH2

Sequence-based analysis suggests that HP B9WFH2 may act as a transporter protein (Table S1). Conserved domain analysis strongly suggests that this protein belongs to the major facilitator superfamily (MFS), has multi-domain for sugar transporter-like protein and may be involved in transport activity (Marchler-Bauer et al. 2011) (Table S2). HHpred also suggests high similarity with α-helical transmembrane protein glucose transporter (Soding et al. 2005) (Table S3). Motif search using InterProScan suggests that HP B9WFH2 sequence possesses a motif involved in sugar/inositol transporter (Quevillon et al. 2005). Based on these observations, we suggest that HP B9WFH2 may function as a transporter protein.

We predicted the three-dimensional structure of HP B9WFH2 using tool MODELLER, SWISS-MODEL and Phyre (Eswar et al. 2006; Kelley and Sternberg 2009; Kiefer et al. 2009). Analyses of these predicted structures indicate that the structure predicted by Phyre is the best among all three models with 83.60 % of the residues found in the allowed region of the Ramachandran (RC) plot (Laskowski et al. 1996) (Table 2). The RMSD of the model with respect to the template (PDB code: 4GBZ) is 2.419 Å, indicating similar functionality (Maiorov and Crippen 1994). The overall structure of HP B9WFH2 comprises 21 α-helices, of which 15 are transmembrane, 4 helices are intracellular and 2 are extracellular (Fig. 1a). Like other MFS transporter protein structure, the N- and C-terminal and seven-helix bundles come together to form a ‘V’-shaped transporter (Sun et al. 2012).

Docking result showed that residues His20, Lys21, Ser24, Asn351, His488 and Gln492 are involved in polar interaction with β-d-glucose (BGC) and hydroxyl group of BGC specifically recognized through the total seven H-bonds (Fig. 1b). The aromatic residues, Trp456 and Phe489, present in the vicinity of the BGC, are surrounded by Val71, Gln346, Asn461 and Leu532. These residues are significant for biological activity and contribute to the spatial fitting BGC (Sun et al. 2012). Hence, this motif is amenable to drug targeting. Structure alignment with the template showed that residues Gln346 and Asn351 are conserved in the model and important for interaction with BGC (Figure S1). The polar interaction between TMs and ICs residues regulated the passage of BGC. H-bond interaction results are consistent with glucose transporter protein (XylE), and residues of TMs (Glu190 and Arg477) and ICs (Ser264, Trp267, Asn580, Gln583, Tyr584, Glu585) are conserved in the protein of this family (Figure S2). It has been reported that mutation in these residues led to a substantial loss in the active uptake of D-xylose into the cell, implicating that intracellular helix domain has a critical role and contributes to proton symport (Sun et al. 2012).

Furthermore, ProFunc (Table 4) server predicted SP, MFS_gen_substrate_transporter and sugar transport-conserved motifs showing close resemblance with that of MFS protein (Laskowski et al. 2005). The other two structural motifs, Val48-Ser51 and Val152-Ile155, are also depicted to be conserved in HP B9WFH2 and may have a similar function. We identified a structurally similar protein to HP B9WFH2 on DALI server (Holm and Rosenstrom 2010). Our results clearly indicate a significant similarity with the glycerol-3-phosphate transporter (Z score = 56.5), d-xylose-proton symporter (Z score = 26.8) and many other proteins listed in Table S4. All these analyses strongly suggest that B9WFH2 may acts as a transporter protein.

Table 4 Sequence and structural motifs present in the HPs

Full size table

To further validate this protein as a potential drug target, we docked a novel glucose transport (GLUT) inhibitor (5-(4-hydroxy-3-trifluoromethylbenzylidene)-3-[4,4,4-trifluoro-2-methyl-2-(2,2,2-trifluoroethyl) butyl]thiazolidine-2,4-dione) in the glucose binding site of HP B9WFH2 (Fig. 1c). This compound has been reported to show antitumor potency by suppressing glucose uptake and occupied similar spatial orientation at HP B9WFH2 and provide steric hindrance to the binding of BGC (Eswar et al. 2006). It is noteworthy that the oxygen of the two keto-groups of thiazole ring of GLUT inhibitor shows polar interaction with the proposed active site residues, Lys21, Asn351, Tyr465 and His488, of HP B9WFH2 (Fig. 1d). Moreover, other residues surround the GLUT inhibitor are Trp456, Phe489 and Gln492. These observations clearly indicate a possibility of HP B9WFH2 as a potential target for antifungal therapy.

HP B9WFH4

Analysis for sub-cellular localization of HP B9WFH4 indicates its presence in the nuclear region. Signal peptide prediction suggests that this is a non-secretory protein without any transmembrane helix (Table S1). Domain analysis results reveal that HP B9WFH4 possesses diacylglycerol (DAG) kinase catalytic domain including sphingosine kinase domain, responsible for the kinase activity (Marchler-Bauer et al. 2011) (Table S2). HHpred also suggests high similarity with DAG kinase protein (Laskowski et al. 2005) (Table S3). InterProScan results show that HP B9WFH4 sequence possesses motif involved in activation of protein kinase C activity by G-protein coupled receptor signaling pathway (Lubec et al. 2005). Based on these observations, we suggest that HP B9WFH4 may function as a kinase protein. The virulence factor analysis shows that HP B9WFH4 has an adhesin character; hence it may be involved in virulence as well (Chaudhuri et al. 2011).

We predicted the model of HP B9WFH4 on SWISS-MODEL with diacylglycerol kinase (PDB ID: 3T5P) as a template (Kiefer et al. 2009). The predicted structure was validated at SAVES server showing 83.80 % of residues are in the allowed region of the RC-plot (Table 2). The RMSD of the model with respect to template was 2.389 Å, which is quite good and indicates similar functionality (Maiorov and Crippen 1994) (Table 2). The overall structure comprises eight α-helices and 14 β-strands, resembling the dinucleotide-binding motif of the Rossmann fold (Wang et al. 2013) (Fig. 2a). Another structurally related protein to HP B9WFH4 is sphingosine kinase 1 (SphK1) (PDB ID: 3VZB), having RMSD 3.712 Å, which is a lipid kinase that catalyzes the conversion of sphingosine to sphingosine-1-phosphate (Wang et al. 2013).

These enzymes play a role in lymphocyte trafficking, angiogenesis and response to apoptotic stimuli (Wang et al. 2013). Therefore, sphingosine is used as ligand for docking study with HP B9WFH4 to explicit active site residues and their functionality. It binds at the C-terminal domain (CTD) of SphK1 which catalyzes sphingosine to sphingosine-1-phosphate. The central role of SphK1 enzyme is in modulating the S1P levels in cells; it emerged as an important regulator for diverse cellular functions and may be a potential target for drug discovery (Wang et al. 2013). SphK1 consists of a two-domain architecture, CTD and NTD (N-terminal domain), and its catalytic site is located in the cleft between these two domains, and a hydrophobic lipid-binding pocket is buried in the CTD. A similar structural topology is observed in HP B9WFH4. The spatial orientation and docking interactions of sphingosine with HP B9WFH4 are consistent with SphK1 enzyme (Fig. 2a). The hydrophilic 2-amino-1,3-diol moiety head of the sphingosine spatially fitted at the cleft between two domains, making hydrogen bond interactions with Asp117, Thr144 and Ser225. The hydrophobic pocket of CTD comprises residues Val205, Ile228 and Leu264, which are involved in hydrophobic interaction with the polycarbon tail of sphingosine. Binding of sphingosine to SphK1 is mediated by anchoring of the hydrophilic head group to the protein surface and accommodation of the hydrophobic alkyl chain in the interior of protein. Further, the active site residue Asp117 is conserved in both proteins. Structural alignment of the HP B9WFH4 with the template structure reveals that Asp117 is conserved and may play a crucial role in the function of the HP B9WFH4 (Figure S3).

To further validate the role of this protein as a drug target, we performed a docking study of HP B9WFH4 with amidine-based sphingosine kinase inhibitor [(S)-1-(4-(4-(3-(2-cyclohexylethyl) phynyl)oxazol-2-yl)benzoyl)pyrrolidine-2carboximidamide] which is a potential drug for leukemia (Kennedy et al. 2011). We found that amidine-based sphingosine kinase inhibitor binds in the same cavity where sphingosine binds (Fig. 2c). The oxygen of oxazole ring of SphK inhibitor is involved in polar interaction with Thr144 and nitrogen of the oxazole ring is involved in polar interaction with Arg224, whereas the polar head of the inhibitor is involved in polar interaction with Asp117, His226 and Asp350 (Fig. 2d). Other important residues present in the vicinity of SphK inhibitor are Ile228 and leu229. These observations clearly indicate a possibility of HP B9WFH4 as a novel therapeutic target because of a close target similarity (Kennedy et al. 2011).

We further compared the nucleotide binding in HP B9WFH4 with SphK1, and docking of ADP was carried out to validate the structure. The active site at NTD comprises five α-helices and four parallel β-sheets. ADP binding showed that adenine ring fits at the cleft consisting of Ser57-Arg64 and Thr91-Ile97, as in the SphK1, and Lys92 at the opening of the active site is involved in H-bond interaction. Both of the ribose hydroxyls are involved in polar interaction with Glu122. The side chain of ADP, α-phosphate is hydrogen bonded with Ser115 and Thr118, whereas β-phosphate forms two H-bond interactions with Gly142 and Lys59 (Fig. 2b). It has been observed that ADP binding site of HP B9WFH4 is consistent with the crystal structure. However, π-stacking interactions with the backbones of Glu55 and Arg56 and with side chain Asn22 are not observed here.

ProFunc analysis identified sphingosine kinase and DAGK motifs (Laskowski et al. 2005). Six other structural motifs were also identified as Asn146-Leu148, Gly209-His211, Gln344-Arg346, Thr95-Ile97, Leu152-Ile154 and Ser167-Lys169 (Table 4). DALI results showed a significant similarity with other kinase protein ‘BmrU protein’ with maximum Z score = 43.8 for the top five hits (Holm and Rosenstrom 2010) (Table S4). These results also suggest the possible kinase activity of HP B9WFH4.

HP B9WFR9

HP B9WFR9 is predicted as a cytoplasmic protein. Domain analysis suggests the similarity with CoA-transferase family III that has multi-domain for acyl-CoA-transferases and carnitine dehydratase (Marchler-Bauer et al. 2011) (Table S2). Results of HHpred showed higher homology with FCOCT (PDB ID: 3UBM), a bacterial formyl-CoA:oxalate CoA-transferase protein (Soding et al. 2005) (Table S3). InterProScan result also suggests that the HP B9WFR9 sequence possesses a motif that is involved in formyl-CoA transferase activity (Quevillon et al. 2005).

We successfully predicted the structure of HP B9WFR9. The best model having 87.30 % of residues in the allowed region of the RC-plot (Table 2) is selected for further analysis. The structure of HP B9WFR9 shows a close resemblance with the template, formyl-coenzyme A (CoA):oxalate CoA-transferase, from the acidophile Acetobacter aceti with an RMSD of 0.178 Å (Mullins et al. 2012). The overall structure of HP B9WFR9 comprises eight α-helices and six parallel β-sheets having very similar backbone topology with the template (Fig. 3). In the template Rossmann fold domain with a conserved small domain contributes to the formation of active site for the catabolic disposal of carboxylic acid (Mullins et al. 2012).

To further validate the functional features, we performed docking study with template ligand CoA. Molecular docking results showed that the binding site of HP B9WFR9 consists of majorly α-helices (α3, α4 and α6) and extended from α3 to α6 to form a cleft-like structure, and the surrounding loops are rich in positively and negatively charged residues (Fig. 3). The predicted active site residue Arg260, Ser283, Pro288, Asp289, Leu306, Lys 309 and Arg332 are found to be involved in H-bonding. Among active site residues, Arg332 and Lys309 are structurally conserved and essential for electrostatic interactions (Mullins et al. 2012) (Fig. 3 and Figure S4).

Functional motif search resulted in the identification of two biologically significant motifs in the HP B9WFR9 (Table S5), CoA-transferase family III (CaiB/BaiF) and α-methylacyl-Coa racemase with two structurally significant motifs, Tyr274-Ala276 and Gly334-Ile336. DALI search result is consistent with ProFunc finding and showed a significant similarity with the formyl-coenzyme-A transferase (Z score = 47.1) (Table S4). These findings clearly indicate the possible transferase activity of B9WFR9 (Holm and Rosenstrom 2010; Laskowski et al. 2005).

HP B9WFS0

HP B9WFS0 was predicted as a cytoplasmic protein and possibly possesses ketosteroid isomerase activity (Table S1 and Table S3). To explore its function, a model of B9WFS0 was generated using MODELLER (Eswar et al. 2006). Procheck results showed that 85.80 % of residues are in the allowed region of the RC-plot and LG and MaxSub score are 2.341 and 0.309, respectively (Laskowski et al. 1996; Wallner and Elofsson 2003). The predicted structure was superimposed with the template (PDB ID: 2RGQ) and showed an RMSD of 0.46 Å (Table 2). The overall structure of HP B9WFS0 comprised three α-helices and eight parallel and anti-parallel β-strands, similar to the crystal structure of Rv0760c protein from M. tuberculosis (Cherney et al. 2008). The active site of HP B9WFS0 is very similar to a related protein of M. tuberculosis and resembles a cone-shaped closed-barrel fold formed by a curved anti-parallel β-sheet composed of seven β-strands and four adjacent anti-parallel α-helices (Fig. 4).

HP B9WFS0 is primarily involved in steroid metabolism and found to be an important target for antibiotic development. The active site of this enzyme lies at the surface, just above the longest helix and has clearly two distinct hydrophobic and hydrophilic binding domains (Fig. 4). The ligand estradiol-17 β-hemisuccinate spatially fitted at the surface and its hydrophilic tail showed polar interaction with Asn11 and Asn80, whereas the hydrophobic head protruded deep to the lower side of the pocket and showed π–π interaction with Tyr22. Other residues Val19, Val77 and Ile18 are involved in hydrophobic interaction. However, the hydroxyl group of benzene at the head of estradiol-17β-hemisuccinate showed polar interaction with Asp26 to form a tri-point attachment with protein. Other protein function analysis showed a significant similarity with the γ-hexachlorocyclohexane dehydrochlorinase and ketosteroid isomerase protein (Eberhardt et al. 2013).

HP B9WFS1

HP B9WFS1 is found to be present in the cytoplasm (Table S1) and is involved in the lysophospholipase activity. This protein belongs to the α/β-hydrolase family and may have hydrolase activity (Table S2). This result is consistent with HHpred and InterProScan results described in Table S3. The structure of HP B9WFS1 was predicted with MODELLER. The refined structure shows 86.60 % of the residues in the allowed region of the RC-plot with LG and MaxSub score of 5.560 and 0.533, respectively. The structure of HP B9WFS1 showed a profound similarity with the aryl esterase from Pseudomonas fluorescence (Cheeseman et al. 2004) with RMSD of 0.296 Å (Fig. 5a). The structure of this model totally resembles along with the catalytic triad, His30, Ser99, Met100 and Asp216 (Fig. 5b). Ser97 is also located in the canonical elbow of the conserved sequence Gly-X₁-Ser-X₂-Gly, similar to esterases and lipases family (Cheeseman et al. 2004). In our model, X₁ is His98 and X₂ is Met100, the same as the template protein; however, Gly96 is substituted by Ser101. The structure alignment of HP B9WFS1 with its template protein depicts that in the acyl-binding pocket, only Val125 is conserved and no other residue is conserved in the alcohol-binding pocket (Fig. 5b).

This type of lipase has been reported in soil bacteria showing activity in response to oxidative stress. Esterase and lipase efficiently act in ester hydrolysis, while the generation of peroxycarboxylic acid is efficiently performed by haloperoxidases. The model closely resembles the Pseudomonas fluorescens aryl esterase structure, which is functionally more similar to the esterases and haloperoxidases. Therefore, we can say that HP B9WFS1 may function as an esterase protein. Further, functional analysis using ProFunc and InterPro database scan revealed the presence of α/β-hydrolase motifs in the HP B9WFS1 (Table 4) with structural motifs Leu86-Leu88. DALI results also showed similarity with arylesterase with Z score of 49.4. All these systematic analyses precisely suggest that HP B9WFS1 may have hydrolase-like activity and be essential for survival of the pathogen.

HP B9WFS6

HP B9WFS6 was found to be distributed in the nuclear region and might be involved in methyltransferase activity (Table S1). There domains are highly conserved with RNA methyltransferase family (Table S2). Similar results are obtained with HHpred analysis, and these show high similarity with rRNA methylase, a methyltransferase (Table S3). HP B9WFS6 also acts in nucleotide binding. These findings are crucial and suggest the role of HP B9WFS6 in methyltransferase and nucleic acid-binding activity (Marchler-Bauer et al. 2011; Quevillon et al. 2005).

Furthermore, we predicted the three-dimensional structure of HP B9WFS6 to explain its function more precisely. The stereochemical quality of refined model show 85.10 % of the residues in the allowed region of the RC-plot (Table 2), with LG score 5.205 and MaxSub score: 0.367. This protein shows structural similarity with MT1 protein from M. thermoautotrophicum (Zarembinski et al. 2003). The overall structure of HP B9WFS6 comprises eight α-helices and seven β-sheets (Fig. 6a). MT1 protein consists of two subunits, the small β-barrel auxiliary domain (MT1-CSD) and larger domain defined as the TIM barrel (MT1-DD). The TIM barrel is essential for interfacial contact to form the homodimer of the MT1 protein. The overall structure of MT1-DD resembles the classical nucleotide-binding Rossmann fold and is highly conserved in HP B9WFS6. The small domain of HP B9WFS6 shows a similar topology to MT1-CSD; however, their sequences are not highly conserved. These two domains are almost conserved in most of the bacteria, yeast and archaebacteria. Interestingly, the TIM barrel domain structure is generally conserved in various organisms, although their sequences are highly divergent and give rise to a plethora of distinct functions (Zarembinski et al. 2003). Structural alignment showed the presence of nucleotide-binding Rossmann fold in HP B9WS6; however, sequences were not conserved. HP B9WS6 has conserved residues Asp230 and Asn351 near the site of knot formation, which is distinct to the MT1 protein (Fig. 6b). The predicted structure of B9WFS6 also contains a β-barrel auxiliary domain, which is structurally similar to CspA protein of E. coli, an RNA chaperone that binds to RNA to prevent hairpin formation for transcription anti-termination. ProFunc server revealed the presence of methyltrn_RNA_3 and nucleic acid-binding motifs in the HP B9WFS6 with one structural motif (Ile212-Glu214) (Table 4). DALI result shows similarity with the methyl transferase from the M. thermoautotrophicum (Z score = 41.0), and E. coli (Z score = 13.5) (Table S4). These findings are quite helpful to assign RNA-binding activity to HP B9WFS6.

HP B9WFU3

Domains of HP B9WFU3 are conserved with the protein disulfide isomerase (PDIa) family. It also contains s multi-domain that shows similarity with protein disulfide oxidoreductases and thioredoxin fold, involved in isomerase activity (Table S2). Similar prediction is obtained with Hpred analysis and shows high similarity with disulfide isomerase and thioredoxin-like fold (Table S3). InterProScan result also suggests the presence of thioredoxin-like fold in HP B9WFU3 which maintains cell redox homeostasis.

The three-dimensional structure of HP B9WFU3 was predicted to understand their function at the molecular level. The refined model shows 76.90 % of the residues in the allowed region of the RC-plot. The ProQ module results are impressive with an LG score of 1.123 and MaxSub score of 0.104, which are considerable scores to carry out further analysis. The overall structure of HP B9WFU3 comprises four α-helices (Fig. 7) and shows a close RMSD 1.877 Å with the template protein, disulfide isomerase ERp72 (Kozlov et al. 2010). This protein belongs to the PDIa family and is responsible for catalyzing the proper oxidation and isomerization of disulfide bonds of newly synthesized proteins in the endoplasmic reticulum. The overall structure of the PDI family proteins possesses a characteristic “U” shape twisted structure formed with four thioredoxin folds. However, our model contains only two thioredoxin folds. The prototypical thioredoxin fold is composed of a five-central β-sheet sandwiched with two α-helices on each of the other sides, whereas the predicted structure has only four β-strands. The active site of the template of HP B9WFU3 is characterized by the C-G-H-C consensus sequences located at the N-terminus of the second α-helix, consistent with PDI family proteins. However, in the predicted structure, GlyL92 and His93 have been mutated by Lys62 and Tyr63. These two cysteines are responsible for oxidoreductase activity (Figure S5). In the oxidized state, the two cysteines form an intramolecular disulfide bond that in principle enables PDI to convert a pair of sulfhydryl groups of the polypeptide substrate into a disulfide bond. Similarly, PDI-like protein ERp57, reported in ER of Saccharomyces cerevisiae, interacts with two lectin chaperons, calnexin and calreticulin, and promotes the oxidative folding of newly synthesized glycoprotein. Other prediction tools also supported the presence of thioredoxin domain in our model (Table 4 and Table S4), suggesting the possible thioredoxin function of HP B9WFU3.

HP B9WFW8

HP B9WFW8 is localized in the nuclear region of the cell (Table S1). CDD result suggests that this protein has a conserved multi-domain similar to cell cycle checkpoint protein Rad17 involved in DNA damage control (Table S3) (Nakai and Horton 1999). HHpred predicted the processivity clamp, DNA sliding clamp and AAA+ polymerase fold in HP B9WFW8 reported to be involved in DNA replication, recombination and restriction (Table S3) (Soding et al. 2005). InterProScan result shows the presence of similar motifs belonging to cell cycle regulatory proteins of yeast. These proteins are crucial in cell cycle regulation and control the DNA damage (Quevillon et al. 2005). Therefore, the structure of HP B9WFW8 is predicted to define their probable functions more precisely.

The overall topology of HP B9WFW8 predicted is similar to the putative methyltransferase protein of S. cerevisiae (Bowman et al. 2004), with an RMSD of 0.334 Å, suggesting a similar function. The overall structure of B9WFW8 comprises 28 α-helices and 5 β-strands (Fig. 8) and has clamp loader complex (replication factor-C, RFC) that binds to the sliding clamp (proliferating cell nuclear antigen, PCNA) (Fig. 8). Sliding clamps are ring-shaped proteins that encircle DNA and confer high processivity on DNA polymerases (Bowman et al. 2004). Tight interfacial coordination of the ATP analog, ATP-γS by RFC, results in a spiral arrangement of the ATPase domains of the clamp loader above the PCNA ring (Figure S6). Placement of a model for primed DNA within the central hole of PCNA reveals a striking correspondence between the RFC spiral and the grooves of the DNA double helix. Further, Profunc result revealed the presence of single motif Rad17 in HP B9WFW8 (Table 4). However, five structural motifs are also present in the structure, These are Gly101–Ser105, Asn70–Thr72, Glu193–Glu195, Gln17–Asp179 and Thr597–Gly599 probably involved in anion and cation binding site formation. DALI server was used to find the homologous structure and in identification of the RFC subunit (Z score = 17.4) and DNA Polymerase Accessory Protein (Z score = 16.9) in HP B9WFW8 (Table S4). These results are decisive for structure-based function determination and suggest the role of HP B9WFW8 in DNA replication.

Conclusions

Function prediction of the putative uncharacterized protein from sequence generated by genome project is a major challenging task. Here, we have used in silico techniques to examine C. dubliniensis genome and exemplify the functions for HPs. Our primary sequence-based analysis led to the identification of eight HPs as biologically significant, which might be involved as enzymes (kinase, hydrolase, transferase, isomerase and oxido-reductase), transporter protein and glucose symporter and involved in cell cycle control mechanisms. Furthermore, we successfully predicted the structure of all eight HPs to describe their functions at the molecular level. The outcome of the present study may facilitate better understanding of the mechanism of virulence, drug resistance, pathogenesis, adaptability to host, tolerance for host immune response and drug discovery for treatment of C. dubliniensis infections.

Abbreviations

HP:: Hypothetical protein
BLAST:: Basic local alignment search tool
PSI-BLAST:: Position-specific iterative basic local alignment search tool
CDD:: Conserved domain database
SMART:: Simple modular architecture research tool
PANTHER:: Protein analysis through evolutionary relationships
SVM:: Support vector machine
PP2C:: Protein phosphatase 2C
MFS:: Major facilitator superfamily
CATH:: Protein structure classification database

References

Achkar JM, Fries BC (2010) Candida infections of the genitourinary tract. Clin Microbiol Rev 23:253–273
Article Google Scholar
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215:403–410
Article CAS Google Scholar
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402
Article CAS Google Scholar
Apweiler R, Attwood TK, Bairoch A, Bateman A, Birney E, Biswas M, Bucher P, Cerutti L, Corpet F, Croning MD et al (2001) The InterPro database, an integrated documentation resource for protein families, domains and functional sites. Nucleic Acids Res 29:37–40
Article CAS Google Scholar
Biasini M, Bienert S, Waterhouse A, Arnold K, Studer G, Schmidt T, Kiefer F, Cassarino TG, Bertoni M, Bordoli L et al (2014) SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information. Nucleic Acids Res 42:W252–W258
Bork P, Koonin EV (1996) Protein sequence motifs. Curr Opin Struct Biol 6:366–376
Article CAS Google Scholar
Bowman GD, O’Donnell M, Kuriyan J (2004) Structural analysis of a eukaryotic sliding DNA clamp-clamp loader complex. Nature 429:724–730
Article CAS Google Scholar
Brady GP Jr, Stouten PF (2000) Fast prediction and visualization of protein binding pockets with PASS. J Comput Aided Mol Des 14:383–401
Article CAS Google Scholar
Butler G, Rasmussen MD, Lin MF, Santos MA, Sakthikumar S, Munro CA, Rheinbay E, Grabherr M, Forche A, Reedy JL et al (2009) Evolution of pathogenicity and sexual reproduction in eight Candida genomes. Nature 459:657–662
Article CAS Google Scholar
Chaudhuri R, Ansari FA, Raghunandanan MV, Ramachandran S (2011) FungalRV: adhesin prediction and immunoinformatics portal for human fungal pathogens. BMC Genom 12:192
Article Google Scholar
Cheeseman JD, Tocilj A, Park S, Schrag JD, Kazlauskas RJ (2004) Structure of an aryl esterase from Pseudomonas fluorescens. Acta Crystallogr D Biol Crystallogr 60:1237–1243
Article Google Scholar
Cherney MM, Garen CR, James MN (2008) Crystal structure of Mycobacterium tuberculosis Rv0760c at 1.50 A resolution, a structural homolog of Delta(5)-3-ketosteroid isomerase. Biochim Biophys Acta 1784:1625–1632
Article CAS Google Scholar
Chiusano ML, D’Agostino N, Traini A, Licciardello C, Raimondo E, Aversano M, Frusciante L, Monti L (2008) ISOL@: an Italian SOLAnaceae genomics resource. BMC Bioinform 9(Suppl 2):S7
Article Google Scholar
de Castro E, Sigrist CJ, Gattiker A, Bulliard V, Langendijk-Genevaux PS, Gasteiger E, Bairoch A, Hulo N (2006) ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins. Nucleic Acids Res 34:W362–W365
Article Google Scholar
Desler C, Suravajhala P, Sanderhoff M, Rasmussen M, Rasmussen LJ (2009) In Silico screening for functional candidates amongst hypothetical proteins. BMC Bioinform 10:289
Article Google Scholar
Dutta A, Katarkar A, Chaudhuri K (2013) In-silico structural and functional characterization of a V. cholerae O395 hypothetical protein containing a PDZ1 and an uncommon protease domain. PLoS ONE 8:e56725
Article CAS Google Scholar
Eberhardt RY, Chang Y, Bateman A, Murzin AG, Axelrod HL, Hwang WC, Aravind L (2013) Filling out the structural map of the NTF2-like superfamily. BMC Bioinform 14:327
Article Google Scholar
Ebihara A, Yao M, Masui R, Tanaka I, Yokoyama S, Kuramitsu S (2006) Crystal structure of hypothetical protein TTHB192 from Thermus thermophilus HB8 reveals a new protein family with an RNA recognition motif-like domain. Protein Sci 15:1494–1499
Article CAS Google Scholar
Eswar N, Webb B, Marti-Renom MA, Madhusudhan MS, Eramian D, Shen MY, Pieper U, Sali A (2006) Comparative protein structure modeling using Modeller. Curr Protoc Bioinform Chapter 5, Unit 5 6
Finn RD, Bateman A, Clements J, Coggill P, Eberhardt RY, Eddy SR, Heger A, Hetherington K, Holm L, Mistry J et al (2014) Pfam: the protein families database. Nucleic Acids Res 42:D222–D230
Article CAS Google Scholar
Galperin MY, Koonin EV (2004) ‘Conserved hypothetical’ proteins: prioritization of targets for experimental study. Nucleic Acids Res 32:5452–5463
Article CAS Google Scholar
Gattiker A, Gasteiger E, Bairoch A (2002) ScanProsite: a reference implementation of a PROSITE scanning tool. Appl Bioinform 1:107–108
CAS Google Scholar
Haas BJ, Zeng Q, Pearson MD, Cuomo CA, Wortman JR (2011) Approaches to fungal genome annotation. Mycology 2:118–141
CAS Google Scholar
Hassan MI, Ahmad F (2011) Structural diversity of class i MHC-like molecules and its implications in binding specificities. In: Advances in protein chemistry and structural biology, pp 223--270
Hassan MI, Kumar V, Singh TP, Yadav S (2007a) Structural model of human PSA: a target for prostate cancer therapy. Chem Biol Drug Des 70:261–267
Article CAS Google Scholar
Hassan MI, Kumar V, Somvanshi RK, Dey S, Singh TP, Yadav S (2007b) Structure-guided design of peptidic ligand for human prostate specific antigen. J Pept Sci 13:849–855
Article CAS Google Scholar
Hassan MI, Bilgrami S, Kumar V, Singh N, Yadav S, Kaur P, Singh TP (2008) Crystal structure of the novel complex formed between zinc α2-glycoprotein (ZAG) and prolactin-inducible protein (PIP) from human seminal plasma. J Mol Biol 384:663–672
Article CAS Google Scholar
Hassan MI, Waheed A, Grubb JH, Klei HE, Korolev S, Sly WS (2013) High resolution crystal structure of human β-glucuronidase reveals structural basis of lysosome targeting. PLoS One 8
Hirashima A, Huang H (2008) Homology modeling, agonist binding site identification, and docking in octopamine receptor of Periplaneta americana. Comput Biol Chem 32:185–190
Article CAS Google Scholar
Hirokawa T, Boon-Chieng S, Mitaku S (1998) SOSUI: classification and secondary structure prediction system for membrane proteins. Bioinformatics 14:378–379
Article CAS Google Scholar
Holm L, Rosenstrom P (2010) Dali server: conservation mapping in 3D. Nucleic Acids Res 38:W545–W549
Article CAS Google Scholar
Jackson AP, Gamble JA, Yeomans T, Moran GP, Saunders D, Harris D, Aslett M, Barrell JF, Butler G, Citiulo F et al (2009) Comparative genomics of the fungal pathogens Candida dubliniensis and Candida albicans. Genome Res 19:2231–2244
Article CAS Google Scholar
Kalev I, Habeck M (2011) HHfrag: HMM-based fragment detection using HHpred. Bioinformatics 27:3110–3116
Article CAS Google Scholar
Karkowska-Kuleta J, Rapala-Kozik M, Kozik A (2009) Fungi pathogenic to humans: molecular bases of virulence of Candida albicans, Cryptococcus neoformans and Aspergillus fumigatus. Acta Biochim Pol 56:211–224
CAS Google Scholar
Kelley LA, Sternberg MJ (2009) Protein structure prediction on the Web: a case study using the Phyre server. Nat Protoc 4:363–371
Article CAS Google Scholar
Kennedy AJ, Mathews TP, Kharel Y, Field SD, Moyer ML, East JE, Houck JD, Lynch KR, Macdonald TL (2011) Development of amidine-based sphingosine kinase 1 nanomolar inhibitors and reduction of sphingosine 1-phosphate in human leukemia cells. J Med Chem 54:3524–3548
Article CAS Google Scholar
Kiefer F, Arnold K, Kunzli M, Bordoli L, Schwede T (2009) The SWISS-MODEL repository and associated resources. Nucleic Acids Res 37:D387–D392
Article CAS Google Scholar
Kozlov G, Azeroual S, Rosenauer A, Maattanen P, Denisov AY, Thomas DY, Gehring K (2010) Structure of the catalytic a(0)a fragment of the protein disulfide isomerase ERp72. J Mol Biol 401:618–625
Article CAS Google Scholar
Krogh A, Larsson B, von Heijne G, Sonnhammer EL (2001) Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol 305:567–580
Article CAS Google Scholar
Kumar K, Prakash A, Tasleem M, Islam A, Ahmad F, Hassan MI (2014) Functional annotation of putative hypothetical proteins from Candida dubliniensis. Gene 543:93–100
Article CAS Google Scholar
Laskowski RA, Moss DS, Thornton JM (1993) Main-chain bond lengths and bond angles in protein structures. J Mol Biol 231:1049–1067
Article CAS Google Scholar
Laskowski RA, Rullmannn JA, MacArthur MW, Kaptein R, Thornton JM (1996) AQUA and PROCHECK-NMR: programs for checking the quality of protein structures solved by NMR. J Biomol NMR 8:477–486
Article CAS Google Scholar
Laskowski RA, Watson JD, Thornton JM (2005) ProFunc: a server for predicting protein function from 3D structure. Nucleic Acids Res 33:W89–W93
Article CAS Google Scholar
Laurie AT, Jackson RM (2005) Q-SiteFinder: an energy-based method for the prediction of protein-ligand binding sites. Bioinformatics 21:1908–1916
Article CAS Google Scholar
Letunic I, Doerks T, Bork P (2012) SMART 7: recent updates to the protein domain annotation resource. Nucleic Acids Res 40:D302–D305
Article CAS Google Scholar
Lill MA, Danielson ML (2011) Computer-aided drug design platform using PyMOL. J Comput Aided Mol Des 25:13–19
Article CAS Google Scholar
Loewenstein Y, Raimondo D, Redfern OC, Watson J, Frishman D, Linial M, Orengo C, Thornton J, Tramontano A (2009) Protein function annotation by homology-based inference. Genome Biol 10:207
Article Google Scholar
Lubec G, Afjehi-Sadat L, Yang JW, John JP (2005) Searching for hypothetical proteins: theory and practice based upon original data and literature. Prog Neurobiol 77:90–127
Article CAS Google Scholar
Maiorov VN, Crippen GM (1994) Significance of root-mean-square deviation in comparing three-dimensional structures of globular proteins. J Mol Biol 235:625–634
Article CAS Google Scholar
Marchler-Bauer A, Lu S, Anderson JB, Chitsaz F, Derbyshire MK, DeWeese-Scott C, Fong JH, Geer LY, Geer RC, Gonzales NR et al (2011) CDD: a conserved domain database for the functional annotation of proteins. Nucleic Acids Res 39:D225–D229
Article CAS Google Scholar
Mazandu GK, Mulder NJ (2012) Function prediction and analysis of mycobacterium tuberculosis hypothetical proteins. Int J Mol Sci 13:7283–7302
Article CAS Google Scholar
Moran GP, Sullivan DJ, Henman MC, McCreary CE, Harrington BJ, Shanley DB, Coleman DC (1997) Antifungal drug susceptibilities of oral Candida dubliniensis isolates from human immunodeficiency virus (HIV)-infected and non-HIV-infected subjects and generation of stable fluconazole-resistant derivatives in vitro. Antimicrob Agents Chemother 41:617–623
CAS Google Scholar
Morris GM, Huey R, Lindstrom W, Sanner MF, Belew RK, Goodsell DS, Olson AJ (2009) AutoDock4 and AutoDockTools4: automated docking with selective receptor flexibility. J Comput Chem 30:2785–2791
Article CAS Google Scholar
Mount DW (2007) Using the basic local alignment search tool (BLAST). CSH Protoc 2007, pdb top17
Mullins EA, Starks CM, Francois JA, Sael L, Kihara D, Kappock TJ (2012) Formyl-coenzyme A (CoA): oxalate CoA-transferase from the acidophile Acetobacter aceti has a distinctive electrostatic surface and inherent acid stability. Protein Sci 21:686–696
Article CAS Google Scholar
Nakai K, Horton P (1999) PSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization. Trends Biochem Sci 24:34–36
Article CAS Google Scholar
O’Connor L, Caplice N, Coleman DC, Sullivan DJ, Moran GP (2010) Differential filamentation of Candida albicans and Candida dubliniensis Is governed by nutrient regulation of UME6 expression. Eukaryot Cell 9:1383–1397
Article Google Scholar
Orengo CA, Michie AD, Jones S, Jones DT, Swindells MB, Thornton JM (1997) CATH—a hierarchic classification of protein domain structures. Structure 5:1093–1108
Article CAS Google Scholar
Petersen TN, Brunak S, von Heijne G, Nielsen H (2011) SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods 8:785–786
Article CAS Google Scholar
Prakash A, Kumar K, Islam A, Hassan MI, Ahmad F (2013) Receptor chemoprint derived pharmacophore model for development of CAIX inhibitors. J Carcinog Mutagen S8:003
Google Scholar
Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, Lopez R (2005) InterProScan: protein domains identifier. Nucleic Acids Res 33:W116–W120
Article CAS Google Scholar
Rashid M, Saha S, Raghava GP (2007) Support Vector Machine-based method for predicting subcellular localization of mycobacterial proteins using evolutionary information and motifs. BMC Bioinform 8:337
Article Google Scholar
Sandeep G, Nagasree KP, Hanisha M, Kumar MM (2011) AUDocker LE: a GUI for virtual screening with AUTODOCK Vina. BMC Res Notes 4:445
Article Google Scholar
Schwede T, Kopp J, Guex N, Peitsch MC (2003) SWISS-MODEL: an automated protein homology-modeling server. Nucleic Acids Res 31:3381–3385
Article CAS Google Scholar
Shahbaaz M, Hassan MI, Ahmad F (2013) Functional annotation of conserved hypothetical proteins from Haemophilus influenzae Rd KW20. PLoS ONE 8:e84263
Article Google Scholar
Shahbaaz M, Ahmad F, Hassan MI (2014) Structure-based functional annotation of putative conserved proteins having lyase activity from Haemophilus influenzae. 3Biotech. doi:10.1007/s13205-014-0231-z
Shen HB, Chou KC (2009) Predicting protein fold pattern with functional domain and sequential evolution information. J Theor Biol 256:441–446
Article CAS Google Scholar
Singh A, Thakur PK, Meena M, Kumar D, Bhatnagar S, Dubey AK, Hassan MI (2014) Interaction between basic 7S globulin and leginsulin in soybean [Glycine max]: a structural insight. Lett Drug Des Discov 11:231–239
Article CAS Google Scholar
Sinha A, Ahmad MI, Hassan MI (2014) Structure based functional annotation of putative conserved proteins from treponema pallidum: search for a potential drug target. Lett Drug Des Discov 11. doi:10.2174/1570180811666140806005822
Soding J, Biegert A, Lupas AN (2005) The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res 33:W244–W248
Article Google Scholar
Sonnhammer EL, von Heijne G, Krogh A (1998) A hidden Markov model for predicting transmembrane helices in protein sequences. Proc Int Conf Intell Syst Mol Biol 6:175–182
CAS Google Scholar
Stokes C, Moran GP, Spiering MJ, Cole GT, Coleman DC, Sullivan DJ (2007) Lower filamentation rates of Candida dubliniensis contribute to its lower virulence in comparison with Candida albicans. Fungal Genet Biol 44:920–931
Article CAS Google Scholar
Sullivan D, Coleman D (1998) Candida dubliniensis: characteristics and identification. J Clin Microbiol 36:329–334
CAS Google Scholar
Sullivan DJ, Moran GP, Coleman DC (2005) Candida dubliniensis: ten years on. FEMS Microbiol Lett 253:9–17
Article CAS Google Scholar
Sun L, Zeng X, Yan C, Sun X, Gong X, Rao Y, Yan N (2012) Crystal structure of a bacterial homologue of glucose transporters GLUT1-4. Nature 490:361–366
Article CAS Google Scholar
Tasleem M, Ishrat R, Islam A, Ahmad F, Hassan MI (2014) Structural characterization, homology modeling and docking studies of ARG674 mutation in MyH8gene associated with trismus-pseudocamptodactyly syndrome. Lett Drug Des Discov1 1. doi:10.2174/1570180811666140717190217
Thakur PK, Hassan I (2011) Discovering a potent small molecule inhibitor for gankyrin using de novo drug design approach. Int J Comput Biol Drug Des 4:373–386
Article CAS Google Scholar
Thakur PK, Kumar J, Ray D, Anjum F, Hassan MI (2013) Search of potential inhibitor against New Delhi metallo-beta-lactamase 1 from a series of antibacterial natural compounds. J Nat Sci Biol Med 4:51–56
Article CAS Google Scholar
Thakur PK, Prakash A, Khan P, Fleming RE, Waheed A, Ahmad F, Hassan MI (2014) identification of interfacial residues involved in hepcidin–ferroportin interaction. Lett Drug Des Discov 11:363–374
Article CAS Google Scholar
Thomas PD, Campbell MJ, Kejariwal A, Mi H, Karlak B, Daverman R, Diemer K, Muruganujan A, Narechania A (2003) PANTHER: a library of protein families and subfamilies indexed by function. Genome Res 13:2129–2141
Article CAS Google Scholar
Totrov M, Abagyan R (2008) Flexible ligand docking to multiple receptor conformations: a practical alternative. Curr Opin Struct Biol 18:178–184
Article CAS Google Scholar
Tusnady GE, Simon I (2001) The HMMTOP transmembrane topology prediction server. Bioinformatics 17:849–850
Article CAS Google Scholar
Wallner B, Elofsson A (2003) Can correct protein models be identified? Protein Sci 12:1073–1086
Article CAS Google Scholar
Wang Z, Min X, Xiao SH, Johnstone S, Romanow W, Meininger D, Xu H, Liu J, Dai J, An S et al (2013) Molecular basis of sphingosine kinase 1 substrate recognition and catalysis. Structure 21:798–809
Article CAS Google Scholar
Webb B, Sali A (2014) Protein structure modeling with MODELLER. Methods Mol Biol 1137:1–15
Article CAS Google Scholar
Wilkins MR, Gasteiger E, Bairoch A, Sanchez JC, Williams KL, Appel RD, Hochstrasser DF (1999) Protein identification and analysis tools in the ExPASy server. Methods Mol Biol 112:531–552
CAS Google Scholar
Zarembinski TI, Kim Y, Peterson K, Christendat D, Dharamsi A, Arrowsmith CH, Edwards AM, Joachimiak A (2003) Deep trefoil knot implicated in RNA binding found in an archaebacterial protein. Proteins 50:177–183
Article CAS Google Scholar

Download references

Acknowledgments

This work was supported by the Indian Council of Medical Research (BIC/12(04)/2012) to MIH and FA. AP is thankful to the UGC (BSR grant), Delhi, India, for providing the Dr. DS Kothari Postdoctoral Fellowship to carry out this work.

Conflict of interest

The authors declare no conflict of interest regarding any financial and personal relationships with other people or organizations that could inappropriately influence (bias) this work.

Author information

Authors and Affiliations

Center for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, Jamia Nagar, New Delhi, 110025, India
Kundan Kumar, Amresh Prakash, Asimul Islam, Faizan Ahmad & Md. Imtaiyaz Hassan
Female College of Applied Medical Science, Taif University, Al-Taif, Kingdom of Saudi Arabia
Farah Anjum

Authors

Kundan Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Amresh Prakash
View author publications
You can also search for this author in PubMed Google Scholar
Farah Anjum
View author publications
You can also search for this author in PubMed Google Scholar
Asimul Islam
View author publications
You can also search for this author in PubMed Google Scholar
Faizan Ahmad
View author publications
You can also search for this author in PubMed Google Scholar
Md. Imtaiyaz Hassan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Md. Imtaiyaz Hassan.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (DOCX 2044 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Reprints and permissions

About this article

Cite this article

Kumar, K., Prakash, A., Anjum, F. et al. Structure-based functional annotation of hypothetical proteins from Candida dubliniensis: a quest for potential drug targets. 3 Biotech 5, 561–576 (2015). https://doi.org/10.1007/s13205-014-0256-3

Download citation

Received: 31 May 2014
Accepted: 30 September 2014
Published: 17 October 2014
Issue Date: August 2015
DOI: https://doi.org/10.1007/s13205-014-0256-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Structure-based functional annotation of hypothetical proteins from Candida dubliniensis: a quest for potential drug targets

Abstract

Similar content being viewed by others

Functional annotation of Candida albicans hypothetical proteins: a bioinformatics approach

Novel and unique domains in aminoacyl-tRNA synthetases from human fungal pathogens Aspergillus niger, Candida albicans and Cryptococcus neoformans

Draft genome of a commonly misdiagnosed multidrug resistant pathogen Candida auris

Introduction