Structure-based function analysis of putative conserved proteins with isomerase activity from Haemophilus influenzae
- 1.3k Downloads
- 6 Citations
Abstract
Haemophilus influenzae, a Gram-negative bacterium and a member of the family Pasteurellaceae, causes chronic bronchitis, bacteremia, meningitis, etc. The H. influenzae is the first organism whose genome was completely sequenced and annotated. Here, we have extensively analyzed the genome of H. influenzae using available proteins structure and function analysis tools. The objective of this analysis is to assign a precise function to hypothetical proteins (HPs) whose functions are not determined so far. Function prediction of these proteins is helpful in precise understanding of mechanisms of pathogenesis and biochemical pathways important for selecting novel therapeutic target. After an extensive analysis of H. Influenzae genome we have found 13 HPs showing high level of sequence and structural similarity to the enzyme isomerase. Consequently, the structures of HPs have been modeled and analyzed to determine their precise functions. We found these HPs are alanine racemase, lysine 2, 3-aminomutase, topoisomerase DNA-binding C4 zinc finger, pseudouridine synthase B, C and E (Rlu B, C and E), hydroxypyruvate isomerase, nucleoside-diphosphate-sugar epimerase, amidophosphoribosyltransferase, aldose-1-epimerase, tautomerase/MIF, Xylose isomerase-like, have TIM barrel domain and sedoheptulose-7-phosphate isomerase like activity, signifying their corresponding functions in the H. influenzae. This work provides a better understanding of the role HPs with isomerase activities in the survival and pathogenesis of H. influenzae.
Keywords
Haemophilus influenza Hypothetical protein Isomerase activity Structure prediction Structure analysis Structure–function relationshipIntroduction
Haemophilus influenzae, a member of family Pasteurellaceae, is a non-motile Gram-negative bacterium (Kuhnert, 2008). It is an obligatory human parasite which causes meningitis, sinusitis, epiglottitis, chronic bronchitis and community acquired pneumonia (Apisarnthanarak and Mundy 2005; Eldika and Sethi 2006). The genome of H. influenzae was successfully sequenced (Fleischmann et al. 1995) which revealed 1,740 protein-coding genes, 2 transfer RNA genes, and 18 other RNA genes in a 1.83 Mb single circular chromosome (Fleischmann et al. 1995). H. influenzae requires β-nicotinamide adenine dinucleotide and heme-related compounds for its growth (Markel et al. 2007; Morton et al. 2004a). Hence, it uses numerous mechanisms to obtain heme (Stojiljkovic and Perkins-Balding 2002) using various heme acquisition proteins like Hup protein (Morton et al. 2004b) and HbpA lipoprotein (Morton et al. 2005). It is also evident that the periplasmic iron-binding protein, FbpA (ferric-ion-binding protein A), plays an essential role in procurement of iron from transferrin in H. influenzae (Khun et al. 1998; Kirby et al. 1997). This shows that iron is important for its survival and virulence (Morton et al. 2004a). Furthermore, there is a strict regulation of iron homeostasis in H. influenzae as indicated by the mechanism for heme acquisition in the organism.
H. influenzae strains comprise high antibiotic resistance, including multidrug resistance to ampicillin and chloramphenicol, make the treatment of meningitis and chronic pneumonia more complex (Campos 2001; Pfeifer et al. 2013; Saha et al. 2008). The antibiotic resistance in H. influenzae was strongly associated with the presence of large conjugative plasmids (Leaves et al. 2000). The antibiotic resistances in H. influenzae occur due to various mechanisms which can affect the empirical treatment of infections (Jorgensen, 1991; Kostyanev and Sechanova, 2012; Tristram et al. 2007). There is an increasing prevalence of resistance to antibiotics like aminopenicillins, macrolides, tetracyclines and fluoroquinolones. This is a major associated problem (Jorgensen, 1991; Kostyanev and Sechanova, 2012; Tristram et al. 2007). An extensive genome analysis of the organism may be helpful to find novel drug targets against multidrug-resistant strains.
Analysis of 102 bacterial genomes of the genomic consortium reflects that 45,110 proteins are prearranged in 7,853 orthologous groups with unknown function (Doerks et al. 2004). These proteins are considered as a “conserved hypothetical proteins (HPs)”, i.e., proteins that have not been functionally characterized and described at biochemical and physiological level in organisms (Galperin and Koonin 2004). The HPs are supposed to be the products of pseudogenes in majority of organisms and comprise a wide fraction of their proteomes (Desler et al. 2012; Galperin 2001). The species-specific phenotypic properties such as pathogenicity in a given organism can be determined by analyzing unique sequences of HPs because these determinants are assumed to be the potent drug targets in pathogenic strains of organisms (Tsoka and Ouzounis 2000). The significance of functional characterization HP can further be understood by recent functional annotation of formerly uncharacterized tRNA modification enzymes (Alexandrov et al. 2002; Jackman et al. 2003; Soma et al. 2003) of the deoxyxylulose pathway (Eisenreich et al. 2001) that plays a central role in cyclic diguanylate bacterial signaling (Galperin 2004; Jenal 2004). We have been working in the area of structure-based rational drug design hence we are searching a novel therapeutic target in pathogenic organism (Hassan et al. 2007a, b; Thakur et al. 2013a). We have successfully annotated the function of HPs from pathogenic organism both at sequence and structure levels (Kumar et al. 2014a, b; Shahbaaz et al. 2014; Sinha et al. 2014).
The biological function cannot be predicted by comparison of sequence similarity alone (Illergard et al. 2009). Structure-based function prediction is often considered as a better tool in comparison to the sequence-based methods. Because in most cases the evolution retains a conserved folding pattern despite of very poor sequence similarity (Hassan and Ahmad 2011; Hassan et al. 2008, 2013; Illergard et al. 2009). Furthermore, identification of binding motifs and catalytic sites is critical for a protein function, which can easily be predicted from the available protein structure (Shapiro and Harris 2000; Singh et al. 2014). Moreover, the process of structure-based rational drug design is completely based on the structural features of a protein molecule (Capdeville et al. 2002; Klebe 2000; Tasleem et al. 2014; Thakur et al. 2013b). Hence, structure analysis of HPs is central to strengthen the process of biological function prediction and development of better therapeutics intervention for the treatment of diseases associated with the pathogen.
List of HPs with isomerase activity from H. influenzae Rd KW
| S.no | Accession no | Gene id | Protein product | Uniprot id | Protein name |
|---|---|---|---|---|---|
| 1. | NC_000907.1 | 950992 | NP_438263.1 | P44506 | HP HI0090 |
| 2. | NC_000907.1 | 949459 | NP_438493.1 | P44641 | HP HI0329 |
| 3. | NC_000907.1 | 949423 | NP_438817.1 | P46494 | HP HI0656.1 |
| 4. | NC_000907.1 | 950211 | NP_438854.1 | P44827 | HP HI0694 |
| 5. | NC_000907.1 | 950733 | NP_439174.1 | Q57151 | HP HI1013 |
| 6. | NC_000907.1 | 950006 | NP_439175.1 | P44094 | HP HI1014 |
| 7. | NC_000907.1 | 950150 | NP_439355.1 | P45104 | HP HI1199 |
| 8. | NC_000907.1 | 950157 | NP_439364.1 | P71373 | HP HI1208 |
| 9. | NC_000907.1 | 950209 | NP_439468.1 | P44160 | HP HI1317 |
| 10. | NC_000907.1 | 950703 | NP_439541.1 | O86237 | HP HI1388.1 |
| 11. | NC_000907.1 | 950784 | NP_439587.1 | Q57152 | HP HI1436 |
| 12. | NC_000907.1 | 950455 | NP_439742.1 | P44268 | HP HI1600 |
| 13. | NC_000907.1 | 950796 | NP_439799.1 | P52606 | HP HI1657 |
Materials and methods
Sequence retrieval
Extensive analysis of H. influenzae genome shows 1,657 proteins which are encoded by its genome (http://www.ncbi.nlm.nih.gov/genome/?term=haemophilus+influenzae). We have already characterized 429 proteins as HP in H. influenzae and their FASTA sequences were retrieved from UniProt (http://www.uniprot.org/) using the “Gene ID” (Shahbaaz et al. 2013). After sequence analysis, we classified all 429 HPs into various classes using the information available in the publically available databases like PDB, Pfam, etc.(Shahbaaz et al. 2013). Here, we selected HPs with isomerase activity for further structure analysis. All tools used in this study are listed in the Table S1.
Sequence analysis
We used several bioinformatics tools such as PSORTb (Yu et al. 2010b), PSLpred (Bhasin et al. 2005) and CELLO (Yu et al. 2006) to identify the subcellular localization of HPs. Furthermore, we also analyzed the presence of signal peptide using SignalP 4.1 (Emanuelsson et al. 2007) and to identify non-classical secretory pathway protein we used SecretomeP (Bendtsen et al. 2005). To characterize a protein to be a membrane protein, the online servers TMHMM (Krogh et al. 2001) and HMMTOP (Tusnady and Simon 2001) were used. Conserved sequence patterns in protein families were used for the prediction of the functions of HPs (Chen and Jeong 2000). The BLASTp (Altschul et al. 1990) and HHpred (Soding et al. 2005) were used for remote homology detection against various available protein databases such as PDB (Bernstein et al. 1978), SCOP (Hubbard et al. 1999) and CATH (Sillitoe et al. 2013). We further performed domain analysis of proteins for more precise function prediction of HPs [47]. The databases such as Pfam (Punta et al. 2011), PANTHER (Mi et al. 2005), SMART (Letunic et al. 2012), SUPERFAMILY (Gough et al. 2001), CATH (Sillitoe et al. 2012), CDART (Geer et al. 2002), SYSTERS (Meinel et al. 2005), ProtoNet (Rappoport et al. 2011) and SVMProt (Cai et al. 2003) were used for precise domain annotation in HPs. Similarly, instead of direct sequence similarity, we also used domain architecture and profile-based methods like CDART and SMART for similarity search.
The annotation of signature protein sequences was performed using the program MOTIF (Kanehisa 1997) and InterProScan (Quevillon et al. 2005). For the identification of motif sequence, we used MEME suite (Bailey et al. 2009). In addition, we also performed virulence factor prediction using VICMpred (Saha and Raghava 2006) and Virulentpred (Garg and Gupta 2008), since virulence factors are considered as potential drug/vaccine targets (Baron and Coombes 2007). We also acknowledge the importance of understanding the protein function using the information of protein–protein interactions. Therefore, to predict the interaction partners of HPs we used STRING (version–9.05) (Szklarczyk et al. 2011a, 2011b).
Structure prediction
For modeling three-dimensional structure of HPs, we used two classes of structure prediction methodologies (Baker and Sali, 2001) (i.e., threading/comparative modeling and de novo or ab initio methods). The MODELLER (Eswar et al. 2006) module of Discovery Studio 3.5 (Accelrys 2013), I-TASSER (Roy et al. 2010) and ROBETTA server (Kim et al. 2004) were used for prediction of a reliable structure of HPs. We used the homology modeling (Marti-Renom et al. 2000) for structure prediction of those HPs where the sequence identity is >30 % between the target and template sequences. We, first, identified templates using sequence similarity search methods like PSI-BLAST (Altschul et al. 1997) present in Discovery Studio 3.5 (Accelrys 2013) for identification of potential templates in protein data bank (PDB). We also used the fold recognition methods like HHpred (Soding et al. 2005) for template identification. The template and query sequences were aligned and used for modeling HPs structures in MODELLER (Eswar et al. 2006).
In case of sequence identity <30 %, we used ab initio modeling protocols for predicting the structure. The I-TASSER (Roy et al. 2010) server uses ab initio algorithms, first generates three-dimensional (3D) atomic models from multiple threading alignments and iterative structural assembly simulations. It inferred function of the HPs using the structural matching of the 3D models with other known proteins and produced outputs contain full-length tertiary as well as secondary structure predictions, ligand-binding sites, Enzyme Commission (EC) numbers, etc. (Roy et al. 2010).
Similarly, ROBETTA server (Kim et al. 2004) also uses ab initio or de novo methods to predict the structure of proteins whose structural analogs do not exist in the PDB. First, it uses the alignment method, called K*Sync, to align the query sequence onto the parent structure. Then it models variable regions by allowing them to explore conformational space with fragments in a fashion similar to the de novo protocol in context of the template. Second, when no structural homolog is accessible, server modeled the domains using Rosetta de novo protocol (Misura et al. 2006), which allows the full length of the domain to explore conformational space via fragment inclusion, generating a sizeable decoy collection from which the concluding models are chosen.
The resulting models are optimized and then energy minimization was carried out using CHARM-22 from Accelrys Discovery Studio 3.5 and the steepest descent algorithm of GROMOS from Deepview (Kaplan and Littlejohn 2001). We further refined the predicted models of HPs using a side chain refinement protocol of Discovery studio 3.5 using force fields, like CHARMM (Brooks et al. 2009), and backbone-dependent rotamer library of SCWRL4 (Krivov et al. 2009) predicts positions of the side chains which are used for refinement of predicted protein structures. The loop refinement protocol of MODELLER (Eswar et al. 2006) is also used for improving the quality of predicted models.
Structure validation
The quality of predicted HPs models were analyzed on SAVES server (Structural Analysis and Verification Server). The modeled protein structures are validated using PROCHECK (Laskowski et al. 1996), WHAT_CHECK (Hooft et al. 1996; Vriend 1990), ERRAT (Colovos and Yeates 1993), VERIFY_3D (Eisenberg et al. 1997; Luthy et al. 1992) and PROVE (Pontius et al. 1996) services present in SAVES server. PROCHECK validated the stereo-chemical quality of a protein structure by analyzing the overall structure and residue-by-residue geometry of the protein. Similarly, WHAT_CHECK also analyzes the stereo-chemical parameters of the residues in HPs. The ERRAT server of UCLA (University of California, Los Angeles) verifies the structures HPs by performing the statistical analysis of the patterns of non-bonded atomic interactions. Further, VERIFY_3D provides a visual analysis of the quality of HPs structures by determining the compatibility of predicted model of HP with its own primary structure. The PyMOL (DeLano 2002), a molecular graphics system, is used for visualization of protein structure and for calculating the r.m.s. deviation between the target HP and the template.
Structure analysis
Structure similarity is more consistent than sequence similarity (Taylor and Orengo 1989). Since the structures of homologous proteins are more conserved than their sequences (Chothia and Lesk 1986). We used varieties of protein structure analysis tools for the prediction of function of HPs. CASP (Critical Assessment of protein Structure Prediction) contains firestar (Lopez et al. 2011), COACH (Yang et al. 2013), COFACTOR (Roy et al. 2012), 3DLigandSite (Wass et al. 2010), TM-SITE (Yang et al. 2013) and S-SITE (Yang et al. 2013), which were used for predicting catalytic and ligand-binding residues in protein sequences. We also used information available in literature about the templates used in protein modeling to identify the catalytic residues in HPs. Furthermore, active pocket sites in the predicted structures of HPs were identified using POCASA (Yu et al. 2010) and Pocket-Finder (Laurie and Jackson, 2005) servers. The PPM server (Lomize et al. 2012) was used for calculating spatial positions in membranes of HPs. The ProFunc (Laskowski et al. 2005) web server was used for structure-based function annotation and for predicting structural motifs associated with catalytic functions. Function predictions of HPs are also complimented by DALI server that compares the target structure with known structure submitted in PDB. The secondary structure elements are computed from atomic resolution protein structures of HPs using the STRIDE web server (Heinig and Frishman 2004).
Result and discussion
Here, we performed sequence and structure analysis of 13 HPs which was predicted to be isomerase such as alanine racemase, lysine 2, 3-aminomutase, topoisomerase DNA-binding C4 zinc finger, pseudouridine synthase B, C and E hydroxypyruvate isomerase, nucleoside-diphosphate-sugar epimerase, amidophosphoribosyl transferase, aldose-1-epimerase, tautomerase/MIF, xylose isomerase-like, TIM barrel domain, sedoheptulose-7-phosphate isomerase-like activity. We predicted the structures of all 13 HPs and analyzed them using available bioinformatics tools. The predicted models of P44506, P44641, P46494, P44827, Q57151, P44094, P45104, P71373, P44160, Q57152, P44268, P52606 show significant validation score on SAVES server. The outcomes of structural analysis for each protein are described here, separately.
HP P44506
List of sequence-based predicted function of HPs with isomerase activity and Motif discovered using MEME of H. influenzae strain Rd KW20
| S.NO | Cluster# | UNIPROT ID | MEME results | Consensusa function | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Motif 1 | Motif 2 | Motif 3 | MAST function prediction | |||||||
| Start | Site | Start | Site | Start | Site | |||||
| 1. | Cluster 102 | P44506 | 151 | ENLPHLCLRGLM | 209 | PSAIKCGSTMV | 76 | EWHFIG | UPF0001 protein | Alanine racemase |
| 2. | Cluster 170 | P44641 | 120 | GCAVNC | 236 | IFAHAM | 88 | GFSTDP | l-lysine 2,3-aminomutase | lysine 2,3-aminomutase |
| 3. | Cluster 152 | P46494 | 76 | FGMFIGCSHYPECDFVV | 1 | MNQSLFHH | 115 | RRGRQGKIFY | No result | Topoisomerase DNA-binding C4 zinc finger |
| 4. | Cluster 80 | P44827 | 84 | VYAAGRLDRDSEGLLILTNNGELQHRLADPKFKTEKTYWVQVEGI | 51 | TKVVLFNKPFDVLTQFTDEQGRATLKD | 178 | WLEIKISEGRNRQVRRMTAHIGFP | Ribosomal large subunit pseudouridine synthase E | ribosomal large subunit pseudouridine synthase E |
| 5. | Cluster 128 | Q57151 | 99 | CPNVHIM | 71 | WGGSAI | 178 | DYFHAQ | Putative hydroxypyruvate isomerase | hydroxypyruvate isomerase |
| 6. | Cluster 162 | P44094 | 149 | MCELLINDYSRKGFVDGIVVRLPTICIRPGKPNKAASSFVSSIMREPLHG | 55 | CPVSEE | 291 | QALALGFKV | No result | Nucleoside-diphosphate-sugar epimerase |
| 7. | Cluster 80 | P45104 | 176 | WIAVGRLDINTSGLLLFTTDGELANRLMHPSREVEREYSVRVFGQ | 140 | CRVLMYYKPEGELCTRSDPEGRATVFD | 256 | WYDVTLMEGRNREVRRLWESQGIQ | Ribosomal large subunit pseudouridine synthase B | ribosomal large subunit pseudouridine synthase B |
| 8. | Cluster 113 | P71373 | 209 | DHSECRGAFNFAAPKSIKQH | 284 | DCENYL | 268 | VVPEKLLNAGFQFQY | Epimerase family protein HI_1208 | Amidophosphoribosyltransferase (Epimerase) |
| 9. | Cluster 38 | P44160 | 86 | QPAHGT | 75 | PICYPW | 29 | CGWNTKNFPC | Putative glucose-6-phosphate 1-epimerase | Aldose 1-epimerase |
| 10. | Cluster 38 | O86237 | 104 | QPAHCW | 48 | DFYYPF | 34 | KGKHAIRFLC | No result | Tautomerase/MIF |
| 11. | Cluster 114 | Q57152 | 51 | WVFIPRM | 72 | AISPYI | 38 | FSIDTM | No result | RNA pseudouridine synthase C |
| 12. | Cluster 196 | P44268 | 249 | KGTVWD | 99 | CECEGH | 35 | ENWSKM | No result | Xylose isomerase-like, TIM barrel domain |
| 13. | Cluster 141 | P52606 | 97 | ELYCHQ | 32 | QMVMQC | 1 | MLQKVK | No result | Sedoheptulose 7-phosphate isomerase |
The sequence of HP P44506 was also annotated in the Unirpot database. We found that pyridoxal 5′-phosphate (PLP) is a cofactor for this protein, clearly indicated its role in the pyridoxal 5′-phosphate binding. It is interesting to note that sequence similarities searches showed that HP P44506 belongs to the uncharacterised protein family UPF0001, which is primarily involved in the biosynthesis of amino acids and amino acid-derived metabolites. Finally, family and domain database search analysis clearly indicates that HP P44506 containing N-terminal alanine racemase domain, PLP-binding barrel, belongs to racemases and epimerases and actis on amino acids and derivatives.
List of structure-based predicted function and validation of HP with isomerase activity in H. influenzae strain Rd KW20
| S. no. | Uniprot id | Template | Identity (%) | RMSD | TM score | Ramachandran plot | Proposed function |
|---|---|---|---|---|---|---|---|
| 1. | P44506 | PLP-binding protein, 1W8G | 61 | 0.223 | 0.6229 | 99.5 % (94.4 % core 5.1 % allow 0.0 % gener 0.5 % disall) | Alanine racemase |
| Pyridoxal phosphate-binding protein, 3SY1 | 57 | 0.243 | 0.5130 | ||||
| Alanine racemase, 4A3Q (HHpred result) | N/A | 3.997 | 0.2651 | ||||
| 2. | P44641 | Lysine-2,3-aminomutase, 2A5H | 34 | 0.241 | 0.3718 | 99 % (90.8 % core 8.2 % allow 0.0 % gener 1.0 % disall) | Lysine 2,3-aminomutase |
| 3. | P46494 (Robetta server) | Rosetta de novo protocol (no template used) | N/A | N/A | N/A | 100 % (86.7 % core 13.3 % allow 0.0 % gener 0.0 % disall) | DNA Topoisomerase,type IA,Zn finger |
| 4. | P44827 | pseudouridine synthase Rlu E, 2OLW | 66 | 0.233 | 0.73306 | 98.1 % (87.3 % core 9.9 % allow 0.9 % gener 1.9 % disall) | Pseudouridine synthase Rlu E |
| pseudouridine synthase Rlu E, 2OML | 66 | 0.604 | 0.74206 | ||||
| Ribosomal small subunit pseudouridine synthase A, 1KSK | 31 | 1.662 | 0.71557 | ||||
| 5. | Q57151 | AP endonuclease, family 2, 3NGF | 54 | 0.206 | 0.8329 | 99.6 % (92.4 % core 7.1 % allow 0.0 % gener 0.4 % disall) | hydroxypyruvate isomerase/D-tagatose 3-epimerase |
| Putative Oxygenase, 1K77 | 60 | 0.751 | 0.9661 | ||||
| L-ribulose 3-epimerase, 3VYL | 26 | 1.759 | 0.4682 | ||||
| 6. | P44094 | Nucleoside-diphosphate-sugar epimerase, 2HRZ | 41 | 0.194 | 0.1688 | 99.6 % (92.8 % core 6.1 % allow 0.7 % gener 0.4 % disall) | Nucleoside-diphosphate-sugar epimerase |
| 7. | P45104 (ITASSER) | Ribosomal large subunit pseudouridine synthase F, 3DH3 | 29 | 0.657 | 0.66428 | 97.8 % (81.9 % core 12.4 % allow 3.5 % gener 2.2 % disall) | Ribosomal large subunit pseudouridine synthase F |
| 8. | P71373 | Nucleoside-diphosphate sugar epimerase (SulA family), 3OH8 | 33 | 0.655 | 0.84610 | 99.6 % (93.2 % core 5.7 % allow 0.8 % gener 0.4 % disall) | Nucleoside-diphosphate sugar epimerase |
| Epimerase family protein SDR39U1, 4B4O | 32 | 0.572 | 0.92240 | ||||
| 9. | P44160 | Crystal Structure Analysis of HI1317, 1JOV | 93 | 0.227 | 0.97926 | 99.6 % (88.8 % core 9.5 % allow 1.2 % gener 0.4 % disall) | Galactose mutarotase (Aldose 1-epimerase) |
| putative mutarotase (YeaD), 2HTA | 33 | 0.640 | 0.94691 | ||||
| Hexose-6-phosphate mutarotase, 2CIR | 26 | 1.583 | 0.84898 | ||||
| 10. | Q57152 (ITASSER) | Solution NMR Structure of protein YqcC, 2HGK | 47 | 0.876 | 0.97784 | 96.9 % (85.4 % core 10.4 % allow 1.0 % gener 3.1 % disall) | Beta-fructofuranosidase/invertase inhibitor |
| 11. | P44268 | Crystal structure of a DUF692 family protein, 3BWW | 73 | 0.478 | 0.79170 | 98.9 % (86.5 % core 11.3 % allow 1.1 % gener 1.1 % disall) | Xylose isomerase-like |
| L-ribulose 3-epimerase, 3VYL | 50 | 3.912 | 0.73310 | ||||
| 12. | P52606 | Crystal structure of Escherichia coli DiaA, 2YVA | 35 | 0.364 | 0.98496 | 99.4 % (92.7 % core 5.6 % allow 1.1 % gener 0.6 % disall) | Sedoheptulose 7-phosphate isomerase |
| Phosphoheptose isomerase, 1X92 | 32 | 0.341 | 0.98528 | ||||
| Phosphoheptose isomerase, 3BJZ | 32 | 0.447 | 0.87983 | ||||
| Phosphoheptose isomerase 1, 1TK9 | 27 | 0.495 | 0.92181 | ||||
| Phosphoheptose Isomerase, 2I2 W | 28 | 0.521 | 0.86321 |
Representation of model structure of HP P44506. a Showing characteristic TIM barrel domain. b Secondary structure prediction of HPs using their three-dimensional structural framework by STRIDE, where α-helix, β-strands, loops, 310 helix and β-bridges are represented in red, green, yellow, blue and pink, respectively (this illustration is applicable for all figures). c Residues present in the active site pocket are illustrated in stick
The DALI server shows high structure similarity of P44506 with proteins with functionality of alanine racemase (Table S6). We observed a significant match with lysine-preferred racemases (Z score = 20.9), alanine racemase (Z score = 20.8), etc. The aligned residues are usually in the range of 221–628 with RMSD in the range of 0.3–3.1 Å2, and similarity usually ranges from 12 to 62 %. We also observed a close structural similarity to d-serine dehydratase. Furthermore, ProFunc (Table S6) server revealed eight motifs in the InterPro (Mulder et al. 2002) database with pyridoxal 5′-phosphate-dependent enzyme motif. An extensive sequence and structure analyses strongly suggest that HP P44506 is a PLP-dependent alanine racemase. Alanine racemase is a PLP-dependent enzyme which is important for bacterial cell wall biosynthesis in which it catalyzes the inter-conversion of alanine enantiomers (Noda et al. 2004).
HP P44641
HP P44641 is localized in cytoplasm and not involved in non-classical secretory pathway and lacking any transmembrane helix (Table S2). The sequence-based function prediction suggests the presence of lysine 2, 3-aminomutase activity in the HP P44641 (Table S3 and S4). The MEME suite also suggests that P44641 may have lysine 2, 3-aminomutase activity. We discovered three sequence motifs, namely 120′-GCAVNC, 236′-IFAHAM, and 88′-GFSTDP (Table 2). This HP is a non-virulent protein (Table S3). The predicted interaction partners of HP P44641 are elongation factor P, lysyl-tRNA synthetase, diaminopimelate decarboxylase, opacity-associated protein, glycogen phosphorylase, biotin synthase, lysyl-tRNA synthetase, acetate CoA-transferase beta subunit, opacity-associated protein OapB and 23S rRNA 5-methyluridine methyltransferase.
The sequence of HP P44641 was also annotated in the Unirpot database to explore its possible function. We found that HP P44641 is annotated as an enzyme l-lysine 2,3-aminomutase which produces (R)-beta-lysine from (S)-alpha-lysine (l-lysine). This protein has several cofactor binding sites including [4Fe–4S] cluster and PLP-binding motif. Family and domain database search analysis indicates that HP P44641 belongs to the radical sam superfamily kama family containing a signature motif CxxxCxxC. The characteristic three-cysteine motif nucleates a [4Fe–4S] cluster, which binds SAM as a ligand to the unique Fe not ligated to a cysteine residue (Frey et al. 2008). The members of this family participate in more than 40 distinct biochemical transformations, and most of the members are not characterized biochemically so far. GO analysis suggest that this is a protein which involved in metabolic process, possesses isomerase like catalytic activity, and a metal-binding protein which specifically binds to the 4 iron and 4 sulfur.
Representation of model structure of HP P44641. a Overall structure is represented in cartoon. b Secondary structure. c Predicted active site residues shown in stick
Furthermore, we observed a significant resemblance with ribosomal RNA large subunit methyltransferase N (Z score = 15.8, RMSD = 2.6 Å2), pyruvate formate-lyase 1-activating enzyme (Z score = 15.0, RMSD = 2.7 Å2), etc. Similarly, ProFunc shows that the predicted structure of P44641 contains nine characteristic motifs with function of lysine-2, 3-aminomutase and four significant ligand-binding templates. Our extensive analysis shows that P44641 contains lysine-2, 3-aminomutase activity. Since lysine is used as a source of energy in bacteria (Lepore et al. 2005). Hence, the lysine-2, 3-aminomutase, a radical SAM-dependent enzyme, performs the inter-conversion of L-α-lysine and L-β-lysine, the elementary step in lysine degradation (Lepore et al. 2005) in the bacteria.
HP P46494
HP P46494 is predicted to be localized in cytoplasm and periplasm as suggested by PSLpred and CELLO, respectively (Table S2). This protein is secretory in nature but lacks signal peptide and transmembrane helix. The function analysis reveals that the HP P46494 comprises DNA topoisomerase activity (Table S3 and S4). The INTERPROSCAN and MOTIF tools identified domain with a function of DNA topoisomerase (type IA, Zn finger). This prediction is further confirmed by MEME suite, which identified three signature sequences in P46494, namely 76′-FGMFIGCSHYPECDFVV, 1′-MNQSLFHH, 115′-RRGRQGKIFY a signature sequence for DNA topoisomerase I, a zinc metalloprotein with three repetitive zinc-binding domains (Tse-Dinh and Beran-Steed 1988). This protein is non-virulent and involved in cellular processes (Table S3). STRING database suggests several interaction partners such as DNA topoisomerase III, shikimate 5-dehydrogenase, ABC transporter ATP-binding protein, DNA-3-methyladenine glycosylase, DNA processing chain A, recombination regulator RecX, peptide deformylase, methionyl-tRNA formyltransferase and recombinase A. Gene ontology analysis suggests that HP P46494 is involved in the DNA binding and causes a topological change in the DNA; hence, it has type 1 DNA topoisomerase-like activity.
Representation of model structure of HP P46494. a Cartoon model showing overall structure with non-bonded sphere describing the membrane. b Secondary structure of P46494. c A detailed description of P46494 active site
We also identified Pro151 is a membrane-embedded residue according to PPM server which calculates rotational and translational positions in a protein structure. The twisting in topoisomerase is essential for its biological activity and Pro151 is one of the essential residues for such conformational changes in this enzyme during catalysis. The structure similarity using DALI server shows a model which is similar to 2GAI only (Z score = 0.4, RMSD = 6.0 Å2) (Table S6). ProFunc has identified three motifs as zf-C4_Topoisom, etc. Further, six ligand-binding templates are also recognized in reference to P46494. These analyses suggest that P46494 is a DNA topoisomerase IA (Zn finger)-like protein. DNA topoisomerase type IA has an exclusive mechanism of strand passage over an enzyme-bridged, ssDNA gate, consequently allowing them to carry out varied reactions in processing structures crucial for replication, recombination and repair (Lee et al. 2013).
HP P44827
HP P44827 is localized in the cytoplasm, lacks any transmembrane helix and is not involved in any secretory pathway (Table S2). HP P44827 contains ribosomal large subunit pseudouridine synthase E activity as suggested by sequence analysis (Table S3 and S4). The MEME suite also predicted a similar function for HP P44827 along with the three annotated motifs 84′-VYAAGRLDRDSEGLLILTNNGELQHRLADPKFKTEKTYWVQVEGI, 51′-TKVVLFNK PFDVLTQFTDEQGRATLKD, and 178′-WLEIKISEGRNRQVRRMTAHIGFP (Table 2). Uniprot has also annotated this HP as ribosomal large subunit pseudouridine synthase E (rluE) which is responsible for synthesis of pseudouridine from uracil-2457 in 23S ribosomal RNA. Such enzymes catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). The domain surface is populated by conserved, charged residues that define a likely RNA-binding site. Further, P44827 is involved in metabolism and a non-virulent protein (Table S3). The STRING database suggests that HP P44827 interacts with lipoprotein E, β-hexosaminidase, 23S rRNA pseudouridylate synthase C, adenylosuccinate lyase, transport protein and tRNA-specific 2-thiouridylase MnmA.
Representation of model structure of HP P44827. a Cartoon diagram showing mixed alpha/beta fold. Collection of non-bonded spheres represents membrane. b Secondary structure evaluation using predicted three-dimensional structure. c Stick representation of P44827 active site, with Asp91 residue is proposed to be involve in nucleic acid binding
Structure similarity searches clearly indicates that HP P44827 has a close structure similarity to the small subunit of pseudouridine synthase (Z score = 23.7, RMSD = 3.3 Å2), and hence this protein may possess pseudouridine synthase-like activity. We found a similar structural pattern with six pseudouridine synthase on ProFunc analysis. These observations suggest that HP P44827 may be a pseudouridine synthase E. There are five characterized subfamilies of Ψ synthases in prokaryotes on the basis of sequence conservation (Gustafsson et al. 1996). The pseudouridine synthase RluE is classified as a member of RsuA family (Del Campo et al. 2001) and modifies the single site Ψ2457 on a stem of 23S RNA.
HP Q57151
The sequence analysis showed that the HP Q57151 is localized in cytoplasm and is not involved in secretory mechanisms (Table S2). Sequence-based function analysis clearly indicates that HP Q57151 is a hydroxypyruvate isomerase and a non-virulent protein (Table S3 and S4). We identified three motif repeats in HP Q57151 as 99′-CPNVHIM, 71′-WGGSAI, 78′-DYFHAQ (Table 2). The predicted functional partners for Q57151 are 3-hydroxyisobutyrate dehydrogenase, putative aldolase, glycerate dehydrogenase, glycerol-3-phosphate regulon repressor, gluconate permease, D-xylose transporter subunit XylF and cAMP-regulatory protein indicating its importance for the survival of the organism.
Uniprot annotation suggests that HP Q57151 is a putative hydroxypyruvate isomerase which catalyzes the reversible isomerization between hydroxypyruvate and 2-hydroxy-3-oxopropanoate. Domain annotation suggests that HP Q57151 contains a structural motif with a beta/alpha TIM barrel which is found in several proteins families including xylose isomerase. Family analysis suggests that HP Q57151 belongs to the hydroxypyruvate isomerase Hyi and possesses hydroxypyruvate isomerase activity.
Representation of model structure of HP Q57151. a Cartoon model showing overall topology described by predicted structure. b Description of secondary structure prediction in the HP. c A detailed description of active site
The predicted structure of HP Q57151 is quite similar to those of D-tagatose 3-epimerase (Z score = 26.1, RMSD = 2.3 Å2), L-ribulose 3-epimerase (Z score = 26.1, RMSD = 2.4 Å2), etc., indicating that this HP may act as an epimerase. Moreover, structure-based function prediction using ProFunc shows this protein may acts as hydroxypyruvate isomerase, xylose isomerase-like, etc. All these finding suggest that the HP Q57151 is actually hydroxypyruvate isomerase which catalyzes a reversible conversion of hydroxypyruvate from tartronate semialdehyde (de Windt and van der Drift 1980).
HP P44094
HP P44094 is a cytoplasmic, non-virulent and non-secretory protein (Table S2). We observed that HP P44094 contains a nucleoside-diphosphate-sugar epimerase domain (Table S3 and S4). MEME suite analysis suggests the presence of three significant motifs in the sequence of Q57151 namely 149′-MCELLINDYSRKGFVDGIVVRLPTICIRPGKPNKAASSFVSSIMREPLHG, 55′-CPVSEE and 291′-QALALGFKV (Table 2). STRING analysis suggests that gluconate permease, putative aldolase, 3-hydroxyisobutyrate dehydrogenase and glycerol-3-phosphate regulon repressor are the functional network partner of HP P44094. Sequence similarities search suggest that HP P44094 belongs to the NAD(P)-dependent epimerase/dehydratase family. However, a detail annotation of this HP is not available at the Uniprot.
Representation of model structure of HP P44094. a Overall structure of P44094 shown in cartoon model with membrane is represented as non-bonded spheres. b Secondary structure of HP P44094. c Representation of the active site residues of P44094 in stick model
HP P45104
HP P45104 is localized in cytoplasm and lacks signal peptide (Table S2). It contains the domain with activity of ribosomal large subunit pseudouridine synthase (Table S3 and S4). The MEME suite analysis shows the presence of three significant motifs namely 176′-WIAVGRLDINTSGLLLFTTDGELANRLMHPSREVEREYSVRV FGQ, 140′-CRVLMYYKPEGELCTRSDPEGRATVFD and 256′-WYDVTLMEGRNREVRRLWESQGIQ, indicating a functional resemblance with ribosomal large subunit pseudouridine synthase B (Table 2). This protein is also annotated as rluB in the Uniprot database and belongs to the pseudouridine synthase RsuA family. Interaction networking partners of HP P45104 are 23S rRNA pseudouridine synthase D, transcriptional regulator CysB, 23S rRNA pseudouridylate synthase C, tRNA pseudouridine synthase B, GTP-binding protein EngA, 30S ribosomal protein S1 and cytidylate kinase. This also confirms its predicted function.
Representation of model structure of HP P45104. a Three-dimensional structure represented in cartoon model with membrane represented as non-bonded spheres. b Representation of secondary structure predicted using STRIDE. c Representation of the active site residues of HP P45104 in stick model
HP P45104 shows a close resemblance with the ribosomal large subunit pseudouridine synthase B and ribosomal large subunit pseudouridine synthase F. These findings are clearly indicating that HP P45104 may functions as a large subunit pseudouridine synthase B. This enzyme catalyzes the conversion of U2605 to pseudouridine (Ψ) in a stem-loop of 23S rRNA (Czudnochowski et al. 2013), while close homolog RluF isomerizes the adjacent nucleotide in the stem, i.e., U2604 (Czudnochowski et al. 2013).
HP P71373
HP P71373 was predicted as a virulent protein localized in the cytoplasm (Table S2). This protein is also a non-secretory protein and lacks transmembrane helix. The function prediction shows that HP P71373 may be an epimerase amidophosphoribosyltransferase (Table S3 and S4). Motif analysis also suggests the presence of epimerase activity in the HP P71373 (Table 2). HP P71373 was also annotated as epimerase family protein HI_1208 in the uniprot database and belongs to the NAD(P)-dependent epimerase/dehydratase family. The STRING predicts arginine repressor, malate dehydrogenase, ferrochelatase, lipoyltransferase, 2-oxoglutarate dehydrogenase E2 component dihydrolipoamide succinyltransferase and dihydrolipoamide acetyltransferase as functional networking partners.
Representation of model structure of HP P71373. a Cartoon model representation of overall structure in which membrane is represented by non-bonded atoms. b Predicted secondary structure. c Representation of the active site residues of HP P71373 in stick model
HP P44160
HP P44160 is a secretory protein present in the cytoplasm (Table S2). There is no transmembrane helix present in the sequence of P44160. The motif and domain analysis suggests that the HP P44160 is an aldose 1-epimerase enzyme which is important for metabolic pathways like glycolysis and gluconeogenesis (Chittori et al. 2007) (Table S3 and S4). Uniprot annotation has also indicated that HP P44160 is a putative glucose-6-phosphate 1-epimerase which converts α-d-glucose 6-phosphate to β-d-glucose 6-phosphate. Furthermore, GO analysis indicated that this protein is involved in the carbohydrate metabolic process. Interestingly, sequence similarity search also suggest that this HP belongs to the glucose-6-phosphate 1-epimerase family. The HP P44160 is a virulent protein involved in cellular process. It contains three motifs predicted by MEME suite, namely 86′-QPAHGT, 75′-PICYPW and 29′-CGWNTKNFPC (Table 2). The predicted partners for P44160 are glucose-6-phosphate isomerase, glucose-specific PTS system component, keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase, transaldolase B, deoxyribose-phosphate aldolase, transketolase, fructose-bisphosphate aldolase, aldose 1-epimerase and UDP-glucose 4-epimerase, indicating the role HP P44160 in carbohydrate metabolism.
Representation of model structure of HP P44160. a Showing characteristic β-sandwich topology. b Detailed description of secondary structure using STRIDE. c The active site pocket is illustrated in stick representation
HP O86237
The HP O86237 is a cytoplasmic protein showing tautomerase/MIF activity (Table S2 and Table 3). GO annotation has also indicated that HP O86237 is involved in the cellular aromatic compound metabolic process and possesses isomerase activity. Interestingly, family and domain database search has also indicated that this HP belongs to the 4-oxalocrotonate_tautomerase family. These predictions were further confirmed by understanding the interaction network of O86237 using STRING database which shows HP O86237 interacts with anthranilate phosphoribosyl transferase, bifunctional indole-3-glycerol phosphate synthase/phosphoribosylanthranilate isomerase, anthranilate synthase component II and anthranilate synthase component I.
Representation of model structure of HP O86237. a Overall structure is represented in cartoon form. b Secondary structure of HP O86237. c Detailed description of active site of O86237
HP Q57152
The PSLpred server shows that HP Q57152 is localized in periplasm, while CELLO suggests cytoplasmic localization (Table S2). This is a virulent protein involved in cellular processes with tRNA pseudouridine synthase C activity (Table S3 and S4). Uniprot annotation has also indicated that HP Q57152 is similar to the N-terminal of E. carotovora exoenzyme regulation regulon ORF1 and the C-terminal part is colinear with YqcB. YqcC-like structural domain found in the N-terminal of some tRNA pseudouridine synthase C proteins, as well as other uncharacterised proteins.
Results were validated by STRING which shows interaction network contains putative sulfate transport protein CysZ, penicillin-binding protein 1B, N-acetylmuramic acid-6-phosphate etherase, opacity protein, anhydro-N-acetylmuramic acid kinase, ATP-dependent helicase along with Q57152. These predictions are further validated by MEME suite which identified three sequence-based motifs namely, 51′-WVFIPRM, 72′-AISPYI and 38′-FSIDTM.
Representation of model structure of HP Q57152. a Describing all alpha helix topology in carton form. b Secondary structure prediction and assignment of STRIDE. c Description of active site residues in stick form
HP P44268
Subcellular localization of HP P44268 suggests that it is localized in cytoplasm with no transmembrane helix and is not involved in any secretory pathways (Table S2). Sequence-based function predictions show that HP P44268 may possess xylose isomerase activity (Table S3 and S4). Uniprot annotation has also indicated that HP P44268 contains a Xyl_isomerase-like TIM barrel domain and belongs to the UPF0276 family that functionally uncharacterised. We further validated our prediction by analyzing the interaction network of P44268 which primarily includes RNA polymerase sigma factor and phosphate transport regulator. We identified three sequence-based motifs in the HP P44268 are 249′-KGTVWD, 99′-CECEGH and 35′-ENWSKM which are helpful in validating the annotation results.
Representation of model structure of HP P44268. a Showing characteristic TIM barrel topology. b Secondary structure prediction of HP P44268. c Representation of active site residues in stick
HP P52606
Sequence analysis of HP P52606 indicates that this protein is localized in the cytoplasm (Table S2). Sequence similarity search and domain analysis show that P51606 may have sedoheptulose 7-phosphate isomerase like activity (Table S3 and S4). Uniprot annotation has also indicated that HP P52606 is involved in carbohydrate metabolic process and acts as isomerase. Furthermore, sequence similarity search suggest that HP P52606 contains a SIS, a phosphosugar-binding domain and belongs to the DiaA subfamily, a DnaA initiator-associating protein DiaA which is required for the timely initiation of chromosomal replication via direct interactions with the DnaA initiator protein, required for DNA replication. We further validated the result using protein–protein interaction analysis that shows that this protein interacted with bifunctional heptose 7-phosphate kinase/heptose 1-phosphate adenyltransferase, antigen, chromosomal replication initiation protein, imidazole glycerol-phosphate dehydratase/histidinol phosphatase and D-heptose 1, 7-bisphosphate phosphatase.
Representation of model structure of HP P52606. a Illustration of beta sandwich topology of P52606. b Predicted Secondary structural elements in P52606. c Stick representation of active site residues
Conclusions
The isomerases have an important role in the virulence of pathogens such as UDP N-acetylgalactosamine 4-epimerase which is found to be involved in the formation of smooth lipopolysaccharide and is essential for the virulence of mesophilic aeromonas hydrophila serotype O34 (Canals et al. 2006). Similarly, UDP-glucose 4-epimerase encoded by galE gene plays an important part in lipopolysaccharide biosynthesis which is one of the main virulence factors of bacterial pathogens (Fry et al. 2000). Our extensive analysis of structures of 13 isomerases characterized from 429 HPs of H. influenzae is helpful in identification of putative drug targets for better drug design. HP P71373 annotated as nucleoside-diphosphate-sugar epimerase. Four HPs were identified as a virulent protein which can be used to better understand the virulence mechanism of H. influenzae and search for a potential target for therapeutic intervention. Isomerases clearly play a central role in the relationship between bacteria and the host. Our structure-based function elucidation provides an insight how microbes interact with the hosts and will contribute significantly to our understanding of both the isomerase molecule and bacterial pathogenesis in the future.
Notes
Acknowledgments
Authors sincerely thank Indian Council of Medical Research for financial assistance (Project No. BIC/12(04)/2012).
Conflict of interest
Authors declare no conflict of interest regarding any financial and personal relationships with other people or organizations that could inappropriately influence (bias) this work.
Supplementary material
References
- Accelrys (2013) Discovery studio modeling environment, Release 3.5., San Diego, Accelrys Software IncGoogle Scholar
- Alexandrov A, Martzen MR, Phizicky EM (2002) Two proteins that form a complex are required for 7-methylguanosine modification of yeast tRNA. RNA 8:1253–1266CrossRefGoogle Scholar
- Almrud JJ, Kern AD, Wang SC, Czerwinski RM, Johnson WH Jr, Murzin AG, Hackert ML, Whitman CP (2002) The crystal structure of YdcE, a 4-oxalocrotonate tautomerase homologue from Escherichia coli, confirms the structural basis for oligomer diversity. Biochemistry 41:12010–12024CrossRefGoogle Scholar
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215:403–410CrossRefGoogle Scholar
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402CrossRefGoogle Scholar
- Apisarnthanarak A, Mundy LM (2005) Etiology of community-acquired pneumonia. Clin Chest Med 26:47–55CrossRefGoogle Scholar
- Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble WS (2009) MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res 37:W202–W208CrossRefGoogle Scholar
- Baker D, Sali A (2001) Protein structure prediction and structural genomics. Science 294:93–96CrossRefGoogle Scholar
- Baron C, Coombes B (2007) Targeting bacterial secretion systems: benefits of disarmament in the microcosm. Infect Disord Drug Targets 7:19–27CrossRefGoogle Scholar
- Bendtsen JD, Kiemer L, Fausboll A, Brunak S (2005) Non-classical protein secretion in bacteria. BMC Microbiol 5:58CrossRefGoogle Scholar
- Bernstein FC, Koetzle TF, Williams GJ, Meyer EF Jr, Brice MD, Rodgers JR, Kennard O, Shimanouchi T, Tasumi M (1978) The Protein Data Bank: a computer-based archival file for macromolecular structures. Arch Biochem Biophys 185:584–591CrossRefGoogle Scholar
- Bhasin M, Garg A, Raghava GP (2005) PSLpred: prediction of subcellular localization of bacterial proteins. Bioinformatics 21:2522–2524CrossRefGoogle Scholar
- Bjornson HS (1984) Enzymes associated with the survival and virulence of gram-negative anaerobes. Rev Infect Dis 6(Suppl 1):S21–S24CrossRefGoogle Scholar
- Brooks BR, Brooks CL 3rd, Mackerell AD Jr, Nilsson L, Petrella RJ, Roux B, Won Y, Archontis G, Bartels C, Boresch S et al (2009) CHARMM: the biomolecular simulation program. J Comput Chem 30:1545–1614CrossRefGoogle Scholar
- Cai CZ, Han LY, Ji ZL, Chen X, Chen YZ (2003) SVM-Prot: web-based support vector machine software for functional classification of a protein from its primary sequence. Nucleic Acids Res 31:3692–3697CrossRefGoogle Scholar
- Campos J (2001) Haemophilus influenzae: from the post-vaccination era to antibiotic resistance. Clin Microbiol Infect 7:287–290CrossRefGoogle Scholar
- Canals R, Jimenez N, Vilches S, Regue M, Merino S, Tomas JM (2006) The UDP N-acetylgalactosamine 4-epimerase gene is essential for mesophilic Aeromonas hydrophila serotype O34 virulence. Infect Immun 74:537–548CrossRefGoogle Scholar
- Capdeville R, Buchdunger E, Zimmermann J, Matter A (2002) Glivec (STI571, imatinib), a rationally developed, targeted anticancer drug. Nat Rev Drug Discov 1:493–502CrossRefGoogle Scholar
- Champoux JJ (2001) DNA topoisomerases: structure, function, and mechanism. Annu Rev Biochem 70:369–413CrossRefGoogle Scholar
- Chen R, Jeong SS (2000) Functional prediction: identification of protein orthologs and paralogs. Protein Sci 9:2344–2353CrossRefGoogle Scholar
- Chittori S, Simanshu DK, Savithri HS, Murthy MR (2007) Structure of the putative mutarotase YeaD from Salmonella typhimurium: structural comparison with galactose mutarotases. Acta Crystallogr D Biol Crystallogr 63:197–205CrossRefGoogle Scholar
- Chothia C, Lesk AM (1986) The relation between the divergence of sequence and structure in proteins. EMBO J 5:823–826Google Scholar
- Colovos C, Yeates TO (1993) Verification of protein structures: patterns of nonbonded atomic interactions. Protein Sci 2:1511–1519CrossRefGoogle Scholar
- Czudnochowski N, Ashley GW, Santi DV, Alian A, Finer-Moore J, Stroud RM (2013) The mechanism of pseudouridine synthases from a covalent complex with RNA, and alternate specificity for U2605 versus U2604 between close homologs. Nucleic Acids Res 42(3):2037–2048CrossRefGoogle Scholar
- de Windt FE, van der Drift C (1980) Purification and some properties of hydroxypyruvate isomerase of Bacillus fastidiosus. Biochim Biophys Acta 613:556–562CrossRefGoogle Scholar
- Del Campo M, Kaya Y, Ofengand J (2001) Identification and site of action of the remaining four putative pseudouridine synthases in Escherichia coli. RNA 7:1603–1615Google Scholar
- DeLano WL (ed) (2002) The PyMOL molecular graphics system. L Schrödinger, DeLano Scientific, San CarlosGoogle Scholar
- Desler C, Durhuus JA, Rasmussen LJ (2012) Genome-wide screens for expressed hypothetical proteins. Methods Mol Biol 815:25–38CrossRefGoogle Scholar
- Doerks T, von Mering C, Bork P (2004) Functional clues for hypothetical proteins based on genomic context analysis in prokaryotes. Nucleic Acids Res 32:6321–6326CrossRefGoogle Scholar
- Donn RP, Ray DW (2004) Macrophage migration inhibitory factor: molecular, cellular and genetic aspects of a key neuroendocrine molecule. J Endocrinol 182:1–9CrossRefGoogle Scholar
- Dormann P, Benning C (1998) The role of UDP-glucose epimerase in carbohydrate metabolism of Arabidopsis. Plant J 13:641–652CrossRefGoogle Scholar
- Eisenberg D, Luthy R, Bowie JU (1997) VERIFY3D: assessment of protein models with three-dimensional profiles. Methods Enzymol 277:396–404CrossRefGoogle Scholar
- Eisenreich W, Rohdich F, Bacher A (2001) Deoxyxylulose phosphate pathway to terpenoids. Trends Plant Sci 6:78–84CrossRefGoogle Scholar
- Eldika N, Sethi S (2006) Role of nontypeable Haemophilus influenzae in exacerbations and progression of chronic obstructive pulmonary disease. Curr Opin Pulm Med 12:118–124CrossRefGoogle Scholar
- Emanuelsson O, Brunak S, von Heijne G, Nielsen H (2007) Locating proteins in the cell using TargetP, SignalP and related tools. Nat Protoc 2:953–971CrossRefGoogle Scholar
- Eswar N, Webb B, Marti-Renom MA, Madhusudhan MS, Eramian D, Shen MY, Pieper U, Sali A (2006) Comparative protein structure modeling using Modeller. Curr Protoc Bioinformatics Chapter 5: Unit 5 6Google Scholar
- Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR, Bult CJ, Tomb JF, Dougherty BA, Merrick JM et al (1995) Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science 269:496–512CrossRefGoogle Scholar
- Frey PA, Hegeman AD, Ruzicka FJ (2008) The radical SAM superfamily. Crit Rev Biochem Mol Biol 43:63–88CrossRefGoogle Scholar
- Fry BN, Feng S, Chen YY, Newell DG, Coloe PJ, Korolik V (2000) The galE gene of Campylobacter jejuni is involved in lipopolysaccharide synthesis and virulence. Infect Immun 68:2594–2601CrossRefGoogle Scholar
- Galperin MY (2001) Conserved ‘hypothetical’ proteins: new hints and new puzzles. Comp Funct Genomics 2:14–18CrossRefGoogle Scholar
- Galperin MY (2004) Bacterial signal transduction network in a genomic perspective. Environ Microbiol 6:552–567CrossRefGoogle Scholar
- Galperin MY, Koonin EV (2004) ‘Conserved hypothetical’ proteins: prioritization of targets for experimental study. Nucleic Acids Res 32:5452–5463CrossRefGoogle Scholar
- Garg A, Gupta D (2008) VirulentPred: a SVM based prediction method for virulent proteins in bacterial pathogens. BMC Bioinformatics 9:62CrossRefGoogle Scholar
- Geer LY, Domrachev M, Lipman DJ, Bryant SH (2002) CDART: protein homology by domain architecture. Genome Res 12:1619–1623CrossRefGoogle Scholar
- Gerlt JA, Raushel FM (2003) Evolution of function in (beta/alpha)8-barrel enzymes. Curr Opin Chem Biol 7:252–264CrossRefGoogle Scholar
- Gough J, Karplus K, Hughey R, Chothia C (2001) Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure. J Mol Biol 313:903–919CrossRefGoogle Scholar
- Graille M, Baltaze JP, Leulliot N, Liger D, Quevillon-Cheruel S, van Tilbeurgh H (2006) Structure-based functional annotation: yeast ymr099c codes for a D-hexose-6-phosphate mutarotase. J Biol Chem 281:30175–30185CrossRefGoogle Scholar
- Gustafsson C, Reid R, Greene PJ, Santi DV (1996) Identification of new RNA modifying enzymes by iterative genome search using known modifying enzymes as probes. Nucleic Acids Res 24:3756–3762CrossRefGoogle Scholar
- Hassan MI, Ahmad F (2011) Structural diversity of class I MHC-like molecules and its implications in binding specificities. Adv Protein Chem Struct Biol 83:223–270CrossRefGoogle Scholar
- Hassan MI, Kumar V, Singh TP, Yadav S (2007a) Structural model of human PSA: a target for prostate cancer therapy. Chem Biol Drug Des 70:261–267CrossRefGoogle Scholar
- Hassan MI, Kumar V, Somvanshi RK, Dey S, Singh TP, Yadav S (2007b) Structure-guided design of peptidic ligand for human prostate specific antigen. J Pept Sci 13:849–855CrossRefGoogle Scholar
- Hassan MI, Bilgrami S, Kumar V, Singh N, Yadav S, Kaur P, Singh T (2008) Crystal structure of the novel complex formed between zinc α2-glycoprotein (ZAG) and prolactin-inducible protein (PIP) from human seminal plasma. J Mol Biol 384:663–672CrossRefGoogle Scholar
- Hassan MI, Waheed A, Grubb JH, Klei HE, Korolev S, Sly WS (2013) High resolution crystal structure of human β-Glucuronidase reveals structural basis of lysosome targeting. PLoS One 8:e79687CrossRefGoogle Scholar
- Heinig M, Frishman D (2004) STRIDE: a web server for secondary structure assignment from known atomic coordinates of proteins. Nucleic Acids Res 32:W500–W502CrossRefGoogle Scholar
- Hooft RW, Vriend G, Sander C, Abola EE (1996) Errors in protein structures. Nature 381:272CrossRefGoogle Scholar
- Hubbard TJ, Ailey B, Brenner SE, Murzin AG, Chothia C (1999) SCOP: a Structural Classification of Proteins database. Nucleic Acids Res 27:254–256CrossRefGoogle Scholar
- Illergard K, Ardell DH, Elofsson A (2009) Structure is three to ten times more conserved than sequence–a study of structural response in protein cores. Proteins 77:499–508CrossRefGoogle Scholar
- Jackman JE, Montange RK, Malik HS, Phizicky EM (2003) Identification of the yeast gene encoding the tRNA m1G methyltransferase responsible for modification at position 9. RNA 9:574–585CrossRefGoogle Scholar
- Jenal U (2004) Cyclic di-guanosine-monophosphate comes of age: a novel secondary messenger involved in modulating cell surface structures in bacteria? Curr Opin Microbiol 7:185–191CrossRefGoogle Scholar
- Jorgensen JH (1991) Global perspective on antimicrobial resistance in Haemophilus influenzae. J Chemother 3(Suppl 1):155–157Google Scholar
- Kanehisa M (1997) Linking databases and organisms: genomeNet resources in Japan. Trends Biochem Sci 22:442–444CrossRefGoogle Scholar
- Kaplan W, Littlejohn TG (2001) Swiss-PDB viewer (deep view). Brief Bioinform 2:195–197CrossRefGoogle Scholar
- Khun HH, Kirby SD, Lee BC (1998) A Neisseria meningitidis fbpABC mutant is incapable of using nonheme iron for growth. Infect Immun 66:2330–2336Google Scholar
- Kim DE, Chivian D, Baker D (2004) Protein structure prediction and analysis using the Robetta server. Nucleic Acids Res 32:W526–W531CrossRefGoogle Scholar
- Kirby SD, Gray-Owen SD, Schryvers AB (1997) Characterization of a ferric-binding protein mutant in Haemophilus influenzae. Mol Microbiol 25:979–987CrossRefGoogle Scholar
- Klebe G (2000) Recent developments in structure-based drug design. J Mol Med (Berl) 78:269–281CrossRefGoogle Scholar
- Kostyanev TS, Sechanova LP (2012) Virulence factors and mechanisms of antibiotic resistance of haemophilus influenzae. Folia Med (Plovdiv) 54:19–23Google Scholar
- Krivov GG, Shapovalov MV, Dunbrack RL Jr (2009) Improved prediction of protein side-chain conformations with SCWRL4. Proteins 77:778–795CrossRefGoogle Scholar
- Krogh A, Larsson B, von Heijne G, Sonnhammer EL (2001) Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol 305:567–580CrossRefGoogle Scholar
- Kuhnert P (2008) Pasteurellaceae: biology, genomics and molecular aspects. Caister Academic PressGoogle Scholar
- Kumar K, Prakash A, Islam A, Ahmad F, Hassan MI (2014a) Structure based functional annotation of hypothetical proteins from Candida dubliniensis: a quest for novel drug target. 3 Biotech. doi: 10.1007/s13205-014-0256-3 Google Scholar
- Kumar K, Prakash A, Tasleem M, Islam A, Ahmad F, Hassan MI (2014b) Functional annotation of putative hypothetical proteins from Candida dubliniensis. Gene 543:93–100CrossRefGoogle Scholar
- Laskowski RA, Rullmannn JA, MacArthur MW, Kaptein R, Thornton JM (1996) AQUA and PROCHECK-NMR: programs for checking the quality of protein structures solved by NMR. J Biomol NMR 8:477–486Google Scholar
- Laskowski RA, Watson JD, Thornton JM (2005) ProFunc: a server for predicting protein function from 3D structure. Nucleic Acids Res 33:W89–W93CrossRefGoogle Scholar
- Laurie AT, Jackson RM (2005) Q-SiteFinder: an energy-based method for the prediction of protein-ligand binding sites. Bioinformatics 21:1908–1916CrossRefGoogle Scholar
- Leaves NI, Dimopoulou I, Hayes I, Kerridge S, Falla T, Secka O, Adegbola RA, Slack MP, Peto TE, Crook DW (2000) Epidemiological studies of large resistance plasmids in Haemophilus. J Antimicrob Chemother 45:599–604CrossRefGoogle Scholar
- Lee SH, Siaw GE, Willcox S, Griffith JD, Hsieh TS (2013) Synthesis and dissolution of hemicatenanes by type IA DNA topoisomerases. Proc Natl Acad Sci USA 110:E3587–E3594CrossRefGoogle Scholar
- Lepore BW, Ruzicka FJ, Frey PA, Ringe D (2005) The x-ray crystal structure of lysine-2,3-aminomutase from Clostridium subterminale. Proc Natl Acad Sci USA 102:13819–13824CrossRefGoogle Scholar
- Letunic I, Doerks T, Bork P (2012) SMART 7: recent updates to the protein domain annotation resource. Nucleic Acids Res 40:D302–D305CrossRefGoogle Scholar
- Lomize MA, Pogozheva ID, Joo H, Mosberg HI, Lomize AL (2012) OPM database and PPM web server: resources for positioning of proteins in membranes. Nucleic Acids Res 40:D370–D376CrossRefGoogle Scholar
- Lopez G, Maietta P, Rodriguez JM, Valencia A, Tress ML (2011) firestar–advances in the prediction of functionally important residues. Nucleic Acids Res 39:W235–W241CrossRefGoogle Scholar
- Luthy R, Bowie JU, Eisenberg D (1992) Assessment of protein models with three-dimensional profiles. Nature 356:83–85CrossRefGoogle Scholar
- Markel TA, Crisostomo PR, Wang M, Herring CM, Meldrum KK, Lillemoe KD, Meldrum DR (2007) The struggle for iron: gastrointestinal microbes modulate the host immune response during infection. J Leukoc Biol 81:393–400CrossRefGoogle Scholar
- Marti-Renom MA, Stuart AC, Fiser A, Sanchez R, Melo F, Sali A (2000) Comparative protein structure modeling of genes and genomes. Annu Rev Biophys Biomol Struct 29:291–325CrossRefGoogle Scholar
- Meinel T, Krause A, Luz H, Vingron M, Staub E (2005) The SYSTERS Protein Family Database in 2005. Nucleic Acids Res 33:D226–D229CrossRefGoogle Scholar
- Mi H, Lazareva-Ulitsky B, Loo R, Kejariwal A, Vandergriff J, Rabkin S, Guo N, Muruganujan A, Doremieux O, Campbell MJ et al (2005) The PANTHER database of protein families, subfamilies, functions and pathways. Nucleic Acids Res 33:D284–D288CrossRefGoogle Scholar
- Misura KM, Chivian D, Rohl CA, Kim DE, Baker D (2006) Physically realistic homology models built with ROSETTA can be more accurate than their templates. Proc Natl Acad Sci USA 103:5361–5366CrossRefGoogle Scholar
- Morton DJ, Bakaletz LO, Jurcisek JA, VanWagoner TM, Seale TW, Whitby PW, Stull TL (2004a) Reduced severity of middle ear infection caused by nontypeable Haemophilus influenzae lacking the hemoglobin/hemoglobin-haptoglobin binding proteins (Hgp) in a chinchilla model of otitis media. Microb Pathog 36:25–33CrossRefGoogle Scholar
- Morton DJ, Smith A, Ren Z, Madore LL, VanWagoner TM, Seale TW, Whitby PW, Stull TL (2004b) Identification of a haem-utilization protein (Hup) in Haemophilus influenzae. Microbiology 150:3923–3933CrossRefGoogle Scholar
- Morton DJ, Madore LL, Smith A, Vanwagoner TM, Seale TW, Whitby PW, Stull TL (2005) The heme-binding lipoprotein (HbpA) of Haemophilus influenzae: role in heme utilization. FEMS Microbiol Lett 253:193–199CrossRefGoogle Scholar
- Mulder NJ, Apweiler R, Attwood TK, Bairoch A, Bateman A, Binns D, Biswas M, Bradley P, Bork P, Bucher P et al (2002) InterPro: an integrated documentation resource for protein families, domains and functional sites. Brief Bioinform 3:225–235CrossRefGoogle Scholar
- Noda M, Matoba Y, Kumagai T, Sugiyama M (2004) Structural evidence that alanine racemase from a D-cycloserine-producing microorganism exhibits resistance to its own product. J Biol Chem 279:46153–46161CrossRefGoogle Scholar
- Pfeifer Y, Meisinger I, Brechtel K, Grobner S (2013) Emergence of a multidrug-resistant Haemophilus influenzae strain causing chronic pneumonia in a patient with common variable immunodeficiency. Microb Drug Resist 19:1–5CrossRefGoogle Scholar
- Pontius J, Richelle J, Wodak SJ (1996) Deviations from standard atomic volumes as a quality measure for protein crystal structures. J Mol Biol 264:121–136CrossRefGoogle Scholar
- Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K, Ceric G, Clements J et al (2011) The Pfam protein families database. Nucleic Acids Res 40:D290–D301CrossRefGoogle Scholar
- Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, Lopez R (2005) InterProScan: protein domains identifier. Nucleic Acids Res 33:W116–W120CrossRefGoogle Scholar
- Ramachandran GN, Ramakrishnan C, Sasisekharan V (1963) Stereochemistry of polypeptide chain configurations. J Mol Biol 7:95–99CrossRefGoogle Scholar
- Rappoport N, Karsenty S, Stern A, Linial N, Linial M (2011) ProtoNet 6.0: organizing 10 million protein sequences in a compact hierarchical family tree. Nucleic Acids Res 40:D313–D320CrossRefGoogle Scholar
- Reffuveille F, Connil N, Sanguinetti M, Posteraro B, Chevalier S, Auffray Y, Rince A (2012) Involvement of peptidylprolyl cis/trans isomerases in Enterococcus faecalis virulence. Infect Immun 80:1728–1735CrossRefGoogle Scholar
- Ren P, Rossettini A, Chaturvedi V, Hanes SD (2005) The Ess1 prolyl isomerase is dispensable for growth but required for virulence in Cryptococcus neoformans. Microbiology 151:1593–1605CrossRefGoogle Scholar
- Roy A, Kucukural A, Zhang Y (2010) I-TASSER: a unified platform for automated protein structure and function prediction. Nat Protoc 5:725–738CrossRefGoogle Scholar
- Roy A, Yang J, Zhang Y (2012) COFACTOR: an accurate comparative algorithm for structure-based protein function annotation. Nucleic Acids Res 40:W471–W477CrossRefGoogle Scholar
- Saha S, Raghava GP (2006) VICMpred: an SVM-based method for the prediction of functional proteins of Gram-negative bacteria using amino acid patterns and composition. Genomics Proteomics Bioinform 4:42–47CrossRefGoogle Scholar
- Saha SK, Darmstadt GL, Baqui AH, Islam N, Qazi S, Islam M, El Arifeen S, Santosham M, Black RE, Crook DW (2008) Direct detection of the multidrug resistance genome of Haemophilus influenzae in cerebrospinal fluid of children: implications for treatment of meningitis. Pediatr Infect Dis J 27:49–53CrossRefGoogle Scholar
- Sanchez S, Smiley KL (1975) Properties of D-xylose isomerase from Streptomyces albus. Appl Microbiol 29:745–750Google Scholar
- Shahbaaz M, Hassan MI, Ahmad F (2013) Functional annotation of conserved hypothetical proteins from haemophilus influenzae Rd KW20. PLoS ONE 8:e84263CrossRefGoogle Scholar
- Shahbaaz M, Ahmad F, Imtaiyaz Hassan M (2014) Structure-based functional annotation of putative conserved proteins having lyase activity from Haemophilus influenzae. Biotech. doi: 10.1007/s13205-014-0231-z:1-20 Google Scholar
- Shapiro L, Harris T (2000) Finding function through structural genomics. Curr Opin Biotechnol 11:31–35CrossRefGoogle Scholar
- Sillitoe I, Cuff AL, Dessailly BH, Dawson NL, Furnham N, Lee D, Lees JG, Lewis TE, Studer RA, Rentzsch R et al (2012) New functional families (FunFams) in CATH to improve the mapping of conserved functional sites to 3D structures. Nucleic Acids Res 41:D490–D498CrossRefGoogle Scholar
- Sillitoe I, Cuff AL, Dessailly BH, Dawson NL, Furnham N, Lee D, Lees JG, Lewis TE, Studer RA, Rentzsch R et al (2013) New functional families (FunFams) in CATH to improve the mapping of conserved functional sites to 3D structures. Nucleic Acids Res 41:D490–D498CrossRefGoogle Scholar
- Singh A, Kumar Thakur P, Meena M, Kumar D, Bhatnagar S, Dubey AK, Hassan I (2014) Interaction between Basic 7S Globulin and Leginsulin in Soybean [Glycine max]: a structural insight. Lett Drug Des Discovery 11:231–239CrossRefGoogle Scholar
- Sinha A, Ahmad F, Hassan MI (2014) Structure based functional annotation of putative conserved proteins from treponema pallidum: search for a potential drug target. Lett Drug Des Discovery 12:46–59CrossRefGoogle Scholar
- Soding J, Biegert A, Lupas AN (2005) The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res 33:W244–W248CrossRefGoogle Scholar
- Soma A, Ikeuchi Y, Kanemasa S, Kobayashi K, Ogasawara N, Ote T, Kato J, Watanabe K, Sekine Y, Suzuki T (2003) An RNA-modifying enzyme that governs both the codon and amino acid specificities of isoleucine tRNA. Mol Cell 12:689–698CrossRefGoogle Scholar
- Stojiljkovic I, Perkins-Balding D (2002) Processing of heme and heme-containing proteins by bacteria. DNA Cell Biol 21:281–295CrossRefGoogle Scholar
- Szklarczyk D, Franceschini A, Kuhn M, Simonovic M, Roth A, Minguez P, Doerks T, Stark M, Muller J, Bork P et al (2011a) The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored. Nucleic Acids Res 39:D561–D568CrossRefGoogle Scholar
- Szklarczyk D, Franceschini A, Kuhn M, Simonovic M, Roth A, Minguez P, Doerks T, Stark M, Muller J, Bork P et al (2011b) The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored. Nucleic Acids Res 39:D561–D568CrossRefGoogle Scholar
- Tasleem M, Ishrat R, Islam A, Ahmad F, Hassan MI (2014) Structural characterization, homology modeling and docking studies of ARG674 Mutation in MyH8 Gene associated with trismus-pseudocamptodactyly syndrome. Lett Drug Des Discov 11:1177–1187CrossRefGoogle Scholar
- Taylor WR, Orengo CA (1989) Protein structure alignment. J Mol Biol 208:1–22CrossRefGoogle Scholar
- Taylor PL, Blakely KM, de Leon GP, Walker JR, McArthur F, Evdokimova E, Zhang K, Valvano MA, Wright GD, Junop MS (2008) Structure and function of sedoheptulose-7-phosphate isomerase, a critical enzyme for lipopolysaccharide biosynthesis and a target for antibiotic adjuvants. J Biol Chem 283:2835–2845CrossRefGoogle Scholar
- Thakur PK, Kumar J, Ray D, Anjum F, Hassan MI (2013a) Search of potential inhibitor against New Delhi metallo-beta-lactamase 1 from a series of antibacterial natural compounds. J Nat Sci Biol Med 4:51–56CrossRefGoogle Scholar
- Thakur PK, Prakash A, Khan P, Fleming RE, Waheed A, Ahmad F, Hassan MI (2013b) Identification of interfacial residues involved in hepcidin-ferroportin interaction. Lett Drug Des Discov 11:363–374CrossRefGoogle Scholar
- Tristram S, Jacobs MR, Appelbaum PC (2007) Antimicrobial resistance in Haemophilus influenzae. Clin Microbiol Rev 20:368–389CrossRefGoogle Scholar
- Tse-Dinh YC, Beran-Steed RK (1988) Escherichia coli DNA topoisomerase I is a zinc metalloprotein with three repetitive zinc-binding domains. J Biol Chem 263:15857–15859Google Scholar
- Tsoka S, Ouzounis CA (2000) Recent developments and future directions in computational genomics. FEBS Lett 480:42–48CrossRefGoogle Scholar
- Tusnady GE, Simon I (2001) The HMMTOP transmembrane topology prediction server. Bioinformatics 17:849–850CrossRefGoogle Scholar
- Vriend G (1990) WHAT IF: a molecular modeling and drug design program. J Mol Graph 8(52–56):29Google Scholar
- Wass MN, Kelley LA, Sternberg MJ (2010) 3DLigandSite: predicting ligand-binding sites using similar structures. Nucleic Acids Res 38:W469–W473CrossRefGoogle Scholar
- Wierenga RK (2001) The TIM-barrel fold: a versatile framework for efficient enzymes. FEBS Lett 492:193–198CrossRefGoogle Scholar
- Yang J, Roy A, Zhang Y (2013) Protein-ligand binding site recognition using complementary binding-specific substructure comparison and sequence profile alignment. Bioinformatics 29:2588–2595CrossRefGoogle Scholar
- Yu CS, Chen YC, Lu CH, Hwang JK (2006) Prediction of protein subcellular localization. Proteins 64:643–651CrossRefGoogle Scholar
- Yu J, Zhou Y, Tanaka I, Yao M (2010a) Roll: a new algorithm for the detection of protein pockets and cavities with a rolling probe sphere. Bioinformatics 26:46–52CrossRefGoogle Scholar
- Yu NY, Wagner JR, Laird MR, Melli G, Rey S, Lo R, Dao P, Sahinalp SC, Ester M, Foster LJ et al (2010b) PSORTb 3.0: improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotes. Bioinformatics 26:1608–1615CrossRefGoogle Scholar
Copyright information
Open AccessThis article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.












