Protein Bioinformatics Databases and Resources

  • Chuming ChenEmail author
  • Hongzhan Huang
  • Cathy H. Wu
Part of the Methods in Molecular Biology book series (MIMB, volume 694)


In the past decades, a variety of publicly available data repositories and resources have been developed to support protein related information management, data-driven hypothesis generation and biological knowledge discovery. However, there is also an increasing confusion for the researchers who are trying to quickly find the appropriate resources to help them solve their problems. In this chapter, we present a comprehensive review (with categorization and description) of major protein bioinformatics databases and resources that are relevant to comparative proteomics research. We conclude the chapter by discussing the challenges and opportunities for developing new protein bioinformatics databases.

Key words

Bioinformatics Database Protein sequence Protein family Protein structure Protein function Proteomics Data integration Comparative analysis 



We would like to thank Dr. Winona C. Barker for reviewing the manuscript and providing constructive comments.


  1. 1.
    Ridley, M. (2006) Genome. Harper Perennial, New York.Google Scholar
  2. 2.
    Velculescu, V. E., Zhang, L., Zhou, W., Vogelstein, J., Basrai, M. A., Bassett, D. E. Jr, Hieter, P., Vogelstein, B., Kinzler, K. W. (1997) Characterization of the yeast transcriptome. Cell 2, 243–251.CrossRefGoogle Scholar
  3. 3.
    Anderson, N. L., Anderson, N. G. (1998) Proteome and proteomics: new technologies, new concepts, and new words. Electrophoresis 11, 1853–1861.CrossRefGoogle Scholar
  4. 4.
    Hye, A., Lynham, S., Thambisetty, M., Causevic, M., Campbell, J., Byers, H. L., Hooper, C., Rijsdijk, F., Tabrizi, S. J., Banner, S., Shaw, C. E., Foy, C., Poppe, M., Archer, N., Hamilton, G., Powell, J., Brown, R. G., Sham, P., Ward, M., Lovestone, S. (2006) Proteome-based plasma biomarkers for Alzheimer’s disease. Brain 11, 3042–3050.CrossRefGoogle Scholar
  5. 5.
    Decramer, S., Wittke, S., Mischak, H., Zürbig, P., Walden, M., Bouissou, F., Bascands, J. L., Schanstra, J. P. (2006) Predicting the clinical outcome of congenital unilateral ureteropelvic junction obstruction in newborn by urinary proteome analysis. Nat. Med. 4, 398–400.CrossRefGoogle Scholar
  6. 6.
    Savidor, A., Donahoo, R. S., Hurtado-Gonzales, O., Land, M. L., Shah, M. B., Lamour, K. H., McDonald, W. H. (2008) Cross-species global proteomics reveals conserved and unique processes in Phytophthora sojae and Phytophthora ramorum. Mol. Cell Proteomics 8, 1501–1516.Google Scholar
  7. 7.
    Huang, M., Chen, T., Chan, Z. (2006) An evaluation for cross-species proteomics research by publicly available expressed sequence tag database search using tandem mass spectral data. Rapid Commun. Mass Spectrom. 18, 2635–2640.CrossRefGoogle Scholar
  8. 8.
    Ishii, A., Dutta, R., Wark, G. M., Hwang, S. I., Han, D. K., Trapp, B. D., Pfeiffer, S. E., Bansal, R. (2009) Human myelin proteome and comparative analysis with mouse myelin. Proc. Natl. Acad. Sci. U. S. A. 34, 14605–14610.CrossRefGoogle Scholar
  9. 9.
    Irmler, M., Hartl, D., Schmidt, T., Schuchhardt, J., Lach, C., Meyer, H. E., Hrabé, de Angelis M., Klose, J., Beckers, J. (2008) An approach to handling and interpretation of ambiguous data in transcriptome and proteome comparisons. Proteomics 6, 1165–1169.CrossRefGoogle Scholar
  10. 10.
    Galperin, M. Y., Cochrane, G. R. (2009) Nucleic acids research annual database issue and the NAR online molecular biology database collection in 2009. Nucleic Acids Res. 37, D1–D4.PubMedCrossRefGoogle Scholar
  11. 11.
    Pruitt, K. D., Tatusova, T., Maglott, D. R. (2007) NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 35, D61–D65.PubMedCrossRefGoogle Scholar
  12. 12.
    Benson, D. A., Karsch-Mizrachi, I., Lipman, D. J., Ostell, J., Wheeler, D. L. (2008) GeneBank. Nucleic Acids Res. 36, D25–D30.PubMedCrossRefGoogle Scholar
  13. 13.
    The UniProt Consortium. (2010) The universal protein resource (UniProt) in 2010. Nucleic Acids Res. 38, D142–D148.CrossRefGoogle Scholar
  14. 14.
    Leinonen, R., Diez, F. G., Binns, D., Fleischmann, W., Lopez, R., Apweiler, R. (2004) UniProt archive. Bioinformatics 20, 3236–3237.PubMedCrossRefGoogle Scholar
  15. 15.
    Suzek, B. E., Huang, H., McGarvey, P., Mazumder, R., Wu, C. H. (2007) UniRef: comprehensive and non-redundant UniProt reference clusters. Bioinformatics 23, 1282–1288.PubMedCrossRefGoogle Scholar
  16. 16.
    Yooseph, S., Sutton, G., Rusch, D. B., Halpern, A. L., Williamson, S. J., Remington, K., Eisen, J. A., Heidelberg, K. B., Manning, G., Li, W., Jaroszewski, L., Cieplak, P., Miller, C. S., Li, H., Mashiyama, S. T., Joachimiak, M. P., van Belle, C., Chandonia, J. M., Soergel, D. A., Zhai, Y., Natarajan, K., Lee, S., Raphael, B. J., Bafna, V., Friedman, R., Brenner, S. E., Godzik, A., Eisenberg, D., Dixon, J. E., Taylor, S. S., Strausberg, R. L., Frazier, M., Venter, J. C. (2007) The Sorcerer II global ocean sampling expedition: expanding the universe of protein families. PLoS Biol. 5, e16.PubMedCrossRefGoogle Scholar
  17. 17.
    Patient, S., Wieser, D., Kleen, M., Kretschmann, E., Martin, M. J., Apweiler, R. (2008) UniProtJAPI: a remote API for accessing UniProt data. Bioinformatics 24, 1321–1322.PubMedCrossRefGoogle Scholar
  18. 18.
    Nikolskaya, A. N., Arighi, C. N., Huang, H., Barker, W. C., Wu, C. H. (2006) PIRSF family classification system for protein functional and evolutionary analysis. Evol. Bioinform. Online 2, 197–209.Google Scholar
  19. 19.
    Finn, R. D., Tate, J., Mistry, J., Coggill, P. C., Sammut, S. J., Hotz, H. R., Ceric, G., Forslund, K., Eddy, S. R., Sonnhammer, E. L., Bateman, A. (2008) The Pfam protein families database. Nucleic Acids Res. 36, D281–D288.PubMedCrossRefGoogle Scholar
  20. 20.
    Wheeler, D. L., Barrett, T., Benson, D. A., Bryant, S. H., Canese, K., Chetvernin, V., Church, D. M., DiCuccio, M., Edgar, R., Federhen, S., Geer, L. Y., Kapustin, Y., Khovayko, O., Landsman, D., Lipman, D. J., Madden, T. L., Maglott, D. R., Ostell, J., Miller, V., Pruitt, K. D., Schuler, G. D., Sequeira, E., Sherry, S. T., Sirotkin, K., Souvorov, A., Starchenko, G., Tatusov, R. L., Tatusova, T. A., Wagner, L., Yaschenko, E. (2007) Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 35, D5–D12.PubMedCrossRefGoogle Scholar
  21. 21.
    Bru, C., Courcelle, E., Carrère, S., Beausse, Y., Dalmar, S., Kahn, D. (2005) The ProDom database of protein domain families: more emphasis on 3D. Nucleic Acids Res. 33, D212–D215.PubMedCrossRefGoogle Scholar
  22. 22.
    Andreeva, A., Howorth, D., Chandonia, J. M., Brenner, S. E., Hubbard, T. J., Chothia, C., Murzin, A. G. (2008) Data growth and its impact on the SCOP database: new developments. Nucleic Acids Res. 36, D419–D425.PubMedCrossRefGoogle Scholar
  23. 23.
    Berman, H., Henrick, K., Nakamura, H., Markley, J. L. (2007) The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data. Nucleic Acids Res. 35, D301–D303.PubMedCrossRefGoogle Scholar
  24. 24.
    Hulo, N., Bairoch, A., Bulliard, V., Cerutti, L., Cuche, B. A., de Castro, E, Lachaize, C., Langendijk-Genevaux, P. S., Sigrist, C. J. (2008) The 20 years of PROSITE. Nucleic Acids Res. 36, D245–D249.PubMedCrossRefGoogle Scholar
  25. 25.
    Sigrist, C. J. A., Cerutti, L., Hulo, N., Gattiker, A., Falquet, L., Pagni, M., Bairoch, A., Bucher, P. (2002) PROSITE: a documented database using patterns and profiles as motif descriptors. Brief. Bioinform. 3, 265–274.PubMedCrossRefGoogle Scholar
  26. 26.
    De Castro, E., Sigrist, C. J. A., Gattiker, A., Bulliard, V., Langendijk-Genevaux, P. S., Gasteiger, E., Bairoch, A., Hulo, N. (2006) ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins. Nucleic Acids Res. 34, W362–W365.PubMedCrossRefGoogle Scholar
  27. 27.
    Hunter, S., Apweiler, R., Attwood, T. K., Bairoch, A., Bateman, A., Binns, D., Bork, P., Das, U., Daugherty, L., Duquenne, L., Finn, R. D., Gough, J., Haft, D., Hulo, N., Kahn, D., Kelly, E., Laugraud, A., Letunic, I., Lonsdale, D., Lopez, R., Madera, M., Maslen, J., McAnulla, C., McDowall, J., Mistry, J., Mitchell, A., Mulder, N., Natale, D., Orengo, C., Quinn, A. F., Selengut, J. D., Sigrist, C. J., Thimma, M., Thomas, P. D., Valentin, F., Wilson, D., Wu, C. H., Yeats, C. (2009) InterPro: the integrative protein signature database. Nucleic Acids Res. 37, D211–D215.CrossRefGoogle Scholar
  28. 28.
    Yeats, C., Lees, J., Reid, A., Kellam, P., Martin, N., Liu, X., Orengo, C. (2008) Gene3D: comprehensive structural and functional annotation of genomes. Nucleic Acids Res. 36, D414–D418.PubMedCrossRefGoogle Scholar
  29. 29.
    Mi, H., Guo, N., Kejariwal, A., Thomas, P. D. (2007) PANTHER version 6: protein sequence and function evolution data with expanded representation of biological pathways. Nucleic Acids Res. 35, D247–D252.PubMedCrossRefGoogle Scholar
  30. 30.
    Attwood, T. K. (2002) The PRINTS database: a resource for identification of protein families. Brief. Bioinform. 3, 252–263.PubMedCrossRefGoogle Scholar
  31. 31.
    Letunic, I., Doerks, T., Bork, P. (2009) SMART 6: recent updates and new developments. Nucleic Acids Res. 37, D229–D232.PubMedCrossRefGoogle Scholar
  32. 32.
    Wilson, D., Pethica, R., Zhou, Y., Talbot, C., Vogel, C., Madera, M., Chothia, C., Gough, J. (2009) SUPERFAMILY – sophisticated comparative genomics, data mining, visualization and phylogeny. Nucleic Acids Res. 37, D380–D386.PubMedCrossRefGoogle Scholar
  33. 33.
    Haft, D. H., Selengut, J. D., White, O. (2003) The TIGRFAMs database of protein families. Nucleic Acids Res. 31, D371–D373.CrossRefGoogle Scholar
  34. 34.
    Mulder, N., Apweiler, R. (2007) InterPro and InterProScan: tools for protein sequence classification and comparison. Methods Mol Biol. 396, 59–70.PubMedCrossRefGoogle Scholar
  35. 35.
    Smedley, D., Haider, S., Ballester, B., Holland, R., London, D., Thorisson, G., Kasprzyk, A. (2009) BioMart – biological queries made easy. BMC Genomics 10, 22.PubMedCrossRefGoogle Scholar
  36. 36.
    Westbrook, J., Ito, N., Nakamura, H., Henrick, K., Berman, H. M. (2005) PDBML: the representation of archival macromolecular structure data in XML. Bioinformatics 21, 988–992.PubMedCrossRefGoogle Scholar
  37. 37.
    Cuff, A. L., Sillitoe, I., Lewis, T., Redfern, O. C., Garratt, R., Thornton, J., Orengo, C. A. (2009) The CATH classification revisited – architectures reviewed and new ways to characterize structural divergence in superfamilies. Nucleic Acids Res. 37, D310–D314.PubMedCrossRefGoogle Scholar
  38. 38.
    Fulton, K. F., Bate, M. A., Faux, N. G., Mahmood, K., Betts, C., Buckle, A. M. (2007) Protein Folding Database (PFD 2.0): an online environment for the International Foldeomics Consortium. Nucleic Acids Res. 35, D304–D307.PubMedCrossRefGoogle Scholar
  39. 39.
    Maxwell, K. L., Wildes, D., Zarrine-Afsar, A., De Los Rios, M. A., Brown, A. G., Friel, C. T., Hedberg, L., Horng, J. C., Bona, D., Miller, E. J., Vallée-Bélisle, A., Main, E. R., Bemporad, F., Qiu, L., Teilum, K., Vu, N. D., Edwards, A. M., Ruczinski, I., Poulsen, F. M., Kragelund, B. B., Michnick, S. W., Chiti, F., Bai, Y., Hagen, S. J., Serrano, L., Oliveberg, M., Raleigh, D. P., Wittung-Stafshede, P., Radford, S. E., Jackson, S. E., Sosnick, T. R., Marqusee, S., Davidson, A. R., Plaxco, K. W. (2005) Protein folding: defining a “standard” set of experimental conditions and a preliminary kinetic data set of two-state proteins. Protein Sci. 14, 602–616.PubMedCrossRefGoogle Scholar
  40. 40.
    Zanzoni, A., Ausiello, G., Via, A., Gherardini, P. F., Helmer-Citterich, M. (2007) Phospho3D: a database of three-dimensional structures of protein phosphorylation sites. Nucleic Acids Res. 35, D229–D231.PubMedCrossRefGoogle Scholar
  41. 41.
    Diella, F., Cameron, S., Gemünd, C., Linding, R., Via, A., Kuster, B., Sicheritz-Pontén, T., Blom, N., Gibson, T. J. (2004) Phospho.ELM: a database of experimentally verified phosphorylation sites in eukaryotic proteins. BMC Bioinformatics 5, 79.PubMedCrossRefGoogle Scholar
  42. 42.
    Aranda, B., Achuthan, P., Alam-Faruque, Y., Armean, I., Bridge, A., Derow, C., Feuermann, M., Ghanbarian, A. T., Kerrien, S., Khadake, J., Kerssemakers, J., Leroy, C., Menden, M., Michaut, M., Montecchi-Palazzi, L., Neuhauser, S. N., Orchard, S., Perreau, V., Roechert, B., van Eijk, K., Hermjakob, H. (2010) The IntAct molecular interaction database in 2010. Nucleic Acids Res. 38, D525–D531.PubMedCrossRefGoogle Scholar
  43. 43.
    Orchard, S., Kerrien, S., Jones, P., Ceol, A., Chatr-Aryamontri, A., Salwinski, L., Nerothin, J., Hermjakob, H. (2007) Submit your interaction data the IMEx way: a step by step guide to trouble-free deposition. Proteomics 7 Suppl 1, 28–34.PubMedCrossRefGoogle Scholar
  44. 44.
    Orchard, S., Salwinski, L., Kerrien, S., Montecchi-Palazzi, L., Oesterheld, M., Stümpflen, V., Ceol, A., Chatr-aryamontri, A., Armstrong, J., Woollard, P., Salama, J. J., Moore, S., Wojcik, J., Bader, G. D., Vidal, M., Cusick, M. E., Gerstein, M., Gavin, A. C., Superti-Furga, G., Greenblatt, J., Bader, J., Uetz, P., Tyers, M., Legrain, P., Fields, S,, Mulder, N., Gilson, M., Niepmann, M., Burgoon, L., De Las Rivas, J., Prieto, C., Perreau, V. M., Hogue, C., Mewes, H. W., Apweiler, R., Xenarios, I., Eisenberg, D., Cesareni, G., Hermjakob, H. (2007) The minimum information required for reporting a molecular interaction experiment (MIMIx). Nat. Biotechnol. 25, 894–898.PubMedCrossRefGoogle Scholar
  45. 45.
    Kerrien, S., Orchard, S., Montecchi-Palazzi, L., Aranda, B., Quinn, A. F., Vinod, N., Bader, G. D., Xenarios, I., Wojcik, J., Sherman, D., Tyers, M., Salama, J. J., Moore, S., Ceol, A., Chatr-Aryamontri, A., Oesterheld, M., Stümpflen, V., Salwinski, L., Nerothin, J., Cerami, E., Cusick, M. E., Vidal, M., Gilson, M., Armstrong, J., Woollard, P., Hogue, C., Eisenberg, D., Cesareni, G., Apweiler, R., Hermjakob, H. (2007) Broadening the horizon – level 2.5 of the HUPO-PSI format for molecular interactions. BMC Biol. 5, 44.PubMedCrossRefGoogle Scholar
  46. 46.
    Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G. (2000) Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 25, 25–29.CrossRefGoogle Scholar
  47. 47.
    Matthews, L., Gopinath, G., Gillespie, M., Caudy, M., Croft, D., de Bono, B., Garapati, P., Hemish, J., Hermjakob, H., Jassal, B., Kanapin, A., Lewis, S., Mahajan, S., May, B., Schmidt, E., Vastrik, I., Wu, G., Birney, E., Stein, L., D’Eustachio, P. (2009) Reactome knowledgebase of human biological pathways and processes. Nucleic Acids Res. 37, D619–D622.PubMedCrossRefGoogle Scholar
  48. 48.
    Hucka, M., Finney, A., Sauro, H. M., Bolouri, H., Doyle, J. C., Kitano, H., Arkin, A. P., Bornstein, B. J., Bray, D., Cornish-Bowden, A., Cuellar, A. A., Dronov, S., Gilles, E. D., Ginkel, M., Gor, V., Goryanin, II., Hedley, W. J., Hodgman, T. C., Hofmeyr, J. H., Hunter, P. J., Juty, N. S., Kasberger, J. L., Kremling, A., Kummer, U., Le Novère, N., Loew, L. M., Lucio, D., Mendes, P., Minch, E., Mjolsness, E. D., Nakayama, Y., Nelson, M. R., Nielsen, P. F., Sakurada, T., Schaff, J. C., Shapiro, B. E., Shimizu, T. S., Spence, H. D., Stelling, J., Takahashi, K., Tomita, M., Wagner, J., Wang, J., SBML Forum. (2003) The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics 19, 524–531.PubMedCrossRefGoogle Scholar
  49. 49.
    Noy, N. F., Crubezy, M., Fergerson, R. W., Knublauch, H., Tu, S. W., Vendetti, J., Musen, M. A. (2003) Protégé-2000: an open-source ontology-development and knowledge-acquisition environment. AMIA. Annu Symp Proc. 953.Google Scholar
  50. 50.
    Cline, M. S., Smoot, M., Cerami, E., Kuchinsky, A., Landys, N., Workman, C., Christmas, R., Avila-Campilo, I., Creech, M., Gross, B., Hanspers, K., Isserlin, R., Kelley, R., Killcoyne, S., Lotia, S., Maere, S., Morris, J., Ono, K., Pavlovic, V., Pico, A. R., Vailaya, A., Wang, P. L., Adler, A., Conklin, B. R., Hood, L., Kuiper, M., Sander, C., Schmulevich, I., Schwikowski, B., Warner, G. J., Ideker, T., Bader, G. D. (2007) Integration of biological networks and gene expression data using Cytoscape. Nat. Protoc. 2, 2366–2382.PubMedCrossRefGoogle Scholar
  51. 51.
    Caspi, R., Foerster, H., Fulcher, C. A., Kaipa, P., Krummenacker, M., Latendresse, M., Paley, S., Rhee, S. Y., Shearer, A., Tissier, C., Walk, T. C., Zhang, P. and Karp, P. D. (2008) The MetaCyc Database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases. Nucleic Acids Res. 36, D623–D631.PubMedCrossRefGoogle Scholar
  52. 52.
    Hoogland, C., Mostaguir, K., Appel, R. D., Lisacek, F. (2008) The World-2DPAGE Constellation to promote and publish gel-based proteomics data through the ExPASy server. J. Proteomics 71, 245–248.PubMedCrossRefGoogle Scholar
  53. 53.
    Mostaguir, K., Hoogland, C., Binz, P. A., Appel, R. D. (2003) The Make 2D-DB II package: conversion of federated two-dimensional gel electrophoresis databases into a relational format and interconnection of distributed databases. Proteomics 3, 1441–1444.PubMedCrossRefGoogle Scholar
  54. 54.
    Vizcaíno, J. A., Côté, R., Reisinger, F., Barsnes, H., Foster, J. M., Rameseder, J., Hermjakob, H., Martens, L. (2009) The proteomics identifications database: 2010 update. Nucleic Acids Res. 38, D736–D742.PubMedCrossRefGoogle Scholar
  55. 55.
    Côté, R. G., Jones, P., Martens, L., Kerrien, S., Reisinger, F., Lin, Q., Leinonen, R., Apweiler, R., Hermjakob, H. (2007) The protein identifier cross-referencing (PICR) service: reconciling protein identifiers across multiple source databases. BMC Bioinformatics 8, 401–414.PubMedCrossRefGoogle Scholar
  56. 56.
    Burgun, A., Bodenreider, O. (2008) Accessing and integrating data and knowledge for biomedical research. Yearb Med Inform. 91–101.Google Scholar
  57. 57.
    Hwang, D., Rust, A. G., Ramsey, S., Smith, J. J., Leslie, D. M., Weston, A. D., de Atauri, P., Aitchison, J. D., Hood, L., Siegel, A. F., Bolouri, H. (2005) A data integration methodology for systems biology. Proc. Natl Acad. Sci. U. S. A. 102, 17296–17301.PubMedCrossRefGoogle Scholar
  58. 58.
    Mathew, J. P., Taylor, B. S., Bader, G. D., Pyarajan, S., Antoniotti, M., Chinnaiyan, A. M., Sander, C., Burakoff, S. J., Mishra, B. (2007) From bytes to bedside: data integration and computational biology for translational cancer research. PLoS Comput. Biol. 3, e12.PubMedCrossRefGoogle Scholar
  59. 59.
    McGarvey, P. B., Huang, H., Mazumder, R., Zhang, J., Chen, Y., Zhang, C., Cammer, S., Will, R., Odle, M., Sobral, B., Moore, M., Wu, C. H. (2009) Systems integration of biodefense omics data for analysis of pathogen–host interactions and identification of potential targets. PLoS One 4, e7162.PubMedCrossRefGoogle Scholar
  60. 60.
    Stevens, R., Zhao, J., Goble, C. (2007) Using provenance to manage knowledge of in silico experiments. Brief. Bioinform. 8, 183–194.PubMedCrossRefGoogle Scholar
  61. 61.
    Pedrioli, P. G., Eng, J. K., Hubley, R., Vogelzang, M., Deutsch, E. W., Raught, B., Pratt, B., Nilsson, E., Angeletti, R. H., Apweiler, R., Cheung, K., Costello, C. E., Hermjakob, H., Huang, S., Julian, R. K., Kapp, E., McComb, M. E., Oliver, S. G., Omenn, G., Paton, N. W., Simpson, R., Smith, R., Taylor, C. F., Zhu, W., Aebersold, R. (2004) A common open representation of mass spectrometry data and its application to proteomics research. Nat. Biotechnol. 22, 1459–1466.PubMedCrossRefGoogle Scholar
  62. 62.
    Orchard, S., Montechi-Palazzi, L., Deutsch, E. W., Binz, P. A., Jones, A. R., Paton, N., Pizarro, A., Creasy, D. M., Wojcik, J., Hermjakob, H. (2007) Five years of progress in the standardization of proteomics data 4(th) annual spring workshop of the HUPO-proteomics standards initiative April 23–25, 2007 Ecole Nationale Supérieure (ENS), Lyon, France. Proteomics 7, 3436–3440.PubMedCrossRefGoogle Scholar
  63. 63.
    Taylor, C. F., Paton, N. W., Lilley, K. S., Binz, P. A., Julian, R. K. Jr, Jones, A. R., Zhu, W., Apweiler, R., Aebersold, R., Deutsch, E. W., Dunn, M. J., Heck, A. J., Leitner, A., Macht, M., Mann, M., Martens, L., Neubert, T. A., Patterson, S. D., Ping, P., Seymour, S. L., Souda, P., Tsugita, A., Vandekerckhove, J., Vondriska, T. M., Whitelegge, J. P., Wilkins, M. R., Xenarios, I., Yates, J. R. 3rd, Hermjakob, H. (2007) The minimum information about a proteomics experiment (MIAPE). Nat. Biotechnol. 25, 887–893.PubMedCrossRefGoogle Scholar
  64. 64.
    Tatusov, R. L., Fedorova, N. D., Jackson, J. D., Jacobs, A. R., Kiryutin, B., Koonin, E. V., Krylov, D. M., Mazumder, R., Mekhedov, S. L., Nikolskaya, A. N., Rao, B. S., Smirnov, S., Sverdlov, A. V., Vasudevan, S., Wolf, Y. I., Yin, J. J., Natale, D. A. (2003) The COG database: an updated version includes eukaryotes. BMC Bioinformatics 4, 41–54.PubMedCrossRefGoogle Scholar
  65. 65.
    Kaplan, N., Sasson, O., Inbar, U., Friedlich, M., Fromer, M., Fleischer, H., Portugaly, E., Linial, N., Linial, M. (2005) ProtoNet 4.0: a hierarchical classification of one million protein sequences. Nucleic Acids Res. 33, D216–D218.PubMedCrossRefGoogle Scholar
  66. 66.
    Marchler-Bauer, A., Anderson, J. B., Chitsaz, F., Derbyshire, M. K., DeWeese-Scott, C., Fong, J. H., Geer, L. Y., Geer, R. C., Gonzales, N. R., Gwadz, M., He, S., Hurwitz, D. I., Jackson, J. D., Ke, Z., Lanczycki, C. J., Liebert, C. A., Liu, C., Lu, F., Lu, S., Marchler, G. H., Mullokandov, M., Song, J. S., Tasneem, A., Thanki, N., Yamashita, R. A., Zhang, D., Zhang, N., Bryant, S. H. (2009) CDD: specific functional annotation with the conserved domain database. Nucleic Acids Res. 37, D205–D210.PubMedCrossRefGoogle Scholar
  67. 67.
    Wang, Y., Addess, K. J., Chen, J., Geer, L. Y., He, J., He, S., Lu, S., Madej, T., Marchler-Bauer, A., Thiessen, P. A., Zhang, N., Bryant, S. H. (2007) MMDB: annotating protein sequences with Entrez’s 3D-structure database. Nucleic Acids Res. 35, D298–D300.PubMedCrossRefGoogle Scholar
  68. 68.
    Pieper, U., Eswar, N., Webb, B. M., Eramian, D., Kelly, L., Barkan, D. T., Carter, H., Mankoo, P., Karchin, R., Marti-Renom, M. A., Davis, F. P., Sali, A. (2009) MODBASE, a database of annotated comparative protein structure models and associated resources. Nucleic Acids Res. 37, D347–D354.PubMedCrossRefGoogle Scholar
  69. 69.
    Kiefer, F., Arnold, K., Künzli, M., Bordoli, L., Schwede, T. (2009) The SWISS-MODEL repository and associated resources. Nucleic Acids Res. 37, D387–D392.PubMedCrossRefGoogle Scholar
  70. 70.
    Bogatyreva, N. S., Osypov, A. A., Ivankov, D. N. (2009) KineticDB: a database of protein folding kinetics. Nucleic Acids Res. 37, D342–D346.PubMedCrossRefGoogle Scholar
  71. 71.
    Garavelli, J. S. (2004) The RESID database of protein modifications as a resource and annotation tool. Proteomics 4, 1527–1533.PubMedCrossRefGoogle Scholar
  72. 72.
    Salwinski, L., Miller, C. S., Smith, A. J., Pettit, F. K., Bowie, J. U., Eisenberg, D. (2004) The database of interacting proteins: 2004 update. Nucleic Acids Res. 32, D449–D451.PubMedCrossRefGoogle Scholar
  73. 73.
    Breitkreutz, B. J., Stark, C., Reguly, T., Boucher, L., Breitkreutz, A., Livstone, M., Oughtred, R., Lackner, D. H., Bähler, J., Wood, V., Dolinski, K., Tyers, M. (2008) The BioGRID interaction database: 2008 update. Nucleic Acids Res. 36, D637–D640.PubMedCrossRefGoogle Scholar
  74. 74.
    Kanehisa, M., Goto, S. (2000) KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30.PubMedCrossRefGoogle Scholar
  75. 75.
    Tarcea, V. G., Weymouth, T., Ade, A., Bookvich, A., Gao, J., Mahavisno, V., Wright, Z., Chapman, A., Jayapandian, M., Ozgür, A., Tian, Y., Cavalcoli, J., Mirel, B., Patel, J., Radev, D., Athey, B., States, D., Jagadish, H. V. (2009) Michigan molecular interactions r2: from interacting proteins to pathways. Nucleic Acids Res. 37, D642–D646.PubMedCrossRefGoogle Scholar
  76. 76.
    Craig, R., Cortens, J. C., Fenyo, D., Beavis, R. C. (2006) Using annotated peptide mass spectrum libraries for protein identification. J. Proteome Res. 5, 1843–1849.CrossRefGoogle Scholar
  77. 77.
    Deutsch, E. W., Lam, H., Aebersold, R. (2008) PeptideAtlas: a resource for target selection for emerging targeted proteomics workflows. EMBO Rep. 9, 429–434.PubMedCrossRefGoogle Scholar
  78. 78.
    Slotta, D. J., Barrett, T., Edgar, R. (2009) NCBI peptidome: a new public repository for mass spectrometry peptide identifications. Nat. Biotechnol. 27, 600–601.PubMedCrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC 2011

Authors and Affiliations

  1. 1.Department of Computer and Information SciencesUniversity of DelawareNewarkUSA

Personalised recommendations