, Volume 51, Issue 4, pp 587–591 | Cite as

InterPro as a new tool for complete genome analysis: An example of comparative analysis

  • N. J. Mulder
  • W. Fleischmann
  • A. Kanapin
  • R. Apweiler
Molecular Biophysics


InterPro, an integrated documentation resource for protein families, protein domains, and functional sites, was developed to amalgamate the individual efforts of the PROSITE, PRINTS, Pfam, and ProDom databases. InterPro can be used for the computational functional classification of newly determined amino acid sequences that lack biochemical characterization and for comparative genome analysis. InterPro contains over 3500 entries for more than 1 000 000 hits in SWISS-PROT and TrEMBL. The database is accessible for text-and sequence-based searches at InterPro was used for the complete analysis of the proteome of the pathogenic microorganism Mycobacterium tuberculosis and the comparison with the predicted protein-coding sequences of the complete genomes of Bacillus subtilis and Escherichia coli. It was found that 64.8% of proteins in the proteome of M. tuberculosis matched InterPro entries and can be classified by their functions. The comparison with B. subtilis and E. coli provided information on the most common protein families and domains and on the most highly represented protein families in each organism. Thus, InterPro is a useful tool for general comparison of complete proteomes and their compositions.

Key words

database protein domain function protein family proteome 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    R. Apweiler, T. K. Attwood, A. Bairoch, et al., Nucl. Acids Res. 291, 37 (2001).CrossRefGoogle Scholar
  2. 2.
    T. K. Attwood, M. D. R. Croning, D. R. Flower, et al., Nucl. Acids Res. 28, 225 (2000).CrossRefGoogle Scholar
  3. 3.
    K. Hofmann, P. Bucher, L. Falquet, and A. Bairoch, Nucl. Acids Res. 27, 215 (1999).CrossRefGoogle Scholar
  4. 4.
    A. Bateman, E. Birney, R. Durbin, et al., Nucl. Acids Res. 28, 263 (2000).CrossRefGoogle Scholar
  5. 5.
    F. Corpet, F. Servant, J. Gouzy, and D. Kahn, Nucl. Acids Res. 28, 267 (2000).CrossRefGoogle Scholar
  6. 6.
    S. T. Cole, R. Brosch, J. Parkhill, et al., Nature 11(393), 537 (1998).ADSGoogle Scholar
  7. 7.
    A. Bairoch and R. Apweiler, Nucl. Acids Res. 28, 45 (2000).CrossRefGoogle Scholar
  8. 8.
    R. Apweiler, M. Biswas, W. Fleischmann, et al., Nucl. Acids Res. 29, 44 (2001).CrossRefGoogle Scholar

Copyright information

© Pleiades Publishing, Inc. 2006

Authors and Affiliations

  • N. J. Mulder
    • 1
  • W. Fleischmann
    • 1
  • A. Kanapin
    • 1
  • R. Apweiler
    • 1
  1. 1.EMBL Outstation—European Bioinformatics InstituteWellcome Trust Genome CampusHinxton, CambridgeUK

Personalised recommendations