Abstract
InterPro, an integrated documentation resource for protein families, protein domains, and functional sites, was developed to amalgamate the individual efforts of the PROSITE, PRINTS, Pfam, and ProDom databases. InterPro can be used for the computational functional classification of newly determined amino acid sequences that lack biochemical characterization and for comparative genome analysis. InterPro contains over 3500 entries for more than 1 000 000 hits in SWISS-PROT and TrEMBL. The database is accessible for text-and sequence-based searches at http://www.ebi.ac.uk/interpro/. InterPro was used for the complete analysis of the proteome of the pathogenic microorganism Mycobacterium tuberculosis and the comparison with the predicted protein-coding sequences of the complete genomes of Bacillus subtilis and Escherichia coli. It was found that 64.8% of proteins in the proteome of M. tuberculosis matched InterPro entries and can be classified by their functions. The comparison with B. subtilis and E. coli provided information on the most common protein families and domains and on the most highly represented protein families in each organism. Thus, InterPro is a useful tool for general comparison of complete proteomes and their compositions.
Similar content being viewed by others
References
R. Apweiler, T. K. Attwood, A. Bairoch, et al., Nucl. Acids Res. 291, 37 (2001).
T. K. Attwood, M. D. R. Croning, D. R. Flower, et al., Nucl. Acids Res. 28, 225 (2000).
K. Hofmann, P. Bucher, L. Falquet, and A. Bairoch, Nucl. Acids Res. 27, 215 (1999).
A. Bateman, E. Birney, R. Durbin, et al., Nucl. Acids Res. 28, 263 (2000).
F. Corpet, F. Servant, J. Gouzy, and D. Kahn, Nucl. Acids Res. 28, 267 (2000).
S. T. Cole, R. Brosch, J. Parkhill, et al., Nature 11(393), 537 (1998).
A. Bairoch and R. Apweiler, Nucl. Acids Res. 28, 45 (2000).
R. Apweiler, M. Biswas, W. Fleischmann, et al., Nucl. Acids Res. 29, 44 (2001).
Author information
Authors and Affiliations
Additional information
Original Russian Text © N.J. Mulder, W. Fleischmann, A. Kanapin, R. Apweiler, 2006, published in Biofizika, 2006, Vol. 51, No. 4, pp. 656–660.
Rights and permissions
About this article
Cite this article
Mulder, N.J., Fleischmann, W., Kanapin, A. et al. InterPro as a new tool for complete genome analysis: An example of comparative analysis. BIOPHYSICS 51, 587–591 (2006). https://doi.org/10.1134/S0006350906040117
Received:
Issue Date:
DOI: https://doi.org/10.1134/S0006350906040117