A Tutorial on Protein Ontology Resources for Proteomic Studies
The protein ontology (PRO) is designed as a formal and well-principled open biomedical ontologies (OBO) foundry ontology for proteins. The components of PRO extend from the classification of proteins, on the basis of evolutionary relationships at the full-length level, to the representation of the multiple protein forms of a gene, such as those resulting from alternative splicing, cleavage and/or posttranslational modifications, and protein complexes. As an ontology, PRO differs from a database in that it provides description about the protein types and their relationships. In addition, the representation of specific protein types, such as a phosphorylated protein form, allows precise definition of objects in pathways, complexes, or in disease modeling. This is useful for proteomics studies where isoforms and modified forms must be differentiated, and for biological pathway/network representation where the cascade of events often depends on a specific protein modification. PRO is manually curated starting with content derived from scientific literature. Only annotation with experimental evidence is included, and is in the form of relationship to other ontologies. In this tutorial, you will learn how to use the PRO resources to gain information about proteins of interest, such as finding conserved isoforms (ortho-isoforms), and different modified forms and their attributes. In addition, it will provide some details on how you can contribute to the ontology via the rapid annotation interface RACE-PRO.
Key wordsBiomedical ontology Protein ontology Community annotation Protein
PRO Consortium participants: Protein Information Resource, The Jackson Laboratory, Reactome, and the New York State Center of Excellence in Bioinformatics and Life Sciences. PRO is funded by NIH grant #R01 GM080646-01.
- 2.Li, D., Li, J-Q., Ouyang, S-G., Wang, J., Xu, X., Zhu, Y-P., He, F-C. (2005) An integrated strategy for functional analysis in large-scale proteomic research by gene ontology. Prog. Biochem. Biophys. 32, 1026–1029.Google Scholar
- 5.Brown-Bryan, T.A., Leoh, L.S., Ganapathy, V., Pacheco, F.J., Mediavilla-Varela, M., Filippova, M., Linkhart, T.A., Gijsbers, R., Debyser, Z., Casiano, C.A. (2008) Alternative splicing and caspase-mediated cleavage generate antagonistic variants of the stress oncoprotein LEDGF/p75. Mol. Cancer Res. 6, 1293–1307.PubMedCrossRefGoogle Scholar
- 6.Nchoutmboube, J., Arighi, C.N., and Wu, C.H. (2009) Data integration and literature mining for the curation of protein forms in the protein ontology (PRO). BIBM09, IEEE International Conference on Bioinformatics & Biomedicine, Washington, DC. Google Scholar
- 7.URL: http://www.obofoundry.org/..
- 9.Day-Richter, J., Harris, M.A., Haendel, M.; Gene Ontology OBO-Edit Working Group, Lewis, S. (2007) OBO-Edit – an ontology editor for biologists Bioinformatics 23, 2198–2200.Google Scholar
- 17.Wu, C.H., Nikolskaya, A., Huang, H., Yeh, L-S., Natale, D.A., Vinayaka, C.R., Hu, Z., Mazumder, R., Kumar, S., Kourtesis, P., Ledley, R.S., Suzek, B.E., Arminski, L., Chen, Y., Zhang, J., Cardenas, J.L., Chung, S., Castro-Alvear, J., Dinkov, G., Barker, W.C. (2004) PIRSF family classification system at the Protein Information Resource. Nucleic Acids Res. 32, D112–D114.PubMedCrossRefGoogle Scholar