Abstract
The ever increasing volumes of proteomic data now being produced by laboratories across the world have resulted in major issues in data storage and accessibility. The further demands of multilaboratory initiatives has highlighted issues when collaborators cannot import data generated within the same project but generated by different hardware types and processed by laboratory-specific work flows and analyses packages. There is an increasing need for common data standards that will allow the interchange of data between different instrumentation, search engines, and between laboratory databases. This could then lead to the establishment of data repositories from where benchmark datasets could be accessed and reanalyzed.
The Human Proteome Organization is currently supporting efforts to establish such standards. The work of the Proteomics Standards Initiative has lead to the development of the mzData XML interchange standard and is now broadening its scope to produce a spectral analysis output format, mzIdent. Accompanying controlled vocabularies allow the accurate, while systematic, representation of metadata throughout both schema.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Orchard, S., Hermjakob, H., Binz, P. A., et al. (2005) Further steps towards data standardisation: the Proteomic Standards Initiative HUPO 3rd annual congress. October 25–27, 2004 Beijing, China. Proteomics 5, 337–339.
Hanash, S. (2004) HUPO initiatives relevant to clinical proteomics. Mol. Cell Proteomics 3, 298–301.
Bairoch, A., Apweiler, R., Wu, C. H., et al. (2005) The universal protein resource (UniProt). Nucleic Acids Res. 33, 154–159.
Boeckmann, B., Bairoch, A., Apweiler, R., et al. (2003) The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 31, 365–370.
Wu, C. H., Yeh, L. S., Huang, H., et al. (2003) The Protein Information Resource. Nucleic Acids Res. 31, 345–347.
Hermjakob, H., Montecchi-Palazzi, L., Lewington, C., et al. (2004) IntAct: an open source molecular interaction database. Nucleic Acids Res. 32, D452–D455.
Kersey, P. J., Duarte, J., Williams, A., Karavidopoulou, Y., Birney, E., and Apweiler, R. (2004) The International Protein Index: an integrated database for proteomics experiments. Proteomics 4, 1985–1988.
Pruitt, K. D., Tatusova, T., and Maglott, D. R. (2005) NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 33, 501–504.
Birney, E., Andrews, T. D., Bevan, P., et al. (2004) An overview of Ensembl. Genome Res. 14, 925–928.
Rhee, S. Y., Beavis, W., Berardini, T. Z., et al. (2003) The Arabidopsis Information Resource (TAIR): a model organism database providing a centralized, curated gateway to Arabidopsis biology, research materials and community. Nucleic Acids Res. 31, 224–228.
Orchard, S., Montecchi-Palazzi, L., Hermjakob, H., and Apweiler, R. (2005) The use of common ontologies and controlled vocabularies to enable data exchange and deposition for complex proteomic experiments. Pac. Symp. Biocomput. 186–196.
Aitken, J. S., Webber, B. L., and Bard, J. B. (2004) Part-of relations in anatomy ontologies: a proposal for RDFS and OWL formalisations. Pac. Symp. Biocomput. 166–177.
Pedrioli, P. G., Eng, J. K., Hubley, R., et al. (2004) A common open representation of mass spectrometry data and its application to proteomics research. Nat. Biotechnol. 22, 1459–1466.
Garavelli, J. S. (2004) The RESID Database of protein modifications as a resource and annotation tool. Proteomics 4, 1527–1533.
Creasy, D. M. and Cottrell, J. S. (2004) Unimod: protein modifications for mass spectrometry. Proteomics 4, 1534–1536.
Martens, L., Hermjakob, H., Jones, P., et al. (2005) PRIDE: The Proteomics IDEntifications database. Proteomics 5, 3537–3545.
Omenn, G. S., States, D. J., Adamski, M., et al. (2005) Overview of the HUPO Plasma Proteome Project: results from the pilot phase with 35 collaborating laboratories and multiple analytical groups, generating a core dataset of 3020 proteins and a publicly-available database. Proteomics 5, 3226–3245.
Orchard, S., Taylor, C. F., Hermjakob, H., Weimin-Zhu, Julian, R. K. Jr., and Apweiler, R. (2004) Advances in the development of common interchange standards for proteomic data. Proteomics 4, 2363–2365.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Humana Press Inc., Totowa, NJ
About this protocol
Cite this protocol
Orchard, S. et al. (2007). Proteomic Data Exchange and Storage. In: Matthiesen, R. (eds) Mass Spectrometry Data Analysis in Proteomics. Methods in Molecular Biology, vol 367. Humana Press. https://doi.org/10.1385/1-59745-275-0:261
Download citation
DOI: https://doi.org/10.1385/1-59745-275-0:261
Publisher Name: Humana Press
Print ISBN: 978-1-58829-563-7
Online ISBN: 978-1-59745-275-5
eBook Packages: Springer Protocols