Tranche Distributed Repository and ProteomeCommons.org

Smith, Bryan E.; Hill, James A.; Gjukich, Mark A.; Andrews, Philip C.

doi:10.1007/978-1-60761-987-1_8

Bryan E. Smith⁴,
James A. Hill⁴,
Mark A. Gjukich⁴ &
…
Philip C. Andrews⁴

Part of the book series: Methods in Molecular Biology ((MIMB,volume 696))

3073 Accesses
31 Citations

Abstract

Tranche is a distributed repository designed to redundantly store and disseminate data sets for the proteomics community. It has several important features for researchers, including support for large data files, prepublication access controls, licensing options, and ensuring both data provenance and integrity. Tranche tightly integrates with ProteomeCommons.org, an online community resource that offers a variety of useful tools for proteomics researchers, including project management and data annotation. In this chapter, we discuss the development of Tranche and ProteomeCommons.org, paying particular attention to why it is desirable that data be publicly available and unrestricted as well as the challenges facing data archiving and open access. We then provide a technical overview of Tranche and ProteomeCommons.org as well as step-by-step instructions for using these resources, including the graphical user interface (GUI ), command-line tools, and Application Programmer Interface (API). We end with a brief discussion of current and future development efforts and collaborations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Protocol: USD 49.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 159.00; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Falkner JA, Ulintz PJ, Andrews PC (2006) A code and data archival and dissemination tool for the proteomics community. Am Biotechnol Lab 38:28–30
Google Scholar
Toronto International Data Release Workshop Authors (2009) Prepublication data sharing. Nature 461:168–170
Article Google Scholar
Schofield PN, Bubela T, Weaver T, Portilla L et al (2009) Post-publication sharing of data and tools. Nature 461:171–173
Article CAS PubMed Google Scholar
Editorial (2009) Data’s shameful neglect. Nature 461:145
Google Scholar
Salo D (2008) Innkeeper at the roach motel. Libr Trends 57:98–123
Article Google Scholar
Heidorn PB (2008) Shedding light on the dark data in the long tail of science. Libr Trends 57:280–299
Article Google Scholar
Wiley S (2009) Why don’t we share data? The Scientist 23:33
Google Scholar
Deutsch EW, Lam H, Aebersold R (2008) PeptideAtlas: a resource for target selection for emerging targeted proteomics workflows. EMBO Rep 9:429–434
Article CAS PubMed Google Scholar
Craig R, Cortens JP, Beavis RC (2004) An open source system for analyzing, validating and storing protein identification data. Proteome Res 3:1234–1242
Article CAS Google Scholar
Martens L, Hermjakob H, Jones P, Taylor C et al (2005) The PRoteomics IDEntification database. Proteomics 5:3537–3545
Article CAS PubMed Google Scholar
Prasad TS, Goel R, Kandasamy K, Keerthikumar S et al (2009) Human Protein Reference Database – 2009 update. Nucleic Acids Res 37:D767–D772
Article CAS Google Scholar
Slotta DJ, Barrett T, Edgar R (2009) NCBI Peptidome: a new public repository for mass spectrometry peptide identifications. Nat Biotechnol 27:600–601
Article CAS PubMed Google Scholar
(2007) Publication guidelines for the analysis and documentation of peptide and protein identifications. Mol Cell Proteomics (http://www.mcponline.org/misc/ParisReport_Final.dtl) accessed on July 13 2009.
Editorial (2007) Democratizing proteomics data. Nat Biotechnol 25:262
Google Scholar
(2008) Instructions to authors. Proteomics (http://www3.interscience.wiley.com/cgi-bin/jabout/76510741/2120_instruc.pdf) accessed on July 13 2009.
(2003) Final NIH statement on sharing research data. (http://grants.nih.gov/grants/guide/notice-files/NOT-OD-03-032.html) accessed on July 13 2009
Howe D, Costanzo M, Fey P, Gojobori T et al (2008) The future of biocuration. Nature 455:47–50
Article CAS PubMed Google Scholar
Martin DB, Nelson PS (2001) From genomics to proteomics: techniques and applications in cancer research. Trends Cell Biol 11:61–65
Google Scholar
Tyshenko MG (2005) Current trends in publicly available genetic databases. Health Inform J 11:295–308
Article Google Scholar
(2009) About CC0--“No Rights Reserved”. (http://creativecommons.org/about/cc0) accessed on July 13 2009
Prince JT, Carlson MW, Wang R, Lu P, Marcotte EM (2004) The need for a public proteomics repository. Nat Biotechnol 22:471–472
Article CAS PubMed Google Scholar
Why tumor samples are so important for research. (http://www.pediatricgist.cancer.gov/Source/Research/ResearchArticles/TumorSampleImpArticle.aspx)
Schweitzer MH, Suo Z, Avci R, Asara JM et al (2007) Analyses of soft tissue from Tyrannosaurus rex suggest the presence of protein. Science 316:277–280
Article CAS PubMed Google Scholar
Schweitzer MH, Zheng W, Organ CL, Avci R et al (2009) Biomolecular characterization and protein sequences of the Campanian hadrosaur B. canadensis. Science 324:626–631
Article CAS PubMed Google Scholar
Taylor CF, Paton NW, Lilley KS, Binz P et al (2007) The minimum information about a proteomics experiment (MIAPE). Nat Biotechnol 25:887–893
Article CAS PubMed Google Scholar
Pedrioli PGA, Eng JK, Hubley R, Vogelzang M et al (2004) A common open representation of mass spectrometry data and its application to proteomics research. Nat Biotechnol 22:1459–1466
Article CAS PubMed Google Scholar
Hamacher M, Stephan C, Meyer HE, Eisenacher M (2009) Data handling and processing in proteomics. Expert Rev Proteomics 6, 217–219. (2006) The mzData Standard. (http://www.psidev.info/index.php?q=node/80#mzdata)
Google Scholar
Orchard S, Taylor C, Hermjakob H, Zhu W et al (2004) Current status of proteomic standards development. Expert Rev Proteomics 1:179–183
Article CAS PubMed Google Scholar
Deutsch E (2008) mzML: a single, unifying data format for mass spectrometer output. Proteomics 8:2776–2777
Article CAS PubMed Google Scholar
Bayer R (1971) Binary B-trees for virtual memory. ACM-SIGFIDET Workshop 1971:219–235
Google Scholar
Martens L, Deutsch E, Hermjakob H, Omenn G (2009) Proteomics data submission strategy for ProteomeExchange. (http://proteomexchange.org/doc/ProteomExchange_data_submission_strategy_final.pdf)

Download references

Acknowledgments

Special thanks to Jayson Falkner, who led the initial development for both Tranche and ProteomeCommons.org. The authors would also like to thank Peter Ulintz, Jared Falkner, Brian Maso, and Panagiotis Papoulias for their contributions. The ProteomeCommons.org and Tranche Repository community resources are primarily sponsored by NCRR grant #P41-RR018627 and the NCI CPTC subcontract #27XS115. We also thank all the users of Tranche who have provided invaluable feedback and suggestions for the Tranche Project and ProteomeCommons.org.

Author information

Authors and Affiliations

Departments of Biological Chemistry, Bioinformatics and Chemistry, University of Michigan, Ann Arbor, MI, USA
Bryan E. Smith, James A. Hill, Mark A. Gjukich & Philip C. Andrews

Authors

Bryan E. Smith
View author publications
You can also search for this author in PubMed Google Scholar
James A. Hill
View author publications
You can also search for this author in PubMed Google Scholar
Mark A. Gjukich
View author publications
You can also search for this author in PubMed Google Scholar
Philip C. Andrews
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bryan E. Smith .

Editor information

Editors and Affiliations

Lead Discovery Center GmbH, Emil-Figge-Straße 76a76a, Dortmund, 44227, Germany
Michael Hamacher
Medizinisches Proteom-Center, Ruhr-Universität Bochum, Universitätsstraße 150, Bochum, 44801, Germany
Martin Eisenacher
Medizinisches Proteom-Center, Ruhr-Universität Bochum, Universitätsstraße 150, Bochum, 44801, Germany
Christian Stephan

Rights and permissions

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Smith, B.E., Hill, J.A., Gjukich, M.A., Andrews, P.C. (2011). Tranche Distributed Repository and ProteomeCommons.org. In: Hamacher, M., Eisenacher, M., Stephan, C. (eds) Data Mining in Proteomics. Methods in Molecular Biology, vol 696. Humana Press. https://doi.org/10.1007/978-1-60761-987-1_8

Download citation

DOI: https://doi.org/10.1007/978-1-60761-987-1_8
Published: 13 October 2010
Publisher Name: Humana Press
Print ISBN: 978-1-60761-986-4
Online ISBN: 978-1-60761-987-1
eBook Packages: Springer Protocols

Publish with us

Policies and ethics