Skip to main content

Simple Proteomics Data Analysis in the Object-Oriented PowerShell

  • Protocol
  • First Online:
Book cover Mass Spectrometry Data Analysis in Proteomics

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1007))

Abstract

Scripting languages such as Perl and Python are appreciated for solving simple, everyday tasks in bioinformatics. A more recent, object-oriented command shell and scripting language, Windows PowerShell, has many attractive features: an object-oriented interactive command line, fluent navigation and manipulation of XML files, ability to consume Web services from the command line, consistent syntax and grammar, rich regular expressions, and advanced output formatting. The key difference between classical command shells and scripting languages, such as bash, and object-oriented ones, such as PowerShell, is that in the latter the result of a command is a structured object with inherited properties and methods rather than a simple stream of characters. Conveniently, PowerShell is included in all new releases of Microsoft Windows and therefore already installed on most computers in classrooms and teaching labs. In this chapter we demonstrate how PowerShell in particular allows easy interaction with mass spectrometry data in XML formats, connection to Web services for tools such as BLAST, and presentation of results as formatted text or graphics. These features make PowerShell much more than “yet another scripting language.”

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Stajich JE (2007) An introduction to BioPerl. Methods Mol Biol 406:535–548

    PubMed  CAS  Google Scholar 

  2. Cock PJ, Antao T, Chang JT, Chapman BA, Cox CJ, Dalke A, Friedberg I, Hamelryck T, Kauff F, Wilczynski B, de Hoon MJ (2009) Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics 25:1422–1423

    Article  PubMed  CAS  Google Scholar 

  3. Reimers M, Carey VJ (2006) Bioconductor: an open source framework for bioinformatics and computational biology. Methods Enzymol 411:119–134

    Article  PubMed  CAS  Google Scholar 

  4. Schmitt T, Messina DN, Schreiber F, Sonnhammer EL (2011) Letter to the editor: SeqXML and OrthoXML: standards for sequence and orthology information. Brief Bioinform 12:485–488

    Article  PubMed  Google Scholar 

  5. NeXML—phylogenetic data as xml, http://www.nexml.org

  6. Han MV, Zmasek CM (2009) phyloXML: XML for evolutionary biology and comparative genomics. BMC Bioinformatics 10:356

    Article  PubMed  Google Scholar 

  7. Spellman PT, Miller M, Stewart J, Troup C, Sarkans U, Chervitz S, Bernhart D, Sherlock G, Ball C, Lepage M, Swiatek M, Marks WL, Goncalves J, Markel S, Iordan D, Shojatalab M, Pizarro A, White J, Hubley R, Deutsch E, Senger M, Aronow BJ, Robinson A, Bassett D, Stoeckert CJ Jr, Brazma A (2002) Design and implementation of microarray gene expression markup language (MAGE-ML). Genome Biol 3:RESEARCH0046

    Article  PubMed  Google Scholar 

  8. Picardi E, D'Antonio M, Carrabino D, Castrignano T, Pesole G (2011) ExpEdit: a webserver to explore human RNA editing in RNA-Seq experiments. Bioinformatics 27:1311–1312

    Article  PubMed  CAS  Google Scholar 

  9. SRA Format, http://www.ebi.ac.uk/ena/about/sra_format

  10. Pedrioli PG, Eng JK, Hubley R, Vogelzang M, Deutsch EW, Raught B, Pratt B, Nilsson E, Angeletti RH, Apweiler R, Cheung K, Costello CE, Hermjakob H, Huang S, Julian RK, Kapp E, McComb ME, Oliver SG, Omenn G, Paton NW, Simpson R, Smith R, Taylor CF, Zhu W, Aebersold R (2004) A common open representation of mass spectrometry data and its application to proteomics research. Nat Biotechnol 22:1459–1466

    Article  PubMed  CAS  Google Scholar 

  11. Deutsch E (2008) mzML: a single, unifying data format for mass spectrometer output. Proteomics 8:2776–2777

    Article  PubMed  CAS  Google Scholar 

  12. Martens L, Chambers M, Sturm M, Kessner D, Levander F, Shofstahl J, Tang WH, Rompp A, Neumann S, Pizarro AD, Montecchi-Palazzi L, Tasman N, Coleman M, Reisinger F, Souda P, Hermjakob H, Binz PA, Deutsch EW (2011) mzML—a community standard for mass spectrometry data. Mol Cell Proteomics 10(R110):000133

    PubMed  Google Scholar 

  13. Keller A, Eng J, Zhang N, Li XJ, Aebersold R (2005) A uniform proteomics MS/MS analysis platform utilizing open XML file formats. Mol Syst Biol 1(2005):0017

    PubMed  Google Scholar 

  14. Eisenacher M (2011) mzIdentML: an open community-built standard format for the results of proteomics spectrum identification algorithms. Methods Mol Biol 696:161–177

    Article  PubMed  CAS  Google Scholar 

  15. Pash—PowerShell open source cross-platform reimplementation, http://sourceforge.net/projects/pash

  16. Fourment M, Gillings MR (2008) A comparison of common programming languages used in bioinformatics. BMC Bioinformatics 9:82

    Article  PubMed  Google Scholar 

  17. Dudley JT, Butte AJ (2009) A quick guide for developing effective bioinformatics programming skills. PLoS Comput Biol 5:e1000589

    Article  PubMed  Google Scholar 

  18. Hay RT (2005) SUMO: a history of modification. Mol Cell 18:1–12

    Article  PubMed  CAS  Google Scholar 

  19. Aasland R, Abrams C, Ampe C, Ball LJ, Bedford MT, Cesareni G, Gimona M, Hurley JH, Jarchau T, Lehto VP, Lemmon MA, Linding R, Mayer BJ, Nagai M, Sudol M, Walter U, Winder SJ (2002) Normalization of nomenclature for peptide motifs as ligands of modular protein domains. FEBS Lett 513:141–144

    Article  PubMed  CAS  Google Scholar 

  20. MSChart Control, http://www.microsoft.com/download/en/details.aspx?id=14422

  21. Charting with PowerShell (2011) http://blogs.technet.com/b/richard_macdonald/archive/2009/04/28/3231887.aspx

  22. Lam H, Deutsch EW, Eddes JS, Eng JK, King N, Stein SE, Aebersold R (2007) Development and validation of a spectral library searching method for peptide identification from MS/MS. Proteomics 7:655–667

    Article  PubMed  CAS  Google Scholar 

  23. Web Services at the EBI, http://www.ebi.ac.uk/Tools/webservices/

  24. SOAP-based BLAST Web Service (2011) http://www.ncbi.nlm.nih.gov/books/NBK55699/

  25. Oinn T, Addis M, Ferris J, Marvin D, Senger M, Greenwood M, Carver T, Glover K, Pocock MR, Wipat A, Li P (2004) Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics 20:3045–3054

    Article  PubMed  CAS  Google Scholar 

  26. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215:403–410

    PubMed  CAS  Google Scholar 

  27. NCBI BLAST (SOAP) (2011) http://www.ebi.ac.uk/Tools/webservices/services/sss/ncbi_blast_soap

  28. Peptide VAALEGDVLGSYQHGAR from E. coli EF-Ts http://peptide.nist.gov/browser/peptide_stat.php?description=IT&organism=e_coli&pep_seq=VAALEGDVLGSYQHGAR

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer Science+Business Media, LLC

About this protocol

Cite this protocol

Mohammed, Y., Palmblad, M. (2013). Simple Proteomics Data Analysis in the Object-Oriented PowerShell. In: Matthiesen, R. (eds) Mass Spectrometry Data Analysis in Proteomics. Methods in Molecular Biology, vol 1007. Humana Press, Totowa, NJ. https://doi.org/10.1007/978-1-62703-392-3_17

Download citation

  • DOI: https://doi.org/10.1007/978-1-62703-392-3_17

  • Published:

  • Publisher Name: Humana Press, Totowa, NJ

  • Print ISBN: 978-1-62703-391-6

  • Online ISBN: 978-1-62703-392-3

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics