Abstract
Scripting languages such as Perl and Python are appreciated for solving simple, everyday tasks in bioinformatics. A more recent, object-oriented command shell and scripting language, Windows PowerShell, has many attractive features: an object-oriented interactive command line, fluent navigation and manipulation of XML files, ability to consume Web services from the command line, consistent syntax and grammar, rich regular expressions, and advanced output formatting. The key difference between classical command shells and scripting languages, such as bash, and object-oriented ones, such as PowerShell, is that in the latter the result of a command is a structured object with inherited properties and methods rather than a simple stream of characters. Conveniently, PowerShell is included in all new releases of Microsoft Windows and therefore already installed on most computers in classrooms and teaching labs. In this chapter we demonstrate how PowerShell in particular allows easy interaction with mass spectrometry data in XML formats, connection to Web services for tools such as BLAST, and presentation of results as formatted text or graphics. These features make PowerShell much more than “yet another scripting language.”
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Stajich JE (2007) An introduction to BioPerl. Methods Mol Biol 406:535–548
Cock PJ, Antao T, Chang JT, Chapman BA, Cox CJ, Dalke A, Friedberg I, Hamelryck T, Kauff F, Wilczynski B, de Hoon MJ (2009) Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics 25:1422–1423
Reimers M, Carey VJ (2006) Bioconductor: an open source framework for bioinformatics and computational biology. Methods Enzymol 411:119–134
Schmitt T, Messina DN, Schreiber F, Sonnhammer EL (2011) Letter to the editor: SeqXML and OrthoXML: standards for sequence and orthology information. Brief Bioinform 12:485–488
NeXML—phylogenetic data as xml, http://www.nexml.org
Han MV, Zmasek CM (2009) phyloXML: XML for evolutionary biology and comparative genomics. BMC Bioinformatics 10:356
Spellman PT, Miller M, Stewart J, Troup C, Sarkans U, Chervitz S, Bernhart D, Sherlock G, Ball C, Lepage M, Swiatek M, Marks WL, Goncalves J, Markel S, Iordan D, Shojatalab M, Pizarro A, White J, Hubley R, Deutsch E, Senger M, Aronow BJ, Robinson A, Bassett D, Stoeckert CJ Jr, Brazma A (2002) Design and implementation of microarray gene expression markup language (MAGE-ML). Genome Biol 3:RESEARCH0046
Picardi E, D'Antonio M, Carrabino D, Castrignano T, Pesole G (2011) ExpEdit: a webserver to explore human RNA editing in RNA-Seq experiments. Bioinformatics 27:1311–1312
SRA Format, http://www.ebi.ac.uk/ena/about/sra_format
Pedrioli PG, Eng JK, Hubley R, Vogelzang M, Deutsch EW, Raught B, Pratt B, Nilsson E, Angeletti RH, Apweiler R, Cheung K, Costello CE, Hermjakob H, Huang S, Julian RK, Kapp E, McComb ME, Oliver SG, Omenn G, Paton NW, Simpson R, Smith R, Taylor CF, Zhu W, Aebersold R (2004) A common open representation of mass spectrometry data and its application to proteomics research. Nat Biotechnol 22:1459–1466
Deutsch E (2008) mzML: a single, unifying data format for mass spectrometer output. Proteomics 8:2776–2777
Martens L, Chambers M, Sturm M, Kessner D, Levander F, Shofstahl J, Tang WH, Rompp A, Neumann S, Pizarro AD, Montecchi-Palazzi L, Tasman N, Coleman M, Reisinger F, Souda P, Hermjakob H, Binz PA, Deutsch EW (2011) mzML—a community standard for mass spectrometry data. Mol Cell Proteomics 10(R110):000133
Keller A, Eng J, Zhang N, Li XJ, Aebersold R (2005) A uniform proteomics MS/MS analysis platform utilizing open XML file formats. Mol Syst Biol 1(2005):0017
Eisenacher M (2011) mzIdentML: an open community-built standard format for the results of proteomics spectrum identification algorithms. Methods Mol Biol 696:161–177
Pash—PowerShell open source cross-platform reimplementation, http://sourceforge.net/projects/pash
Fourment M, Gillings MR (2008) A comparison of common programming languages used in bioinformatics. BMC Bioinformatics 9:82
Dudley JT, Butte AJ (2009) A quick guide for developing effective bioinformatics programming skills. PLoS Comput Biol 5:e1000589
Hay RT (2005) SUMO: a history of modification. Mol Cell 18:1–12
Aasland R, Abrams C, Ampe C, Ball LJ, Bedford MT, Cesareni G, Gimona M, Hurley JH, Jarchau T, Lehto VP, Lemmon MA, Linding R, Mayer BJ, Nagai M, Sudol M, Walter U, Winder SJ (2002) Normalization of nomenclature for peptide motifs as ligands of modular protein domains. FEBS Lett 513:141–144
MSChart Control, http://www.microsoft.com/download/en/details.aspx?id=14422
Charting with PowerShell (2011) http://blogs.technet.com/b/richard_macdonald/archive/2009/04/28/3231887.aspx
Lam H, Deutsch EW, Eddes JS, Eng JK, King N, Stein SE, Aebersold R (2007) Development and validation of a spectral library searching method for peptide identification from MS/MS. Proteomics 7:655–667
Web Services at the EBI, http://www.ebi.ac.uk/Tools/webservices/
SOAP-based BLAST Web Service (2011) http://www.ncbi.nlm.nih.gov/books/NBK55699/
Oinn T, Addis M, Ferris J, Marvin D, Senger M, Greenwood M, Carver T, Glover K, Pocock MR, Wipat A, Li P (2004) Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics 20:3045–3054
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215:403–410
NCBI BLAST (SOAP) (2011) http://www.ebi.ac.uk/Tools/webservices/services/sss/ncbi_blast_soap
Peptide VAALEGDVLGSYQHGAR from E. coli EF-Ts http://peptide.nist.gov/browser/peptide_stat.php?description=IT&organism=e_coli&pep_seq=VAALEGDVLGSYQHGAR
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Mohammed, Y., Palmblad, M. (2013). Simple Proteomics Data Analysis in the Object-Oriented PowerShell. In: Matthiesen, R. (eds) Mass Spectrometry Data Analysis in Proteomics. Methods in Molecular Biology, vol 1007. Humana Press, Totowa, NJ. https://doi.org/10.1007/978-1-62703-392-3_17
Download citation
DOI: https://doi.org/10.1007/978-1-62703-392-3_17
Published:
Publisher Name: Humana Press, Totowa, NJ
Print ISBN: 978-1-62703-391-6
Online ISBN: 978-1-62703-392-3
eBook Packages: Springer Protocols