OpenKnowledge for peer-to-peer experimentation in protein identification by MS/MS
Traditional scientific workflow platforms usually run individual experiments with little evaluation and analysis of performance as required by automated experimentation in which scientists are being allowed to access numerous applicable workflows rather than being committed to a single one. Experimental protocols and data under a peer-to-peer environment could potentially be shared freely without any single point of authority to dictate how experiments should be run. In such environment it is necessary to have mechanisms by which each individual scientist (peer) can assess, locally, how he or she wants to be involved with others in experiments. This study aims to implement and demonstrate simple peer ranking under the OpenKnowledge peer-to-peer infrastructure by both simulated and real-world bioinformatics experiments involving multi-agent interactions.
A simulated experiment environment with a peer ranking capability was specified by the Lightweight Coordination Calculus (LCC) and automatically executed under the OpenKnowledge infrastructure. The peers such as MS/MS protein identification services (including web-enabled and independent programs) were made accessible as OpenKnowledge Components (OKCs) for automated execution as peers in the experiments. The performance of the peers in these automated experiments was monitored and evaluated by simple peer ranking algorithms.
Peer ranking experiments with simulated peers exhibited characteristic behaviours, e.g., power law effect (a few dominant peers dominate), similar to that observed in the traditional Web. Real-world experiments were run using an interaction model in LCC involving two different types of MS/MS protein identification peers, viz., peptide fragment fingerprinting (PFF) and de novo sequencing with another peer ranking algorithm simply based on counting the successful and failed runs. This study demonstrated a novel integration and useful evaluation of specific proteomic peers and found MASCOT to be a dominant peer as judged by peer ranking.
The simulated and real-world experiments in the present study demonstrated that the OpenKnowledge infrastructure with peer ranking capability can serve as an evaluative environment for automated experimentation.
- Robertson D: Multi-agent coordination as distributed logic programming. Proceedings of the International Conference on Logic Programming 2004. Sant-Malo
- Hull D, Wolstencroft K, Stevens R, Goble C, Pocock M, Li P, Oinn T: Taverna: a tool for building and running workflows of services. Nucl Acids Res 2006, 34:w729-w732. CrossRef
- De Roure D, Goble C: Software design for empowering scientists. IEEE Software 2009,26(1):88–95. CrossRef
- Shukla AK, Futrell JK: Tandem mass spectrometry: dissociation of ions by collisional activation. J Mass Spectrom 2000,35(9):1069–90. CrossRef
- Perkins DN, Pappin DJC, Creasy DM, Cottrell JS: Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis 1999,20(18):3551–3567. CrossRef
- Geer LY, Markey SP, Kowalak JA, Wagner L, Xu M, Maynard DM, Yang X, Shi W, Bryant SH: Open mass spectrometry search algorithm. J Proteome Res 2004,3(5):958–64. CrossRef
- Eng JK, McCormack AL, Yates JR: An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J Am Soc Mass Spec 1994, 5:976–989. CrossRef
- Dancik V, Addona TA, Clauser KR, Vath JE, Pevzner PA: De novo peptide sequencing via tandem mass spectrometry. J Comput Biol 1999,6(3/4):327–342. CrossRef
- Hernandez P, Muller M, Appel RD: Automated protein identification by tandem mass spectrometry: issues and strategies. Mass Spectrom Rev 2006, 25:235–254. CrossRef
- Page L: Methods for node ranking in a linked database. US Patent 2001. No. 6,285,999
- Frank A, Pevzner P: PepNovo: de novo peptide sequencing via probabilistic network modelling. Anal Chem 2005, 77:964–973. CrossRef
- Johnson RS, Taylor JA: Searching sequence databases via de novo peptide sequencing by tandem mass spectrometry. Mol Biotechnology 2002,22(3):301–315. CrossRef
- Shevchenko A, Sunyaev S, Loboda A, Shevchenko A, Bork P, Ens W, Standing KG: Charting the proteomes of organisms with unsequenced genomes by MALDI-quadrupole time-of-flight mass spectrometry and BLAST homology searching. Anal Chem 2001,73(9):1917–26. CrossRef
- Habermann B, Oegema J, Sunyaev S, Shevchenko A: The power and the limitations of cross-species protein identification by mass spectrometry-driven sequence similarity searches. Mol Cell Proteomics 2004, 3:238–249. CrossRef
- Xiong L, Liu L: PeerTrust: Supporting Reputation-Based Trust for Peer-to-Peer Electronic Communities. IEEE Transactions of Knowledge and Data Engineering 2004,16(7):843–857. CrossRef
- OpenKnowledge for peer-to-peer experimentation in protein identification by MS/MS
- Open Access
- Available under Open Access This content is freely available online to anyone, anywhere at any time.
- Online Date
- December 2011
- Online ISSN
- BioMed Central
- Additional Links
- Author Affiliations
- 1. School of Informatics, University of Edinburgh, Edinburgh, EH8 9AB, UK
- 2. State Key Laboratory of Quality Research in Chinese Medicine, University of Macau, Macao SAR, China
- 3. Institute of Chinese Medical Sciences, University of Macau, Macao SAR, China
- 4. Division of Ecology and Evolution, Imperial College London, London, SW7 2AZ, UK
- 5. Department of Biomolecular Engineering, University of California, Santa Cruz, CA, 95064, USA