Abstract
Protein peptide identification from a tandem mass spectrum (MS/MS) is a challenging task. Previous approaches for peptide identification with database search are time consuming due to huge search space. De novo sequencing approaches which derive a peptide sequence directly from a MS/MS spectrum usually are of high complexities and the accuracies of the approaches highly depend on the quality of the spectra. In this paper, we developed an accurate and efficient algorithm for peptide identification. Our work consisted of the following steps. Firstly, we found a pair of complementary mass peaks that are b-ion and y-ion, respectively. We then used the two mass peaks as two tree nodes and extend the trees such that in the end the nodes of the trees are elements of a b-ion set and a y-ion set, respectively. Secondly, we applied breadth first search to the trees to generate peptide sequence tags. Finally, we designed a weight function to evaluate the reliabilities of the tags and rank the tags. Our experiment on 2620 experimental MS/MS spectra with one PTM showed that our algorithm achieved better accuracy than other approaches with higher efficiency.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Wilkins, M.R., Gasteiger, E., Gooley, A.A., Herbert, B.R., Molloy, M.P., Binz, P.A., Ou, K., Sanchez, J.C., Bairoch, A., Williams, K.L., Hochstrasser, D.F.: High-throughput Mass Spectrometric Discovery of Protein Post-Translational Modifications. Journal of Molecular Biology 289, 645–657 (1999)
Walsh, C.T.: Posttranslational Modification of Proteins: Expanding Nature’s Inventory. Roberts & Company Publishers, Englewood, Colorado (2005)
Mann, M., Jensen, O.N.: Proteomic Analysis of Post-Translational Modifications. Nat. Biotechnol. 21, 255–261 (2003)
Han, Y., Ma, B., Zhang, K.: SPIDER: Software for Protein Identification from Sequence Tags Containing Sequencing Error. Journal of Bioinformatics and Computational Biology, 97–716 (2005)
Yan, B., Zhou, T., Wang, P., Liu, Z., Emanuele II, V.A., Olman, V., Xu, Y.: A Point-Process Model for Rapid Identification of Post-Translational Modifications. In: Proceedings of 2006 Pacific Symposium on Biocomputing, pp. 327–338 (2006)
Liu, C., Yan, B., Song, Y., Xu, Y., Cai, L.: Fast De Novo Peptide Sequencing and Spectral Alignment via Tree Decomposition. In: Proceedings of the 11th International Pacific Symposium on Biocomputing (PSB 2006), pp. 255–266 (2006)
Fu, Y., Jia, W., Lu, Z., Wang, H., Yuan, Z., Chi, H., Li, Y., Xiu, L., Wang, W., Liu, C., et al.: Efficient Discovery of Abundant Post-Translational Modifications and Spectral Pairs Using Peptide Mass and Retention Time Differences. BMC Bioinformatics 10(Suppl 1), S50 (2009)
Ma, B., Zhang, K., Hendrie, C., Liang, C., Li, M., Doherty-Kirby, A., Lajoie, G.: PEAKS: Powerful Software for Peptide De Novo Sequencing by Tandem Mass Spectrometry. Rapid Communication in Mass Spectrometry 17, 2337–2342 (2003)
Searle, B.C., Dasari, S., Turner, M., Reddy, A.P., Choi, D., Wilmarth, P.A., McCormack, A.L., David, L.L., Nagalla, S.R.: High-Throughput Identification of Proteins and Unanticipated Sequence Modifications Using a Mass-Based Alignment Algorithm for MS/MS De Novo Sequencing Results. Anal. Chem. 76, 2220–2230 (2004)
Yan, B., Pan, C., Olman, V.N., Hettich, R.L., Xu, Y.: A Graph-Theoretic Approach for the Separation of b and y Ions in Tandem Mass Spectrometry. Bioinformatics 21, 563–574 (2005)
Perkins, D.N., Pappin, D.J., Creasy, D.M., Cottrell, J.S.: Probability-based Protein Identification by Searching Sequence Databases Using Mass Spectrometry Data. Electrophoresis 20, 3551–3567 (1999)
Eng, J.K., McCormack, A.L., Yates III, J.R.: An Approach to Correlate Tandem Mass Spectral Data of Peptides with Amino Acid Sequences in A Protein Database. Journal of the American Society of Mass Spectrometry 5, 976–989 (1994)
Frank, A., Tanner, S., Pevzner, P.: Peptide Sequence Tags for Fast Database Search in Mass-Spectrometry. Journal of Proteome Research 4, 1287–1295 (2005)
Frank, A., Pevzner, P.: PepNovo: De Novo Peptide Sequencing via Probabilistic Network Modeling. Anal. Chem. 77, 964–973 (2005)
Tabb, D.L., Saraf, A., Ates, J.R.: GutenTag: High-Throughput Sequence Tagging via an Empirically Derived Fragmentation Model. Analytical Chemistry 75, 6415–6421 (2003)
Tsur, D., Tanner, S., Zandi, E., Bafna, V., Pevzner, P.: Identification of Post-translational Modifications by Blind Search of Mass Spectra. Nature Biotechnology 23, 1562–1567 (2005)
Tanner, S., Shu, H., Frank, A., Wang, L.C., Zandi, E., Mumby, M., Pevzner, P.A., Bafna, V.: InsPecT: Identification of Posttranslationally Modified Peptides from Tandem Mass Spectra. Analytical Chemistry 77, 4626–4639 (2005)
Liu, C., Yan, B., Song, Y., Xu, Y., Cai, L.: Peptide Sequence Tag-Based Blind Identification of Post-Translational Modifications with Point Process Model. In: The 14th International Conference on Intelligent Systems for Molecular Biology (ISMB 2006), Fortaleza, Brazil (2006)
Liu, C., Yan, B., Song, Y., Xu, Y., Cai, L.: Graph Tree Decomposition Based Fast Peptide Sequencing and Spectral Alignment. International Journal of Computational Science, Special Issue on Bioinformatics and Computational Biology 2, 632–645 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, H., Scott, L., Liu, C., Rwebangira, M., Burge, L., Southerland, W. (2011). Rapid and Accurate Generation of Peptide Sequence Tags with a Graph Search Approach. In: Chen, J., Wang, J., Zelikovsky, A. (eds) Bioinformatics Research and Applications. ISBRA 2011. Lecture Notes in Computer Science(), vol 6674. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21260-4_25
Download citation
DOI: https://doi.org/10.1007/978-3-642-21260-4_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21259-8
Online ISBN: 978-3-642-21260-4
eBook Packages: Computer ScienceComputer Science (R0)