Model-Based Analysis of Quantitative Proteomics Data with Data Independent Acquisition Mass Spectrometry

Chen, Gengbo; Teo, Guo Shou; Teo, Guo Ci; Choi, Hyungwon

doi:10.1007/978-3-319-45809-0_7

Gengbo Chen⁸,
Guo Shou Teo⁸,
Guo Ci Teo⁸ &
…
Hyungwon Choi⁸

Part of the book series: Frontiers in Probability and the Statistical Sciences ((FROPROSTAS))

2988 Accesses

Abstract

In shotgun proteomics, more abundant peptides are selected for MS/MS fragmentation leading to sequence assignment and their quantitative abundance is computed from peak area of the extracted ion chromatogram. This analysis framework is called data dependent acquisition (DDA). However, the bias towards abundant peptides limits reproducible extraction of peptide signals for a large proportion of the proteome. Recent advances in next generation mass spectrometers enabled implementation of an alternative approach called data independent acquisition (DIA), which improves data quality in terms of dynamic range, measurement precision, and more importantly, reproducible detection. In this chapter, we review the process of generating quantitative proteomics data with DIA, and present a computational tool mapDIA designed for data processing and statistical analysis of the DIA proteomics data. Using an example of renal cancer data set, we demonstrate that fragment intensity data from DIA provide a reliable repeated measure of peptide abundance after careful filtering, and direct modeling of the hierarchical data (protein → peptide → fragment) improves the detection of differentially expressed proteins compared to the analysis using protein intensity data derived by summation of fragment intensities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Hardcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Besag, J. (1986). On the statistical analysis of dirty pictures. Journal of the Royal Statistical Society: Series B, 48, 259–302.
MathSciNet MATH Google Scholar
Choi, H., Kim, S., Fermin, D., Tsou, C. C., & Nesvizhskii, A. I. (2015). QPROT: Statistical method for testing differential expression using protein-level intensity data in label-free quantitative proteomics. Journal of Proteomics, 129, 121–126.
Article Google Scholar
Clough, T., Key, M., Ott, I., Ragg, S., Schadow, G., & Vitek, O. (2009). Protein quantification in label-free LC-MS experiments. Journal of Proteome Research, 8(11), 5275–5284.
Article Google Scholar
Collins, B. C., Gillet, L. C., Rosenberger, G., Röst, H. L., Vichalkovski, A., Gstaiger, M., et al. (2013). Quantifying protein interaction dynamics by SWATH mass spectrometry: Application to the 14-3-3 system. Nature Methods, 10(12), 1246–1253.
Article Google Scholar
Craig, R., & Beavis, R. C. (2003). A method for reducing the time required to match protein sequences with tandem mass spectra. Rapid Communications in Mass Spectrometry, 17, 2310–2316.
Article Google Scholar
Egertson, J. D., Kuehn, A., Merrihew, G. E., Bateman, N. W., MacLean, B. X., Ting, Y. S., et al. (2013). Multiplexed MS/MS for improved data-independent acquisition. Nature Methods, 10, 744–746.
Article Google Scholar
Gillet, L. C., Navarro, P., Tate, S., Röst, H. L., Selevsek, N., Reiter, L., et al. (2012). Targeted data extraction of the MS/MS spectra generated by data-independent acquisition: A new concept for consistent and accurate proteome analysis. Molecular & Cellular Proteomics, 11(6), O111.016717.
Article Google Scholar
Guo, T., Kouvonen, P., Koh, C. C., Gillet, L. C., Wolski, W., Röst, H. L., et al. (2015). Rapid mass spectrometric conversion of tissue biopsy samples into permanent quantitative digital proteome maps. Nature Medicine, 21(4), 407–413.
Article Google Scholar
Karpievitch, Y., Stanley, J., Taverner, T., Huang, J., Adkins, J. N., Ansong, C., et al. (2009). A statistical framework for protein quantitation in bottom-up MS-based proteomics. Bioinformatics, 25(16), 2028–2034.
Article Google Scholar
Lambert, J. P., Ivosev, G., Couzens, A. L., Larsen, B., Taipale, M., Lin, Z. Y., et al. (2013). Mapping differential interactomes by affinity purification coupled with data-independent mass spectrometry acquisition. Nature Methods, 10(12), 1239–1245.
Article Google Scholar
MacLean, B. X., Tomazela, D. M., Shulman, N., Chambers, M., Finney, G. L., Frewen, B., et al. (2010). Skyline: An open source document editor for creating and analyzing targeted proteomics experiments. Bioinformatics, 26(7), 966–968.
Article Google Scholar
Newton, M. A., Noueiry, A., Sarkar, D., & Ahlquist, P. (2004). Detecting differential gene expression with a semiparametric hierarchical mixture method. Biostatistics, 5(2), 155–176.
Article MATH Google Scholar
Panchaud, A., Scherl, A., Shaffer, S. A., von Haller, P. D., Kulasekara, H. D., Miller, S. I., et al. (2009). PAcIFIC: How to dive deeper into the proteomics ocean. Analytical Chemistry, 81(15), 6481–6488.
Article Google Scholar
Prakash, A., Peterman, S., Ahmad, S., Sarracino, D., Frewen, B., Vogelsang, M., et al. (2013). Hybrid data acquisition and processing strategies with increased throughput and selectivity: pSMART analysis for global qualitative and quantitative analysis. Journal of Proteome Research, 12, 5415–5430.
Google Scholar
Röst, H. L., Rosenberger, G., Navarro, P., Gillet, L., Miladinović, S. M., Schubert, O. T., et al. (2014). Openswath enables automated, targeted analysis of data-independent acquisition MS data. Nature Biotechnology, 32(3), 219–223.
Article Google Scholar
Schubert, O. T., Gillet, L. C., Collins, B. C., Navarro, P., Rosenberger, G., Wolski, W., et al. (2015). Building high-quality assay libraries for targeted analysis of SWATH MS data. Nature Protocols, 10(3), 426–441.
Article Google Scholar
Silva, J. C., Gorenstein, M. V., Li, G. Z., Vissers, J. P. C., & Geromanos, S. J. (2006). Absolute quantification of proteins by LCMSE: A virtue of parallel ms acquisition. Molecular & Cellular Proteomics, 5, 144–156.
Article Google Scholar
Steen, H., & Mann, M. (2004). The abc’s (and xyz’s) of peptide sequencing. Nature Reviews Molecular Cell Biology, 5(9), 699–711.
Article Google Scholar
Teo, G. S., Kim, S., Tsou, C.-C., Gingras, A.-C., Nesvizhskii, A. I., & Choi, H. (2015). mapDIA: Preprocessing and statistical analysis of quantitative proteomics data from data independent acquisition mass spectrometry. Journal of Proteomics, 129, 108–120.
Article Google Scholar
Tsou, C.-C., Avtonomov, D., Larsen, B., Tucholska, M., Choi, H., Gingras, A.-C., et al. (2015). DIA-Umpire: Comprehensive computational framework for data independent acquisition proteomics. Nature Methods, 12(3), 258–264.
Article Google Scholar
Venable, J. D., Dong, M. Q., Wohlschlegel, J., Dillin, A., & Yatesm, J. R. (2004). Automated approach for quantitative analysis of complex peptide mixtures from tandem mass spectra. Nature Methods, 1(1), 39–45.
Article Google Scholar
Wei, Z., & Li, H. (2007). A Markov random field model for network-based analysis of genomic data. Bioinformatics, 23(12), 1537–1544.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Saw Swee Hock School of Public Health, National University of Singapore, Singapore, Singapore
Gengbo Chen, Guo Shou Teo, Guo Ci Teo & Hyungwon Choi

Authors

Gengbo Chen
View author publications
You can also search for this author in PubMed Google Scholar
Guo Shou Teo
View author publications
You can also search for this author in PubMed Google Scholar
Guo Ci Teo
View author publications
You can also search for this author in PubMed Google Scholar
Hyungwon Choi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hyungwon Choi .

Editor information

Editors and Affiliations

Department of Biostatistics, University of Florida, Gainesville, Florida, USA
Susmita Datta
Department of Medical Statistics and Bioinformatics, Leiden University Medical Centre, RC Leiden, The Netherlands
Bart J. A. Mertens

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Chen, G., Teo, G.S., Teo, G.C., Choi, H. (2017). Model-Based Analysis of Quantitative Proteomics Data with Data Independent Acquisition Mass Spectrometry. In: Datta, S., Mertens, B. (eds) Statistical Analysis of Proteomics, Metabolomics, and Lipidomics Data Using Mass Spectrometry. Frontiers in Probability and the Statistical Sciences. Springer, Cham. https://doi.org/10.1007/978-3-319-45809-0_7

Download citation

DOI: https://doi.org/10.1007/978-3-319-45809-0_7
Published: 16 December 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-45807-6
Online ISBN: 978-3-319-45809-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics