Skip to main content

Efficient and Accurate Algorithm for Cleaved Fragments Prediction (CFPA) in Protein Sequences Dataset Based on Consensus and Its Variants: A Novel Degradomics Prediction Application

  • Protocol
  • First Online:
Neuroproteomics

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1598))

Abstract

Degradomics is a novel discipline that involves determination of the proteases/substrate fragmentation profile, called the substrate degradome, and has been recently applied in different disciplines. A major application of degradomics is its utility in the field of biomarkers where the breakdown products (BDPs) of different protease have been investigated. Among the major proteases assessed, calpain and caspase proteases have been associated with the execution phases of the pro-apoptotic and pro-necrotic cell death, generating caspase/calpain-specific cleaved fragments. The distinction between calpain and caspase protein fragments has been applied to distinguish injury mechanisms. Advanced proteomics technology has been used to identify these BDPs experimentally. However, it has been a challenge to identify these BDPs with high precision and efficiency, especially if we are targeting a number of proteins at one time. In this chapter, we present a novel bioinfromatic detection method that identifies BDPs accurately and efficiently with validation against experimental data. This method aims at predicting the consensus sequence occurrences and their variants in a large set of experimentally detected protein sequences based on state-of-the-art sequence matching and alignment algorithms. After detection, the method generates all the potential cleaved fragments by a specific protease. This space and time-efficient algorithm is flexible to handle the different orientations that the consensus sequence and the protein sequence can take before cleaving. It is O(mn) in space complexity and O(Nmn) in time complexity, with N number of protein sequences, m length of the consensus sequence, and n length of each protein sequence. Ultimately, this knowledge will subsequently feed into the development of a novel tool for researchers to detect diverse types of selected BDPs as putative disease markers, contributing to the diagnosis and treatment of related disorders.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. McQuibban GA, Gong JH, Tam EM, McCulloch CA, Clark-Lewis I, Overall CM (2000) Inflammation dampened by gelatinase A cleavage of monocyte chemoattractant protein-3. Science (New York) 289:1202–1206

    Article  CAS  Google Scholar 

  2. Shen Y, Tolic N, Liu T, Zhao R, Petritis BO, Gritsenko MA, Camp DG, Moore RJ, Purvine SO, Esteva FJ, Smith RD (2010) Blood peptidome-degradome profile of breast cancer. PLoS One 5:e13133

    Article  PubMed  PubMed Central  Google Scholar 

  3. Lopez-Otin C, Overall CM (2002) Protease degradomics: a new challenge for proteomics. Nat Rev Mol Cell Biol 3:509–519

    Article  CAS  PubMed  Google Scholar 

  4. Pellikainen JM, Ropponen KM, Kataja VV, Kellokoski JK, Eskelinen MJ, Kosma VM (2004) Expression of matrix metalloproteinase (MMP)-2 and MMP-9 in breast cancer with a special reference to activator protein-2, HER2, and prognosis. Clin Cancer Res 10:7621–7628

    Article  CAS  PubMed  Google Scholar 

  5. Itoh Y, Nagase H (2002) Matrix metalloproteinases in cancer. Essays Biochem 38:21–36

    Article  CAS  PubMed  Google Scholar 

  6. Fuhrman-Luck RA, Silva ML, Dong Y, Irving-Rodgers H, Stoll T, Hastie ML, Loessner D, Gorman JJ, Clements JA (2014) Proteomic and other analyses to determine the functional consequences of deregulated kallikrein-related peptidase (KLK) expression in prostate and ovarian cancer. Proteomics Clin Appl 8:403–415

    Article  CAS  PubMed  Google Scholar 

  7. Vihinen P, Kahari VM (2002) Matrix metalloproteinases in cancer: prognostic markers and therapeutic targets. Int J Cancer 99:157–166

    Article  CAS  PubMed  Google Scholar 

  8. Butler GS, Overall CM (2009) Updated biological roles for matrix metalloproteinases and new "intracellular" substrates revealed by degradomics. Biochemistry 48:10830–10845

    Article  CAS  PubMed  Google Scholar 

  9. Cauwe B, Martens E, Proost P, Opdenakker G (2009) Multidimensional degradomics identifies systemic autoantigens and intracellular matrix proteins as novel gelatinase B/MMP-9 substrates. Integr Biol (Camb) 1:404–426

    Article  CAS  Google Scholar 

  10. Patterson NL, Iyer RP, de Castro Bras LE, Li Y, Andrews TG, Aune GJ, Lange RA, Lindsey ML (2013) Using proteomics to uncover extracellular matrix interactions during cardiac remodeling. Proteomics Clin Appl 7:516–527

    Article  CAS  PubMed  Google Scholar 

  11. Clark TG, Conway SJ, Scott IC, Labosky PA, Winnier G, Bundy J, Hogan BL, Greenspan DS (1999) The mammalian Tolloid-like 1 gene, Tll1, is necessary for normal septation and positioning of the heart. Development 126:2631–2642

    CAS  PubMed  Google Scholar 

  12. Berry R, Jowitt TA, Ferrand J, Roessle M, Grossmann JG, Canty-Laird EG, Kammerer RA, Kadler KE, Baldock C (2009) Role of dimerization and substrate exclusion in the regulation of bone morphogenetic protein-1 and mammalian tolloid. Proc Natl Acad Sci U S A 106:8561–8566

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Wang KK (2000) Calpain and caspase: can you tell the difference? Trends Neurosci 23:20–26

    Article  PubMed  Google Scholar 

  14. Kobeissy FH, Liu MC, Yang Z, Zhang Z, Zheng W, Glushakova O, Mondello S, Anagli J, Hayes RL, Wang KK (2015) Degradation of betaII-spectrin protein by calpain-2 and caspase-3 under neurotoxic and traumatic brain injury conditions. Mol Neurobiol 52:696–709

    Article  CAS  PubMed  Google Scholar 

  15. Kobeissy FH, Ottens AK, Zhang Z, Liu MC, Denslow ND, Dave JR, Tortella FC, Hayes RL, Wang KK (2006) Novel differential neuroproteomics analysis of traumatic brain injury in rats. Mol Cell Proteomics 5:1887–1898

    Article  CAS  PubMed  Google Scholar 

  16. Kobeissy FH, Sadasivan S, Liu J, Gold MS, Wang KK (2008) Psychiatric research: psychoproteomics, degradomics and systems biology. Expert Rev Proteomics 5:293–314

    Article  CAS  PubMed  Google Scholar 

  17. Alawieh A, Sabra Z, Nokkari A, El-Assaad A, Mondello S, Zaraket F, Fadlallah B, Kobeissy FH (2014) Bioinformatics approach to understanding interacting pathways in neuropsychiatric disorders. Methods Mol Biol 1168:157–172

    Article  PubMed  Google Scholar 

  18. Godovac-Zimmermann J (2012) The 9th Siena meeting: from genome to proteome: open innovations. Expert Rev Proteomics 9:591–594

    Article  CAS  PubMed  Google Scholar 

  19. Huesgen PF, Overall CM (2012) N- and C-terminal degradomics: new approaches to reveal biological roles for plant proteases from substrate identification. Physiol Plant 145:5–17

    Article  CAS  PubMed  Google Scholar 

  20. Doucet A, Overall CM (2008) Protease proteomics: revealing protease in vivo functions using systems biology approaches. Mol Aspects Med 29:339–358

    Article  CAS  PubMed  Google Scholar 

  21. Overall CM, Dean RA (2006) Degradomics: systems biology of the protease web. Pleiotropic roles of MMPs in cancer. Cancer Metastasis Rev 25:69–75

    Article  PubMed  Google Scholar 

  22. Glantz SB, Cianci CD, Iyer R, Pradhan D, Wang KK, Morrow JS (2007) Sequential degradation of alphaII and betaII spectrin by calpain in glutamate or maitotoxin-stimulated cells. Biochemistry 46:502–513

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215:403–410

    Article  CAS  PubMed  Google Scholar 

  24. Knuth D, Morris J, Pratt V (1977) Fast Pattern Matching in Strings. SIAM J Comput 6

    Google Scholar 

  25. Lipman DJ, Pearson WR (1985) Rapid and sensitive protein similarity searches. Science 227:1435–1441

    Article  CAS  PubMed  Google Scholar 

  26. Ning Z, Cox AJ, Mullikin JC (2001) SSAHA: a fast search method for large DNA databases. Genome Res 11:1725–1729

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Ma B, Tromp J, Li M (2002) PatternHunter: faster and more sensitive homology search. Bioinformatics 18:440–445

    Article  CAS  PubMed  Google Scholar 

  28. Kurtz S, Phillippy A, Delcher AL, Smoot M, Shumway M, Antonescu C, Salzberg SL (2004) Versatile and open software for comparing large genomes. Genome Biol 5:R12

    Article  PubMed  PubMed Central  Google Scholar 

  29. Lecroq T (2007) Fast exact string matching algorithms. Inf Process Lett 102:229–235

    Article  Google Scholar 

  30. Needleman SB, Wunsch CD (1970) A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol 48:443–453

    Article  CAS  PubMed  Google Scholar 

  31. Srikantha A, Bopardikar AS, Kaipa KK, Venkataraman P, Lee K, Ahn T, Narayanan R (2010) A fast algorithm for exact sequence search in biological sequences using polyphase decomposition. Bioinformatics 26:i414–i419

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Li H, Durbin R (2010) Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26:589–595

    Article  PubMed  PubMed Central  Google Scholar 

  33. Smith TF, Waterman MS (1981) Identification of common molecular subsequences. J Mol Biol 147:195–197

    Article  CAS  PubMed  Google Scholar 

  34. Blundell TL, Srinivasan N (1996) Symmetry, stability, and dynamics of multidomain and multicomponent protein systems. Proc Natl Acad Sci U S A 93:14243–14248

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgment

This work was supported by a grant from the National Council for Scientific Research (CNRS) in Lebanon.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Firas H. Kobeissy .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Science+Business Media LLC

About this protocol

Cite this protocol

El-Assaad, A., Dawy, Z., Nemer, G., Hajj, H., Kobeissy, F.H. (2017). Efficient and Accurate Algorithm for Cleaved Fragments Prediction (CFPA) in Protein Sequences Dataset Based on Consensus and Its Variants: A Novel Degradomics Prediction Application. In: Kobeissy, F., Stevens, Jr., S. (eds) Neuroproteomics. Methods in Molecular Biology, vol 1598. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-6952-4_17

Download citation

  • DOI: https://doi.org/10.1007/978-1-4939-6952-4_17

  • Published:

  • Publisher Name: Humana Press, New York, NY

  • Print ISBN: 978-1-4939-6950-0

  • Online ISBN: 978-1-4939-6952-4

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics