Skip to main content

Identifying Driver Potential in Passenger Genes Using Chemical Properties of Mutated and Surrounding Amino Acids

  • Chapter
  • First Online:
Computational Intelligence and Big Data Analytics

Part of the book series: SpringerBriefs in Applied Sciences and Technology ((BRIEFSFOMEBI))

  • 612 Accesses

Abstract

One of the major challenging tasks today is to understand uncontrolled growth of tumor. Generally, two types of genes and mutations are observed in tumor cells. The driver mutations, within genes, confer a selective growth advantage and are responsible for causing the cancer. The passenger mutations are observed in those genes that, generally, do not provide growth advantage of cells in tumor. In tumor genome, more frequently mutated genes are considered as driver, whereas less frequent mutations are known as passenger genes. The prime aim of the present article is to identify the set of passenger genes that may have driver potential. The current analysis completely deals with the amino acid sequence and embedded chemical properties of amino acids present in both driver and passenger proteins. We picked up mutated and surrounding 21 amino acids, i.e. one mutated non-synonymous amino acid in the middle and 10 amino acids on the both sides of the mutated amino acid, in driver proteins and compared the presence of this length of amino acids having the same mutated amino acid, in passenger protein sequence. In this comparison of pairwise alignment, we generated similarity score between driver and passenger proteins. Based on the similarity index (i.e., alignment score) above the median value, we considered a set of passenger genes as having driver potential. Some of these passenger genes also possess reported biological functions and are found in the pathways of cancer development. So, these passenger genes or proteins, with diver potential, may play crucial role for cancer development.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 44.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 59.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Sjöblom T, Jones S, Wood LD, Parsons DW, Lin J, Barber TD, Mandelker D, Leary RJ, Ptak J, Silliman N et al (2006) The consensus coding sequences of human breast and colorectal cancers. Science 314(5797):268–274

    Article  Google Scholar 

  2. Greenman C, Stephens P, Smith R, Dalgliesh GL, Hunter C, Bignell G, Davies H, Teague J, Butler A, Stevens C et al (2007) Patterns of somatic mutation in human cancer genomes. Nature 446(7132):153–158

    Article  Google Scholar 

  3. Stratton MR, Campbell PJ, Futreal PA (2009) The cancer genome. Nature 458(7239):719–724

    Article  Google Scholar 

  4. Pon JR, Marra MA (2015) Driver and passenger mutations in cancer. Annu Rev Pathol Mech Dis 10:25–50

    Article  Google Scholar 

  5. Luo SY, Lam DC (2013) Oncogenic driver mutations in lung cancer. Transl Respir Med 1(1):6

    Google Scholar 

  6. Walczak AM, Nicolaisen LE, Plotkin JB, Desai MM (2011) The structure of genealogies in the presence of purifying selection: A “fitness-class coalescent”. Genetics 111

    Google Scholar 

  7. Fröhling S, Scholl C, Levine RL, Loriaux M, Boggon TJ, Bernard OA, Berger R, Döhner H, Döhner K, Ebert BL et al (2007) Identification of driver and passenger mutations of FLT3 by high-throughput DNA sequence analysis and functional assessment of candidate alleles. Cancer Cell 12(6):501–513

    Article  Google Scholar 

  8. Dimitrakopoulos CM, Beerenwinkel N (2017) Computational approaches for the identification of cancer genes and pathways. Wiley Interdisc Rev Syst Biol Med 9(1):e1364

    Article  Google Scholar 

  9. Carter H, Chen S, Isik L, Tyekucheva S, Velculescu VE, Kinzler KW, Vogelstein B, Karchin R (2009) Cancer-specific high-throughput annotation of somatic mutations: computational prediction of driver missense mutations. Cancer Res 69(16):6660–6667

    Article  Google Scholar 

  10. Nei M, Gojobori T (1986) Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol Biol Evol 3(5):418–426

    Google Scholar 

  11. https://www.ncbi.nlm.nih.gov

  12. Das JK, Majumder A, Choudhury PP, Mukhopadhyay B (2016) Understanding of genetic code degeneracy and new way of classifying of protein family: a mathematical approach. In: 2016 IEEE 6th international conference on advanced computing (IACC). IEEE, pp 262–267

    Google Scholar 

  13. https://teaching.ncl.ac.uk/bms/wiki/index.php

  14. Meister A et al (1957) Biochemistry of the amino acids

    Google Scholar 

  15. Das JK, Das P, Ray KK, Choudhury PP, Jana SS (2016) Mathematical characterization of protein sequences using patterns as chemical group combinations of amino acids. PloS One 11(12):0167651

    Article  Google Scholar 

  16. Das JK, Choudhury PP (2017) Chemical property based sequence characterization of PpcA and its homolog proteins PpcB-E: a mathematical approach. PloS One 12(3):0175031

    Google Scholar 

  17. Weathers EA, Paulaitis ME, Woolf TB, Hoh JH (2004) Reduced amino acid alphabet is sufficient to accurately recognize intrinsically disordered protein. FEBS Lett 576(3):348–352

    Article  Google Scholar 

  18. Murphy LR, Wallqvist A, Levy RM (2000) Simplified amino acid alphabets for protein fold recognition and implications for folding. Protein Eng 13(3):149–152

    Article  Google Scholar 

  19. Li T, Fan K, Wang J, Wang W (2003) Reduction of protein sequence complexity by residue grouping. Protein Eng 16(5):323–330

    Article  Google Scholar 

  20. Basak P, Maitra-Majee S, Das JK, Mukherjee A, Dastidar SG, Choudhury PP, Majumder AL (2017) An evolutionary analysis identifies a conserved pentapeptide stretch containing the two essential lysine residues for rice L-myo-inositol 1-phosphate synthase catalytic activity. PloS One 12(9):0185351

    Article  Google Scholar 

  21. Van Rechem C, Whetstine JR (2014) Examining the impact of gene variants on histone lysine methylation. Biochim Biophys Acta (BBA) Gene Regul Mech 1839(12):1463–1476

    Article  Google Scholar 

  22. Wu H-T, Hajirasouliha I, Raphael BJ (2014) A combinatorial algorithm to identify independent and recurrent copy number aberrations across cancer types. In: 2014 IEEE 4th international conference on computational advances in bio and medical sciences (ICCABS). IEEE, p 1

    Google Scholar 

  23. Baur B, Bozdag S (2017) Processdriver: a computational pipeline to identify copy number drivers and associated disrupted biological processes in cancer. Genomics 109(3):233–240

    Article  Google Scholar 

  24. Wei P-J, Zhang D, Xia J, Zheng C-H (2016) LNDriver: identifying driver genes by integrating mutation and expression data based on gene-gene interaction network. BMC Bioinform 17(17):467

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jayanta Kumar Das .

Rights and permissions

Reprints and permissions

Copyright information

© 2019 The Author(s)

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Das, J.K., Singh, R., Pal Choudhury, P., Roy, B. (2019). Identifying Driver Potential in Passenger Genes Using Chemical Properties of Mutated and Surrounding Amino Acids. In: Computational Intelligence and Big Data Analytics. SpringerBriefs in Applied Sciences and Technology(). Springer, Singapore. https://doi.org/10.1007/978-981-13-0544-3_10

Download citation

Publish with us

Policies and ethics