Computational Management of Alignment of Multiple Protein Sequences Using ClustalW

  • Riddhi SharmaEmail author
  • Sanjay Kumar Dubey
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 1045)


The astounding and multidisciplinary field of bioinformatics has had its origin in the year 1900, a decade before the concept of DNA sequencing became feasible and applicable to researchers. The era of the “new biology” that has a subtle link with the modern-day computers emerged and was accompanied by the birth as well as the development of other streams of sciences such as bioinformatics (it conceptualizes biology in reference to molecules and then applies “informatics techniques” (derived from computer science in this paper) to organize and understand the information that is related to these molecules. The several categories in bioinformatics focus on genomics (gene study), proteomics (protein study: which will be discussed in this paper), metabolomics (study of metabolic pathway within a cell), etc. and computational biology. Bioinformatics focuses on the application of IT and statistics. It is very significant to note here that bioinformatics does not have to do anything in the real world. Rather, it creates a virtual image of our living system having computerized biological knowledge, sequence, and structural information fed in the CPU and by using virtual-based techniques like mathematical modeling and through stimulation that helps in basic principles leading to its practical application. In bioinformatics, a sequence is subjected (or susceptible towards) to certain range of analytical methods in order to understand its function, or features, or structure, or evolution. Analysis of sequences can be very efficiently used to assign roles to certain genes and proteins as a result of the study of the various similarities between the compared sequences. In this research paper, comparison and alignment of multiple protein sequences are provided. It will help to generate a phylogenetic tree and also draw the relationships between the aligned sequences.


Multiple sequence alignment Dynamic programming Pairwise alignment ClustalW Phylogenetic tree 


  1. 1.
    Vijayarani, S., & Deepa, M.S.: Protein sequence classification in data mining—a study. Int. J. Emerg. Technol. Comput. Sci. Electron. (IJETCSE) 23(7) (2014)Google Scholar
  2. 2.
    Behera, N., Jeevitesh, M.S., Jose, J., Kant, K., Dey, A., Mazher, J.: Higher accuracy protein multiple sequence alignments by genetic algorithm. Procedia Comput. Sci. 108, 1135–1144 (2017)CrossRefGoogle Scholar
  3. 3.
    Rout, S.B., Dehury, S., Mishra, B.S.P.: Protein structure prediction using genetic algorithm. Int. J. Comput. Sci. Mobile Comput. 2(6), 187–192 (2013)Google Scholar
  4. 4.
    Gupta, O.P.: Study and analysis of various bioinformatics applications using protein BLAST: an overview. Adv. Comput. Sci. Technol. 10(8), 2587–2601 (2017)Google Scholar
  5. 5.
    Diniz, W.J.S., Canduri, F.: Bioinformatics: an overview and its applications. Genet. Mol. Res. 16(1) (2017)Google Scholar
  6. 6.
    Chenna, R., Sugawara, H., Koike, T., Lopez, R., Gibson, T.J., Higgins, D.G., Thompson, J.D.: Multiple sequence alignment with the Clustal series of programs. Nucl. Acids Res. 31(13), 3497–3500 (2003)CrossRefGoogle Scholar
  7. 7.
    Luthy, R., Xenarios, I., & Bucher, P.: Protein Sci. 3, 139–146 (1994), Russell, R.B., Barton, G.J.: Proteins 14, 309–323 (1992)Google Scholar
  8. 8.
    Gouy, M., Guindon, S., Gascuel, O.: SeaView version 4: a multiplatform graphical user interface for sequence alignment and phylogenetic tree building. Mol. Biol. Evol. 27(2), 221–224 (2009)CrossRefGoogle Scholar
  9. 9.
    Feng, D.F., Doolittle, R.F.: Progressive sequence alignment as a pre requisite to correct phylogenetic trees. J. Mol. Evol. 25(4), 351–360 (1987)CrossRefGoogle Scholar
  10. 10.
    Lassmann, T., Sonnhammer, E.L.: Kalign—an accurate and fast multiple sequence alignment algorithm. BMC Bioinform. 6(1), 298 (2005)CrossRefGoogle Scholar
  11. 11.
    Kumar, S., Tamura, K., Nei, M.: MEGA3: integrated software for molecular evolutionary genetics analysis and sequence alignment. Brief. Bioinform. 5(2), 150–163 (2004)CrossRefGoogle Scholar
  12. 12.
    Reguant, R., Antipin, Y., Sheridan, R., Luna, A., Sander, C.: Alignment Viewer: Sequence Analysis of Large Protein Families. bioRxiv, 269720 (2018)Google Scholar
  13. 13.
    Jeanmougin, F., Thompson, J.D., Guoy, M., Higgins, D.G., Gibson, T.J.: Multiple sequence alignment with Clustal X. Trends Biochem. Sci. 403–405 (1998)Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2020

Authors and Affiliations

  1. 1.Amity UniversityNoidaIndia

Personalised recommendations