Abstract
In this paper, 12 accession numbers of rice has been used. The accession numbers have been taken from the article Cho et al. where it has already been used for other studies. The accession number for DNA, i.e., A, C, G and T along with the gap character (–) have been converted into alignment matrix with 5 rows and 7473 columns. The alignment has been done using ClustalX software. The 7473 columns have been alienated into 5 parts with different dimensions. Later for each part scoring has been done separately. Highest scores from all the 5 parts have been noted down. To minimize the data, the common regions between these 5 parts have been taken into consideration. Later one way ANOVA (Huck and McLean in Psychological Bulletin, 82(4), 511–518,1975; Mukhopadhyay in Applied statistics. Books and Allied (P) Ltd., Kolkata, 2011) has been constructed and conclusions are drawn accordingly.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cho, Y. G., Ishii, T., Temnykh, S., Chen, X., Lipovich, L., McCouch, R. S., Park, D. W., Ayres, N., & Cartinhour, S. (2000). Diversity of microsatellites derived from genomic libraries and GenBank sequences in rice. (Oryza sativa L.) Theor Appl Genet, 100, 713–722. Springer-Verlag.
Hertz, Z. G., & Stormo, D. G. (1999). Identifying DNA and protein patterns with statistically significant alignments of multiple sequences. Bioinformatics, 15(7/8), 563–577.
Huck, W. S., & McLean, A. R. (1975). Using a repeated measures ANOVA to analyze the data from a pretest-posttest design: A potentially confusing task. Psychological Bulletin, 82(4), 511–518.
Pei, J. (2008). Multiple protein sequence alignment. In Current opinion in structural biology (Vol. 18, pp. 382–386). Elsevier.
Shu, J. J., Yong, Y. K., & Chang, K. W. (2012). An improved scoring matrix for multiple sequence alignment. In Mathematical problems in engineering (Vol. 2012, no. 490649, pp. 1–9).
Mukhopadhyay, P. (2011). Applied statistics. Books and Allied (P) Ltd.
Wallace, M. I., Blackshields, G., & Higgins, G. D. (2005). Multiple sequence alignments. In Current opinion in structural biology (Vol. 15, p. 261–266). Elsevier.
Williams, J. L., & Abdi, H. (2010). Fisher’s least significant difference (LSD) test. In N. Salkind (ed.), Encyclopedia of research design (pp. 1–6).
Acknowledgements
The author Miss. Anamika Dutta thank to Department of Science and Technology (DST), India for providing financial assistance for carrying out this work as an INSPIRE Fellow. Also we thank the reviewer for their thorough review and highly appreciate the comments and suggestions which substantially contributed to improving the class of the paper.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix
Appendix
The alignment of matrix (Hertz and Stormo 1999) has been shown with an example. Let us take some DNA sequences of different length say:
-
A – A C G T T C C
-
A C A C G T A C A
-
G C A A G A T – C
-
A C A C G T T C C
Gap character (–) come to view when ClustalX software is used. It happens due to multiple sequence alignment.
The above alignment has been created by ClustalX software. Now from the above DNA sequences, the alignment matrix can be formed which has been shown below:
Weight matrix using for the above example is given by:
The highest weights of the above weight matrix are:
Hence the score of the above matrix is:
This was a counter example of alignment and weight matrix.
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Dutta, A., Das, K.K. (2018). A Study on DNA Sequence of Rice Using Scoring Matrix Method and ANOVA Technique. In: Chattopadhyay, A., Chattopadhyay, G. (eds) Statistics and its Applications. PJICAS 2016. Springer Proceedings in Mathematics & Statistics, vol 244. Springer, Singapore. https://doi.org/10.1007/978-981-13-1223-6_2
Download citation
DOI: https://doi.org/10.1007/978-981-13-1223-6_2
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1222-9
Online ISBN: 978-981-13-1223-6
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)