Extracting Molecular Diversity Between Populations Through Sequence Alignments
The use of sequence alignments for establishing protein homology relationships has an extensive tradition in the field of bioinformatics, and there is an increasing desire for more statistical methods in the data analysis. We present statistical methods and algorithms that are useful when the protein alignments can be divided into two or more populations based on known features or traits. The algorithms are considered valuable for discovering differences between populations at a molecular level. The approach is illustrated with examples from real biological data sets, and we present experimental results in applying our work on bacterial populations of Vibrio, where the populations are defined by optimal growth temperature, T opt .
Keywordssequence analysis structural analysis physicochemical properties extremophiles Fisher’s exact test Wilcoxon test
Unable to display preview. Download preview PDF.
- 3.Pe’er, I., Felder, C.E., Man, O., Silman, I., Sussman, J.L., Beckmann, J.S.: Proteomic signatures: Amino acid and oligopeptide compositions differentiate among phyla. Proteins-Structure Function and Genetics 54(1), 20–40 (2004)Google Scholar
- 5.Nikolova, N., Jaworska, J.: Approaches to measure chemical similarity - A review. QSAR & Combinatorial Science 9-10, 1006–1026 (2004)Google Scholar
- 6.Kearsley, S.K., Sallamack, S., Fluder, E.M., Andose, J.D., Mosley, R.T., Sheridan, R.P.: Chemical Similarity Using Physicochemical Property Descriptors. J. Chem. Inf. Comput. Sci. 36, 118–127 (1996)Google Scholar
- 7.Basak, S.C., Grunwald, G.D.: Molecular Similarity And Estimation of Molecular- Properties. J. Chem. Inf. Comput. Sci. 35, 366–372 (1995)Google Scholar
- 8.Dayhoff, M.O., Schwartz, R.M., Orcutt, B.C.: A model of evolutionary change in proteins. Atlas of Protein Sequences and Structure 5 (suppl. 3), 345–352 (1978)Google Scholar
- 10.Jones, D.T., Taylor, W.R., Thornton, J.M.: The rapid generation of mutation data matrices from protein sequences. Computer Applications in the Biosciences 8(3), 275–282 (1992)Google Scholar
- 14.Conover, W.J.: Practical Nonparametric Statistics, 3rd edn. John Wiley & Sons, Chichester (1999)Google Scholar
- 15.Oppenheim, A.V., Schafer, R.W.: Discrete-Time Signal Processing, 2nd edn. Prentice-Hall, Englewood Cliffs (1999)Google Scholar