, Volume 131, Issue 2, pp 151–156

Understanding relationship between sequence and functional evolution in yeast proteins

Original Paper


The underlying relationship between functional variables and sequence evolutionary rates is often assessed by partial correlation analysis. However, this strategy is impeded by the difficulty of conducting meaningful statistical analysis using noisy biological data. A recent study suggested that the partial correlation analysis is misleading when data is noisy and that the principal component regression analysis is a better tool to analyze biological data. In this paper, we evaluate how these two statistical tools (partial correlation and principal component regression) perform when data are noisy. Contrary to the earlier conclusion, we found that these two tools perform comparably in most cases. Furthermore, when there is more than one ‘true’ independent variable, partial correlation analysis delivers a better representation of the data. Employing both tools may provide a more complete and complementary representation of the real data. In this light, and with new analyses, we suggest that protein length and gene dispensability play significant, independent roles in yeast protein evolution.


Partial correlation Principal component regression Functional genomic data Yeast protein evolution 

Supplementary material

10709_2006_9125_MOESM1_ESM.pdf (186 kb)
ESM (PDF 186 kb)

Copyright information

© Springer Science+Business Media, Inc. 2006

Authors and Affiliations

  1. 1.School of BiologyGeorgia Institute of TechnologyAtlantaUSA
  2. 2.Division of BiostatisticsSchool of Medicine, Indiana UniversityIndianapolisUSA

Personalised recommendations