Abstract
The concept of morphological integration describes the pattern and the amount of correlation between morphological traits. Integration is relevant in evolutionary biology as it imposes constraint on the variation that is exposed to selection, and is at the same time often based on heritable genetic correlations. Several measures have been proposed to assess the amount of integration, many using the distribution of eigenvalues of the correlation matrix. In this paper, we analyze the properties of eigenvalue variance as a much applied measure. We show that eigenvalue variance scales linearly with the square of the mean correlation and propose the standard deviation of the eigenvalues as a suitable alternative that scales linearly with the correlation. We furthermore develop a relative measure that is independent of the number of traits and can thus be readily compared across datasets. We apply this measure to examples of phenotypic correlation matrices and compare our measure to several other methods. The relative standard deviation of the eigenvalues gives similar results as the mean absolute correlation (W.P. Cane, Evol Int J Org Evol 47:844–854, 1993) but is only identical to this measure if the correlation matrix is homogenous. For heterogeneous correlation matrices the mean absolute correlation is consistently smaller than the relative standard deviation of eigenvalues and may thus underestimate integration. Unequal allocation of variance due to variation among correlation coefficients is captured by the relative standard deviation of eigenvalues. We thus suggest that this measure is a better reflection of the overall morphological integration than the average correlation.
Similar content being viewed by others
References
Berg, R. L. (1960). The ecological significance of correlation pleiades. Evolution; International Journal of Organic Evolution, 14, 171–180. doi:10.2307/2405824.
Cane, W. P. (1993). The ontogeny of post-cranial integration in the common tern, Sterna hirundo. Evolution; International Journal of Organic Evolution, 47, 844–854. doi:10.2307/2409980.
Chetverikov, S. S., Barker, M., & Lerner, I. M. (1961). On certain aspects of the evolutionary process from the standpoint of modern genetics. Proceedings of the American Philosophical Society, 105(2), 167–195.
Cheverud, J. M. (1982). Relationships among ontogenetic, static, and evolutionary allometry. American Journal of Physical Anthropology, 59(2), 139–149. doi:10.1002/ajpa.1330590204.
Cheverud, J. M. (1984). Quantitative genetics and developmental constraints on evolution by selection. Journal of Theoretical Biology, 110(2), 155–171. doi:10.1016/S0022-5193(84)80050-8.
Cheverud, J. M. (1988). A comparison of genetic and phenotypic correlations. Evolution; International Journal of Organic Evolution, 42, 958–968. doi:10.2307/2408911.
Cheverud, J. M. (1996). Developmental integration and the evolution of pleiotropy. American Zoologist, 36, 44–50.
Cheverud, J. M. (2001). A simple correction for multiple comparisons in interval mapping genome scans. Heredity, 87(1), 52–58. doi:10.1046/j.1365-2540.2001.00901.x.
Cheverud, J. M., Rutledge, J. J., & Atchley, W. R. (1983). Quantitative genetics of development: Genetic correlations among age-specific trait values and the evolution of ontogeny. Evolution; International Journal of Organic Evolution, 37(5), 895–905. doi:10.2307/2408405.
Cheverud, J. M., Wagner, G. P., & Dow, M. M. (1989). Methods for the comparative analysis of variation patterns. Systematic Zoology, 38(3), 201–213. doi:10.2307/2992282.
Darwin, C. (1895). On the origin of species. London: John Murray.
Fisher, R. A. (1928). The general sampling distribution of the multiple correlation coefficient. Proceedings of the Royal Society of London. Series A, Containing Papers of a Mathematical and Physical Character, 121, 654–673. doi:10.1098/rspa.1928.0224.
Hallgrimsson, B., Brown, J. J., Ford-Hutchinson, A. F., Sheets, H. D., Zelditch, M. L., & Jirik, F. R. (2006). The brachymorph mouse and the developmental-genetic basis for canalization and morphological integration. Evolution & Development, 8(1), 61–73. doi:10.1111/j.1525-142X.2006.05075.x.
Hallgrimsson, B., Dorval, C. J., Zelditch, M. L., & German, R. Z. (2004). Craniofacial variability and morphological integration in mice susceptible to cleft lip and palate. Journal of Anatomy, 205(6), 501–517. doi:10.1111/j.0021-8782.2004.00356.x.
Hansen, T. F., & Houle, D. (2008). Measuring and comparing evolvability and constraint in multivariate characters. Journal of Evolutionary Biology, 21(5), 1201–1219. doi:10.1111/j.1420-9101.2008.01573.x.
Herrera, C. M., Cerda, X., Garcia, M. B., Guitan, J., Medrano, M., Rey, P. J., et al. (2002). Floral integration, phenotypic covariance structure and pollinator variation in bumblebee-pollinated Helleborus foetidus. Journal of Evolutionary Biology, 15, 108–121. doi:10.1046/j.1420-9101.2002.00365.x.
Hutchison, D. W., & Cheverud, J. M. (1995). Fluctuating asymmetry in tamarin (Saguinus) cranial morphology: Intra- and interspecific comparisons between taxa with varying levels of genetic heterozygosity. The Journal of Heredity, 86(4), 280–288.
Jones, A. G., Arnold, S. J., & Burger, R. (2007). The mutation matrix and the evolution of evolvability. Evolution; International Journal of Organic Evolution, 61(4), 727–745. doi:10.1111/j.1558-5646.2007.00071.x.
Jorjani, H., Klei, L., & Emanuelson, U. (2003). A simple method for weighted bending of genetic (co)variance matrices. Journal of Dairy Science, 86(2), 677–679.
Kenney-Hunt, J. P., Wang, B., Norgard, E. A., Fawcett, G., Falk, D., Pletscher, S. L., et al. (2008). Pleiotropic patterns of quantitative trait loci for 70 murine skeletal traits. Genetics, 178(4), 2275–2288. doi:10.1534/genetics.107.084434.
Kirkpatrick, M. (2008). Patterns of quantitative genetic variation in multiple dimensions. Genetica, in press.
Kotz, S., Pearn, W. L., & Wichern, D. W. (1984). Eigenvalue-eigenvector analysis for a class of patterned correlation matrices with an application. Statistics & Probability Letters, 2, 119–125. doi:10.1016/0167-7152(84)90001-4.
Lande, R. (1979). Quantitative genetic analysis of multivariate evolution, applied to brain: Body size allometry. Evolution, 33, 402–416.
Lande, R. (1980). The genetic covariance between characters maintained by pleiotropic mutations. Genetics, 94, 203–215.
Lande, R., & Arnold, S. J. (1983). The measurement of selection on correlated characters. Evolution; International Journal of Organic Evolution, 37(6), 1210–1226. doi:10.2307/2408842.
Magwene, P. M. (2008). Using correlation proximity graphs to study phenotypic integration. Evolutionary Biology, 35, 191–198. doi:10.1007/s11692-008-9030-y.
Morrison, D. F. (1976). Multivariate Statistical Methods (2nd ed.). New York: McGraw-Hill, Inc.
Nyholt, D. R. (2004). A simple correction for multiple testing for single-nucleotide polymorphisms in linkage disequilibrium with each other. American Journal of Human Genetics, 74(4), 765–769. doi:10.1086/383251.
Olson, E. C., & Miller, R. L. (1958). Morphological integration. Chicago: University of Chicago Press.
Parsons, K. J., & Robinson, B. W. (2006). Replicated evolution of integrated plastic responses during early adaptive divergence. Evolution; International Journal of Organic Evolution, 60(4), 801–813.
Pearson, K. (1903). Mathematical contributions to the theory of evolution. XI. On the influence of natural selection on the variability and correlation of organs. Philosophical Transactions of the Royal Society of London. Series A: Mathematical and Physical Sciences, 200, 1–66. doi:10.1098/rsta.1903.0001.
Peres-Neto, P. R., & Magnan, P. (2004). The influence of swimming demand on phenotypic plasticity and morphological integration: A comparison of two polymorphic charr species. Oecologia, 140(1), 36–45. doi:10.1007/s00442-004-1562-y.
Revell, L. J. (2007). The G matrix under fluctuating correlational mutation and selection. Evolution; International Journal of Organic Evolution, 61(8), 1857–1872. doi:10.1111/j.1558-5646.2007.00161.x.
Searle, S. R. (1961). Phenotypic, genetic and environmental correlations. Biometrics, 17, 474–480.
Terentjev, P. V. (1931). Biometrische Untersuchungen ueber die morphologischen Merkmale von Rana ridibunda Pall. (Amphibia, Salientia). Biometrika, 23, 23–51.
Van Valen, L. (1965). The study of morphological integration. Evolution; International Journal of Organic Evolution, 19(3), 347–349. doi:10.2307/2406444.
Wagner, G. P. (1984). On the eigenvalue distribution of genetic and phenotypic dispersion matrices: Evidence for a nonrandom organization for quantitative character variation. Journal of Mathematical Biology, 21, 77–95.
Wagner, G. P., Kenney-Hunt, J. P., Pavlicev, M., Peck, J. R., Waxman, D., & Cheverud, J. M. (2008). Pleiotropic scaling of gene effects and the ‘cost of complexity’. Nature, 452(7186), 470–472. doi:10.1038/nature06756.
Young, N. M. (2006). Function, ontogeny and canalization of shape variance in the primate scapula. Journal of Anatomy, 209(5), 623–636. doi:10.1111/j.1469-7580.2006.00639.x.
Young, N. M., & Hallgrimsson, B. (2005). Serial homology and the evolution of mammalian limb covariation structure. Evolution; International Journal of Organic Evolution, 59(12), 2691–2704.
Acknowledgements
We thank Hans L. Nemeschkal for discussion in the initial forming of ideas, Thomas F. Hansen for comments that improved the paper, and Jason Wolf for help with the polynomial. We also thank Jane P. Kenney-Hunt and Remo Probst for kindly allowing the use of the mouse skeleton and Accipiter wing phenotypic data, respectively. MP was supported by Postdoctoral Fellowship from the Austrian Science Fund (FWF). JMC acknowledges support from NSF BCS0725068 and GPW by a grant from the John Templeton Foundation, #12793 Genetics and the Origin of Organismal Complexity; results presented here do not necessarily reflect the views of the John Templeton Foundation.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Appendices
Appendix 1
Let R be a homogenous correlation matrix of size N with \( {\mathbf{R}}_{ij} = r \) for all i ≠ j. If 1 = (1,…,1) is an eigenvector of R and λ 1 = 1 + (N − 1)r the corresponding eigenvalue, then \( {\mathbf{R1}} = \lambda_{1} {\mathbf{1}} \). Considering this equation component wise:
we see that the right hand side of the latter equation is simply the row sum of the i-th row. Because of the symmetry of R this sum is the same for all rows in R, and which has (N − 1) terms of size r and one equal to 1:
The other eigenvalues of R can be obtained by considering the constraint \( \sum\nolimits_{i = 1}^{N} {\lambda_{i} } = \text{tr} {\mathbf{R}} = N \), from which we see that
Due to the symmetry of the R matrix the N − 1 non-leading eigenvalues are all the same and λ i > 1 = 1 − r.
Appendix 2
To derive the maximum variance corresponding to a particular \( \bar{r} \), we consider maximal symmetrical dispersal of r. Maximum dispersal is reached when half of the sample size is located at outermost values, in equal distances to both sides from the \( \bar{r} \). This can be expressed as
where r imax, r imin are the maximal and minimal values of r i , given the symmetrical distribution. For example, if \( \bar{r} = 0.2 \), the r max = 1 and r min = 0.2 − (1 − 0.2) = −0.6. In general:
And therefore, the correlation variance of the maximal symmetrical dispersion for a given average r is:
Appendix 3
Here we investigate the eigenvalue variance in two special cases of non-hierarchical modular matrices. General equation for eigenvalue variance in matrices with k homogeneous submodules, each of size N i , and of no correlation between modules is:
It is helpful to consider a case where all modules are of the same size, \( N_{i} = \frac{N}{k}\hbox{; }{\text{and}}\;\sum\limits_{i = 1}^{k} {N_{i} } = N \), and equal within-module correlation r w . In this case the equation simplifies to
N > k, thus eigenvalue variance (i.e., integration) is lower than in a homogeneous (unimodular) matrix by a factor
To compare it with integration estimated from the overall mean correlation, we calculate overall mean correlation:
And therefore
Here, the two integration levels differ by a factor
the factor is smaller than 1, indicating that the estimate by the mean squared correlation is lower than the true integration of modular matrix. In fact we can see that the smaller the module size (and the greater thus the k), the smaller this factor will be.
The second case considered here is a case with different-sized modules of equal within-module correlation.
In which case only the term of the eigenvalue variance describing the effect of submodule size is averaged.
Appendix 4
Eigenvalue Variance for the Three-Trait Case with Unsymmetrical Distribution of Correlation
The correlation coefficients in matrices underlying this consideration are normally distributed on a Fisher’s z-transformed scale (Fisher 1928), but not on the Pearson’s correlation scale −1 < r < 1. Note that in the following we refer to the result of the normalizing Fisher’s z-transformation as to the z-scale. This is not to be confused with z-standardization of normal distribution. The Pearson’s product–moment correlations (r) are referred to as being on the r-scale.
Consider a given mean correlation on r-scale \( (\bar{r}) \) and given maximum deviation from the mean (d) on z-scale. The \( \bar{r} \) has a corresponding value when transformed to z-scale, and the deviation d is added and subtracted to gain the lowest and the highest values of the correlation coefficients on the z-scale. The two values were back-transformed onto the r-scale and the deviations ε1, ε2 from \( \bar{r} \) were calculated as differences of the extreme values from the mean. The third correlation coefficient is e 3 = 0 − ε1 − ε2 due to the constraint of \( \bar{r} \), i.e., \( \sum {\varepsilon_{i} } = 0 \). The three off-diagonal elements of the correlation matrix are now \( (\bar{r} + \varepsilon_{1} ),(\bar{r} - \varepsilon_{2} ),(\bar{r} + \varepsilon_{3} ) \). The corresponding eigenvalue polynomial (as in previous symmetrical case, see text), substituting x for (1 − λ) and multiplying out reduces to
Figure 6 shows plot of a family of polynomials with different combinations of parameters \( \bar{r} \) and d; the three solutions representing the three x i (=1 − λ i ). Bundles of polynomials share the same mean correlation of matrix \( (\bar{r}) \), but differ in dispersion of correlations (d).
Analogous to the matrices with symmetrical distribution of correlation coefficients (main part of the text), we do not present the exact solutions, but rather approximate the three solutions here. Note that the three solutions follow the same pattern as above in symmetrical matrices. The following can be concluded from the plot: (i) when allowing the asymmetrical distribution of correlation, the heterogeneity affects all three eigenvalues; (ii) the effect of heterogeneity of matrix on first eigenvalue is smaller at higher \( \overline{\left| r \right|} \); (iii) the first eigenvalue increases with increased heterogeneity (i.e., x 1 decreases) while the remaining two diverge.
Approximating the three solutions in a way similar to symmetrical situation, but noting that all three eigenvalues vary depending on the heterogeneity, rather than only the second and the third, we consider the following solutions:
and the resulting eigenvalues are
If we substitute these into the equation for eigenvalue variance, it yields for three traits:
And after some rearrangement for a general case:
This version corresponds to the eigenvalue variance equation for the heterogeneous matrices with symmetrical distribution of correlations, but involves an additional term (that yields zero if the distribution is symmetrical). This equation also shows that the eigenvalue variance in heterogeneous matrices will be greater than the corresponding eigenvalue variance of the homogeneous matrix of the same average correlation.
Appendix 5
Phenotypic Datasets
Tamarin Cranial Dataset
This dataset is comprised of 16 linear lateral cranial measurements on the skull of 275 individuals of the pure bred cotton-top tamarin (Saguinus oedipus). Detailed description of the population, measurement method and the list of measurements can be found in Hutchison and Cheverud (1995). The linear measurements were derived from 3D coordinate data for 16 landmarks per each side of the skull. All landmarks were taken twice, to estimate the measurement error and improve repeatability of short measurements by using their average. The average repeatability per trait of these data was 0.788 on the right and 0.776 on the left side.
Bird Wing Dataset
The lengths of seven wing feathers have been measured on 64 museum specimens of Eurasian Goshawk (Accipiter gentilis gentilis, data are available in online supplementary material, Table S1). This dataset comprises the first five primary—(‘hand’) as well as the third and the fifth secondary (“forearm”) flight feathers, counted from distal end of the wing towards the body. The feathers were measured as a distance from bend of the wing (i.e., wrist joint) to the tip of the feather along the shaft, measured over the folded wing. The sample size consists of individuals covering most of the subspecies range. All measurements were taken three-fold. The average repeatability of single measurements in these data is 0.978.
Mouse Skeletal Dataset
Phenotypic traits comprise 70 skeletal measurements representing the cranium, axial and appendicular skeleton, as well as body weight at 10 weeks and at necropsy. The experimental population results from an intercross of inbred mouse strains LG/J and SM/J, selected for large and small body weight at 60 days of age, respectively. The data stems from two consecutive analogous intercrosses, comprising a total of 1040 F2 mice. For the list of measurements and details on measurement techniques see Kenney-Hunt et al. (2008). Extreme outliers were identified by being more than 2.7 standard deviations from the sample mean (SYSTAT 10.2) and were eliminated to avoid biasing the data. The data were corrected for the effects of dam, litter size, experimental block, sex, age at necropsy, and intercross. The correction was done by regressing each of the phenotypic trait measurements on all of the respective predictor variables (dam, litter size, sex, etc.) and taking the residuals as corrected scores for the trait. The repeatability was assessed for each trait by repeated measurements on 30–50 individuals and amounts on average 0.93, with the range from 0.82 to 0.992 for single traits.
Appendix 6
Several Other Indices of Integration
Some of the listed measures require the matrices to be non-negative, or even positive definite. Errorless phenotypic matrices are always at least non-negative definite, however due to error, either sampling or measurement error, empirical matrices are often negative. To apply the methods 2 and 5 to the empirical datasets in the paper, we used only dimensions with eigenvalues greater than zero, in case the matrices were not positive definite.
-
1.
Van Valen (1965) suggested calculating the average Fisher’s z-transformed correlation, and transforming it back to the scale 0–1.
-
2.
Cheverud et al. (1983) defined the index of integration as one minus the geometric mean of eigenvalues of a correlation matrix:
$$ I = 1 - \sqrt[N]{{\prod\limits_{i = 1}^{N} {\lambda_{i} } }} $$ -
3.
Wagner (1984) suggested the variance of eigenvalues of a correlation matrix. We provide also the relative eigenvalue variance, as described in the main body of this paper:
$$ \text{Var}_{\text{rel}} (\lambda ) = \frac{{\text{Var} (\lambda )}}{{\max \text{Var} (\lambda )}} = \frac{{\text{Var} (\lambda )}}{N - 1} $$ -
4.
Cane (1993) used an average absolute pair wise correlation
-
5.
Hansen and Houle (2008) suggested a measure that is based on covariance matrix and applies the harmonic mean of eigenvalues (H(λ)):
$$ i = 1 - \left[ {\frac{H\left( \lambda \right)}{{\overline{\lambda } }}\left[ {1 + \frac{{1 + I\left( {\frac{1}{\lambda }} \right) + I(\lambda ) - \frac{H(\lambda )}{{\overline{\lambda } }}}}{k + 1}} \right]} \right] $$where
$$ I\left( {\frac{1}{\lambda }} \right) = \frac{{{\text{Var}}\left( {\frac{1}{{\lambda_{i} }}} \right)}}{{\left( {\overline{{\frac{1}{{\lambda_{i} }}}} } \right)^{2} }}\quad I(\lambda ) = \frac{{{\text{Var}}(\lambda )}}{{(\overline{\lambda } )^{2} }} $$
The data for this procedure need to be on similar scale, which can be accomplished by variance or mean standardization. We used variance standardization in the examples, which corresponds to correlation.
Rights and permissions
About this article
Cite this article
Pavlicev, M., Cheverud, J.M. & Wagner, G.P. Measuring Morphological Integration Using Eigenvalue Variance. Evol Biol 36, 157–170 (2009). https://doi.org/10.1007/s11692-008-9042-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11692-008-9042-7