Linear Phenotypic Eigen Selection Index Methods

Céron-Rojas, J. Jesus; Crossa, José

doi:10.1007/978-3-319-91223-3_7

J. Jesus Céron-Rojas⁴ &
José Crossa⁴

4844 Accesses
1 Citations

Abstract

Based on the canonical correlation, on the singular value decomposition (SVD), and on the linear phenotypic selection indices theory, we describe the eigen selection index method (ESIM), the restricted ESIM (RESIM), and the predetermined proportional gain ESIM (PPG-ESIM), which use only phenotypic information to predict the net genetic merit. The ESIM is an unrestricted linear selection index, but the RESIM and PPG-ESIM are linear selection indices that allow null and predetermined restrictions respectively to be imposed on the expected genetic gains of some traits, whereas the rest remain without any restrictions. The aims of the three indices are to predict the unobservable net genetic merit values of the candidates for selection, maximize the selection response, and the accuracy, and provide the breeder with an objective rule for evaluating and selecting several traits simultaneously. Their main characteristics are: they do not require the economic weights to be known, the first multi-trait heritability eigenvector is used as its vector of coefficients; and because of the properties associated with eigen analysis, it is possible to use the theory of similar matrices to change the direction and proportion of the expected genetic gain values without affecting the accuracy. We describe the foregoing three indices and validate their theoretical results using real and simulated data.

You have full access to this open access chapter, Download chapter PDF

A singular value decomposition Bayesian multiple-trait and multiple-environment genomic model

Article Open access 17 August 2018

The use of a genetic relationship matrix biases the best linear unbiased prediction

Article 07 October 2020

Biometrical Approaches for Analysis of Phenotypic Data of Complex Traits

7.1 The Linear Phenotypic Eigen Selection Index Method

The conditions described in Chap. 2 for the linear phenotypic selection index (LPSI) are necessary and sufficient for constructing the linear phenotypic eigen selection index method (ESIM). The ESIM index can be written as I = b′y, where b′ = [b₁ b₂ ⋯ b_t] is the unknown index vector of coefficients, t is the number of traits, and $ {\mathbf{y}}^{\prime }=\left[{y}_1\kern0.5em {y}_2\kern0.5em \cdots \kern0.5em {y}_t\right] $ is a known vector of trait phenotypic values. The objectives of ESIM are:

1.
To predict the net genetic merit H = w′g, where $ {\mathbf{g}}^{\prime }=\left[{g}_1\kern0.5em {g}_2\kern0.5em \dots \kern0.5em {g}_t\right] $ is the unknown vector of true breeding values for an individual and $ {\mathbf{w}}^{\prime }=\left[{w}_1\kern0.5em {w}_2\kern0.5em \dots \kern0.5em {w}_t\right] $ is a vector of unknown economic weights.
2.
To maximize the ESIM selection response and the accuracy.
3.
To select individuals with the highest H values in each selection cycle as parents of the next generation.
4.
To provide the breeder with an objective rule for evaluating and selecting several traits simultaneously.

Although in the context of the LPSI w is a known and fixed vector of economic weights, in the ESIM w is fixed, but unknown and its values must be estimated in each selection cycle. This latter assumption is the fundamental difference between the ESIM and the LPSI and implies that the ESIM is more general than the LPSI. Thus, when w is known, the LPSI and ESIM give the same results.

7.1.1 The ESIM Parameters

The theoretical ESIM selection response can be written as

$$ {R}_I={k}_I{\sigma}_H{\rho}_{HI}, $$

(7.1)

where k_I is the standardized selection differential (or selection intensity), $ {\sigma}_H=\sqrt{{\mathbf{w}}^{\prime}\mathbf{Cw}} $ is the standard deviation of H, $ {\rho}_{HI}=\frac{{\mathbf{w}}^{\prime}\mathbf{Cb}}{\sqrt{{\mathbf{w}}^{\prime}\mathbf{Cw}}\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}}} $ is the correlation, and w′Cb = σ_HI the covariance between H and I respectively, $ {\sigma}_I=\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}} $ is the standard deviation of I, C is the covariance matrix of the true breeding values (g), and P is the covariance matrix of the trait phenotypic values (y).

In the ESIM, it is assumed that k_I and σ_H are fixed, and that C and P are known; thus, to maximize Eq. (7.1), it is necessary to maximize $ {\rho}_{HI}^2=\frac{{\left({\mathbf{w}}^{\prime}\mathbf{Cb}\right)}^2}{\left({\mathbf{w}}^{\prime}\mathbf{Cw}\right)\left({\mathbf{b}}^{\prime}\mathbf{Pb}\right)} $ with respect to vectors b and w under the restrictions $ {\sigma}_H^2={\mathbf{w}}^{\prime}\mathbf{Cw} $, $ {\sigma}_I^2={\mathbf{b}}^{\prime}\mathbf{Pb} $, and 0 < $ {\sigma}_H^2 $,$ {\sigma}_I^2 $<∞, where $ {\sigma}_H^2={\mathbf{w}}^{\prime}\mathbf{Cw} $ is the variance of H = w′g and $ {\sigma}_I^2={\mathbf{b}}^{\prime}\mathbf{Pb} $ is the variance of I = b′y. That is, it is necessary to maximize the function

$$ f\left(\mathbf{b},\mathbf{w},\mu, \phi \right)={\left({\mathbf{w}}^{\prime}\mathbf{Cb}\right)}^2-\mu \left({\mathbf{b}}^{\prime}\mathbf{Pb}-{\sigma}_I^2\right)-\phi \left({\mathbf{w}}^{\prime}\mathbf{Cw}-{\sigma}_H^2\right) $$

(7.2)

with respect to b, w, μ, and ϕ, where μ and ϕ are Lagrange multipliers. The derivative results of Eq. (7.2) with respect to b, w, μ, and ϕ are:

$$ \left({\mathbf{w}}^{\prime}\mathbf{Cb}\right)\mathbf{Cw}-\mu \mathbf{Pb}=\mathbf{0}, $$

(7.3)

$$ \left({\mathbf{w}}^{\prime}\mathbf{Cb}\right)\mathbf{Cb}-\phi \mathbf{Cw}=\mathbf{0}, $$

(7.4)

$$ {\mathbf{b}}^{\prime}\mathbf{Pb}={\sigma}_I^2\ \mathrm{and}\ {\mathbf{w}}^{\prime}\mathbf{Cw}={\sigma}_H^2, $$

(7.5)

respectively, where Eq. (7.5) denotes the restrictions imposed for maximizing $ {\rho}_{HI}^2 $. It can be shown that $ {\mathbf{w}}^{\prime}\mathbf{Cb}=\sqrt{{\mu \sigma}_I^2}=\sqrt{{\phi \sigma}_H^2}={\theta}^{1/2} $; then, Eqs. (7.3) and (7.4) can be written as

$$ {\theta}^{1/2}\mathbf{Cw}-\frac{\theta }{\sigma_I^2}\mathbf{Pb}=\mathbf{0} $$

(7.6)

and

$$ {\theta}^{1/2}\mathbf{Cb}-\frac{\theta }{\sigma_H^2}\mathbf{Cw}=\mathbf{0}, $$

(7.7)

respectively. Equation (7.6) is equal to $ \mathbf{Cw}=\frac{\theta^{1/2}}{\sigma_I^2}\mathbf{Pb} $; then, vector w can be written as

$$ {\mathbf{w}}_E=\frac{\theta^{1/2}}{\sigma_I^2}{\mathbf{C}}^{-1}\mathbf{Pb}. $$

(7.8)

By the result of Eq. (7.8), the net genetic merit in the ESIM context is $ {H}_E={\mathbf{w}}_E^{\prime}\mathbf{g} $ and the correlation between H_E and I is $ {\rho}_{H_EI}=\frac{{\mathbf{w}}_E^{\prime}\mathbf{Cb}}{\sqrt{{\mathbf{w}}_E^{\prime }{\mathbf{Cw}}_E}\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}}}=\frac{\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}}}{\sqrt{{\mathbf{b}}^{\prime }{\mathbf{PC}}^{-1}\mathbf{Pb}}} $. Now, it is necessary to find the vector b that maximizes $ {\rho}_{H_EI} $, which should be the ESIM index vector of coefficients. Substituting w with w_E in Eq. (7.7), we get

$$ \mathbf{Cb}-\frac{{\left({\mathbf{w}}_E^{\prime}\mathbf{Cb}\right)}^2}{\sigma_I^2{\sigma}_{H_E}^2}\mathbf{Pb}=\mathbf{0}, $$

(7.9)

where $ \frac{{\left({\mathbf{w}}_E^{\prime}\mathbf{Cb}\right)}^2}{\sigma_I^2{\sigma}_{H_E}^2}={\rho}_{H_EI}^2 $ is the square of the correlation between ESIM and $ {H}_E={\mathbf{w}}_E^{\prime}\mathbf{g} $. Let $ {\rho}_{H_EI}^2={\lambda}_E^2 $, then Eq. (7.9) can be written as

$$ \left({\mathbf{P}}^{-1}\mathbf{C}-{\lambda}_E^2\mathbf{I}\right){\mathbf{b}}_E=\mathbf{0}, $$

(7.10)

and the optimized ESIM index is $ {I}_E={\mathbf{b}}_E^{\prime}\mathbf{y} $. Note that in Eq. (7.10) P⁻¹C is the multi-trait heritability. By Eqs. (7.8) and (7.10), the maximized correlation between $ {H}_E={\mathbf{w}}_E^{\prime}\mathbf{g} $ and $ {I}_E={\mathbf{b}}_E^{\prime}\mathbf{y} $ (or ESIM accuracy) can be written as

$$ {\rho}_{H_E{I}_E}=\frac{\sigma_{I_E}}{\sigma_{H_E}}, $$

(7.11)

where $ {\sigma}_{I_E}=\sqrt{{\mathbf{b}}_E^{\prime }{\mathbf{Pb}}_E} $ is the standard deviation of the variance of $ {I}_E={\mathbf{b}}_E^{\prime}\mathbf{y} $, and $ {\sigma}_{H_E}=\sqrt{{\mathbf{b}}_E^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Pb}}_E} $ is the standard deviation of the variance of $ {H}_E={\mathbf{w}}_E^{\prime}\mathbf{g} $. Hereafter, we write Eq. (7.11) as $ {\rho}_E={\rho}_{H_E{I}_E} $ or $ {\lambda}_E={\rho}_{H_E{I}_E} $to simplify the notation.

An additional restriction on Eq. (7.10) is b′b = 1, because $ {\rho}_{H_E{I}_E} $ is invariant to the scale change and because if b_E is an eigenvector of the multi-trait heritability matrix P⁻¹C, vector αb_E is also an eigenvector of P⁻¹C for all real values of α (Mardia et al. 1982). This means that in the ESIM the magnitude of an eigenvector is unimportant; only the direction matters (Watkins 2002). Equation (7.10) can also be written as $ {\mathbf{Cb}}_E={\lambda}_E^2{\mathbf{Pb}}_E $, which is called the generalized eigenvalue problem (Watkins 2002). In the latter case, b_E is called a generalized eigenvector and $ {\lambda}_E^2 $ a generalized eigenvalue. The generalized eigenvalues may not exist; that is, they may be infinite. However, if P is positive definite and has the same size as C, all eigenvalues of P⁻¹C exist and are finite (Gentle 2007). Matrix P is symmetric and positive definite and its eigenvalues are different with a probability of 1 if the number of genotypes is higher than the number of traits (Okamoto 1973).

If the heritability of the ESIM is $ {h}_I^2=\frac{{\mathbf{b}}^{\prime}\mathbf{Cb}}{{\mathbf{b}}^{\prime}\mathbf{Pb}} $, then another way of writing Eq. (7.1) is

$$ {R}_I={k}_I{\sigma}_I{h}_I^2={k}_I\frac{{\mathbf{b}}^{\prime}\mathbf{Cb}}{\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}}}, $$

(7.12)

which is similar to the univariate breeder’s equation (see Chap. 2, Eq. 2.4). All the parameters of Eq. (7.12) were defined earlier.

The derivative of the ratio $ \frac{{\mathbf{b}}^{\prime}\mathbf{Cb}}{\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}}} $ (Eq. 7.12) with respect to b can be written as 2(b′Pb)^1/2Cb − (b′Pb)^−1/2(b′Cb)Pb = 0, and, except by a proportionality constant, the result is

$$ \left({\mathbf{P}}^{-1}\mathbf{C}-{h}_{I_E}^2\mathbf{I}\right){\mathbf{b}}_E=\mathbf{0}, $$

(7.13)

where $ {h}_{I_E}^2==\frac{{\mathbf{b}}_E^{\prime }{\mathbf{Cb}}_E}{{\mathbf{b}}_E^{\prime }{\mathbf{Pb}}_E} $ is the maximized ESIM heritability. Let $ {\lambda}_E^2={\rho}_E^2={h}_{I_E}^2 $, then Eq. (7.13) is equal to Eq. (7.10) and can be written as $ {\mathbf{b}}_E^{\prime }{\mathbf{Cb}}_E={\lambda}_E^2{\mathbf{b}}_E^{\prime }{\mathbf{Pb}}_E $, whence the maximized $ {\rho}_E^2 $ in terms of $ {h}_{I_E}^2 $ is

$$ {\rho}_E^2=\frac{{\mathbf{b}}_E^{\prime }{\mathbf{Cb}}_E}{{\mathbf{b}}_E^{\prime }{\mathbf{Pb}}_E}, $$

(7.14)

which should give a equivalent result to that of Eq. (7.11).

By Eq. (7.11) and $ {\sigma}_{H_E}=\sqrt{{\mathbf{b}}_E^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Pb}}_E} $, the maximized ESIM selection response and expected genetic gain per trait can be written as

$$ {R}_E={k}_I\sqrt{{\mathbf{b}}_E^{\prime }{\mathbf{Pb}}_E} $$

(7.15)

and

$$ {\mathbf{E}}_E={k}_I\frac{{\mathbf{Cb}}_E}{\sqrt{{\mathbf{b}}_E^{\prime }{\mathbf{Pb}}_E}}, $$

(7.16)

respectively. Equations (7.15) and (7.16) do not require the economic weights to be known. In the original derivation of the ESIM, Cerón-Rojas et al. (2008) imposed the restrictions $ {\sigma}_{H_E}^2=1 $ and $ {\sigma}_{I_E}^2=1 $. Under these restrictions, $ {\lambda}_E={\mathbf{w}}_E^{\prime }{\mathbf{Cb}}_E $ and Eq. (7.15) can be written as R_E = k_Iλ_E. When $ {\sigma}_{H_E}^2\ne 1 $ Eq. (7.15) is equal to $ {R}_E={k}_I{\sigma}_{H_E}{\lambda}_E $, where $ {\sigma}_{H_E}=\sqrt{{\mathbf{b}}_E^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Pb}}_E} $ and $ {\lambda}_E^2={\rho}_E^2={h}_{I_E}^2 $.

Let T = P⁻¹C and $ {\lambda}_E^2={h}_{I_E}^2 $; then, Eq. (7.13) can be written as $ {\mathbf{TIb}}_E={\lambda}_E^2{\mathbf{Ib}}_E $, where I = F⁻¹F is an identity matrix of size t × t (t= number of traits), and $ \mathbf{F}=\mathit{\operatorname{diag}}\left\{{f}_1\kern0.5em {f}_1\kern0.5em \cdots \kern0.5em {f}_t\right\} $ is a diagonal matrix with values equal to any real number, except zero values. Thus, another way of writing Eqs. (7.10) and (7.13) is

$$ \left({\mathbf{T}}_2-{\lambda}_E^2\mathbf{I}\right)\boldsymbol{\upbeta} =\mathbf{0}, $$

(7.17)

where T₂ = FTF⁻¹ and β = Fb_E; T and T₂ = FTF⁻¹ are similar matrices and both have the same eigenvalues but different eigenvectors (Harville 1997). When the F values are only 1s, vector b_E is not affected; when the F values are only −1s, vector b_E changes its direction, and if the F values are different from 1 and −1, matrix F changes the proportional values of b_E. In practice, b_E is first obtained from Eq. (7.13) and then multiplied by matrix F to obtain β = Fb_E, that is, β is a linear transformation of b_E. Matrix T₂ = FTF⁻¹ is called the similarity transformation, and matrix F is called the transforming matrix (Watkins 2002). Cerón-Rojas et al. (2006) introduced an alternative procedure for modifying the b_E signs that is a particular case of Eq. (7.17). Vector β = Fb_E can substitute b_E in Eqs. (7.15) and (7.16); and in this case, the optimized ESIM index should be written as I_E = β′y.

7.1.2 Statistical ESIM Properties

The ratio of the index accuracies and the variance of the predicted error (VPE) are good criteria for comparing the index efficiencies for predicting the net genetic merit (see Chap. 2 for details). In Eq. (7.11), we obtained the accuracy of the ESIM; now, we derive the VPE of the ESIM.

The variance of $ {I}_E={\mathbf{b}}_E^{\prime}\mathbf{y} $ ($ {\sigma}_{I_E}^2 $) and the covariance between $ {H}_E={\mathbf{w}}_E^{\prime}\mathbf{g} $ and $ {I}_E={\mathbf{b}}_E^{\prime}\mathbf{y} $($ {\sigma}_{H_E{I}_E} $) are the same, that is,

$$ {\sigma}_{I_E}^2={\mathbf{b}}_E^{\prime }{\mathbf{Pb}}_E\ \mathrm{and}\ {\sigma}_{H_E{I}_E}={\mathbf{w}}_E^{\prime }{\mathbf{Cb}}_E={\mathbf{b}}_E^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Cb}}_E={\mathbf{b}}_E^{\prime }{\mathbf{Pb}}_E, $$

(7.18)

respectively; that is, $ {\sigma}_{I_E}^2={\sigma}_{H_E{I}_E} $. By Eq. (7.18), the VPE of the ESIM can be written as

$$ E\left[{\left({H}_E-{I}_E\right)}^2\right]={\sigma}_{H_E}^2+{\sigma}_{I_E}^2-2{\sigma}_{H_E{I}_E}={\sigma}_{H_E}^2-{\sigma}_{I_E}^2=\left(1-{\rho}_E^2\right){\sigma}_{H_E}^2. $$

(7.19)

The relative effectiveness of $ {I}_E={\mathbf{b}}_E^{\prime}\mathbf{y} $ in predicting $ {H}_E={\mathbf{w}}_E^{\prime}\mathbf{g} $ is the ratio of $ \left(1-{\rho}_E^2\right){\sigma}_{H_E}^2 $ over $ {\sigma}_{H_E}^2 $, i.e., $ 1-{\rho}_E^2 $; thus, the greater $ {\rho}_E^2 $ is, the more effective $ {I}_E={\mathbf{b}}_E^{\prime}\mathbf{y} $ is at predicting $ {H}_E={\mathbf{w}}_E^{\prime}\mathbf{g} $. The mean squared effect of I_E on H_E, or the total variance of H_E explained by I_E is

$$ {\sigma}_{I_E}^2={\rho}_E^2{\sigma}_{H_E}^2, $$

(7.20)

and the relative mean squared effect can be measured by $ {\rho}_E^2 $ (Anderson 2003). If in Eq. (7.20) $ {\rho}_E^2=1 $, $ {\sigma}_{I_E}^2={\sigma}_{H_E}^2 $, and if $ {\rho}_E^2=0 $, $ {\sigma}_{I_E}^2=0 $. That is, the variance of H_E explained by I_E is proportional to $ {\rho}_E^2 $, and when $ {\rho}_E^2 $ is close to 1, $ {\sigma}_{I_E}^2 $ is close to $ {\sigma}_{H_E}^2 $, and if $ {\rho}_E^2 $ is close to 0, $ {\sigma}_{I_E}^2 $ is close to 0. All these results are valid for any index associated with the ESIM, such as the restricted ESIM (RESIM) and the predetermined proportional gains ESIM (PPG-ESIM), which are described in the following sections of this chapter.

7.1.3 The ESIM and the Canonical Correlation Theory

Canonical correlation theory describes the associations between two sets of variables (Hotelling 1935, 1936) and searches for linear combinations, called canonical variables, of each of two sets of variables having maximal correlation. The vector of coefficient of these linear combinations is called the canonical vector and the correlations between the canonical variables is called the canonical correlation (Wilms and Croux 2016).

To see how the ESIM and the canonical correlation theory are related, note that vectors y and g (Eq. 7.1) can be ordered in a new vector x as $ {\mathbf{x}}^{\prime }=\left[{\mathbf{y}}^{\prime}\kern0.5em {\mathbf{g}}^{\prime}\right] $, whence the covariance matrix of x is $ \left[\begin{array}{cc}\mathbf{P}& \mathbf{C}\\ {}\mathbf{C}& \mathbf{C}\end{array}\right] $. One measure of the association between the jth linear combination of y($ {I}_E={\mathbf{b}}_{E_j}^{\prime}\mathbf{y} $) and the jth linear combination of g($ {H}_E={\mathbf{w}}_{E_j}^{\prime}\mathbf{g} $) is the jth canonical correlation (λ_j) value obtained from equation $ \left({\mathbf{P}}^{-1}\mathbf{C}-{\lambda}_j^2\mathbf{I}\right){\mathbf{b}}_{Ej}=\mathbf{0} $, where b_Ej is the jth canonical vector (j = 1, 2⋯, t) of matrix P⁻¹C, and $ {\mathbf{w}}_{E_j}={\mathbf{C}}^{-1}{\mathbf{Pb}}_{E_j} $. Thus, in the canonical correlation context, $ {I}_E={\mathbf{b}}_{E_j}^{\prime}\mathbf{y} $ and $ {H}_E={\mathbf{w}}_{E_j}^{\prime}\mathbf{g} $ are canonical variables.

In the ESIM, the first eigenvector ($ {\mathbf{b}}_{E_1} $) of matrix P⁻¹C should be used on $ {I}_E={\mathbf{b}}_{E_1}^{\prime}\mathbf{y} $; the first eigenvalue ($ {\lambda}_1^2 $) and $ {\mathbf{b}}_{E_1} $of P⁻¹C should be used on the ESIM selection response and on the ESIM expected genetic gain per trait, because, in this case, the ESIM has maximum accuracy compared with other indices, such as the LPSI. The latter results in this subsection imply that the sampling statistical properties associated with the canonical correlation theory are also valid for the ESIM.

7.1.4 Estimated ESIM Parameters and Their Sampling Properties

The estimated covariance matrix of the true breeding values (C) and that of the trait phenotypic values (P) are denoted as $ \widehat{\mathbf{C}} $ and $ \widehat{\mathbf{P}} $ respectively; they can be obtained by restricted maximum likelihood using Eqs. (2.22) to (2.24) described in Chap. 2. With matrices $ \widehat{\mathbf{C}} $ and $ \widehat{\mathbf{P}} $, we constructed matrix $ \widehat{\mathbf{T}}={\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}} $ and equation

$$ \left(\widehat{\mathbf{T}}-{\widehat{\lambda}}_{Ej}^2\mathbf{I}\right){\widehat{\mathbf{b}}}_{Ej}=\mathbf{0}, $$

(7.21)

j = 1, 2, ⋯, t, where t is the number of traits in the ESIM index. Note that $ {\widehat{\lambda}}_{Ej}^2 $ is positive only if $ \widehat{\mathbf{P}} $ is positive definite (all eigenvalues positive) and $ \widehat{\mathbf{C}} $ is positive semidefinite (no negative eigenvalues); in addition, as $ {\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}} $ is an asymmetric matrix, the values of $ {\widehat{\mathbf{b}}}_{Ej} $ and $ {\widehat{\lambda}}_{Ej}^2 $ should be obtained using the singular value decomposition (SVD) theory (Anderson 2003).

Matrix $ \widehat{\mathbf{T}} $ is square and asymmetric of order t × t and rank q≤ minimum (p, c), where p and c denote the rank of $ {\widehat{\mathbf{P}}}^{-1} $ and $ \widehat{\mathbf{C}} $ respectively; the rank of $ \widehat{\mathbf{T}} $ is equal to c only if $ \widehat{\mathbf{C}} $ is square and nonsingular. Thus, matrix $ \widehat{\mathbf{T}} $ has a maximum of q eigenvalues different from zero (Rao 2002). In addition, $ \widehat{\mathbf{T}}{\widehat{\mathbf{T}}}^{\prime } $ and $ {\widehat{\mathbf{T}}}^{\prime}\widehat{\mathbf{T}} $ are symmetric matrices, but $ \widehat{\mathbf{T}}{\widehat{\mathbf{T}}}^{\prime } $ ≠ $ {\widehat{\mathbf{T}}}^{\prime}\widehat{\mathbf{T}} $. Using the SVD theory, matrix $ \widehat{\mathbf{T}} $ can be written as

$$ \widehat{\mathbf{T}}={\mathbf{V}}_1{\mathbf{L}}^{1/2}{\mathbf{V}}_2^{\prime }, $$

(7.22)

where V₁ ($ {\mathbf{V}}_1^{\prime }{\mathbf{V}}_1={\mathbf{V}}_1{\mathbf{V}}_1^{\prime }={\mathbf{I}}_q $) and V₂ ($ {\mathbf{V}}_2^{\prime }{\mathbf{V}}_2={\mathbf{V}}_2{\mathbf{V}}_2^{\prime }={\mathbf{I}}_q $) are matrices with the eigenvectors of matrices $ \widehat{\mathbf{T}}{\widehat{\mathbf{T}}}^{\prime } $ and $ {\widehat{\mathbf{T}}}^{\prime}\widehat{\mathbf{T}} $ respectively; L^1/2 is a diagonal matrix with the square root of the eigenvalues ($ {\widehat{\lambda}}_{E_1}^2\ge {\widehat{\lambda}}_{E_2}^2\ge \cdots \ge {\widehat{\lambda}}_{E_q}^2>0 $) of either $ \widehat{\mathbf{T}}{\widehat{\mathbf{T}}}^{\prime } $ or $ {\widehat{\mathbf{T}}}^{\prime}\widehat{\mathbf{T}} $ (the eigenvalues of $ \widehat{\mathbf{T}}{\widehat{\mathbf{T}}}^{\prime } $ and $ {\widehat{\mathbf{T}}}^{\prime}\widehat{\mathbf{T}} $ are the same). The entries $ {\widehat{\lambda}}_{E_1}^2\ge {\widehat{\lambda}}_{E_2}^2\ge \cdots \ge {\widehat{\lambda}}_{E_q}^2>0 $ of L^1/2 are uniquely determined, and they are called the singular values of $ \widehat{\mathbf{T}} $. The columns of V₁ are orthonormal vectors called left singular vectors of $ \widehat{\mathbf{T}} $, and the columns of V₂ are called right singular vectors (Watkins 2002).

Estimators $ {\widehat{\mathbf{b}}}_{E_1} $ and $ {\widehat{\lambda}}_{E_1}^2 $ of the first eigenvector $ {\mathbf{b}}_{E_1} $ and the first eigenvalue $ {\lambda}_{E_1}^2 $ respectively are the first column of matrix V₁ and the first diagonal element of matrix L^1/2. Thus, because $ \widehat{\mathbf{T}}{\widehat{\mathbf{T}}}^{\prime } $ is a symmetric matrix, the maximum likelihood estimators $ {\widehat{\lambda}}_{E_1}^2 $ and $ {\widehat{\mathbf{b}}}_{E_1} $ in the ESIM context can be obtained from

$$ \left(\widehat{\mathbf{T}}{\widehat{\mathbf{T}}}^{\prime }-{\widehat{\mu}}_j\mathbf{I}\right){\widehat{\mathbf{b}}}_{E_j}=\mathbf{0}, $$

(7.23)

where $ {\widehat{\mu}}_j={\widehat{\lambda}}_{E_j}^4 $, j= 1, 2, …, t. In the asymptotic context, $ {\widehat{\lambda}}_{E_1}^2 $ and $ {\widehat{\mathbf{b}}}_{E_1} $ are consistent and unbiased estimators (Anderson 2003).

The latter results allow the ESIM index ($ {I}_E={\mathbf{b}}_E^{\prime}\mathbf{y} $) as $ {\widehat{I}}_E={\widehat{{\mathbf{b}}^{\prime}}}_{E_1}\mathbf{y} $ to be estimated. The estimator of the maximized ESIM selection response and expected genetic gain per trait are $ {\widehat{R}}_E={k}_I\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{E_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{E_1}} $ and $ {\widehat{\mathbf{E}}}_E={k}_I\frac{{\widehat{\mathbf{C}}\widehat{\mathbf{b}}}_{E_1}}{\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{E_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{E_1}}} $ respectively, whereas the estimator of the maximized ESIM accuracy is $ {\widehat{\lambda}}_{E_1} $, which should be similar to the estimator of the square root of the maximized ESIM heritability.

In the asymptotic context, the estimator of b_Ej ($ {\widehat{\mathbf{b}}}_{Ej} $) has multivariate normal distribution with expectation $ E\left({\widehat{\mathbf{b}}}_{Ej}\right)={\mathbf{b}}_{Ej} $ and variance

$$ Var\left({\widehat{\mathbf{b}}}_{Ej}\right)=\frac{1}{2n}{\mathbf{b}}_{Ej}{\mathbf{b}}_{Ej}^{\prime }+\frac{1}{n}\left(1-{\lambda}_{Ej}^2\right)\sum \limits_{i\ne j}^t\frac{\lambda_{Ej}^2+{\lambda}_{Ei}^2-2{\lambda}_{Ei}^2{\lambda}_{Ej}^2}{{\left({\lambda}_{Ei}^2-{\lambda}_{Ej}^2\right)}^2}{\mathbf{b}}_{Ei}{\mathbf{b}}_{Ei}^{\prime }, $$

(7.24)

and, for i ≠ j, the covariance between $ {\widehat{\mathbf{b}}}_{Ei} $ and $ {\widehat{\mathbf{b}}}_{Ej} $ can be written as

$$ Cov\left({\widehat{\mathbf{b}}}_{Ei},{\widehat{\mathbf{b}}}_{Ej}\right)=\frac{\left(1-{\lambda}_{Ej}^2\right)\left(1-{\lambda}_{Ei}^2\right)\left({\lambda}_{Ei}^2+{\lambda}_{Ej}^2\right)}{n{\left({\lambda}_{Ei}^2-{\lambda}_{Ej}^2\right)}^2}{\mathbf{b}}_{Ej}{\mathbf{b}}_{Ei}^{\prime }, $$

(7.25)

where n is the number of individuals or genotypes (Anderson 1999). The variance of $ {\widehat{\mathbf{b}}}_{Ej} $ and the covariance between $ {\widehat{\mathbf{b}}}_{Ei} $ and $ {\widehat{\mathbf{b}}}_{Ej} $ depend not only on n, but also on eigenvalues $ {\lambda}_{Ei}^2 $ and $ {\lambda}_{Ej}^2 $. Suppose that $ {\lambda}_{Ej}^2>{\lambda}_{Ei}^2 $; then, when $ {\lambda}_{Ej}^2 $ is very close to 1, $ Var\left({\widehat{\mathbf{b}}}_{Ej}\right)\approx \frac{1}{2n}{\mathbf{b}}_{Ej}{\mathbf{b}}_{Ej}^{\prime } $ (“≈”denotes an approximation) and $ Cov\left({\widehat{\mathbf{b}}}_{Ei},{\widehat{\mathbf{b}}}_{Ej}\right) $ is very close to 0. By the result of Eq. (7.24), the variance of the first eigenvector ($ {\widehat{\mathbf{b}}}_{E1} $) of $ {\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}} $ can be written as $ Var\left({\widehat{\mathbf{b}}}_{E1}\right)=\frac{1}{2n}{\mathbf{b}}_{E1}{\mathbf{b}}_{E1}^{\prime }+\frac{1}{n}\left(1-{\lambda}_{E1}^2\right)\sum \limits_{j=2}^t\frac{\lambda_{E1}^2+{\lambda}_{Ej}^2-2{\lambda}_{E1}^2{\lambda}_{Ej}^2}{{\left({\lambda}_{E1}^2-{\lambda}_{Ej}^2\right)}^2}{\mathbf{b}}_{Ej}{\mathbf{b}}_{Ej}^{\prime } $. If the first eigenvalue $ {\lambda}_{E1}^2 $ of P⁻¹C is very close to 1 ($ {\lambda}_{E1}^2\approx 1 $), $ Var\left({\widehat{\mathbf{b}}}_{E1}\right)=\frac{1}{2n}{\mathbf{b}}_{E1}{\mathbf{b}}_{E1}^{\prime } $ and $ Cov\left({\widehat{\mathbf{b}}}_{E1},{\widehat{\mathbf{b}}}_{Ej}\right)\approx 0 $.

In the asymptotic context, the jth estimator ($ {\widehat{\lambda}}_{Ej} $) of the canonical correlations has normal distribution with expectation $ E\left({\widehat{\lambda}}_{Ej}\right)\approx {\lambda}_{Ej} $ and variance

$$ Var\left({\widehat{\lambda}}_{Ej}\right)\approx \frac{{\left(1-{\lambda}_{Ej}^2\right)}^2}{n}, $$

(7.26)

whereas the jth estimator of the square of the canonical correlations $ {\widehat{\lambda}}_{Ej}^2 $ has normal distribution with expectation $ E\left({\widehat{\lambda}}_{Ej}^2\right)\approx {\lambda}_{Ej}^2 $ and variance

$$ Var\left({\widehat{\lambda}}_j^2\right)\approx \frac{4{\lambda}_{Ej}^2{\left(1-{\lambda}_{Ej}^2\right)}^2}{n}. $$

(7.27)

In addition, for i ≠ j, the correlation between $ {\widehat{\lambda}}_{Ej}^2 $ and $ {\widehat{\lambda}}_{Ei}^2 $ is zero, i.e., $ Corr\left({\widehat{\lambda}}_{Ei}^2,{\widehat{\lambda}}_{Ej}^2\right)=0 $ (Bilodeau and Brenner 1999; Muirhead 2005).

Equation (7.26) implies that under the restrictions $ {\sigma}_H^2=1 $ and $ {\sigma}_I^2=1 $, the expectation and variance of $ {\widehat{R}}_E={k}_I{\widehat{\lambda}}_{E1} $ are $ E\left({\widehat{R}}_E\right)\approx {k}_I{\lambda}_{E1} $ and $ Var\left({\widehat{R}}_E\right)\approx \frac{k_I^2{\left(1-{\lambda}_{E1}^2\right)}^2}{n} $ respectively. However, obtaining the expectation and variance of $ {\widehat{R}}_E={k}_I{\widehat{\sigma}}_H{\widehat{\lambda}}_{E1} $ or $ {\widehat{R}}_E={k}_I\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{E1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{E1}} $ is more difficult, because in both equations there are two estimators: $ {\widehat{\sigma}}_H $ and $ {\widehat{\lambda}}_1 $ in the first one, and $ \widehat{\mathbf{P}} $ and $ {\widehat{\mathbf{b}}}_{E1} $ in the second one.

7.1.5 Numerical Examples

We compare ESIM efficiency versus LPSI efficiency using a real data set from commercial egg poultry lines obtained from Akbar et al. (1984). The estimated phenotypic ($ \widehat{\mathbf{P}} $) and genetic ($ \widehat{\mathbf{C}} $) covariance matrices among the rate of lay (RL, number of eggs), age at sexual maturity (SM, days) and egg weight (EW, kg), were $ \widehat{\mathbf{P}}=\left[\begin{array}{ccc}240.57& -95.62& 2.07\\ {}-95.62& 167.20& 4.58\\ {}2.07& 4.58& 22.80\end{array}\right] $ and $ \widehat{\mathbf{C}}=\left[\begin{array}{ccc}29.86& -17.90& -4.13\\ {}-17.90& 18.56& 1.49\\ {}-4.13& 1.49& 9.24\end{array}\right] $ respectively. The number of genotypes and the vector of economic weights were n= 3330 and $ {\mathbf{w}}^{\prime }=\left[19.54\kern0.5em -3.56\kern0.5em 17.01\right] $ respectively, whereas the selection intensity was 10% (k_I = 1.755) for both indices.

The estimated LPSI vector of coefficients was $ {\widehat{\mathbf{b}}}_S^{\prime }={\mathbf{w}}^{\prime }{\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}}=\left[1.82\kern0.5em -1.38\kern0.5em 3.25\right] $, whereas the estimated selection response, expected genetic gain per trait, accuracy, and heritability of the LPSI were $ {\widehat{R}}_S=1.755\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_S}=74.91 $, $ {\widehat{\mathbf{E}}}_S^{\prime }=1.755\frac{{\widehat{\mathbf{b}}}_S^{\prime}\widehat{\mathbf{C}}}{\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_S}}=\left[2.70\kern0.5em -2.20\kern0.5em 0.84\right] $, $ {\widehat{\rho}}_S=\frac{\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_S}}{\sqrt{{\mathbf{w}}^{\prime}\widehat{\mathbf{C}}\mathbf{w}}}=0.362 $, and $ {\widehat{h}}_S^2=\frac{{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{C}}\widehat{\mathbf{b}}}_S}{{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_S}=0.143 $ respectively.

Note that because in the ESIM context $ {\widehat{{\mathbf{b}}^{\prime}}}_E{\widehat{\mathbf{b}}}_E=1 $, the best way of comparing ESIM results versus LPSI results is when the LPSI coefficient vector is normalized, i.e., when the LPSI coefficient vector is equal to $ {{\widehat{\mathbf{b}}}_S}^{\ast }={\widehat{\mathbf{b}}}_S/{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{b}}}_S $ and then $ {\widehat{{\mathbf{b}}^{\prime}}}_S^{\ast^{\prime }}{\widehat{\mathbf{b}}}_S^{\ast }=1 $; however, it can be shown that the normalization process only affects the estimated LPSI selection response because in that case, $ {\widehat{R}}_S=74.91 $ is divided by $ {\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{b}}}_S $. For example, for this data set result, $ {\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{b}}}_S=15.76 $; then, the estimated LPSI selection response using $ {{\widehat{\mathbf{b}}}_S}^{\ast }={\widehat{\mathbf{b}}}_S/{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{b}}}_S $ is $ {\widehat{R}}_S=\frac{74.91}{15.74}=4.75 $, whereas the rest of the estimated LPSI parameters are the same. When $ 0<{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{b}}}_S<1 $ and $ 1<{\widehat{R}}_S $, the values of $ {\widehat{R}}_S $ increase, but when $ 1<{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{b}}}_S $, the values of $ {\widehat{R}}_S $ decrease, as in the example.

The product $ {\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{b}}}_S $ does not affect $ {\widehat{\rho}}_S $ because it is invariant to scale change. Also, $ {\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{b}}}_S $ does not affect $ {\widehat{h}}_S^2 $ and $ {\widehat{\mathbf{E}}}_S $ because $ {\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{b}}}_S $ appears in the numerator and denominator of both estimated parameters.

In the ESIM, the sign and proportion of the expected genetic gain values for traits RL, SM, and EW should be in accordance with the breeder’s interest. For example, if the breeder’s interest is that the expected genetic gain per trait for RL should be positive and negative for SM, the sign and proportion of the values of the first eigenvector should be modified using a linear combination of the estimated first eigenvector $ {\widehat{\mathbf{b}}}_{E_1} $, i.e., $ \widehat{\boldsymbol{\upbeta}}=\mathbf{F}{\widehat{\mathbf{b}}}_{E_1} $, to achieve expected genetic gain per trait values in RL and SM according to the breeder’s interest.

The information needed to obtain the estimated ESIM parameters are matrices $ \widehat{\mathbf{T}}={\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}}=\left[\begin{array}{ccc}0.1102& -0.0405& -0.0280\\ {}-0.0390& 0.0864& -0.0184\\ {}-0.1833& 0.0517& 0.4115\end{array}\right] $ and $ \widehat{\mathbf{T}}{\widehat{\mathbf{T}}}^{\prime }=\left[\begin{array}{ccc}0.0146& -0.0073& -0.0338\\ {}-0.0073& 0.0093& 0.0041\\ {}-0.0338& 0.0041& 0.2056\end{array}\right] $. We need to find the eigenvalues and eigenvectors of equation $ \left(\widehat{\mathbf{T}}{\widehat{\mathbf{T}}}^{\prime }-{\widehat{\mu}}_j\mathbf{I}\right){\widehat{\mathbf{b}}}_{E_j}=\mathbf{0} $, where $ {\widehat{\mu}}_j={\widehat{\lambda}}_{E_j}^4 $, to obtain matrices V₁ and L^1/2, which form matrix $ \widehat{\mathbf{T}}={\mathbf{V}}_1{\mathbf{L}}^{1/2}{\mathbf{V}}_2^{\prime } $.

Matrix V₁ is equal to $ {\mathbf{V}}_1=\left[\begin{array}{ccc}-0.1701& 0.6818& 0.7115\\ {}0.0259& -0.7187& 0.6948\\ {}0.9851& 0.1366& 0.1046\end{array}\right] $, whereas the diagonal elements of matrix L are 0.2115, 0.0155, and 0.0025, that is, matrix $ {\mathbf{L}}^{1/2}=\left[\begin{array}{ccc}0.4599& 0& 0\\ {}0& 0.1244& 0\\ {}0& 0& 0.0498\end{array}\right] $. Thus, $ {\widehat{\mu}}_1={\widehat{\lambda}}_{E_1}^4=0.2115 $, $ {\widehat{\lambda}}_{E_1}^2=0.4599 $, and the estimated ESIM accuracy was $ {\widehat{\lambda}}_{E_1}=0.6782 $. The estimated ESIM eigenvector of coefficients is the first column of matrix V₁, i.e., $ {\widehat{\mathbf{b}}}_{E_1}^{\prime }=\left[-0.1701\kern0.5em 0.0259\kern0.5em 0.9851\right] $, and the estimated ESIM index can be constructed as $ {\widehat{I}}_E=-0.1701\mathrm{RL}+0.0259\mathrm{SM}+0.9851\mathrm{EW} $.

The estimated ESIM selection response and expected genetic gain per trait were $ {\widehat{R}}_E=1.755\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{E_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{E_1}}=9.54 $ and $ {\widehat{\mathbf{E}}}_E^{\prime }=1.755\frac{{\widehat{\mathbf{b}}}_{E_1}^{\prime}\widehat{\mathbf{C}}}{\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{E_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{E_1}}}=\left[-3.10\kern0.5em 1.61\kern0.5em 3.18\right] $ respectively. Because the estimated LPSI selection response was $ {\widehat{R}}_S=\frac{74.91}{15.74}=4.75 $, the estimated ESIM selection response was higher than the estimated LPSI response. In addition, the estimated LPSI expected genetic gain per trait was $ {\widehat{\mathbf{E}}}_S^{\prime }=\left[2.70\kern0.5em -2.20\kern0.5em 0.84\right] $. Now, suppose that the breeder’s interest is to increase RL and decrease SM; then, $ {\widehat{\mathbf{E}}}_S^{\prime } $ is a good result but $ {\widehat{\mathbf{E}}}_E^{\prime } $ is wrong.

We can change the sign and proportion of $ {\widehat{\mathbf{E}}}_E^{\prime } $ by transforming $ {\widehat{\mathbf{b}}}_{E_1} $ into $ \widehat{\boldsymbol{\upbeta}}=\mathbf{F}{\widehat{\mathbf{b}}}_{E_1} $ using a convenient matrix F such as $ \mathbf{F}=\left[\begin{array}{ccc}-9& 0& 0\\ {}0& 1& 0\\ {}0& 0& 1\end{array}\right] $. In such a case $ {\widehat{\boldsymbol{\upbeta}}}^{\prime }={\widehat{\mathbf{b}}}_{E_1}^{\prime}\mathbf{F}=\left[1.531\kern0.5em 0.026\kern0.5em 0.981\right] $, $ {\widehat{R}}_E=1.755\sqrt{{\widehat{\boldsymbol{\upbeta}}}^{\prime}\widehat{\mathbf{P}}\widehat{\boldsymbol{\upbeta}}}=42.44 $, and $ {\widehat{\mathbf{E}}}_E^{\prime }=1.755\frac{\widehat{{\boldsymbol{\upbeta}}^{\prime }}\widehat{\mathbf{C}}}{\sqrt{\widehat{{\boldsymbol{\upbeta}}^{\prime }}\widehat{\mathbf{P}}\widehat{\boldsymbol{\upbeta}}}}=\left[2.990\kern0.5em -1.85\kern0.5em 0.205\right] $. However, vector $ {\widehat{\boldsymbol{\upbeta}}}^{\prime } $ was not normalized. To normalize $ {\widehat{\boldsymbol{\upbeta}}}^{\prime } $ we need to divide it by $ {\widehat{\boldsymbol{\upbeta}}}^{\prime}\widehat{\boldsymbol{\upbeta}}=3.314 $, but $ {\widehat{\boldsymbol{\upbeta}}}^{\prime}\widehat{\boldsymbol{\upbeta}} $ should only affect $ {\widehat{R}}_E=42.44 $, which should be divided by 3.314, that is, $ {\widehat{R}}_E=\frac{42.44}{3.314}=12.806 $. According to the theory of similar matrices (Harville 1997), the estimated maximized ESIM accuracy, $ {\widehat{\lambda}}_{E_1}=0.6782 $, should not be affected by matrix F.

We can compare ESIM efficiency versus LPSI efficiency to predict the net genetic merit using the ratio of the estimated ESIM accuracy $ {\widehat{\lambda}}_{E_1}=0.6782 $ to LPSI accuracy $ {\widehat{\rho}}_S=0.362 $, i.e., $ \frac{{\widehat{\lambda}}_{E_1}}{{\widehat{\rho}}_S}=\frac{0.6782}{0.362}=1.873 $, or in percentage terms, $ {\widehat{p}}_E=100\left(1.873-1\right)=87.3 $ (see Chap. 5, Eq. 5.17). According to the latter result, the ESIM is a better predictor of the net genetic merit and its efficiency is 87.3% higher than that of the LPSI for this data set.

Now, we compare ESIM efficiency versus LPSI efficiency using the data set described in Sect. 2.8.1 of Chap. 2. From this data set, we ran five phenotypic selection cycles, each with four traits (T₁, T₂, T₃, and T₄), 500 genotypes, and four replicates for each genotype. The economic weights for T₁, T₂, T₃, and T₄ were 1, −1, 1, and 1 respectively. In this case, matrix F is an identity matrix of size 4 × 4 for all five selection cycles.

Table 7.1 presents the estimated LPSI, the restricted LPSI (RLPSI), and the predetermined proportional gain LPSI (PPG-LPSI) selection response (the latter two for one, two, and three restrictions) for five simulated selection cycles when their vectors of coefficients are normalized. Table 7.1 also presents the estimated ESIM, the RESIM and the PPG-ESIM selection response for one, two, and three restrictions for five simulated selection cycles. The selection intensity was 10% (k_I = 1.755) for all five selection cycles. In this subsection, we compare only LPSI results versus ESIM results. The estimated LPSI selection response when the vector of coefficients was not normalized was described in Chap. 2 (Table 2.4). The averages of the estimated LPSI and ESIM selection responses were 4.70 and 6.31 respectively.

Table 7.1 Estimated linear phenotypic selection index (LPSI), restricted null LPSI (RLPSI), and predetermined proportional gains LPSI (PPG-LPSI) selection responses when their vectors of coefficients are normalized; estimated eigen selection index method (ESIM), restricted null ESIM (RESIM), and predetermined proportional gain ESIM (PPG-ESIM) selection responses for one, two, and three restrictions for five simulated selection cycles

Full size table

Table 7.2 presents the estimated ESIM expected genetic gain per trait, accuracy ($ {\widehat{\rho}}_E $), and the values $ {\widehat{p}}_E=100\left({\widehat{\lambda}}_E-1\right) $, where $ {\widehat{\lambda}}_E={\widehat{\rho}}_E/{\widehat{\rho}}_S $ is the ratio of $ {\widehat{\rho}}_E $ to the estimated LPSI accuracy ($ {\widehat{\rho}}_S $), expressed as percentages. Table 7.2 also presents the accuracy of the PPG-ESIM and the estimated ratio ($ {\widehat{p}}_{PE} $) of the estimated PPG-ESIM accuracy to the estimated PPG-LPSI accuracy, expressed as percentages, for one, two, and three predetermined restrictions for five simulated selection cycles. In this subsection, we use only the estimated ESIM expected genetic gain per trait and $ {\widehat{p}}_E=100\left({\widehat{\lambda}}_E-1\right) $ to compare ESIM efficiency versus LPSI efficiency.

Table 7.2 Estimated eigen selection index method (ESIM) expected genetic gain per trait, accuracy ($ {\widehat{\rho}}_E $), and ratio of $ {\widehat{\rho}}_E $ to the estimated LPSI (data not presented) accuracy ($ {\widehat{\rho}}_S $), expressed in percentage terms, $ {\widehat{\rho}}_E=100\left({\widehat{\lambda}}_E-1\right) $ (where $ {\widehat{\lambda}}_E={\widehat{\rho}}_E/{\widehat{\rho}}_S $)

Full size table

The estimated LPSI expected genetic gains per trait were presented in Chap. 2, Table 2.4. According to the results shown in Table 2.4, the averages of the estimated LPSI expected genetic gain per trait T1, T2, T3, and T4 for five simulated selection cycles were 7.26, −3.52, 2.78, and 1.58, whereas according to the results of Table 7.2, the averages of the estimated ESIM expected genetic gains per trait were 5.67, −2.67, 1.81, and 2.9 respectively. This means that the estimated LPSI expected genetic gain for traits T1, T2, and T3 was higher than the estimated ESIM expected genetic gain for those traits.

The average of the $ {\widehat{p}}_E=100\left({\widehat{\lambda}}_E-1\right) $ values was 9.76 for all five selection cycles (Table 7.2). The latter result is not in accordance with the LPSI and ESIM expected genetic gain per trait; however, note that the $ {\widehat{p}}_E $ values are associated with the estimated LPSI and ESIM selection responses (Table 7.1), not with the expected genetic gain per trait, because $ {\widehat{\lambda}}_E=\frac{{\widehat{\rho}}_E}{{\widehat{\rho}}_S}\approx \frac{{\widehat{R}}_E}{{\widehat{R}}_S} $, where $ {\widehat{R}}_E $ and $ {\widehat{R}}_S $ are the estimated ESIM and LPSI selection responses respectively. Thus, the $ {\widehat{p}}_E $values indicate that the efficiency of the ESIM and that of the LPSI were very similar because the former was only 9.76% higher than the latter for this data set.

The equality $ \frac{{\widehat{\rho}}_E}{{\widehat{\rho}}_S}=\frac{{\widehat{R}}_E}{{\widehat{R}}_S} $ is true only when the denominators of both estimated correlations are the same, as in the linear selection indices described in Chaps. 2–6. Note that $ {\widehat{\rho}}_S=\frac{\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_S}}{\sqrt{{\mathbf{w}}^{\prime}\widehat{\mathbf{C}}\mathbf{w}}} $ and $ {\widehat{\rho}}_E=\frac{\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_E{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_E}}{\sqrt{{\mathbf{w}}_E^{\prime}\widehat{\mathbf{C}}{\mathbf{w}}_E}} $, whereas $ {\widehat{R}}_S=\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_S} $ and $ {\widehat{R}}_E=\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_E{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_E} $; this means that if $ \sqrt{{\mathbf{w}}_E^{\prime}\widehat{\mathbf{C}}{\mathbf{w}}_E}\ne \sqrt{{\mathbf{w}}^{\prime}\widehat{\mathbf{C}}\mathbf{w}} $, $ \frac{{\widehat{\rho}}_E}{{\widehat{\rho}}_S}\ne \frac{{\widehat{R}}_E}{{\widehat{R}}_S} $. For the Akbar et al. (1984) data, $ {\widehat{R}}_E=9.54 $ and $ {\widehat{R}}_S=4.75 $, then $ \frac{{\widehat{R}}_E}{{\widehat{R}}_S}=2.0 $ but $ \frac{{\widehat{\lambda}}_{E_1}}{{\widehat{\rho}}_S}=1.873 $; that is, $ \frac{{\widehat{\rho}}_E}{{\widehat{\rho}}_S}\approx \frac{{\widehat{R}}_E}{{\widehat{R}}_S} $, where “≈” indicates an approximation.

Figure 7.1 presents the frequency distribution of 500 estimated ESIM values for cycle 2 (Fig. 7.1a) and cycle 5 (Fig. 7.1b), obtained from one selection cycle for 500 genotypes and four traits simulated in one environment. Figure 7.1a, b indicates that the frequency distribution of the estimated ESIM values approaches normal distribution.

7.2 The Linear Phenotypic Restricted Eigen Selection Index Method

Similar to the RLPSI (see Chap. 2), the objective of the RESIM is to fix r of t (r < t) traits by predicting only the genetic gains of (t − r) of them. Let H = w′g be the net genetic merit and I = b′y the ESIM index. In Chap. 2, we showed that Cov(I, g) = Cb is the covariance between the breeding value vector (g) and I = b′y. Thus, to fix r of t traits, we need r covariances between the linear combinations of g (U′g) and I = b′y to be zero, i.e., Cov(I, U′g) = U′Cb = 0, where U′ is a matrix with 1s and 0s (1 indicates that the trait is restricted and 0 that the trait has no restrictions). In the RESIM, it is possible to solve this problem by maximizing $ {\rho}_{HI}^2=\frac{{\left({\mathbf{w}}^{\prime}\mathbf{Cb}\right)}^2}{\left({\mathbf{w}}^{\prime}\mathbf{Cw}\right)\left({\mathbf{b}}^{\prime}\mathbf{Pb}\right)} $ with respect to vectors b and w under the restrictions U′Cb = 0, b′b = 1, w′Cw = 1, and b′Pb = 1, where w′Cw is the variance of H = w′g and b′Pb is the variance of I = b′y. Also, the RESIM problem can be solved by maximizing $ \frac{{\mathbf{b}}^{\prime}\mathbf{Cb}}{\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}}} $ (Eq. 7.12) with respect to vectors b only under the restrictions U′Cb = 0 and b′b = 1, as we did to obtain Eq. (7.13). Both approaches give the same result, but it is easier to work with the second approach than with the first one.

7.2.1 The RESIM Parameters

To obtain the RESIM vector of coefficients that maximizes the RESIM selection response and the expected genetic gain per trait, we need to maximize the function

$$ f\left(\mathbf{b},{\mathbf{v}}^{\prime}\right)=\frac{{\mathbf{b}}^{\prime}\mathbf{Cb}}{\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}}}-{\mathbf{v}}^{\prime }{\mathbf{U}}^{\prime}\mathbf{Cb} $$

(7.28a)

with respect to b and v′, where $ {\mathbf{v}}^{\prime }=\left[{v}_1\kern0.5em {v}_2\kern0.5em \cdots \kern0.5em {v}_{r-1}\right] $ is a vector of Lagrange multipliers. The derivatives of Eq. (7.28a) with respect to b and v′ can be written as

$$ 2{\left({\mathbf{b}}^{\prime}\mathbf{Pb}\right)}^{1/2}\mathbf{Cb}-{\left({\mathbf{b}}^{\prime}\mathbf{Pb}\right)}^{-1/2}\left({\mathbf{b}}^{\prime}\mathbf{Cb}\right)\mathbf{Pb}-\mathbf{CUv}=\mathbf{0} $$

(7.28b)

and

$$ {\mathbf{U}}^{\prime}\mathbf{Cb}=\mathbf{0}, $$

(7.29)

respectively, where Eq. (7.29) denotes the restriction imposed for maximizing Eq. (7.28a). Using algebraic methods on Eq. (7.28b) similar to those used to obtain Eqs. (7.10) and (7.13), we get

$$ \left({\mathbf{KP}}^{-1}\mathbf{C}-{h}_{I_R}^2{\mathbf{I}}_t\right){\mathbf{b}}_R=\mathbf{0}, $$

(7.30)

where K = [I_t − Q_R], I_t is an identity matrix of size t × t, Q_R = P⁻¹CU(U′CP⁻¹CU)⁻¹U′C, and $ {h}_{I_R}^2=\frac{{\mathbf{b}}_R^{\prime }{\mathbf{Cb}}_R}{{\mathbf{b}}_R^{\prime }{\mathbf{Pb}}_R} $ is the maximized RESIM heritability obtained under the restriction U′Cb = 0; $ {h}_{I_R}^2 $ is also the square of the maximized correlation between the net genetic merit and $ {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} $, that is, $ {h}_{I_R}^2={\lambda}_R^2 $. This means that Eq. (7.30) can be written as

$$ \left({\mathbf{KP}}^{-1}\mathbf{C}-{\lambda}_R^2{\mathbf{I}}_t\right){\mathbf{b}}_R=\mathbf{0}. $$

(7.31)

Thus, the optimized RESIM index is $ I={\mathbf{b}}_R^{\prime}\mathbf{y} $. The only difference between Eqs. (7.31) and (7.13) is matrix K. Equation (7.31) was obtained by Cerón-Rojas et al. (2008) by maximizing $ {\rho}_{HI}^2 $ (Eq. 7.1) with respect to vectors b and w under the restriction U′Cb = 0, b′b = 1, w′Cw = 1 and b′Pb = 1 in a similar manner to the canonical correlation theory. The RESIM expected genetic gain per trait uses the first eigenvector (b_R) of matrix KP⁻¹C, whereas the RESIM selection response uses b_R and the first eigenvalue ($ {\lambda}_R^2 $) of matrix KP⁻¹C. When U′ is a null matrix, b_R = b_E (the vector of the ESIM coefficients); thus, the RESIM is more general than the ESIM and includes the ESIM as a particular case.

In the RESIM context, vector w can be obtained (Cerón-Rojas et al. 2008) as

$$ {\mathbf{w}}_R={\mathbf{C}}^{-1}\left[{\lambda}_R{\mathbf{Pb}}_R+\boldsymbol{\Psi} \mathbf{v}\right], $$

(7.32)

where λ_R and b_R are the square roots of the first eigenvalue ($ {\lambda}_R^2 $) and the first eigenvector of matrix KP⁻¹C respectively; Ψ = CU and $ \mathbf{v}={\lambda}_R^{-1}{\left({\boldsymbol{\Psi}}^{\prime }{\mathbf{P}}^{-1}\boldsymbol{\Psi} \right)}^{-1}{\boldsymbol{\Psi}}^{\prime }{\mathbf{P}}^{-1}{\mathbf{Cb}}_R $. Let $ {H}_R={\mathbf{w}}_R^{\prime}\mathbf{g} $ be the net genetic merit in the RESIM context; then, because the correlation between $ {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} $ and $ {H}_R={\mathbf{w}}_R^{\prime}\mathbf{g} $ is not affected by scale change, λ_R and $ {\lambda}_R^{-1} $ can be considered proportional constants and then Ψv can be written as $ \boldsymbol{\Psi} \mathbf{v}=\boldsymbol{\Psi} {\left({\boldsymbol{\Psi}}^{\prime }{\mathbf{P}}^{-1}\boldsymbol{\Psi} \right)}^{-1}{\boldsymbol{\Psi}}^{\prime }{\mathbf{P}}^{-1}{\mathbf{Cb}}_R={\mathbf{Q}}_R^{\prime }{\mathbf{Cb}}_R $, where $ {\mathbf{Q}}_R^{\prime } $ is the transpose of matrix Q_R described in Eq. (7.30). Thus, another way of writing Eq. (7.32) is

$$ {\mathbf{w}}_R={\mathbf{C}}^{-1}\left[\mathbf{P}+{\mathbf{Q}}_R^{\prime}\mathbf{C}\right]{\mathbf{b}}_R. $$

(7.33)

By Eq. (7.33) and the restriction b′Ψ = 0, the covariance between $ {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} $ and $ {H}_R={\mathbf{w}}_R^{\prime}\mathbf{g} $ ($ {\sigma}_{H_R{I}_R} $) can be written as

$$ {\sigma}_{H_R{I}_R}={\mathbf{w}}_R^{\prime }{\mathbf{Cb}}_R={\mathbf{b}}_R^{\prime }{\mathbf{Pb}}_R+{\mathbf{b}}_R^{\prime }{\mathbf{Q}}_R^{\prime }{\mathbf{Cb}}_R={\mathbf{b}}_R^{\prime }{\mathbf{Pb}}_R, $$

(7.34)

where $ {\mathbf{b}}_R^{\prime }{\mathbf{Q}}_R^{\prime }{\mathbf{Cb}}_R=0 $ according to the restriction b′Ψ = 0. Equation (7.34) indicates that the covariance between I_R and H_R ($ {\sigma}_{H_R{I}_R} $) is equal to the variance of I_R ($ {\sigma}_{I_R}^2={\mathbf{b}}_R^{\prime }{\mathbf{Pb}}_R $).

The maximized correlation between I_R and H_R (or RESIM accuracy) can be written as

$$ {\rho}_{H_R{I}_R}=\frac{\sqrt{{\mathbf{b}}_R^{\prime }{\mathbf{Pb}}_R}}{\sqrt{{\mathbf{w}}_R^{\prime }{\mathbf{Cw}}_R}}, $$

(7.35)

where $ {\mathbf{w}}_R^{\prime }{\mathbf{Cw}}_R={\sigma}_{H_R}^2 $ is the variance of H_R, $ {\mathbf{w}}_R={\mathbf{C}}^{-1}\left[\mathbf{P}+{\mathbf{Q}}_R^{\prime}\mathbf{C}\right]{\mathbf{b}}_R $, $ {\mathbf{Q}}_R^{\prime }=\boldsymbol{\Psi} {\left({\boldsymbol{\Psi}}^{\prime }{\mathbf{P}}^{-1}\boldsymbol{\Psi} \right)}^{-1}{\boldsymbol{\Psi}}^{\prime }{\mathbf{P}}^{-1} $, and Ψ = CU. When U′ is a null matrix, $ {\mathbf{w}}_R^{\prime }{\mathbf{Cw}}_R={\mathbf{b}}_E^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Pb}}_E={\mathbf{w}}_E^{\prime }{\mathbf{Cw}}_E $, the variance of H_E, and $ {\sigma}_{I_R}^2={\mathbf{b}}_R^{\prime }{\mathbf{Pb}}_R={\mathbf{b}}_E^{\prime }{\mathbf{Pb}}_E={\sigma}_{I_E}^2 $, the variance of I_E. Hereafter, to simplify the notation, we write Eq. (7.35) as ρ_R or λ_R.

The maximized selection response (R_R) and expected genetic gain per trait (E_R) of the RESIM can be written as

$$ {R}_R={k}_I\sqrt{{\mathbf{b}}_R^{\prime }{\mathbf{Pb}}_R} $$

(7.36)

and

$$ {\mathbf{E}}_R={k}_I\frac{{\mathbf{Cb}}_R}{\sqrt{{\mathbf{b}}_R^{\prime }{\mathbf{Pb}}_R}}, $$

(7.37)

respectively, where $ \sqrt{{\mathbf{b}}_R^{\prime }{\mathbf{Pb}}_R}={\sigma}_{I_R} $ is the standard deviation of the variance of $ {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} $. If vector b_R is transformed as β_R = Fb_R, where matrix F was defined earlier, vector b_R should be changed by β_R in Eqs. (7.36) and (7.37), and in $ {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} $.

Equation (7.36) can also be written as $ {R}_R={k}_I{\sigma}_{H_R}{\lambda}_R $, where $ {\sigma}_{H_R}=\sqrt{{\mathbf{b}}_R^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Pb}}_R+{\mathbf{b}}_R^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Q}}_R^{\prime }{\mathbf{Cb}}_R} $ is the standard deviation of the variance of H_R, and $ {\lambda}_R={\rho}_{H_R{I}_R} $ is the first canonical correlation between $ {H}_R={\mathbf{w}}_R^{\prime}\mathbf{g} $ and $ {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} $. When $ {\sigma}_{H_R}=1 $, λ_R is the covariance between $ {H}_R={\mathbf{w}}_R^{\prime}\mathbf{g} $ and $ {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} $, and then Eq. (7.36) can be written as R_R = k_Iλ_R. This last result was presented by Cerón-Rojas et al. (2008) in their original paper.

The ratio of the index accuracies and the VPE are also valid in the RESIM context. In Eq. (7.34) we showed that the covariance between $ {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} $ and $ {H}_R={\mathbf{w}}_R^{\prime}\mathbf{g} $ ($ {\sigma}_{H_R{I}_R} $) is equal to the variance of $ {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} $ ($ {\sigma}_{I_R}^2 $). This means that the VPE of the RESIM can be written as

$$ E\left[{\left({H}_R-{I}_R\right)}^2\right]={\sigma}_{H_R}^2+{\sigma}_{I_R}^2-2{\sigma}_{H_R{I}_R}={\sigma}_{H_R}^2-{\sigma}_{I_R}^2=\left(1-{\rho}_R^2\right){\sigma}_{H_R}^2. $$

(7.38)

Statistical properties associated with the ESIM and described in Sect. 7.1.2 are also valid for the RESIM.

7.2.2 Estimating the RESIM Parameters

We can estimate the RESIM parameters in a similar manner to the ESIM parameters in Sect. 7.1.4. With matrices $ \widehat{\mathbf{C}} $ and $ \widehat{\mathbf{P}} $, we constructed matrix $ {\widehat{\mathbf{S}}}_R={\widehat{\mathbf{K}}\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}} $ and equation

$$ \left({\widehat{\mathbf{S}}}_R{\widehat{\mathbf{S}}}_R^{\prime }-{\widehat{\mu}}_{Rj}{\mathbf{I}}_t\right){\widehat{\mathbf{b}}}_{R_j}=\mathbf{0}, $$

(7.39)

where $ {\widehat{\mu}}_{Rj}={\widehat{\lambda}}_{R_j}^4 $, j= 1, 2, …, t. The estimated RESIM index ($ {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} $) is $ {\widehat{I}}_R={\widehat{{\mathbf{b}}^{\prime}}}_{R_1}\mathbf{y} $ and the estimator of the maximized RESIM selection response and its expected genetic gain per trait can be denoted as $ {\widehat{R}}_R={k}_I\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{R_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{R_1}} $ and $ {\widehat{\mathbf{E}}}_R={k}_I\frac{{\widehat{\mathbf{C}}\widehat{\mathbf{b}}}_{R_1}}{\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{R_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{R_1}}} $ respectively, whereas the estimator of the maximized RESIM accuracy is $ {\widehat{\lambda}}_{R_1} $.

7.2.3 Numerical Examples

We compare the RLPSI results with those of the RESIM using the Akbar et al. (1984) data described in Sect. 7.1.5. We restrict the trait RL (number of eggs) in both indices. In Chap. 3, Sect. 3.1.3, we indicated how to construct matrix U′ and, in Sect. 3.1.4 of the same chapter, we described how to obtain matrix $ \widehat{\mathbf{K}}=\left[{\mathbf{I}}_t-\widehat{\mathbf{Q}}\right] $ for one and two restrictions. Matrix $ \widehat{\mathbf{K}} $ is the same for the RLPSI and the RESIM. Thus, in this subsection we omit the steps needed to construct matrices U′ and $ \widehat{\mathbf{K}} $.

First, we estimate the RLPSI parameters. Assume a selection intensity of 10% (k_I = 1.755) and a vector of economic weights $ {\mathbf{w}}^{\prime }=\left[19.54\kern0.5em -3.56\kern0.5em 17.01\right] $. The estimated RLPSI vector of coefficients for one restriction was $ {\widehat{\mathbf{b}}}^{\prime }=\left[0.29\kern0.5em -0.84\kern0.5em 5.78\right] $, and the estimated selection response, expected genetic gain per trait, accuracy, and heritability of the RLPSI were $ \widehat{R}=1.755\sqrt{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{P}}\widehat{\mathbf{b}}}=53.01 $, $ {\widehat{\mathbf{E}}}^{\prime }=1.755\frac{{\widehat{\mathbf{b}}}^{\prime}\widehat{\mathbf{C}}}{\sqrt{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{P}}\widehat{\mathbf{b}}}}=\left[0\kern0.5em -0.71\kern0.5em 2.96\right] $, $ \widehat{\rho}=\frac{\sqrt{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{P}}\widehat{\mathbf{b}}}}{\sqrt{{\mathbf{w}}^{\prime}\widehat{\mathbf{C}}\mathbf{w}}}=0.26 $, and $ {\widehat{h}}^2=\frac{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{C}}\widehat{\mathbf{b}}}{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{P}}\widehat{\mathbf{b}}}=0.33 $ respectively. In this case, $ \widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{b}}=34.25 $; then, the estimated RLPSI selection response using the normalized RLPSI vector of coefficients was $ \widehat{R}=\frac{53.01}{34.25}=1.55 $, and the rest of the estimated RLPSI parameters were the same.

In the RESIM, matrix F was an identity matrix of size 3 × 3; that is, we did not use matrix F to transform the RESIM vector of coefficients. In Sect. 7.1.5 we obtained matrix $ {\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}}=\left[\begin{array}{ccc}0.1102& -0.0405& -0.0280\\ {}-0.0390& 0.0864& -0.0184\\ {}-0.1833& 0.0517& 0.4115\end{array}\right] $, and we have indicated that matrix $ \widehat{\mathbf{K}} $ is the same for the RLPSI and the RESIM. In the RESIM, we need matrix $ {\widehat{\mathbf{S}}}_R={\widehat{\mathbf{K}}\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}} $ to solve equation $ \left({\widehat{\mathbf{S}}}_R{\widehat{{\mathbf{S}}^{\prime}}}_R-{\widehat{\mu}}_{Rj}{\mathbf{I}}_t\right){\widehat{\mathbf{b}}}_{R_j}=\mathbf{0} $, where $ {\widehat{\mu}}_{Rj}={\widehat{\lambda}}_{R_j}^4 $, whence we shall obtain the eigenvalues and eigenvectors that form matrices $ {\mathbf{L}}_R^{1/2} $, V_R1, and $ {\widehat{\mathbf{S}}}_R={\mathbf{V}}_{R1}{\mathbf{L}}_R^{1/2}{\mathbf{V}}_{R2}^{\prime } $.

For one null restriction, matrix $ {\widehat{\mathbf{S}}}_R={\widehat{\mathbf{K}}\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}}=\left[\begin{array}{ccc}0& 0.0285& 0.0232\\ {}0& 0.0620& -0.0365\\ {}0& -0.0630& 0.3263\end{array}\right] $. This means that $ {\widehat{\mathbf{S}}}_R $ reflects the trait restrictions imposed on the covariance between the RESIM and the vector of genotypic values; thus, if r traits are restricted, r columns of $ {\widehat{\mathbf{S}}}_R $ are equal to zero. Matrix $ {\widehat{\mathbf{S}}}_R{\widehat{{\mathbf{S}}^{\prime}}}_R=\left[\begin{array}{ccc}0.0013& 0.0009& 0.0058\\ {}0.0009& 0.0052& -0.0158\\ {}0.0058& -0.0158& 0.1104\end{array}\right] $ and $ {\mathbf{V}}_{R_1}=\left[\begin{array}{ccc}0.0500& 0.5216& -0.8517\\ {}-0.1446& 0.8476& 0.5106\\ {}0.9882& 0.0976& 0.1178\end{array}\right] $, whereas the $ {\widehat{\mu}}_{Rj}={\widehat{\lambda}}_{R_j}^4 $ values were 0.1130, 0.0039, and 0.0, whence $ {\mathbf{L}}_R^{1/2}=\left[\begin{array}{ccc}0.3362& 0& 0\\ {}0& 0.0626& 0\\ {}0& 0& 0.0\end{array}\right] $. Thus, $ {\widehat{\mu}}_{R_1}={\widehat{\lambda}}_{R_1}^4=0.1130 $, $ {\widehat{\lambda}}_{R_1}^2=0.3362 $, and the estimated RESIM accuracy was $ {\widehat{\lambda}}_{E_1}=0.5798 $. The estimated RESIM eigenvector, index, the selection response, and expected genetic gain per trait were $ {\widehat{\mathbf{b}}}_{R_1}^{\prime }=\left[0.0500\kern0.5em -0.1446\kern0.5em 0.9882\right] $, $ {\widehat{I}}_R=0.0500\mathrm{RL}-0.1446\mathrm{SM}+0.9882\mathrm{EW} $, $ {\widehat{R}}_R=1.755\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{R_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{R_1}}=9.06 $, and $ {\widehat{\mathbf{E}}}_R^{\prime }=1.755\frac{{\widehat{\mathbf{b}}}_{R_1}^{\prime}\widehat{\mathbf{C}}}{\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{R_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{R_1}}}=\left[0\kern0.5em -0.72\kern0.5em 2.96\right] $ respectively.

The estimated RLPSI selection response was $ \widehat{R}=\frac{53.01}{34.25}=1.55 $; thus, the estimated RESIM selection response was higher than the estimated RLPSI response. In addition, the estimated RLPSI expected genetic gain per trait was $ {\widehat{\mathbf{E}}}^{\prime }=\left[0\kern0.5em -0.71\kern0.5em 2.96\right] $, which is the same as the estimated RESIM expected genetic gain per trait.

We can compare RESIM efficiency versus RLPSI efficiency to predict the net genetic merit using the ratio of the estimated RESIM accuracy $ {\widehat{\lambda}}_{E_1}=0.5798 $ to the RLPSI accuracy $ \widehat{\rho}=0.26 $, i.e., $ \frac{{\widehat{\lambda}}_{R_1}}{{\widehat{\rho}}_S}=\frac{0.5798}{0.26}=2.23 $, or in percentage terms, $ {\widehat{p}}_E=100\left(2.23-1\right)=123 $ (see Chap. 5, Eq. 5.17). That is, the RESIM is a better predictor of the net genetic merit and its efficiency was 123% higher than the RLPSI efficiency for this data set.

Now, we compare RESIM efficiency versus RLPSI efficiency using the simulated data set described in Sect. 2.8.1 of Chap. 2 for five phenotypic selection cycles, each with four traits (T₁, T₂, T₃, and T₄), 500 genotypes, and four replicates for each genotype. The economic weights for T₁, T₂, T₃, and T₄ were 1, −1, 1, and 1 respectively. For this data set, matrix F was equal to an identity matrix of size 4 × 4 for all five selection cycles.

The first and second parts of columns 3, 4, and 5 of Table 7.1 present the estimated RLPSI and RESIM selection responses respectively for one, two, and three null restrictions for five simulated selection cycles, where the selection intensity was 10% (k_I = 1.755) for all five selection cycles. The averages of the estimated RLPSI selection response for each null restriction were 4.43, 4.30, and 4.92, whereas the averages of the estimated RESIM selection response were 4.54, 4.42, and 4.38 respectively. These results indicate that the estimated RLPSI selection response was greater than the estimated RESIM selection response only for three null restrictions.

The first part of Table 7.3 presents the estimated RESIM expected genetic gain per trait for one, two, and three restrictions for five simulated selection cycles. The estimated RLPSI expected genetic gains per trait for one, two, and three restrictions are given in Chap. 3 (Table 3.3). According to the results shown in Table 3.3 (Chap. 3), the averages of the estimated RLPSI expected genetic gains per trait for five simulated selection cycles were −2.52, 2.25, and 2.26 for one restriction; 2.84 and 2.65 for two restrictions; and 3.90 for three restrictions. According to the results shown in Table 7.3, the averages of the estimated RESIM expected genetic gains per trait for five simulated selection cycles were −0.43, −0.75, and 3.90 for one restriction; −0.59 and 3.89 for two restrictions; and 3.90 for three restrictions. This means that the RESIM and RLPSI were the same only for three restrictions, whereas for one and two restrictions, the average of the estimated RESIM expected genetic gains per trait was higher than that of the estimated RLPSI expected genetic gains per trait only for trait 4.

Table 7.3 Estimated RESIM and PPG-ESIM expected genetic gain per trait for one, two, and three restrictions for five simulated selection cycles

Full size table

Figure 7.2 presents the estimated accuracy of the RLPSI and the RESIM for one, two, and three null restrictions for five simulated selection cycles. In all five selection cycles, the estimated RESIM accuracy was greater than the RLPSI accuracy. This means that the RESIM is a better predictor of the net genetic merit than the RLPSI. Additional results associated with the frequency distribution of the estimated RESIM values are presented in Fig. 7.3. Figure 7.3a presents the frequency distribution of the estimated RESIM values with one null restriction for cycle 2, whereas Fig. 7.3b presents the frequency distribution of the estimated RESIM values with two null restrictions for cycle 5; both figures indicate that the estimated RESIM values approach normal distribution.

Finally, in Chap. 10 we present the results of comparing the ESIM with the LPSI and the RESIM with the RLPSI for many selection cycles. Such results are similar to those obtained in this chapter.

7.3 The Linear Phenotypic Predetermined Proportional Gain Eigen Selection Index Method

In a similar manner to the PPG-LPSI (see Chap. 3), in the PPG-ESIM the breeder pre-sets optimal levels (predetermined proportional gains) on certain traits before the selection is carried out. Let $ {\mathbf{d}}^{\prime }=\left[{d}_1\kern0.5em {d}_2\kern0.5em \cdots \kern0.5em {d}_r\right] $ be the vector of the PPGs (predetermined proportional gains) imposed by the breeder on r traits and assume that μ_q is the population mean of the qth trait before selection. The objective of the PPG-ESIM is to change μ_q to μ_q + d_q, where d_q is a predetermined change in μ_q (in the RESIM, d_q = 0, q = 1, 2, ⋯, r, where r is the number of PPGs). That is, the PPG-ESIM attempts to make some traits change their expected genetic gain values based on a predetermined level, whereas the rest of the traits remain without restrictions.

The simplest way to solve the foregoing problem is by maximizing the PPG-ESIM heritability under the restriction D′U′Cb = 0, where $ {\mathbf{D}}^{\prime }=\left[\begin{array}{ccccc}{d}_r& 0& \cdots & 0& -{d}_1\\ {}0& {d}_r& \cdots & 0& -{d}_2\\ {}\vdots & \vdots & \ddots & \vdots & \vdots \\ {}0& 0& \cdots & {d}_r& -{d}_{r-1}\end{array}\right] $ (see Chap. 3 for details) is a matrix (r − 1) × r, r is the number of PPGs, d_q (q = 1, 2…, r) is the qth element of vector d^′, U′ is the RLPSI matrix of restrictions of 1s and 0s, and C is the covariance matrix of genotypic values. Matrix D′ is a Mallard (1972) matrix of PPGs used to impose predetermined restrictions.

The Mallard (1972) matrix of predetermined restrictions can be written as M^′ = D′Ψ′, where Ψ′ = U′C and U′ is the Kempthorne and Nordskog (1959) matrix of restrictions of 1s and 0s (1 indicates that the trait is restricted, i.e., d_q = 0, and 0 that the trait has no restrictions).

To find the PPG-ESIM vector of coefficients that maximizes the PPG-ESIM selection response and expected genetic gain per trait, we can maximize $ {\rho}_{HI}^2=\frac{{\left({\mathbf{w}}^{\prime}\mathbf{Cb}\right)}^2}{\left({\mathbf{w}}^{\prime}\mathbf{Cw}\right)\left({\mathbf{b}}^{\prime}\mathbf{Pb}\right)} $ with respect to vectors b and w under the restrictions M^′b = 0, b′b = 1, w′Cw = 1, and b′Pb = 1, where w′Cw is the variance of H = w′g and b′Pb is the variance of I = b′y, as did Cerón-Rojas et al. (2016) according to the canonical correlation theory, or we can solve this problem by maximizing $ \frac{{\mathbf{b}}^{\prime}\mathbf{Cb}}{\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}}} $ (Eq. 7.12) only with respect to vectors b under the restriction M^′b = 0 and b′b = 1, as we did to obtain the RESIM vector of coefficients. Both approaches give the same result, but we use the latter approach because it is easier to work with.

7.3.1 The PPG-ESIM Parameters

To obtain the PPG-ESIM vector of coefficients, we need to maximize the function

$$ f\left(\mathbf{b},{\mathbf{v}}^{\prime}\right)=\frac{{\mathbf{b}}^{\prime}\mathbf{Cb}}{\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}}}-{\mathbf{v}}^{\prime }{\mathbf{M}}^{\prime}\mathbf{b} $$

(7.40)

with respect to vectors b and v′, where $ {\mathbf{v}}^{\prime }=\left[{v}_1\kern0.5em {v}_2\kern0.5em \cdots \kern0.5em {v}_{r-1}\right] $ is a vector of Lagrange multipliers. The derivatives of Eq. (7.40) with respect to b and v′ were:

$$ 2{\left({\mathbf{b}}^{\prime}\mathbf{Pb}\right)}^{1/2}\mathbf{Cb}-{\left({\mathbf{b}}^{\prime}\mathbf{Pb}\right)}^{-1/2}\left({\mathbf{b}}^{\prime}\mathbf{Cb}\right)\mathbf{Pb}-\mathbf{Mv}=\mathbf{0} $$

(7.41)

and

$$ {\mathbf{M}}^{\prime}\mathbf{b}=\mathbf{0}, $$

(7.42)

respectively, where Eq. (7.42) denotes the restriction imposed for maximizing Eq. (7.40). By using algebraic methods on Eq. (7.41) similar to those used to obtain Eq. (7.10) we get

$$ \left({\mathbf{K}}_P{\mathbf{P}}^{-1}\mathbf{C}-{\lambda}_P^2{\mathbf{I}}_t\right){\mathbf{b}}_P=\mathbf{0}, $$

(7.43)

where K_P = [I_t − Q_P], Q_P = P⁻¹ΨD(D′Ψ′P⁻¹ΨD)⁻¹D′Ψ′, Ψ′ = U′C, I_t is an identity matrix t × t, $ {\lambda}_P^2={h}_{I_P}^2 $, and b_P are the first eigenvalue and the first eigenvector of matrix K_PP⁻¹C respectively. Note that $ {h}_{I_P}^2 $ is PPG-ESIM heritability and λ_P is the maximum correlation between $ {I}_{\mathrm{P}}={\mathbf{b}}_P^{\prime}\mathbf{y} $ and H = w′g. When D′ = U′, b_P = b_R (the vector of coefficients of the RESIM), and when U′ is a null matrix, b_P = b_E (the vector of coefficients of the ESIM). That is, the PPG-ESIM is more general than the RESIM and the ESIM and includes the latter two indices as particular cases. Matrices K_P = [I_t − Q_P] and Q_P = P⁻¹ΨD(D′Ψ′P⁻¹ΨD)⁻¹D′Ψ′ are the same as those obtained in the PPG-LPSI (see Chap. 3). Also, vector b_P can be transformed as β_P = Fb_P; matrix F was defined earlier.

Let S_P = Ψ′P⁻¹Ψ; then, under the assumption D′d = 0, it is possible to show that $ \mathbf{D}{\left({\mathbf{D}}^{\prime }{\mathbf{S}}_P\mathbf{D}\right)}^{-1}{\mathbf{D}}^{\prime }={\mathbf{S}}_P^{-1}-{\mathbf{S}}_P^{-1}\mathbf{d}{\left({\mathbf{d}}^{\prime }{\mathbf{S}}_P^{-1}\mathbf{d}\right)}^{-1}{\mathbf{d}}^{\prime }{\mathbf{S}}_P^{-1} $ (see Chap. 3), whence by substituting $ {\mathbf{S}}_P^{-1}-{\mathbf{S}}_P^{-1}\mathbf{d}{\left({\mathbf{d}}^{\prime }{\mathbf{S}}_P^{-1}\mathbf{d}\right)}^{-1}{\mathbf{d}}^{\prime }{\mathbf{S}}_P^{-1} $ for D(D′S_PD)⁻¹D′ in matrix Q_P = P⁻¹ΨD(D′Ψ′P⁻¹ΨD)⁻¹D′Ψ′, matrix K_PP⁻¹C can be written as

$$ {\mathbf{K}}_P{\mathbf{P}}^{-1}\mathbf{C}=\left[{\mathbf{I}}_t-{\mathbf{P}}^{-1}{\boldsymbol{\Psi} \mathbf{S}}^{-1}{\boldsymbol{\Psi}}^{\prime}\right]{\mathbf{P}}^{-1}\mathbf{C}+{\mathbf{A}}_P, $$

(7.44)

where Ψ′ = U′C, A_P = δα^′, δ = P⁻¹Ψ(Ψ′P⁻¹Ψ)⁻¹d, and $ {\boldsymbol{\upalpha}}^{\prime }=\frac{{\mathbf{d}}^{\prime }{\mathbf{S}}^{-1}{\boldsymbol{\Psi}}^{\prime }{\mathbf{P}}^{-1}\mathbf{C}}{{\mathbf{d}}^{\prime }{\mathbf{S}}^{-1}\mathbf{d}} $. When A_P is a null matrix, K_PP⁻¹C = KP⁻¹C (matrix of the RESIM), and if U′ is a null matrix, K_PP⁻¹C = P⁻¹C (matrix of the ESIM), this means that Eq. (7.44) is a mathematical equivalent form of matrix K_PP⁻¹C and that Eq. (7.44) does not require matrix D′. The easiest way to obtain b_P and λ_P is to use matrix [I_t − P⁻¹ΨS⁻¹Ψ′]P⁻¹C + A_P in Eq. (7.43) instead of matrix K_PP⁻¹C.

In the PPG-ESIM context, vector w can be obtained as

$$ {\mathbf{w}}_P={\mathbf{C}}^{-1}\left[{\lambda}_P{\mathbf{Pb}}_P+{\mathbf{Mv}}_P\right], $$

(7.45)

whence H = w′g can be written as $ {H}_P={\mathbf{w}}_P^{\prime}\mathbf{g} $. In Eq. (7.45), λ_P is the maximum correlation between $ {I}_P={\mathbf{b}}_P^{\prime}\mathbf{y} $ and $ {H}_P={\mathbf{w}}_P^{\prime}\mathbf{g} $, b_P is the first eigenvector of matrix K_PP⁻¹C, $ {\mathbf{v}}_P={\lambda}_P^{-1}{\left({\mathbf{M}}^{\prime }{\mathbf{P}}^{-1}\mathbf{M}\right)}^{-1}{\mathbf{M}}^{\prime }{\mathbf{P}}^{-1}{\mathbf{Cb}}_P $, M^′ = D′Ψ′, and Ψ′ = U′C. In a similar manner to the RESIM context, we can assume that λ_P and $ {\lambda}_P^{-1} $ are proportionality constants and it can be shown that the covariance between $ {I}_P={\mathbf{b}}_P^{\prime}\mathbf{y} $ and $ {H}_P={\mathbf{w}}_P^{\prime}\mathbf{g} $ ($ {\sigma}_{H_P{I}_P} $) is equal to the variance of $ {I}_P={\mathbf{b}}_P^{\prime}\mathbf{y} $ ($ {\sigma}_{I_P}^2={\mathbf{b}}_p^{\prime }{\mathbf{Pb}}_P $), that is, $ {\sigma}_{H_P{I}_P}={\mathbf{w}}_P^{\prime }{\mathbf{Cb}}_P={\mathbf{b}}_p^{\prime }{\mathbf{Pb}}_P $.

The accuracy of the PPG-ESIM can also be written as

$$ {\rho}_{H_P{I}_P}=\frac{\sqrt{{\mathbf{b}}_P^{\prime }{\mathbf{Pb}}_P}}{\sqrt{{\mathbf{w}}_P^{\prime }{\mathbf{Cw}}_P}}, $$

(7.46)

where $ {\sigma}_{H_P}^2={\mathbf{w}}_P^{\prime }{\mathbf{Cw}}_P={\mathbf{b}}_P^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Pb}}_P+{\mathbf{b}}_P^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Q}}_P^{\prime }{\mathbf{Cb}}_P $ is the variance of H_P. When D′ = U′, $ {\mathbf{w}}_P^{\prime }{\mathbf{Cw}}_P={\mathbf{w}}_R^{\prime }{\mathbf{Cw}}_R $ (the variance of H_R), and when U′ is a null matrix, $ {\mathbf{w}}_P^{\prime }{\mathbf{Cw}}_P={\mathbf{w}}_E^{\prime }{\mathbf{Cw}}_E $ (the variance of H_E). Hereafter, to simplify the notation, we write Eq. (7.46) as ρ_P or λ_P.

Let β_P = Fb_P be the PPG-ESIM transformed vector of coefficients by matrix F. By Eqs. (7.1) and (7.46), the maximized selection response (R_P) and expected genetic gain per trait (E_P) of the PPG-ESIM can be written as

$$ {R}_P={k}_I\sqrt{{\boldsymbol{\upbeta}}_P^{\prime }{\mathbf{P}\boldsymbol{\upbeta}}_P} $$

(7.47)

and

$$ {\mathbf{E}}_P={k}_I\frac{{\mathbf{C}\boldsymbol{\upbeta}}_P}{\sqrt{{\boldsymbol{\upbeta}}_P^{\prime }{\mathbf{P}\boldsymbol{\upbeta}}_P}}, $$

(7.48)

respectively, where $ \sqrt{{\boldsymbol{\upbeta}}_P^{\prime }{\mathbf{P}\boldsymbol{\upbeta}}_P}={\sigma}_{I_P} $ is the standard deviation of the variance of $ {I}_P={\boldsymbol{\upbeta}}_P^{\prime}\mathbf{y} $. Equations (7.47) and (7.48) do not require economic weights. When F is an identity matrix, β_P = b_P, $ {I}_P={\mathbf{b}}_P^{\prime}\mathbf{y} $, $ {R}_P={k}_I\sqrt{{\mathbf{b}}_P^{\prime }{\mathbf{Pb}}_P} $, and $ {\mathbf{E}}_P={k}_I\frac{{\mathbf{Cb}}_P}{\sqrt{{\mathbf{b}}_P^{\prime }{\mathbf{Pb}}_P}} $.

Equation (7.47) can also be written as $ {R}_P={k}_I{\sigma}_{H_P}{\lambda}_P $, where $ {\sigma}_{H_P}=\sqrt{{\mathbf{b}}_P^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Pb}}_P+{\mathbf{b}}_P^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Q}}_P^{\prime }{\mathbf{Cb}}_P} $ is the standard deviation of the variance of H_P, and λ_P is the canonical correlation between H_P and $ {I}_P={\boldsymbol{\upbeta}}_P^{\prime}\mathbf{y} $. When $ {\sigma}_{H_P}=1 $, Eq. (7.47) can be written as R_P = k_Iλ_P, where λ_P is the covariance between $ {I}_P={\mathbf{b}}_P^{\prime}\mathbf{y} $ and $ H={\mathbf{w}}_P^{\prime}\mathbf{g} $.

The prediction efficiency of the PPG-ESIM can be obtained in a similar manner to the ESIM and RESIM. The accuracy of the PPG-ESIM (Eq. 7.46) can be used to construct the ratio of index accuracies. The PPG-ESIM mean square error or the VPE can be obtained as

$$ E\left[{\left({H}_P-{I}_P\right)}^2\right]={\sigma}_{H_P}^2+{\sigma}_{I_P}^2-2{\sigma}_{H_P{I}_P}={\sigma}_{H_P}^2-{\sigma}_{I_P}^2=\left(1-{\rho}_P^2\right){\sigma}_{H_P}^2. $$

(7.49)

Additional properties associated with the ESIM are also valid for the PPG-ESIM.

7.3.2 Estimating PPG-ESIM Parameters

The procedure used to estimate PPG-ESIM parameters is the same as that described for RESIM. Let $ \widehat{\mathbf{C}} $ and $ \widehat{\mathbf{P}} $ be the estimated matrices of C and P. In the PPG-ESIM context, we use matrix $ \widehat{\mathbf{S}}={\widehat{\mathbf{K}}}_P{\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}} $ to obtain the estimated eigenvalues and eigenvectors of equation

$$ \left(\widehat{\mathbf{S}}-{\widehat{\lambda}}_{Pj}^2{\mathbf{I}}_t\right){\widehat{\mathbf{b}}}_{Pj}=\mathbf{0}, $$

(7.50)

j = 1, 2, ⋯, t, where t is the number of traits in the PPG-ESIM index, $ {\widehat{\mathbf{K}}}_P=\left[{\mathbf{I}}_t-{\widehat{\mathbf{Q}}}_P\right] $, I_t is an identity matrix of size t × t and $ {\widehat{\mathbf{Q}}}_P={\widehat{\mathbf{P}}}^{-1}\widehat{\boldsymbol{\Psi}}\mathbf{D}{\left({\mathbf{D}}^{\prime}\widehat{{\boldsymbol{\Psi}}^{\prime }}{\widehat{\mathbf{P}}}^{-1}\widehat{\boldsymbol{\Psi}}\mathbf{D}\right)}^{-1}{\mathbf{D}}^{\prime}\widehat{{\boldsymbol{\Psi}}^{\prime }} $. As $ \widehat{\mathbf{S}} $ is an asymmetric matrix, the values of $ {\widehat{\mathbf{b}}}_{Pj} $ and $ {\widehat{\lambda}}_{Pj}^2 $ should be obtained using SVD (singular value decomposition).

According to SVD, we need to solve equation

$$ \left(\widehat{\mathbf{S}}{\widehat{\mathbf{S}}}^{\prime }-{\widehat{\mu}}_{Pj}{\mathbf{I}}_t\right){\widehat{\mathbf{b}}}_{P_j}=\mathbf{0}, $$

(7.51)

where $ {\widehat{\mu}}_{Pj}={\widehat{\lambda}}_{P_j}^4 $ (j= 1, 2, …, t). By Eq. (7.51), the estimated PPG-ESIM index ($ {I}_P={\mathbf{b}}_P^{\prime}\mathbf{y} $) is $ {\widehat{I}}_P={\widehat{{\mathbf{b}}^{\prime}}}_{P_1}\mathbf{y} $. The estimator of the maximized PPG-ESIM selection response, and its expected genetic gain per trait, can be denoted as $ {\widehat{R}}_P={k}_I\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{P_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{P_1}} $ and $ {\widehat{\mathbf{E}}}_P={k}_I\frac{{\widehat{\mathbf{C}}\widehat{\mathbf{b}}}_{P_1}}{\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{P_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{P_1}}} $ respectively, whereas the estimator of the maximized accuracy of the PPG-ESIM is $ {\widehat{\lambda}}_{P_1} $.

7.3.3 Numerical Examples

We compare the results of the PPG-LPSI and the PPG-ESIM using the Akbar et al. (1984) data described earlier. We restrict traits RL and SM, on both indices using the PPG vector $ {\mathbf{d}}^{\prime }=\left[3\kern0.5em -1\right] $. In Chap. 3, Sect. 3.1.4, we indicated how to construct matrix U′ and, in Sect. 3.2.4 of the same chapter, we described how to obtain matrix $ {\widehat{\mathbf{K}}}_P $ for one and two restrictions. Matrix $ {\widehat{\mathbf{K}}}_P $ is the same for the PPG-LPSI and the PPG-ESIM. Thus, we omit the steps for constructing matrices U′ and $ {\widehat{\mathbf{K}}}_P $.

Assume a selection intensity of 10% (k_I = 1.755) and that the vector of economic weights is $ {\mathbf{w}}^{\prime }=\left[19.54\kern0.5em -3.56\kern0.5em 17.01\right] $. The estimated PPG-LPSI vector of coefficients for two predetermined restrictions was $ {\widehat{\mathbf{b}}}^{\prime }=\left[1.70\kern0.5em 1.04\kern0.5em 2.93\right] $, and its estimated selection response, expected genetic gain per trait, accuracy, and heritability were $ \widehat{R}=1.755\sqrt{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{P}}\widehat{\mathbf{b}}}=49.02 $, $ {\widehat{\mathbf{E}}}^{\prime }=1.755\frac{{\widehat{\mathbf{b}}}^{\prime}\widehat{\mathbf{C}}}{\sqrt{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{P}}\widehat{\mathbf{b}}}}=\left[1.25\kern0.5em -0.42\kern0.5em 1.36\right] $, $ \widehat{\rho}=\frac{\sqrt{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{P}}\widehat{\mathbf{b}}}}{\sqrt{{\mathbf{w}}^{\prime}\widehat{\mathbf{C}}\mathbf{w}}}=0.24 $, and $ {\widehat{h}}^2=\frac{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{C}}\widehat{\mathbf{b}}}{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{P}}\widehat{\mathbf{b}}}=0.12 $ respectively. In this case, $ \widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{b}}=12.57 $; then, the estimated PPG-LPSI selection response using the normalized PPG-LPSI vector of coefficients was $ \widehat{R}=\frac{49.02}{12.57}=3.90 $, whereas the rest of the estimated PPG-LPSI parameters were the same.

In the PPG-ESIM, we need matrix $ \widehat{\mathbf{S}}={\widehat{\mathbf{K}}}_P{\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}} $ to obtain the eigenvalues and eigenvectors of $ \left(\widehat{\mathbf{S}}{\widehat{\mathbf{S}}}^{\prime }-{\widehat{\mu}}_{Pj}{\mathbf{I}}_t\right){\widehat{\mathbf{b}}}_{P_j}=\mathbf{0} $ that make up matrices $ {\mathbf{L}}_P^{1/2} $, $ {\mathbf{V}}_{P_1} $, and $ \widehat{\mathbf{S}}={\mathbf{V}}_{P_1}{\mathbf{L}}_P^{1/2}{\mathbf{V}}_{P_2}^{\prime } $, where $ {\widehat{\mu}}_{Pj}={\widehat{\lambda}}_{P_j}^4 $. It can be shown that $ \widehat{\mathbf{S}}={\widehat{\mathbf{K}}}_P{\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}}=\left[\begin{array}{ccc}0.1047& -0.0349& -0.0279\\ {}0.0678& -0.0226& -0.0213\\ {}-0.1970& 0.0657& 0.4119\end{array}\right] $, $ \widehat{\mathbf{S}}\widehat{{\mathbf{S}}^{\prime }}=\left[\begin{array}{ccc}0.0130& 0.0085& -0.0344\\ {}0.0085& 0.0056& -0.0236\\ {}-0.0344& -0.0236& 0.2118\end{array}\right] $, and $ {\mathbf{V}}_{P_1}=\left[\begin{array}{ccc}-0.1663& 0.8292& 0.5336\\ {}-0.1138& 0.5214& -0.8457\\ {}0.9795& 0.2014& -0.0076\end{array}\right] $, whereas the $ {\widehat{\mu}}_{Pj}={\widehat{\lambda}}_{P_j}^4 $values were 0.2214, 0.0099, and 0.0, whence $ {\mathbf{L}}_P^{1/2}=\left[\begin{array}{ccc}0.4705& 0& 0\\ {}0& 0.0997& 0\\ {}0& 0& 0.0\end{array}\right] $. Thus, $ {\widehat{\mu}}_{P_1}={\widehat{\lambda}}_{P_1}^4=0.2214 $, $ {\widehat{\lambda}}_{P_1}^2=0.4705 $, and the estimated maximized PPG-ESIM accuracy was $ {\widehat{\lambda}}_{P_1}=0.6859 $.

We transformed the first eigenvector $ {\widehat{\mathbf{b}}}_{p_1}^{\prime }=\left[-0.1663\kern0.5em -0.1138\kern0.5em 0.9795\right] $ using matrix $ \mathbf{F}=\left[\begin{array}{ccc}-9& 0& 0\\ {}0& 1& 0\\ {}0& 0& 1\end{array}\right] $ to obtain vector $ {\widehat{\boldsymbol{\upbeta}}}_P={\widehat{\mathbf{b}}}_{p_1}^{\prime}\mathbf{F}=\left[1.4968\kern0.5em -0.1138\kern0.5em 0.9795\right] $ and $ {\widehat{\boldsymbol{\upbeta}}}_P^{\prime }{\widehat{\boldsymbol{\upbeta}}}_P=3.21 $, whence the estimates of the index, the selection response, and expected genetic gain per trait of the PPG-ESIM were $ {\widehat{I}}_P=1.4968\mathrm{RL}-0.1138\mathrm{SM}+0.9795\mathrm{EW} $, $ {\widehat{R}}_P=\frac{1.755\sqrt{{\widehat{\boldsymbol{\upbeta}}}_P^{\prime }{\widehat{\mathbf{P}}\widehat{\boldsymbol{\upbeta}}}_P}}{{\widehat{\boldsymbol{\upbeta}}}_P^{\prime }{\widehat{\boldsymbol{\upbeta}}}_P}=\frac{43.01}{3.21}=13.39 $, and $ {\widehat{\mathbf{E}}}_P^{\prime }=1.755\frac{{\widehat{\boldsymbol{\upbeta}}}_p^{\prime}\widehat{\mathbf{C}}}{\sqrt{{\widehat{\boldsymbol{\upbeta}}}_p^{\prime }{\widehat{\mathbf{P}}\widehat{\boldsymbol{\upbeta}}}_p}}=\left[3.05\kern0.5em -1.96\kern0.5em 0.19\right] $ respectively. The estimated PPG-LPSI selection response was $ \widehat{R}=\frac{49.02}{12.57}=3.90 $, which means that the estimated PPG-ESIM selection response was greater than the estimated PPG-LPSI response.

We compared PPG-ESIM efficiency versus LPSI efficiency to predict the net genetic merit using the ratio of the estimated PPG-ESIM accuracy ($ {\widehat{\lambda}}_{P_1}=0.6859 $) to PPG-LPSI accuracy ($ \widehat{\rho}=0.24 $), i.e., $ \frac{{\widehat{\lambda}}_{P_1}}{\widehat{\rho}}=\frac{0.6859}{0.24}=2.858 $ or, in percentage terms, $ {\widehat{p}}_P=100\left(2.858-1\right)=185.80 $. Then, the PPG-ESIM was a better predictor of the net genetic merit and its efficiency was 185.80% higher than that of the PPG-LPSI for this data set.

Now, we compare PPG-ESIM efficiency versus PPG-LPSI efficiency using the data set described in Sect. 2.8.1 of Chap. 2 for five phenotypic selection cycles, each with four traits (T₁, T₂, T₃, and T₄), 500 genotypes, and four replicates for each genotype. The economic weights for T₁, T₂, T₃, and T₄ were 1, −1, 1, and 1 respectively. For this data set, matrix F was an identity matrix of size 4 × 4 for all five selection cycles.

The first and second parts of columns 6, 7, and 8 in Table 7.1 present the estimated PPG-LPSI and PPG-ESIM selection responses for one, two, and three predetermined restrictions for five simulated selection cycles. The selection intensity was 10% (k_I = 1.755) and the vectors of PPG for each predetermined restriction were $ {\mathbf{d}}_1^{\prime }=7 $, $ {\mathbf{d}}_2^{\prime }=\left[7\kern0.5em -3\right] $, and $ {\mathbf{d}}_3^{\prime }=\left[7\kern0.5em -3\kern0.5em 5\right] $ respectively, for all five selection cycles. The estimated PPG-LPSI selection response when the vector of coefficients was not normalized was presented in Chap. 3 (Table 3.5). The averages of the estimated PPG-LPSI selection response for each predetermined restriction were 4.70, 4.91, and 3.14, whereas the averages of the estimated PPG-ESIM selection response were 6.31, 6.28, and 6.75 respectively. These results indicate that the estimated PPG-ESIM selection response was greater than the estimated PPG-LPSI selection response for all predetermined restrictions.

The second part of Table 7.2 presents the estimated PPG-ESIM accuracy ($ {\widehat{\rho}}_P $) and the ratio of $ {\widehat{\rho}}_P $ to the estimated PPG-LPSI accuracy ($ \widehat{\rho} $), expressed in percentage terms, $ {\widehat{p}}_P=100\left({\widehat{\lambda}}_P-1\right) $, where $ {\widehat{\lambda}}_P={\widehat{\rho}}_P/\widehat{\rho} $, for one, two, and three predetermined restrictions for five simulated selection cycles. The estimated PPG-LPSI accuracies were presented in Chap. 3 (Table 3.6). The average estimated PPG-ESIM efficiency for each restriction was 9.76%, 11.71%, and 29.03% greater than the PPG-LPSI efficiency for this data set in all five selection cycles.

The second part of Table 7.3 presents the estimated PPG-ESIM expected genetic gain per trait for one, two, and three predetermined restrictions for five simulated selection cycles. The estimated PPG-LPSI expected genetic gains per trait for one, two, and three predetermined restrictions were presented in Chap. 3, Table 3.5, where it can be seen that the averages of the estimated PPG-LPSI expected genetic gains per trait for five simulated selection cycles were 6.85, −3.25, 2.62 and 1.48 for one restriction; 6.93, −2.97, 2.65 and 1.45 for two restrictions; and 5.20, −2.23, 3.72 and 1.43 for three restrictions, whereas for the same set of restrictions, the averages of the estimated PPG-ESIM expected genetic gain per trait were 5.67, −2.67, 1.81, and 2.97 for one restriction; 5.89, −2.52, 2.04, and 2.83 for two restrictions; and 5.71, −2.45, 4.08, and 0.82 for three restrictions (Table 7.3). Because the vectors of predetermined proportional gains for each predetermined restriction were $ {\mathbf{d}}_1^{\prime }=7 $, $ {\mathbf{d}}_2^{\prime }=\left[7\kern0.5em -3\right] $, and $ {\mathbf{d}}_3^{\prime }=\left[7\kern0.5em -3\kern0.5em 5\right] $, the averages of the estimated PPG-LPSI expected genetic gains per trait were closer than those of the estimated PPG-ESIM expected genetic gains per trait for one and two predetermined restrictions, whereas for three restrictions, the results of both selection indices were similar.

References

Akbar MK, Lin CY, Gyles NR, Gavora JS, Brown CJ (1984) Some aspects of selection indices with constraints. Poult Sci 63:1899–1905
Article Google Scholar
Anderson TW (1999) Asymptotic theory for canonical correlation analysis. J Multivar Anal 70:1–29
Article CAS Google Scholar
Anderson TW (2003) An introduction to multivariate statistical analysis, 3rd edn. Wiley, New Jersey
Google Scholar
Bilodeau M, Brenner D (1999) Theory of multivariate statistics. Springer, New York
Google Scholar
Cerón-Rojas JJ, Crossa J, Sahagún-Castellanos J, Castillo-González F, Santacruz-Varela A (2006) A selection index method based on eigen analysis. Crop Sci 46:1711–1721
Article Google Scholar
Cerón-Rojas JJ, Sahagún-Castellanos J, Castillo-González F, Santacruz-Varela A, Crossa J (2008) A restricted selection index method based on eigenanalysis. J Agric Biol Environ Stat 13(4):421–438
Article Google Scholar
Cerón-Rojas JJ, Crossa J, Toledo FH, Sahagún-Castellanos J (2016) A predetermined proportional gains eigen selection index method. Crop Sci 56:2436–2447
Article Google Scholar
Gentle JE (2007) Matrix algebra theory, computations, and applications in statistics. Springer, New York
Google Scholar
Harville DA (1997) Matrix algebra from a statistician’s perspective. Springer, New York
Book Google Scholar
Hotelling H (1935) The most predictable criterion. J Educ Psychol 26:139–142
Article Google Scholar
Hotelling H (1936) Relations between two sets of variables. Biometrika 28:321–377
Article Google Scholar
Kempthorne O, Nordskog AW (1959) Restricted selection indices. Biometrics 15:10–19
Article Google Scholar
Mallard J (1972) The theory and computation of selection indices with constraints: a critical synthesis. Biometrics 28:713–735
Article Google Scholar
Mardia KV, Kent JT, Bibby JM (1982) Multivariate analysis. Academic, New York
Google Scholar
Muirhead RJ (2005) Aspects of multivariate statistical theory. Wiley, Hoboken
Google Scholar
Okamoto M (1973) Distinctness of the eigenvalues of a quadratic form in a multivariate sample. Ann Stat 1(4):763–765
Article Google Scholar
Rao CR (2002) Linear statistical inference and its applications, 2nd edn. Wiley, New York
Google Scholar
Watkins DS (2002) Fundamentals of matrix computations, 2nd edn. Wiley, New York
Book Google Scholar
Wilms I, Croux C (2016) Robust sparse canonical correlation analysis. BMC Syst Biol 10:72
Article Google Scholar

Download references

Author information

Authors and Affiliations

Biometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), Mexico, Mexico
J. Jesus Céron-Rojas & José Crossa

Authors

J. Jesus Céron-Rojas
View author publications
You can also search for this author in PubMed Google Scholar
José Crossa
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Céron-Rojas, J.J., Crossa, J. (2018). Linear Phenotypic Eigen Selection Index Methods. In: Linear Selection Indices in Modern Plant Breeding. Springer, Cham. https://doi.org/10.1007/978-3-319-91223-3_7

Download citation

DOI: https://doi.org/10.1007/978-3-319-91223-3_7
Published: 27 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-91222-6
Online ISBN: 978-3-319-91223-3
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics

Linear Phenotypic Eigen Selection Index Methods

Abstract

Similar content being viewed by others

A singular value decomposition Bayesian multiple-trait and multiple-environment genomic model

The use of a genetic relationship matrix biases the best linear unbiased prediction

Biometrical Approaches for Analysis of Phenotypic Data of Complex Traits

7.1 The Linear Phenotypic Eigen Selection Index Method

7.1.1 The ESIM Parameters

7.1.2 Statistical ESIM Properties

7.1.3 The ESIM and the Canonical Correlation Theory

7.1.4 Estimated ESIM Parameters and Their Sampling Properties

7.1.5 Numerical Examples

7.2 The Linear Phenotypic Restricted Eigen Selection Index Method

7.2.1 The RESIM Parameters

7.2.2 Estimating the RESIM Parameters

7.2.3 Numerical Examples

7.3 The Linear Phenotypic Predetermined Proportional Gain Eigen Selection Index Method

7.3.1 The PPG-ESIM Parameters

7.3.2 Estimating PPG-ESIM Parameters

7.3.3 Numerical Examples

References

Author information

Authors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Linear Phenotypic Eigen Selection Index Methods

Abstract

Similar content being viewed by others

A singular value decomposition Bayesian multiple-trait and multiple-environment genomic model

The use of a genetic relationship matrix biases the best linear unbiased prediction

Biometrical Approaches for Analysis of Phenotypic Data of Complex Traits

7.1 The Linear Phenotypic Eigen Selection Index Method

7.1.1 The ESIM Parameters

7.1.2 Statistical ESIM Properties

7.1.3 The ESIM and the Canonical Correlation Theory

7.1.4 Estimated ESIM Parameters and Their Sampling Properties

7.1.5 Numerical Examples

7.2 The Linear Phenotypic Restricted Eigen Selection Index Method

7.2.1 The RESIM Parameters

7.2.2 Estimating the RESIM Parameters

7.2.3 Numerical Examples

7.3 The Linear Phenotypic Predetermined Proportional Gain Eigen Selection Index Method

7.3.1 The PPG-ESIM Parameters

7.3.2 Estimating PPG-ESIM Parameters

7.3.3 Numerical Examples

References

Author information

Authors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation