7.1 The Linear Phenotypic Eigen Selection Index Method

The conditions described in Chap. 2 for the linear phenotypic selection index (LPSI) are necessary and sufficient for constructing the linear phenotypic eigen selection index method (ESIM). The ESIM index can be written as I = by, where b′ = [b1b2 ⋯ bt] is the unknown index vector of coefficients, t is the number of traits, and \( {\mathbf{y}}^{\prime }=\left[{y}_1\kern0.5em {y}_2\kern0.5em \cdots \kern0.5em {y}_t\right] \) is a known vector of trait phenotypic values. The objectives of ESIM are:

  1. 1.

    To predict the net genetic merit H = wg, where \( {\mathbf{g}}^{\prime }=\left[{g}_1\kern0.5em {g}_2\kern0.5em \dots \kern0.5em {g}_t\right] \) is the unknown vector of true breeding values for an individual and \( {\mathbf{w}}^{\prime }=\left[{w}_1\kern0.5em {w}_2\kern0.5em \dots \kern0.5em {w}_t\right] \) is a vector of unknown economic weights.

  2. 2.

    To maximize the ESIM selection response and the accuracy.

  3. 3.

    To select individuals with the highest H values in each selection cycle as parents of the next generation.

  4. 4.

    To provide the breeder with an objective rule for evaluating and selecting several traits simultaneously.

Although in the context of the LPSI w is a known and fixed vector of economic weights, in the ESIM w is fixed, but unknown and its values must be estimated in each selection cycle. This latter assumption is the fundamental difference between the ESIM and the LPSI and implies that the ESIM is more general than the LPSI. Thus, when w is known, the LPSI and ESIM give the same results.

7.1.1 The ESIM Parameters

The theoretical ESIM selection response can be written as

$$ {R}_I={k}_I{\sigma}_H{\rho}_{HI}, $$
(7.1)

where kI is the standardized selection differential (or selection intensity), \( {\sigma}_H=\sqrt{{\mathbf{w}}^{\prime}\mathbf{Cw}} \) is the standard deviation of H, \( {\rho}_{HI}=\frac{{\mathbf{w}}^{\prime}\mathbf{Cb}}{\sqrt{{\mathbf{w}}^{\prime}\mathbf{Cw}}\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}}} \) is the correlation, and wCb = σHI the covariance between H and I respectively, \( {\sigma}_I=\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}} \) is the standard deviation of I, C is the covariance matrix of the true breeding values (g), and P is the covariance matrix of the trait phenotypic values (y).

In the ESIM, it is assumed that kI and σH are fixed, and that C and P are known; thus, to maximize Eq. (7.1), it is necessary to maximize \( {\rho}_{HI}^2=\frac{{\left({\mathbf{w}}^{\prime}\mathbf{Cb}\right)}^2}{\left({\mathbf{w}}^{\prime}\mathbf{Cw}\right)\left({\mathbf{b}}^{\prime}\mathbf{Pb}\right)} \) with respect to vectors b and w under the restrictions \( {\sigma}_H^2={\mathbf{w}}^{\prime}\mathbf{Cw} \), \( {\sigma}_I^2={\mathbf{b}}^{\prime}\mathbf{Pb} \), and 0 < \( {\sigma}_H^2 \),\( {\sigma}_I^2 \)<∞, where \( {\sigma}_H^2={\mathbf{w}}^{\prime}\mathbf{Cw} \) is the variance of H = wg and \( {\sigma}_I^2={\mathbf{b}}^{\prime}\mathbf{Pb} \) is the variance of I = by. That is, it is necessary to maximize the function

$$ f\left(\mathbf{b},\mathbf{w},\mu, \phi \right)={\left({\mathbf{w}}^{\prime}\mathbf{Cb}\right)}^2-\mu \left({\mathbf{b}}^{\prime}\mathbf{Pb}-{\sigma}_I^2\right)-\phi \left({\mathbf{w}}^{\prime}\mathbf{Cw}-{\sigma}_H^2\right) $$
(7.2)

with respect to b, w, μ, and ϕ, where μ and ϕ are Lagrange multipliers. The derivative results of Eq. (7.2) with respect to b, w, μ, and ϕ are:

$$ \left({\mathbf{w}}^{\prime}\mathbf{Cb}\right)\mathbf{Cw}-\mu \mathbf{Pb}=\mathbf{0}, $$
(7.3)
$$ \left({\mathbf{w}}^{\prime}\mathbf{Cb}\right)\mathbf{Cb}-\phi \mathbf{Cw}=\mathbf{0}, $$
(7.4)
$$ {\mathbf{b}}^{\prime}\mathbf{Pb}={\sigma}_I^2\ \mathrm{and}\ {\mathbf{w}}^{\prime}\mathbf{Cw}={\sigma}_H^2, $$
(7.5)

respectively, where Eq. (7.5) denotes the restrictions imposed for maximizing \( {\rho}_{HI}^2 \). It can be shown that \( {\mathbf{w}}^{\prime}\mathbf{Cb}=\sqrt{{\mu \sigma}_I^2}=\sqrt{{\phi \sigma}_H^2}={\theta}^{1/2} \); then, Eqs. (7.3) and (7.4) can be written as

$$ {\theta}^{1/2}\mathbf{Cw}-\frac{\theta }{\sigma_I^2}\mathbf{Pb}=\mathbf{0} $$
(7.6)

and

$$ {\theta}^{1/2}\mathbf{Cb}-\frac{\theta }{\sigma_H^2}\mathbf{Cw}=\mathbf{0}, $$
(7.7)

respectively. Equation (7.6) is equal to \( \mathbf{Cw}=\frac{\theta^{1/2}}{\sigma_I^2}\mathbf{Pb} \); then, vector w can be written as

$$ {\mathbf{w}}_E=\frac{\theta^{1/2}}{\sigma_I^2}{\mathbf{C}}^{-1}\mathbf{Pb}. $$
(7.8)

By the result of Eq. (7.8), the net genetic merit in the ESIM context is \( {H}_E={\mathbf{w}}_E^{\prime}\mathbf{g} \) and the correlation between HE and I is \( {\rho}_{H_EI}=\frac{{\mathbf{w}}_E^{\prime}\mathbf{Cb}}{\sqrt{{\mathbf{w}}_E^{\prime }{\mathbf{Cw}}_E}\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}}}=\frac{\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}}}{\sqrt{{\mathbf{b}}^{\prime }{\mathbf{PC}}^{-1}\mathbf{Pb}}} \). Now, it is necessary to find the vector b that maximizes \( {\rho}_{H_EI} \), which should be the ESIM index vector of coefficients. Substituting w with wE in Eq. (7.7), we get

$$ \mathbf{Cb}-\frac{{\left({\mathbf{w}}_E^{\prime}\mathbf{Cb}\right)}^2}{\sigma_I^2{\sigma}_{H_E}^2}\mathbf{Pb}=\mathbf{0}, $$
(7.9)

where \( \frac{{\left({\mathbf{w}}_E^{\prime}\mathbf{Cb}\right)}^2}{\sigma_I^2{\sigma}_{H_E}^2}={\rho}_{H_EI}^2 \) is the square of the correlation between ESIM and \( {H}_E={\mathbf{w}}_E^{\prime}\mathbf{g} \). Let \( {\rho}_{H_EI}^2={\lambda}_E^2 \), then Eq. (7.9) can be written as

$$ \left({\mathbf{P}}^{-1}\mathbf{C}-{\lambda}_E^2\mathbf{I}\right){\mathbf{b}}_E=\mathbf{0}, $$
(7.10)

and the optimized ESIM index is \( {I}_E={\mathbf{b}}_E^{\prime}\mathbf{y} \). Note that in Eq. (7.10) P−1C is the multi-trait heritability. By Eqs. (7.8) and (7.10), the maximized correlation between \( {H}_E={\mathbf{w}}_E^{\prime}\mathbf{g} \) and \( {I}_E={\mathbf{b}}_E^{\prime}\mathbf{y} \) (or ESIM accuracy) can be written as

$$ {\rho}_{H_E{I}_E}=\frac{\sigma_{I_E}}{\sigma_{H_E}}, $$
(7.11)

where \( {\sigma}_{I_E}=\sqrt{{\mathbf{b}}_E^{\prime }{\mathbf{Pb}}_E} \) is the standard deviation of the variance of \( {I}_E={\mathbf{b}}_E^{\prime}\mathbf{y} \), and \( {\sigma}_{H_E}=\sqrt{{\mathbf{b}}_E^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Pb}}_E} \) is the standard deviation of the variance of \( {H}_E={\mathbf{w}}_E^{\prime}\mathbf{g} \). Hereafter, we write Eq. (7.11) as \( {\rho}_E={\rho}_{H_E{I}_E} \) or \( {\lambda}_E={\rho}_{H_E{I}_E} \)to simplify the notation.

An additional restriction on Eq. (7.10) is bb = 1, because \( {\rho}_{H_E{I}_E} \) is invariant to the scale change and because if bE is an eigenvector of the multi-trait heritability matrix P−1C, vector αbE is also an eigenvector of P−1C for all real values of α (Mardia et al. 1982). This means that in the ESIM the magnitude of an eigenvector is unimportant; only the direction matters (Watkins 2002). Equation (7.10) can also be written as \( {\mathbf{Cb}}_E={\lambda}_E^2{\mathbf{Pb}}_E \), which is called the generalized eigenvalue problem (Watkins 2002). In the latter case, bE is called a generalized eigenvector and \( {\lambda}_E^2 \) a generalized eigenvalue. The generalized eigenvalues may not exist; that is, they may be infinite. However, if P is positive definite and has the same size as C, all eigenvalues of P−1C exist and are finite (Gentle 2007). Matrix P is symmetric and positive definite and its eigenvalues are different with a probability of 1 if the number of genotypes is higher than the number of traits (Okamoto 1973).

If the heritability of the ESIM is \( {h}_I^2=\frac{{\mathbf{b}}^{\prime}\mathbf{Cb}}{{\mathbf{b}}^{\prime}\mathbf{Pb}} \), then another way of writing Eq. (7.1) is

$$ {R}_I={k}_I{\sigma}_I{h}_I^2={k}_I\frac{{\mathbf{b}}^{\prime}\mathbf{Cb}}{\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}}}, $$
(7.12)

which is similar to the univariate breeder’s equation (see Chap. 2, Eq. 2.4). All the parameters of Eq. (7.12) were defined earlier.

The derivative of the ratio \( \frac{{\mathbf{b}}^{\prime}\mathbf{Cb}}{\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}}} \) (Eq. 7.12) with respect to b can be written as 2(bPb)1/2Cb − (bPb)−1/2(bCb)Pb = 0, and, except by a proportionality constant, the result is

$$ \left({\mathbf{P}}^{-1}\mathbf{C}-{h}_{I_E}^2\mathbf{I}\right){\mathbf{b}}_E=\mathbf{0}, $$
(7.13)

where \( {h}_{I_E}^2==\frac{{\mathbf{b}}_E^{\prime }{\mathbf{Cb}}_E}{{\mathbf{b}}_E^{\prime }{\mathbf{Pb}}_E} \) is the maximized ESIM heritability. Let \( {\lambda}_E^2={\rho}_E^2={h}_{I_E}^2 \), then Eq. (7.13) is equal to Eq. (7.10) and can be written as \( {\mathbf{b}}_E^{\prime }{\mathbf{Cb}}_E={\lambda}_E^2{\mathbf{b}}_E^{\prime }{\mathbf{Pb}}_E \), whence the maximized \( {\rho}_E^2 \) in terms of \( {h}_{I_E}^2 \) is

$$ {\rho}_E^2=\frac{{\mathbf{b}}_E^{\prime }{\mathbf{Cb}}_E}{{\mathbf{b}}_E^{\prime }{\mathbf{Pb}}_E}, $$
(7.14)

which should give a equivalent result to that of Eq. (7.11).

By Eq. (7.11) and \( {\sigma}_{H_E}=\sqrt{{\mathbf{b}}_E^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Pb}}_E} \), the maximized ESIM selection response and expected genetic gain per trait can be written as

$$ {R}_E={k}_I\sqrt{{\mathbf{b}}_E^{\prime }{\mathbf{Pb}}_E} $$
(7.15)

and

$$ {\mathbf{E}}_E={k}_I\frac{{\mathbf{Cb}}_E}{\sqrt{{\mathbf{b}}_E^{\prime }{\mathbf{Pb}}_E}}, $$
(7.16)

respectively. Equations (7.15) and (7.16) do not require the economic weights to be known. In the original derivation of the ESIM, Cerón-Rojas et al. (2008) imposed the restrictions \( {\sigma}_{H_E}^2=1 \) and \( {\sigma}_{I_E}^2=1 \). Under these restrictions, \( {\lambda}_E={\mathbf{w}}_E^{\prime }{\mathbf{Cb}}_E \) and Eq. (7.15) can be written as RE = kIλE. When \( {\sigma}_{H_E}^2\ne 1 \) Eq. (7.15) is equal to \( {R}_E={k}_I{\sigma}_{H_E}{\lambda}_E \), where \( {\sigma}_{H_E}=\sqrt{{\mathbf{b}}_E^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Pb}}_E} \) and \( {\lambda}_E^2={\rho}_E^2={h}_{I_E}^2 \).

Let T = P−1C and \( {\lambda}_E^2={h}_{I_E}^2 \); then, Eq. (7.13) can be written as \( {\mathbf{TIb}}_E={\lambda}_E^2{\mathbf{Ib}}_E \), where I = F−1F is an identity matrix of size t × t (t= number of traits), and \( \mathbf{F}=\mathit{\operatorname{diag}}\left\{{f}_1\kern0.5em {f}_1\kern0.5em \cdots \kern0.5em {f}_t\right\} \) is a diagonal matrix with values equal to any real number, except zero values. Thus, another way of writing Eqs. (7.10) and (7.13) is

$$ \left({\mathbf{T}}_2-{\lambda}_E^2\mathbf{I}\right)\boldsymbol{\upbeta} =\mathbf{0}, $$
(7.17)

where T2 = FTF−1 and β = FbE; T and T2 = FTF−1 are similar matrices and both have the same eigenvalues but different eigenvectors (Harville 1997). When the F values are only 1s, vector bE is not affected; when the F values are only −1s, vector bE changes its direction, and if the F values are different from 1 and −1, matrix F changes the proportional values of bE. In practice, bE is first obtained from Eq. (7.13) and then multiplied by matrix F to obtain β = FbE, that is, β is a linear transformation of bE. Matrix T2 = FTF−1 is called the similarity transformation, and matrix F is called the transforming matrix (Watkins 2002). Cerón-Rojas et al. (2006) introduced an alternative procedure for modifying the bE signs that is a particular case of Eq. (7.17). Vector β = FbE can substitute bE in Eqs. (7.15) and (7.16); and in this case, the optimized ESIM index should be written as IE = βy.

7.1.2 Statistical ESIM Properties

The ratio of the index accuracies and the variance of the predicted error (VPE) are good criteria for comparing the index efficiencies for predicting the net genetic merit (see Chap. 2 for details). In Eq. (7.11), we obtained the accuracy of the ESIM; now, we derive the VPE of the ESIM.

The variance of \( {I}_E={\mathbf{b}}_E^{\prime}\mathbf{y} \) (\( {\sigma}_{I_E}^2 \)) and the covariance between \( {H}_E={\mathbf{w}}_E^{\prime}\mathbf{g} \) and \( {I}_E={\mathbf{b}}_E^{\prime}\mathbf{y} \)(\( {\sigma}_{H_E{I}_E} \)) are the same, that is,

$$ {\sigma}_{I_E}^2={\mathbf{b}}_E^{\prime }{\mathbf{Pb}}_E\ \mathrm{and}\ {\sigma}_{H_E{I}_E}={\mathbf{w}}_E^{\prime }{\mathbf{Cb}}_E={\mathbf{b}}_E^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Cb}}_E={\mathbf{b}}_E^{\prime }{\mathbf{Pb}}_E, $$
(7.18)

respectively; that is, \( {\sigma}_{I_E}^2={\sigma}_{H_E{I}_E} \). By Eq. (7.18), the VPE of the ESIM can be written as

$$ E\left[{\left({H}_E-{I}_E\right)}^2\right]={\sigma}_{H_E}^2+{\sigma}_{I_E}^2-2{\sigma}_{H_E{I}_E}={\sigma}_{H_E}^2-{\sigma}_{I_E}^2=\left(1-{\rho}_E^2\right){\sigma}_{H_E}^2. $$
(7.19)

The relative effectiveness of \( {I}_E={\mathbf{b}}_E^{\prime}\mathbf{y} \) in predicting \( {H}_E={\mathbf{w}}_E^{\prime}\mathbf{g} \) is the ratio of \( \left(1-{\rho}_E^2\right){\sigma}_{H_E}^2 \) over \( {\sigma}_{H_E}^2 \), i.e., \( 1-{\rho}_E^2 \); thus, the greater \( {\rho}_E^2 \) is, the more effective \( {I}_E={\mathbf{b}}_E^{\prime}\mathbf{y} \) is at predicting \( {H}_E={\mathbf{w}}_E^{\prime}\mathbf{g} \). The mean squared effect of IE on HE, or the total variance of HE explained by IE is

$$ {\sigma}_{I_E}^2={\rho}_E^2{\sigma}_{H_E}^2, $$
(7.20)

and the relative mean squared effect can be measured by \( {\rho}_E^2 \) (Anderson 2003). If in Eq. (7.20) \( {\rho}_E^2=1 \), \( {\sigma}_{I_E}^2={\sigma}_{H_E}^2 \), and if \( {\rho}_E^2=0 \), \( {\sigma}_{I_E}^2=0 \). That is, the variance of HE explained by IE is proportional to \( {\rho}_E^2 \), and when \( {\rho}_E^2 \) is close to 1, \( {\sigma}_{I_E}^2 \) is close to \( {\sigma}_{H_E}^2 \), and if \( {\rho}_E^2 \) is close to 0, \( {\sigma}_{I_E}^2 \) is close to 0. All these results are valid for any index associated with the ESIM, such as the restricted ESIM (RESIM) and the predetermined proportional gains ESIM (PPG-ESIM), which are described in the following sections of this chapter.

7.1.3 The ESIM and the Canonical Correlation Theory

Canonical correlation theory describes the associations between two sets of variables (Hotelling 1935, 1936) and searches for linear combinations, called canonical variables, of each of two sets of variables having maximal correlation. The vector of coefficient of these linear combinations is called the canonical vector and the correlations between the canonical variables is called the canonical correlation (Wilms and Croux 2016).

To see how the ESIM and the canonical correlation theory are related, note that vectors y and g (Eq. 7.1) can be ordered in a new vector x as \( {\mathbf{x}}^{\prime }=\left[{\mathbf{y}}^{\prime}\kern0.5em {\mathbf{g}}^{\prime}\right] \), whence the covariance matrix of x is \( \left[\begin{array}{cc}\mathbf{P}& \mathbf{C}\\ {}\mathbf{C}& \mathbf{C}\end{array}\right] \). One measure of the association between the jth linear combination of y(\( {I}_E={\mathbf{b}}_{E_j}^{\prime}\mathbf{y} \)) and the jth linear combination of g(\( {H}_E={\mathbf{w}}_{E_j}^{\prime}\mathbf{g} \)) is the jth canonical correlation (λj) value obtained from equation \( \left({\mathbf{P}}^{-1}\mathbf{C}-{\lambda}_j^2\mathbf{I}\right){\mathbf{b}}_{Ej}=\mathbf{0} \), where bEj is the jth canonical vector (j = 1, 2⋯, t) of matrix P−1C, and \( {\mathbf{w}}_{E_j}={\mathbf{C}}^{-1}{\mathbf{Pb}}_{E_j} \). Thus, in the canonical correlation context, \( {I}_E={\mathbf{b}}_{E_j}^{\prime}\mathbf{y} \) and \( {H}_E={\mathbf{w}}_{E_j}^{\prime}\mathbf{g} \) are canonical variables.

In the ESIM, the first eigenvector (\( {\mathbf{b}}_{E_1} \)) of matrix P−1C should be used on \( {I}_E={\mathbf{b}}_{E_1}^{\prime}\mathbf{y} \); the first eigenvalue (\( {\lambda}_1^2 \)) and \( {\mathbf{b}}_{E_1} \)of P−1C should be used on the ESIM selection response and on the ESIM expected genetic gain per trait, because, in this case, the ESIM has maximum accuracy compared with other indices, such as the LPSI. The latter results in this subsection imply that the sampling statistical properties associated with the canonical correlation theory are also valid for the ESIM.

7.1.4 Estimated ESIM Parameters and Their Sampling Properties

The estimated covariance matrix of the true breeding values (C) and that of the trait phenotypic values (P) are denoted as \( \widehat{\mathbf{C}} \) and \( \widehat{\mathbf{P}} \) respectively; they can be obtained by restricted maximum likelihood using Eqs. (2.22) to (2.24) described in Chap. 2. With matrices \( \widehat{\mathbf{C}} \) and \( \widehat{\mathbf{P}} \), we constructed matrix \( \widehat{\mathbf{T}}={\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}} \) and equation

$$ \left(\widehat{\mathbf{T}}-{\widehat{\lambda}}_{Ej}^2\mathbf{I}\right){\widehat{\mathbf{b}}}_{Ej}=\mathbf{0}, $$
(7.21)

j = 1, 2, ⋯, t, where t is the number of traits in the ESIM index. Note that \( {\widehat{\lambda}}_{Ej}^2 \) is positive only if \( \widehat{\mathbf{P}} \) is positive definite (all eigenvalues positive) and \( \widehat{\mathbf{C}} \) is positive semidefinite (no negative eigenvalues); in addition, as \( {\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}} \) is an asymmetric matrix, the values of \( {\widehat{\mathbf{b}}}_{Ej} \) and \( {\widehat{\lambda}}_{Ej}^2 \) should be obtained using the singular value decomposition (SVD) theory (Anderson 2003).

Matrix \( \widehat{\mathbf{T}} \) is square and asymmetric of order t × t and rank q≤ minimum (p, c), where p and c denote the rank of \( {\widehat{\mathbf{P}}}^{-1} \) and \( \widehat{\mathbf{C}} \) respectively; the rank of \( \widehat{\mathbf{T}} \) is equal to c only if \( \widehat{\mathbf{C}} \) is square and nonsingular. Thus, matrix \( \widehat{\mathbf{T}} \) has a maximum of q eigenvalues different from zero (Rao 2002). In addition, \( \widehat{\mathbf{T}}{\widehat{\mathbf{T}}}^{\prime } \) and \( {\widehat{\mathbf{T}}}^{\prime}\widehat{\mathbf{T}} \) are symmetric matrices, but \( \widehat{\mathbf{T}}{\widehat{\mathbf{T}}}^{\prime } \) ≠ \( {\widehat{\mathbf{T}}}^{\prime}\widehat{\mathbf{T}} \). Using the SVD theory, matrix \( \widehat{\mathbf{T}} \) can be written as

$$ \widehat{\mathbf{T}}={\mathbf{V}}_1{\mathbf{L}}^{1/2}{\mathbf{V}}_2^{\prime }, $$
(7.22)

where V1 (\( {\mathbf{V}}_1^{\prime }{\mathbf{V}}_1={\mathbf{V}}_1{\mathbf{V}}_1^{\prime }={\mathbf{I}}_q \)) and V2 (\( {\mathbf{V}}_2^{\prime }{\mathbf{V}}_2={\mathbf{V}}_2{\mathbf{V}}_2^{\prime }={\mathbf{I}}_q \)) are matrices with the eigenvectors of matrices \( \widehat{\mathbf{T}}{\widehat{\mathbf{T}}}^{\prime } \) and \( {\widehat{\mathbf{T}}}^{\prime}\widehat{\mathbf{T}} \) respectively; L1/2 is a diagonal matrix with the square root of the eigenvalues (\( {\widehat{\lambda}}_{E_1}^2\ge {\widehat{\lambda}}_{E_2}^2\ge \cdots \ge {\widehat{\lambda}}_{E_q}^2>0 \)) of either \( \widehat{\mathbf{T}}{\widehat{\mathbf{T}}}^{\prime } \) or \( {\widehat{\mathbf{T}}}^{\prime}\widehat{\mathbf{T}} \) (the eigenvalues of \( \widehat{\mathbf{T}}{\widehat{\mathbf{T}}}^{\prime } \) and \( {\widehat{\mathbf{T}}}^{\prime}\widehat{\mathbf{T}} \) are the same). The entries \( {\widehat{\lambda}}_{E_1}^2\ge {\widehat{\lambda}}_{E_2}^2\ge \cdots \ge {\widehat{\lambda}}_{E_q}^2>0 \) of L1/2 are uniquely determined, and they are called the singular values of \( \widehat{\mathbf{T}} \). The columns of V1 are orthonormal vectors called left singular vectors of \( \widehat{\mathbf{T}} \), and the columns of V2 are called right singular vectors (Watkins 2002).

Estimators \( {\widehat{\mathbf{b}}}_{E_1} \) and \( {\widehat{\lambda}}_{E_1}^2 \) of the first eigenvector \( {\mathbf{b}}_{E_1} \) and the first eigenvalue \( {\lambda}_{E_1}^2 \) respectively are the first column of matrix V1 and the first diagonal element of matrix L1/2. Thus, because \( \widehat{\mathbf{T}}{\widehat{\mathbf{T}}}^{\prime } \) is a symmetric matrix, the maximum likelihood estimators \( {\widehat{\lambda}}_{E_1}^2 \) and \( {\widehat{\mathbf{b}}}_{E_1} \) in the ESIM context can be obtained from

$$ \left(\widehat{\mathbf{T}}{\widehat{\mathbf{T}}}^{\prime }-{\widehat{\mu}}_j\mathbf{I}\right){\widehat{\mathbf{b}}}_{E_j}=\mathbf{0}, $$
(7.23)

where \( {\widehat{\mu}}_j={\widehat{\lambda}}_{E_j}^4 \), j= 1, 2, …, t. In the asymptotic context, \( {\widehat{\lambda}}_{E_1}^2 \) and \( {\widehat{\mathbf{b}}}_{E_1} \) are consistent and unbiased estimators (Anderson 2003).

The latter results allow the ESIM index (\( {I}_E={\mathbf{b}}_E^{\prime}\mathbf{y} \)) as \( {\widehat{I}}_E={\widehat{{\mathbf{b}}^{\prime}}}_{E_1}\mathbf{y} \) to be estimated. The estimator of the maximized ESIM selection response and expected genetic gain per trait are \( {\widehat{R}}_E={k}_I\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{E_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{E_1}} \) and \( {\widehat{\mathbf{E}}}_E={k}_I\frac{{\widehat{\mathbf{C}}\widehat{\mathbf{b}}}_{E_1}}{\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{E_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{E_1}}} \) respectively, whereas the estimator of the maximized ESIM accuracy is \( {\widehat{\lambda}}_{E_1} \), which should be similar to the estimator of the square root of the maximized ESIM heritability.

In the asymptotic context, the estimator of bEj (\( {\widehat{\mathbf{b}}}_{Ej} \)) has multivariate normal distribution with expectation \( E\left({\widehat{\mathbf{b}}}_{Ej}\right)={\mathbf{b}}_{Ej} \) and variance

$$ Var\left({\widehat{\mathbf{b}}}_{Ej}\right)=\frac{1}{2n}{\mathbf{b}}_{Ej}{\mathbf{b}}_{Ej}^{\prime }+\frac{1}{n}\left(1-{\lambda}_{Ej}^2\right)\sum \limits_{i\ne j}^t\frac{\lambda_{Ej}^2+{\lambda}_{Ei}^2-2{\lambda}_{Ei}^2{\lambda}_{Ej}^2}{{\left({\lambda}_{Ei}^2-{\lambda}_{Ej}^2\right)}^2}{\mathbf{b}}_{Ei}{\mathbf{b}}_{Ei}^{\prime }, $$
(7.24)

and, for i ≠ j, the covariance between \( {\widehat{\mathbf{b}}}_{Ei} \) and \( {\widehat{\mathbf{b}}}_{Ej} \) can be written as

$$ Cov\left({\widehat{\mathbf{b}}}_{Ei},{\widehat{\mathbf{b}}}_{Ej}\right)=\frac{\left(1-{\lambda}_{Ej}^2\right)\left(1-{\lambda}_{Ei}^2\right)\left({\lambda}_{Ei}^2+{\lambda}_{Ej}^2\right)}{n{\left({\lambda}_{Ei}^2-{\lambda}_{Ej}^2\right)}^2}{\mathbf{b}}_{Ej}{\mathbf{b}}_{Ei}^{\prime }, $$
(7.25)

where n is the number of individuals or genotypes (Anderson 1999). The variance of \( {\widehat{\mathbf{b}}}_{Ej} \) and the covariance between \( {\widehat{\mathbf{b}}}_{Ei} \) and \( {\widehat{\mathbf{b}}}_{Ej} \) depend not only on n, but also on eigenvalues \( {\lambda}_{Ei}^2 \) and \( {\lambda}_{Ej}^2 \). Suppose that \( {\lambda}_{Ej}^2>{\lambda}_{Ei}^2 \); then, when \( {\lambda}_{Ej}^2 \) is very close to 1, \( Var\left({\widehat{\mathbf{b}}}_{Ej}\right)\approx \frac{1}{2n}{\mathbf{b}}_{Ej}{\mathbf{b}}_{Ej}^{\prime } \) (“≈”denotes an approximation) and \( Cov\left({\widehat{\mathbf{b}}}_{Ei},{\widehat{\mathbf{b}}}_{Ej}\right) \) is very close to 0. By the result of Eq. (7.24), the variance of the first eigenvector (\( {\widehat{\mathbf{b}}}_{E1} \)) of \( {\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}} \) can be written as \( Var\left({\widehat{\mathbf{b}}}_{E1}\right)=\frac{1}{2n}{\mathbf{b}}_{E1}{\mathbf{b}}_{E1}^{\prime }+\frac{1}{n}\left(1-{\lambda}_{E1}^2\right)\sum \limits_{j=2}^t\frac{\lambda_{E1}^2+{\lambda}_{Ej}^2-2{\lambda}_{E1}^2{\lambda}_{Ej}^2}{{\left({\lambda}_{E1}^2-{\lambda}_{Ej}^2\right)}^2}{\mathbf{b}}_{Ej}{\mathbf{b}}_{Ej}^{\prime } \). If the first eigenvalue \( {\lambda}_{E1}^2 \) of P−1C is very close to 1 (\( {\lambda}_{E1}^2\approx 1 \)), \( Var\left({\widehat{\mathbf{b}}}_{E1}\right)=\frac{1}{2n}{\mathbf{b}}_{E1}{\mathbf{b}}_{E1}^{\prime } \) and \( Cov\left({\widehat{\mathbf{b}}}_{E1},{\widehat{\mathbf{b}}}_{Ej}\right)\approx 0 \).

In the asymptotic context, the jth estimator (\( {\widehat{\lambda}}_{Ej} \)) of the canonical correlations has normal distribution with expectation \( E\left({\widehat{\lambda}}_{Ej}\right)\approx {\lambda}_{Ej} \) and variance

$$ Var\left({\widehat{\lambda}}_{Ej}\right)\approx \frac{{\left(1-{\lambda}_{Ej}^2\right)}^2}{n}, $$
(7.26)

whereas the jth estimator of the square of the canonical correlations \( {\widehat{\lambda}}_{Ej}^2 \) has normal distribution with expectation \( E\left({\widehat{\lambda}}_{Ej}^2\right)\approx {\lambda}_{Ej}^2 \) and variance

$$ Var\left({\widehat{\lambda}}_j^2\right)\approx \frac{4{\lambda}_{Ej}^2{\left(1-{\lambda}_{Ej}^2\right)}^2}{n}. $$
(7.27)

In addition, for i ≠ j, the correlation between \( {\widehat{\lambda}}_{Ej}^2 \) and \( {\widehat{\lambda}}_{Ei}^2 \) is zero, i.e., \( Corr\left({\widehat{\lambda}}_{Ei}^2,{\widehat{\lambda}}_{Ej}^2\right)=0 \) (Bilodeau and Brenner 1999; Muirhead 2005).

Equation (7.26) implies that under the restrictions \( {\sigma}_H^2=1 \) and \( {\sigma}_I^2=1 \), the expectation and variance of \( {\widehat{R}}_E={k}_I{\widehat{\lambda}}_{E1} \) are \( E\left({\widehat{R}}_E\right)\approx {k}_I{\lambda}_{E1} \) and \( Var\left({\widehat{R}}_E\right)\approx \frac{k_I^2{\left(1-{\lambda}_{E1}^2\right)}^2}{n} \) respectively. However, obtaining the expectation and variance of \( {\widehat{R}}_E={k}_I{\widehat{\sigma}}_H{\widehat{\lambda}}_{E1} \) or \( {\widehat{R}}_E={k}_I\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{E1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{E1}} \) is more difficult, because in both equations there are two estimators: \( {\widehat{\sigma}}_H \) and \( {\widehat{\lambda}}_1 \) in the first one, and \( \widehat{\mathbf{P}} \) and \( {\widehat{\mathbf{b}}}_{E1} \) in the second one.

7.1.5 Numerical Examples

We compare ESIM efficiency versus LPSI efficiency using a real data set from commercial egg poultry lines obtained from Akbar et al. (1984). The estimated phenotypic (\( \widehat{\mathbf{P}} \)) and genetic (\( \widehat{\mathbf{C}} \)) covariance matrices among the rate of lay (RL, number of eggs), age at sexual maturity (SM, days) and egg weight (EW, kg), were \( \widehat{\mathbf{P}}=\left[\begin{array}{ccc}240.57& -95.62& 2.07\\ {}-95.62& 167.20& 4.58\\ {}2.07& 4.58& 22.80\end{array}\right] \) and \( \widehat{\mathbf{C}}=\left[\begin{array}{ccc}29.86& -17.90& -4.13\\ {}-17.90& 18.56& 1.49\\ {}-4.13& 1.49& 9.24\end{array}\right] \) respectively. The number of genotypes and the vector of economic weights were n= 3330 and \( {\mathbf{w}}^{\prime }=\left[19.54\kern0.5em -3.56\kern0.5em 17.01\right] \) respectively, whereas the selection intensity was 10% (kI = 1.755) for both indices.

The estimated LPSI vector of coefficients was \( {\widehat{\mathbf{b}}}_S^{\prime }={\mathbf{w}}^{\prime }{\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}}=\left[1.82\kern0.5em -1.38\kern0.5em 3.25\right] \), whereas the estimated selection response, expected genetic gain per trait, accuracy, and heritability of the LPSI were \( {\widehat{R}}_S=1.755\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_S}=74.91 \), \( {\widehat{\mathbf{E}}}_S^{\prime }=1.755\frac{{\widehat{\mathbf{b}}}_S^{\prime}\widehat{\mathbf{C}}}{\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_S}}=\left[2.70\kern0.5em -2.20\kern0.5em 0.84\right] \), \( {\widehat{\rho}}_S=\frac{\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_S}}{\sqrt{{\mathbf{w}}^{\prime}\widehat{\mathbf{C}}\mathbf{w}}}=0.362 \), and \( {\widehat{h}}_S^2=\frac{{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{C}}\widehat{\mathbf{b}}}_S}{{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_S}=0.143 \) respectively.

Note that because in the ESIM context \( {\widehat{{\mathbf{b}}^{\prime}}}_E{\widehat{\mathbf{b}}}_E=1 \), the best way of comparing ESIM results versus LPSI results is when the LPSI coefficient vector is normalized, i.e., when the LPSI coefficient vector is equal to \( {{\widehat{\mathbf{b}}}_S}^{\ast }={\widehat{\mathbf{b}}}_S/{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{b}}}_S \) and then \( {\widehat{{\mathbf{b}}^{\prime}}}_S^{\ast^{\prime }}{\widehat{\mathbf{b}}}_S^{\ast }=1 \); however, it can be shown that the normalization process only affects the estimated LPSI selection response because in that case, \( {\widehat{R}}_S=74.91 \) is divided by \( {\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{b}}}_S \). For example, for this data set result, \( {\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{b}}}_S=15.76 \); then, the estimated LPSI selection response using \( {{\widehat{\mathbf{b}}}_S}^{\ast }={\widehat{\mathbf{b}}}_S/{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{b}}}_S \) is \( {\widehat{R}}_S=\frac{74.91}{15.74}=4.75 \), whereas the rest of the estimated LPSI parameters are the same. When \( 0<{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{b}}}_S<1 \) and \( 1<{\widehat{R}}_S \), the values of \( {\widehat{R}}_S \) increase, but when \( 1<{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{b}}}_S \), the values of \( {\widehat{R}}_S \) decrease, as in the example.

The product \( {\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{b}}}_S \) does not affect \( {\widehat{\rho}}_S \) because it is invariant to scale change. Also, \( {\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{b}}}_S \) does not affect \( {\widehat{h}}_S^2 \) and \( {\widehat{\mathbf{E}}}_S \) because \( {\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{b}}}_S \) appears in the numerator and denominator of both estimated parameters.

In the ESIM, the sign and proportion of the expected genetic gain values for traits RL, SM, and EW should be in accordance with the breeder’s interest. For example, if the breeder’s interest is that the expected genetic gain per trait for RL should be positive and negative for SM, the sign and proportion of the values of the first eigenvector should be modified using a linear combination of the estimated first eigenvector \( {\widehat{\mathbf{b}}}_{E_1} \), i.e., \( \widehat{\boldsymbol{\upbeta}}=\mathbf{F}{\widehat{\mathbf{b}}}_{E_1} \), to achieve expected genetic gain per trait values in RL and SM according to the breeder’s interest.

The information needed to obtain the estimated ESIM parameters are matrices \( \widehat{\mathbf{T}}={\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}}=\left[\begin{array}{ccc}0.1102& -0.0405& -0.0280\\ {}-0.0390& 0.0864& -0.0184\\ {}-0.1833& 0.0517& 0.4115\end{array}\right] \) and \( \widehat{\mathbf{T}}{\widehat{\mathbf{T}}}^{\prime }=\left[\begin{array}{ccc}0.0146& -0.0073& -0.0338\\ {}-0.0073& 0.0093& 0.0041\\ {}-0.0338& 0.0041& 0.2056\end{array}\right] \). We need to find the eigenvalues and eigenvectors of equation \( \left(\widehat{\mathbf{T}}{\widehat{\mathbf{T}}}^{\prime }-{\widehat{\mu}}_j\mathbf{I}\right){\widehat{\mathbf{b}}}_{E_j}=\mathbf{0} \), where \( {\widehat{\mu}}_j={\widehat{\lambda}}_{E_j}^4 \), to obtain matrices V1 and L1/2, which form matrix \( \widehat{\mathbf{T}}={\mathbf{V}}_1{\mathbf{L}}^{1/2}{\mathbf{V}}_2^{\prime } \).

Matrix V1 is equal to \( {\mathbf{V}}_1=\left[\begin{array}{ccc}-0.1701& 0.6818& 0.7115\\ {}0.0259& -0.7187& 0.6948\\ {}0.9851& 0.1366& 0.1046\end{array}\right] \), whereas the diagonal elements of matrix L are 0.2115, 0.0155, and 0.0025, that is, matrix \( {\mathbf{L}}^{1/2}=\left[\begin{array}{ccc}0.4599& 0& 0\\ {}0& 0.1244& 0\\ {}0& 0& 0.0498\end{array}\right] \). Thus, \( {\widehat{\mu}}_1={\widehat{\lambda}}_{E_1}^4=0.2115 \), \( {\widehat{\lambda}}_{E_1}^2=0.4599 \), and the estimated ESIM accuracy was \( {\widehat{\lambda}}_{E_1}=0.6782 \). The estimated ESIM eigenvector of coefficients is the first column of matrix V1, i.e., \( {\widehat{\mathbf{b}}}_{E_1}^{\prime }=\left[-0.1701\kern0.5em 0.0259\kern0.5em 0.9851\right] \), and the estimated ESIM index can be constructed as \( {\widehat{I}}_E=-0.1701\mathrm{RL}+0.0259\mathrm{SM}+0.9851\mathrm{EW} \).

The estimated ESIM selection response and expected genetic gain per trait were \( {\widehat{R}}_E=1.755\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{E_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{E_1}}=9.54 \) and \( {\widehat{\mathbf{E}}}_E^{\prime }=1.755\frac{{\widehat{\mathbf{b}}}_{E_1}^{\prime}\widehat{\mathbf{C}}}{\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{E_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{E_1}}}=\left[-3.10\kern0.5em 1.61\kern0.5em 3.18\right] \) respectively. Because the estimated LPSI selection response was \( {\widehat{R}}_S=\frac{74.91}{15.74}=4.75 \), the estimated ESIM selection response was higher than the estimated LPSI response. In addition, the estimated LPSI expected genetic gain per trait was \( {\widehat{\mathbf{E}}}_S^{\prime }=\left[2.70\kern0.5em -2.20\kern0.5em 0.84\right] \). Now, suppose that the breeder’s interest is to increase RL and decrease SM; then, \( {\widehat{\mathbf{E}}}_S^{\prime } \) is a good result but \( {\widehat{\mathbf{E}}}_E^{\prime } \) is wrong.

We can change the sign and proportion of \( {\widehat{\mathbf{E}}}_E^{\prime } \) by transforming \( {\widehat{\mathbf{b}}}_{E_1} \) into \( \widehat{\boldsymbol{\upbeta}}=\mathbf{F}{\widehat{\mathbf{b}}}_{E_1} \) using a convenient matrix F such as \( \mathbf{F}=\left[\begin{array}{ccc}-9& 0& 0\\ {}0& 1& 0\\ {}0& 0& 1\end{array}\right] \). In such a case \( {\widehat{\boldsymbol{\upbeta}}}^{\prime }={\widehat{\mathbf{b}}}_{E_1}^{\prime}\mathbf{F}=\left[1.531\kern0.5em 0.026\kern0.5em 0.981\right] \), \( {\widehat{R}}_E=1.755\sqrt{{\widehat{\boldsymbol{\upbeta}}}^{\prime}\widehat{\mathbf{P}}\widehat{\boldsymbol{\upbeta}}}=42.44 \), and \( {\widehat{\mathbf{E}}}_E^{\prime }=1.755\frac{\widehat{{\boldsymbol{\upbeta}}^{\prime }}\widehat{\mathbf{C}}}{\sqrt{\widehat{{\boldsymbol{\upbeta}}^{\prime }}\widehat{\mathbf{P}}\widehat{\boldsymbol{\upbeta}}}}=\left[2.990\kern0.5em -1.85\kern0.5em 0.205\right] \). However, vector \( {\widehat{\boldsymbol{\upbeta}}}^{\prime } \) was not normalized. To normalize \( {\widehat{\boldsymbol{\upbeta}}}^{\prime } \) we need to divide it by \( {\widehat{\boldsymbol{\upbeta}}}^{\prime}\widehat{\boldsymbol{\upbeta}}=3.314 \), but \( {\widehat{\boldsymbol{\upbeta}}}^{\prime}\widehat{\boldsymbol{\upbeta}} \) should only affect \( {\widehat{R}}_E=42.44 \), which should be divided by 3.314, that is, \( {\widehat{R}}_E=\frac{42.44}{3.314}=12.806 \). According to the theory of similar matrices (Harville 1997), the estimated maximized ESIM accuracy, \( {\widehat{\lambda}}_{E_1}=0.6782 \), should not be affected by matrix F.

We can compare ESIM efficiency versus LPSI efficiency to predict the net genetic merit using the ratio of the estimated ESIM accuracy \( {\widehat{\lambda}}_{E_1}=0.6782 \) to LPSI accuracy \( {\widehat{\rho}}_S=0.362 \), i.e., \( \frac{{\widehat{\lambda}}_{E_1}}{{\widehat{\rho}}_S}=\frac{0.6782}{0.362}=1.873 \), or in percentage terms, \( {\widehat{p}}_E=100\left(1.873-1\right)=87.3 \) (see Chap. 5, Eq. 5.17). According to the latter result, the ESIM is a better predictor of the net genetic merit and its efficiency is 87.3% higher than that of the LPSI for this data set.

Now, we compare ESIM efficiency versus LPSI efficiency using the data set described in Sect. 2.8.1 of Chap. 2. From this data set, we ran five phenotypic selection cycles, each with four traits (T1, T2, T3, and T4), 500 genotypes, and four replicates for each genotype. The economic weights for T1, T2, T3, and T4 were 1, −1, 1, and 1 respectively. In this case, matrix F is an identity matrix of size 4 × 4 for all five selection cycles.

Table 7.1 presents the estimated LPSI, the restricted LPSI (RLPSI), and the predetermined proportional gain LPSI (PPG-LPSI) selection response (the latter two for one, two, and three restrictions) for five simulated selection cycles when their vectors of coefficients are normalized. Table 7.1 also presents the estimated ESIM, the RESIM and the PPG-ESIM selection response for one, two, and three restrictions for five simulated selection cycles. The selection intensity was 10% (kI = 1.755) for all five selection cycles. In this subsection, we compare only LPSI results versus ESIM results. The estimated LPSI selection response when the vector of coefficients was not normalized was described in Chap. 2 (Table 2.4). The averages of the estimated LPSI and ESIM selection responses were 4.70 and 6.31 respectively.

Table 7.1 Estimated linear phenotypic selection index (LPSI), restricted null LPSI (RLPSI), and predetermined proportional gains LPSI (PPG-LPSI) selection responses when their vectors of coefficients are normalized; estimated eigen selection index method (ESIM), restricted null ESIM (RESIM), and predetermined proportional gain ESIM (PPG-ESIM) selection responses for one, two, and three restrictions for five simulated selection cycles

Table 7.2 presents the estimated ESIM expected genetic gain per trait, accuracy (\( {\widehat{\rho}}_E \)), and the values \( {\widehat{p}}_E=100\left({\widehat{\lambda}}_E-1\right) \), where \( {\widehat{\lambda}}_E={\widehat{\rho}}_E/{\widehat{\rho}}_S \) is the ratio of \( {\widehat{\rho}}_E \) to the estimated LPSI accuracy (\( {\widehat{\rho}}_S \)), expressed as percentages. Table 7.2 also presents the accuracy of the PPG-ESIM and the estimated ratio (\( {\widehat{p}}_{PE} \)) of the estimated PPG-ESIM accuracy to the estimated PPG-LPSI accuracy, expressed as percentages, for one, two, and three predetermined restrictions for five simulated selection cycles. In this subsection, we use only the estimated ESIM expected genetic gain per trait and \( {\widehat{p}}_E=100\left({\widehat{\lambda}}_E-1\right) \) to compare ESIM efficiency versus LPSI efficiency.

Table 7.2 Estimated eigen selection index method (ESIM) expected genetic gain per trait, accuracy (\( {\widehat{\rho}}_E \)), and ratio of \( {\widehat{\rho}}_E \) to the estimated LPSI (data not presented) accuracy (\( {\widehat{\rho}}_S \)), expressed in percentage terms, \( {\widehat{\rho}}_E=100\left({\widehat{\lambda}}_E-1\right) \) (where \( {\widehat{\lambda}}_E={\widehat{\rho}}_E/{\widehat{\rho}}_S \))

The estimated LPSI expected genetic gains per trait were presented in Chap. 2, Table 2.4. According to the results shown in Table 2.4, the averages of the estimated LPSI expected genetic gain per trait T1, T2, T3, and T4 for five simulated selection cycles were 7.26, −3.52, 2.78, and 1.58, whereas according to the results of Table 7.2, the averages of the estimated ESIM expected genetic gains per trait were 5.67, −2.67, 1.81, and 2.9 respectively. This means that the estimated LPSI expected genetic gain for traits T1, T2, and T3 was higher than the estimated ESIM expected genetic gain for those traits.

The average of the \( {\widehat{p}}_E=100\left({\widehat{\lambda}}_E-1\right) \) values was 9.76 for all five selection cycles (Table 7.2). The latter result is not in accordance with the LPSI and ESIM expected genetic gain per trait; however, note that the \( {\widehat{p}}_E \) values are associated with the estimated LPSI and ESIM selection responses (Table 7.1), not with the expected genetic gain per trait, because \( {\widehat{\lambda}}_E=\frac{{\widehat{\rho}}_E}{{\widehat{\rho}}_S}\approx \frac{{\widehat{R}}_E}{{\widehat{R}}_S} \), where \( {\widehat{R}}_E \) and \( {\widehat{R}}_S \) are the estimated ESIM and LPSI selection responses respectively. Thus, the \( {\widehat{p}}_E \)values indicate that the efficiency of the ESIM and that of the LPSI were very similar because the former was only 9.76% higher than the latter for this data set.

The equality \( \frac{{\widehat{\rho}}_E}{{\widehat{\rho}}_S}=\frac{{\widehat{R}}_E}{{\widehat{R}}_S} \) is true only when the denominators of both estimated correlations are the same, as in the linear selection indices described in Chaps. 26. Note that \( {\widehat{\rho}}_S=\frac{\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_S}}{\sqrt{{\mathbf{w}}^{\prime}\widehat{\mathbf{C}}\mathbf{w}}} \) and \( {\widehat{\rho}}_E=\frac{\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_E{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_E}}{\sqrt{{\mathbf{w}}_E^{\prime}\widehat{\mathbf{C}}{\mathbf{w}}_E}} \), whereas \( {\widehat{R}}_S=\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_S{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_S} \) and \( {\widehat{R}}_E=\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_E{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_E} \); this means that if \( \sqrt{{\mathbf{w}}_E^{\prime}\widehat{\mathbf{C}}{\mathbf{w}}_E}\ne \sqrt{{\mathbf{w}}^{\prime}\widehat{\mathbf{C}}\mathbf{w}} \), \( \frac{{\widehat{\rho}}_E}{{\widehat{\rho}}_S}\ne \frac{{\widehat{R}}_E}{{\widehat{R}}_S} \). For the Akbar et al. (1984) data, \( {\widehat{R}}_E=9.54 \) and \( {\widehat{R}}_S=4.75 \), then \( \frac{{\widehat{R}}_E}{{\widehat{R}}_S}=2.0 \) but \( \frac{{\widehat{\lambda}}_{E_1}}{{\widehat{\rho}}_S}=1.873 \); that is, \( \frac{{\widehat{\rho}}_E}{{\widehat{\rho}}_S}\approx \frac{{\widehat{R}}_E}{{\widehat{R}}_S} \), where “≈” indicates an approximation.

Figure 7.1 presents the frequency distribution of 500 estimated ESIM values for cycle 2 (Fig. 7.1a) and cycle 5 (Fig. 7.1b), obtained from one selection cycle for 500 genotypes and four traits simulated in one environment. Figure 7.1a, b indicates that the frequency distribution of the estimated ESIM values approaches normal distribution.

Fig. 7.1
figure 1

Frequency distribution of 500 estimated eigen selection index method (ESIM) values for (a) cycle 2 and (b) cycle 5, obtained from one selection cycle for 500 genotypes and four traits simulated in one environment

7.2 The Linear Phenotypic Restricted Eigen Selection Index Method

Similar to the RLPSI (see Chap. 2), the objective of the RESIM is to fix r of t (r < t) traits by predicting only the genetic gains of (t − r) of them. Let H = wg be the net genetic merit and I = by the ESIM index. In Chap. 2, we showed that Cov(I, g) = Cb is the covariance between the breeding value vector (g) and I = by. Thus, to fix r of t traits, we need r covariances between the linear combinations of g (Ug) and I = by to be zero, i.e., Cov(I, Ug) = UCb = 0, where U′ is a matrix with 1s and 0s (1 indicates that the trait is restricted and 0 that the trait has no restrictions). In the RESIM, it is possible to solve this problem by maximizing \( {\rho}_{HI}^2=\frac{{\left({\mathbf{w}}^{\prime}\mathbf{Cb}\right)}^2}{\left({\mathbf{w}}^{\prime}\mathbf{Cw}\right)\left({\mathbf{b}}^{\prime}\mathbf{Pb}\right)} \) with respect to vectors b and w under the restrictions UCb = 0, bb = 1, wCw = 1, and bPb = 1, where wCw is the variance of H = wg and bPb is the variance of I = by. Also, the RESIM problem can be solved by maximizing \( \frac{{\mathbf{b}}^{\prime}\mathbf{Cb}}{\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}}} \) (Eq. 7.12) with respect to vectors b only under the restrictions UCb = 0 and bb = 1, as we did to obtain Eq. (7.13). Both approaches give the same result, but it is easier to work with the second approach than with the first one.

7.2.1 The RESIM Parameters

To obtain the RESIM vector of coefficients that maximizes the RESIM selection response and the expected genetic gain per trait, we need to maximize the function

$$ f\left(\mathbf{b},{\mathbf{v}}^{\prime}\right)=\frac{{\mathbf{b}}^{\prime}\mathbf{Cb}}{\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}}}-{\mathbf{v}}^{\prime }{\mathbf{U}}^{\prime}\mathbf{Cb} $$
(7.28a)

with respect to b and v′, where \( {\mathbf{v}}^{\prime }=\left[{v}_1\kern0.5em {v}_2\kern0.5em \cdots \kern0.5em {v}_{r-1}\right] \) is a vector of Lagrange multipliers. The derivatives of Eq. (7.28a) with respect to b and v′ can be written as

$$ 2{\left({\mathbf{b}}^{\prime}\mathbf{Pb}\right)}^{1/2}\mathbf{Cb}-{\left({\mathbf{b}}^{\prime}\mathbf{Pb}\right)}^{-1/2}\left({\mathbf{b}}^{\prime}\mathbf{Cb}\right)\mathbf{Pb}-\mathbf{CUv}=\mathbf{0} $$
(7.28b)

and

$$ {\mathbf{U}}^{\prime}\mathbf{Cb}=\mathbf{0}, $$
(7.29)

respectively, where Eq. (7.29) denotes the restriction imposed for maximizing Eq. (7.28a). Using algebraic methods on Eq. (7.28b) similar to those used to obtain Eqs. (7.10) and (7.13), we get

$$ \left({\mathbf{KP}}^{-1}\mathbf{C}-{h}_{I_R}^2{\mathbf{I}}_t\right){\mathbf{b}}_R=\mathbf{0}, $$
(7.30)

where K = [It − QR], It is an identity matrix of size t × t, QR = P−1CU(UCP−1CU)−1UC, and \( {h}_{I_R}^2=\frac{{\mathbf{b}}_R^{\prime }{\mathbf{Cb}}_R}{{\mathbf{b}}_R^{\prime }{\mathbf{Pb}}_R} \) is the maximized RESIM heritability obtained under the restriction UCb = 0; \( {h}_{I_R}^2 \) is also the square of the maximized correlation between the net genetic merit and \( {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} \), that is, \( {h}_{I_R}^2={\lambda}_R^2 \). This means that Eq. (7.30) can be written as

$$ \left({\mathbf{KP}}^{-1}\mathbf{C}-{\lambda}_R^2{\mathbf{I}}_t\right){\mathbf{b}}_R=\mathbf{0}. $$
(7.31)

Thus, the optimized RESIM index is \( I={\mathbf{b}}_R^{\prime}\mathbf{y} \). The only difference between Eqs. (7.31) and (7.13) is matrix K. Equation (7.31) was obtained by Cerón-Rojas et al. (2008) by maximizing \( {\rho}_{HI}^2 \) (Eq. 7.1) with respect to vectors b and w under the restriction UCb = 0, bb = 1, wCw = 1 and bPb = 1 in a similar manner to the canonical correlation theory. The RESIM expected genetic gain per trait uses the first eigenvector (bR) of matrix KP−1C, whereas the RESIM selection response uses bR and the first eigenvalue (\( {\lambda}_R^2 \)) of matrix KP−1C. When U′ is a null matrix, bR = bE (the vector of the ESIM coefficients); thus, the RESIM is more general than the ESIM and includes the ESIM as a particular case.

In the RESIM context, vector w can be obtained (Cerón-Rojas et al. 2008) as

$$ {\mathbf{w}}_R={\mathbf{C}}^{-1}\left[{\lambda}_R{\mathbf{Pb}}_R+\boldsymbol{\Psi} \mathbf{v}\right], $$
(7.32)

where λR and bR are the square roots of the first eigenvalue (\( {\lambda}_R^2 \)) and the first eigenvector of matrix KP−1C respectively; Ψ = CU and \( \mathbf{v}={\lambda}_R^{-1}{\left({\boldsymbol{\Psi}}^{\prime }{\mathbf{P}}^{-1}\boldsymbol{\Psi} \right)}^{-1}{\boldsymbol{\Psi}}^{\prime }{\mathbf{P}}^{-1}{\mathbf{Cb}}_R \). Let \( {H}_R={\mathbf{w}}_R^{\prime}\mathbf{g} \) be the net genetic merit in the RESIM context; then, because the correlation between \( {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} \) and \( {H}_R={\mathbf{w}}_R^{\prime}\mathbf{g} \) is not affected by scale change, λR and \( {\lambda}_R^{-1} \) can be considered proportional constants and then Ψv can be written as \( \boldsymbol{\Psi} \mathbf{v}=\boldsymbol{\Psi} {\left({\boldsymbol{\Psi}}^{\prime }{\mathbf{P}}^{-1}\boldsymbol{\Psi} \right)}^{-1}{\boldsymbol{\Psi}}^{\prime }{\mathbf{P}}^{-1}{\mathbf{Cb}}_R={\mathbf{Q}}_R^{\prime }{\mathbf{Cb}}_R \), where \( {\mathbf{Q}}_R^{\prime } \) is the transpose of matrix QR described in Eq. (7.30). Thus, another way of writing Eq. (7.32) is

$$ {\mathbf{w}}_R={\mathbf{C}}^{-1}\left[\mathbf{P}+{\mathbf{Q}}_R^{\prime}\mathbf{C}\right]{\mathbf{b}}_R. $$
(7.33)

By Eq. (7.33) and the restriction bΨ = 0, the covariance between \( {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} \) and \( {H}_R={\mathbf{w}}_R^{\prime}\mathbf{g} \) (\( {\sigma}_{H_R{I}_R} \)) can be written as

$$ {\sigma}_{H_R{I}_R}={\mathbf{w}}_R^{\prime }{\mathbf{Cb}}_R={\mathbf{b}}_R^{\prime }{\mathbf{Pb}}_R+{\mathbf{b}}_R^{\prime }{\mathbf{Q}}_R^{\prime }{\mathbf{Cb}}_R={\mathbf{b}}_R^{\prime }{\mathbf{Pb}}_R, $$
(7.34)

where \( {\mathbf{b}}_R^{\prime }{\mathbf{Q}}_R^{\prime }{\mathbf{Cb}}_R=0 \) according to the restriction bΨ = 0. Equation (7.34) indicates that the covariance between IR and HR (\( {\sigma}_{H_R{I}_R} \)) is equal to the variance of IR (\( {\sigma}_{I_R}^2={\mathbf{b}}_R^{\prime }{\mathbf{Pb}}_R \)).

The maximized correlation between IR and HR (or RESIM accuracy) can be written as

$$ {\rho}_{H_R{I}_R}=\frac{\sqrt{{\mathbf{b}}_R^{\prime }{\mathbf{Pb}}_R}}{\sqrt{{\mathbf{w}}_R^{\prime }{\mathbf{Cw}}_R}}, $$
(7.35)

where \( {\mathbf{w}}_R^{\prime }{\mathbf{Cw}}_R={\sigma}_{H_R}^2 \) is the variance of HR, \( {\mathbf{w}}_R={\mathbf{C}}^{-1}\left[\mathbf{P}+{\mathbf{Q}}_R^{\prime}\mathbf{C}\right]{\mathbf{b}}_R \), \( {\mathbf{Q}}_R^{\prime }=\boldsymbol{\Psi} {\left({\boldsymbol{\Psi}}^{\prime }{\mathbf{P}}^{-1}\boldsymbol{\Psi} \right)}^{-1}{\boldsymbol{\Psi}}^{\prime }{\mathbf{P}}^{-1} \), and Ψ = CU. When U′ is a null matrix, \( {\mathbf{w}}_R^{\prime }{\mathbf{Cw}}_R={\mathbf{b}}_E^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Pb}}_E={\mathbf{w}}_E^{\prime }{\mathbf{Cw}}_E \), the variance of HE, and \( {\sigma}_{I_R}^2={\mathbf{b}}_R^{\prime }{\mathbf{Pb}}_R={\mathbf{b}}_E^{\prime }{\mathbf{Pb}}_E={\sigma}_{I_E}^2 \), the variance of IE. Hereafter, to simplify the notation, we write Eq. (7.35) as ρR or λR.

The maximized selection response (RR) and expected genetic gain per trait (ER) of the RESIM can be written as

$$ {R}_R={k}_I\sqrt{{\mathbf{b}}_R^{\prime }{\mathbf{Pb}}_R} $$
(7.36)

and

$$ {\mathbf{E}}_R={k}_I\frac{{\mathbf{Cb}}_R}{\sqrt{{\mathbf{b}}_R^{\prime }{\mathbf{Pb}}_R}}, $$
(7.37)

respectively, where \( \sqrt{{\mathbf{b}}_R^{\prime }{\mathbf{Pb}}_R}={\sigma}_{I_R} \) is the standard deviation of the variance of \( {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} \). If vector bR is transformed as βR = FbR, where matrix F was defined earlier, vector bR should be changed by βR in Eqs. (7.36) and (7.37), and in \( {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} \).

Equation (7.36) can also be written as \( {R}_R={k}_I{\sigma}_{H_R}{\lambda}_R \), where \( {\sigma}_{H_R}=\sqrt{{\mathbf{b}}_R^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Pb}}_R+{\mathbf{b}}_R^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Q}}_R^{\prime }{\mathbf{Cb}}_R} \) is the standard deviation of the variance of HR, and \( {\lambda}_R={\rho}_{H_R{I}_R} \) is the first canonical correlation between \( {H}_R={\mathbf{w}}_R^{\prime}\mathbf{g} \) and \( {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} \). When \( {\sigma}_{H_R}=1 \), λR is the covariance between \( {H}_R={\mathbf{w}}_R^{\prime}\mathbf{g} \) and \( {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} \), and then Eq. (7.36) can be written as RR = kIλR. This last result was presented by Cerón-Rojas et al. (2008) in their original paper.

The ratio of the index accuracies and the VPE are also valid in the RESIM context. In Eq. (7.34) we showed that the covariance between \( {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} \) and \( {H}_R={\mathbf{w}}_R^{\prime}\mathbf{g} \) (\( {\sigma}_{H_R{I}_R} \)) is equal to the variance of \( {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} \) (\( {\sigma}_{I_R}^2 \)). This means that the VPE of the RESIM can be written as

$$ E\left[{\left({H}_R-{I}_R\right)}^2\right]={\sigma}_{H_R}^2+{\sigma}_{I_R}^2-2{\sigma}_{H_R{I}_R}={\sigma}_{H_R}^2-{\sigma}_{I_R}^2=\left(1-{\rho}_R^2\right){\sigma}_{H_R}^2. $$
(7.38)

Statistical properties associated with the ESIM and described in Sect. 7.1.2 are also valid for the RESIM.

7.2.2 Estimating the RESIM Parameters

We can estimate the RESIM parameters in a similar manner to the ESIM parameters in Sect. 7.1.4. With matrices \( \widehat{\mathbf{C}} \) and \( \widehat{\mathbf{P}} \), we constructed matrix \( {\widehat{\mathbf{S}}}_R={\widehat{\mathbf{K}}\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}} \) and equation

$$ \left({\widehat{\mathbf{S}}}_R{\widehat{\mathbf{S}}}_R^{\prime }-{\widehat{\mu}}_{Rj}{\mathbf{I}}_t\right){\widehat{\mathbf{b}}}_{R_j}=\mathbf{0}, $$
(7.39)

where \( {\widehat{\mu}}_{Rj}={\widehat{\lambda}}_{R_j}^4 \), j= 1, 2, …, t. The estimated RESIM index (\( {I}_R={\mathbf{b}}_R^{\prime}\mathbf{y} \)) is \( {\widehat{I}}_R={\widehat{{\mathbf{b}}^{\prime}}}_{R_1}\mathbf{y} \) and the estimator of the maximized RESIM selection response and its expected genetic gain per trait can be denoted as \( {\widehat{R}}_R={k}_I\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{R_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{R_1}} \) and \( {\widehat{\mathbf{E}}}_R={k}_I\frac{{\widehat{\mathbf{C}}\widehat{\mathbf{b}}}_{R_1}}{\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{R_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{R_1}}} \) respectively, whereas the estimator of the maximized RESIM accuracy is \( {\widehat{\lambda}}_{R_1} \).

7.2.3 Numerical Examples

We compare the RLPSI results with those of the RESIM using the Akbar et al. (1984) data described in Sect. 7.1.5. We restrict the trait RL (number of eggs) in both indices. In Chap. 3, Sect. 3.1.3, we indicated how to construct matrix U′ and, in Sect. 3.1.4 of the same chapter, we described how to obtain matrix \( \widehat{\mathbf{K}}=\left[{\mathbf{I}}_t-\widehat{\mathbf{Q}}\right] \) for one and two restrictions. Matrix \( \widehat{\mathbf{K}} \) is the same for the RLPSI and the RESIM. Thus, in this subsection we omit the steps needed to construct matrices U′ and \( \widehat{\mathbf{K}} \).

First, we estimate the RLPSI parameters. Assume a selection intensity of 10% (kI = 1.755) and a vector of economic weights \( {\mathbf{w}}^{\prime }=\left[19.54\kern0.5em -3.56\kern0.5em 17.01\right] \). The estimated RLPSI vector of coefficients for one restriction was \( {\widehat{\mathbf{b}}}^{\prime }=\left[0.29\kern0.5em -0.84\kern0.5em 5.78\right] \), and the estimated selection response, expected genetic gain per trait, accuracy, and heritability of the RLPSI were \( \widehat{R}=1.755\sqrt{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{P}}\widehat{\mathbf{b}}}=53.01 \), \( {\widehat{\mathbf{E}}}^{\prime }=1.755\frac{{\widehat{\mathbf{b}}}^{\prime}\widehat{\mathbf{C}}}{\sqrt{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{P}}\widehat{\mathbf{b}}}}=\left[0\kern0.5em -0.71\kern0.5em 2.96\right] \), \( \widehat{\rho}=\frac{\sqrt{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{P}}\widehat{\mathbf{b}}}}{\sqrt{{\mathbf{w}}^{\prime}\widehat{\mathbf{C}}\mathbf{w}}}=0.26 \), and \( {\widehat{h}}^2=\frac{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{C}}\widehat{\mathbf{b}}}{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{P}}\widehat{\mathbf{b}}}=0.33 \) respectively. In this case, \( \widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{b}}=34.25 \); then, the estimated RLPSI selection response using the normalized RLPSI vector of coefficients was \( \widehat{R}=\frac{53.01}{34.25}=1.55 \), and the rest of the estimated RLPSI parameters were the same.

In the RESIM, matrix F was an identity matrix of size 3 × 3; that is, we did not use matrix F to transform the RESIM vector of coefficients. In Sect. 7.1.5 we obtained matrix \( {\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}}=\left[\begin{array}{ccc}0.1102& -0.0405& -0.0280\\ {}-0.0390& 0.0864& -0.0184\\ {}-0.1833& 0.0517& 0.4115\end{array}\right] \), and we have indicated that matrix \( \widehat{\mathbf{K}} \) is the same for the RLPSI and the RESIM. In the RESIM, we need matrix \( {\widehat{\mathbf{S}}}_R={\widehat{\mathbf{K}}\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}} \) to solve equation \( \left({\widehat{\mathbf{S}}}_R{\widehat{{\mathbf{S}}^{\prime}}}_R-{\widehat{\mu}}_{Rj}{\mathbf{I}}_t\right){\widehat{\mathbf{b}}}_{R_j}=\mathbf{0} \), where \( {\widehat{\mu}}_{Rj}={\widehat{\lambda}}_{R_j}^4 \), whence we shall obtain the eigenvalues and eigenvectors that form matrices \( {\mathbf{L}}_R^{1/2} \), VR1, and \( {\widehat{\mathbf{S}}}_R={\mathbf{V}}_{R1}{\mathbf{L}}_R^{1/2}{\mathbf{V}}_{R2}^{\prime } \).

For one null restriction, matrix \( {\widehat{\mathbf{S}}}_R={\widehat{\mathbf{K}}\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}}=\left[\begin{array}{ccc}0& 0.0285& 0.0232\\ {}0& 0.0620& -0.0365\\ {}0& -0.0630& 0.3263\end{array}\right] \). This means that \( {\widehat{\mathbf{S}}}_R \) reflects the trait restrictions imposed on the covariance between the RESIM and the vector of genotypic values; thus, if r traits are restricted, r columns of \( {\widehat{\mathbf{S}}}_R \) are equal to zero. Matrix \( {\widehat{\mathbf{S}}}_R{\widehat{{\mathbf{S}}^{\prime}}}_R=\left[\begin{array}{ccc}0.0013& 0.0009& 0.0058\\ {}0.0009& 0.0052& -0.0158\\ {}0.0058& -0.0158& 0.1104\end{array}\right] \) and \( {\mathbf{V}}_{R_1}=\left[\begin{array}{ccc}0.0500& 0.5216& -0.8517\\ {}-0.1446& 0.8476& 0.5106\\ {}0.9882& 0.0976& 0.1178\end{array}\right] \), whereas the \( {\widehat{\mu}}_{Rj}={\widehat{\lambda}}_{R_j}^4 \) values were 0.1130, 0.0039, and 0.0, whence \( {\mathbf{L}}_R^{1/2}=\left[\begin{array}{ccc}0.3362& 0& 0\\ {}0& 0.0626& 0\\ {}0& 0& 0.0\end{array}\right] \). Thus, \( {\widehat{\mu}}_{R_1}={\widehat{\lambda}}_{R_1}^4=0.1130 \), \( {\widehat{\lambda}}_{R_1}^2=0.3362 \), and the estimated RESIM accuracy was \( {\widehat{\lambda}}_{E_1}=0.5798 \). The estimated RESIM eigenvector, index, the selection response, and expected genetic gain per trait were \( {\widehat{\mathbf{b}}}_{R_1}^{\prime }=\left[0.0500\kern0.5em -0.1446\kern0.5em 0.9882\right] \), \( {\widehat{I}}_R=0.0500\mathrm{RL}-0.1446\mathrm{SM}+0.9882\mathrm{EW} \), \( {\widehat{R}}_R=1.755\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{R_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{R_1}}=9.06 \), and \( {\widehat{\mathbf{E}}}_R^{\prime }=1.755\frac{{\widehat{\mathbf{b}}}_{R_1}^{\prime}\widehat{\mathbf{C}}}{\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{R_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{R_1}}}=\left[0\kern0.5em -0.72\kern0.5em 2.96\right] \) respectively.

The estimated RLPSI selection response was \( \widehat{R}=\frac{53.01}{34.25}=1.55 \); thus, the estimated RESIM selection response was higher than the estimated RLPSI response. In addition, the estimated RLPSI expected genetic gain per trait was \( {\widehat{\mathbf{E}}}^{\prime }=\left[0\kern0.5em -0.71\kern0.5em 2.96\right] \), which is the same as the estimated RESIM expected genetic gain per trait.

We can compare RESIM efficiency versus RLPSI efficiency to predict the net genetic merit using the ratio of the estimated RESIM accuracy \( {\widehat{\lambda}}_{E_1}=0.5798 \) to the RLPSI accuracy \( \widehat{\rho}=0.26 \), i.e., \( \frac{{\widehat{\lambda}}_{R_1}}{{\widehat{\rho}}_S}=\frac{0.5798}{0.26}=2.23 \), or in percentage terms, \( {\widehat{p}}_E=100\left(2.23-1\right)=123 \) (see Chap. 5, Eq. 5.17). That is, the RESIM is a better predictor of the net genetic merit and its efficiency was 123% higher than the RLPSI efficiency for this data set.

Now, we compare RESIM efficiency versus RLPSI efficiency using the simulated data set described in Sect. 2.8.1 of Chap. 2 for five phenotypic selection cycles, each with four traits (T1, T2, T3, and T4), 500 genotypes, and four replicates for each genotype. The economic weights for T1, T2, T3, and T4 were 1, −1, 1, and 1 respectively. For this data set, matrix F was equal to an identity matrix of size 4 × 4 for all five selection cycles.

The first and second parts of columns 3, 4, and 5 of Table 7.1 present the estimated RLPSI and RESIM selection responses respectively for one, two, and three null restrictions for five simulated selection cycles, where the selection intensity was 10% (kI = 1.755) for all five selection cycles. The averages of the estimated RLPSI selection response for each null restriction were 4.43, 4.30, and 4.92, whereas the averages of the estimated RESIM selection response were 4.54, 4.42, and 4.38 respectively. These results indicate that the estimated RLPSI selection response was greater than the estimated RESIM selection response only for three null restrictions.

The first part of Table 7.3 presents the estimated RESIM expected genetic gain per trait for one, two, and three restrictions for five simulated selection cycles. The estimated RLPSI expected genetic gains per trait for one, two, and three restrictions are given in Chap. 3 (Table 3.3). According to the results shown in Table 3.3 (Chap. 3), the averages of the estimated RLPSI expected genetic gains per trait for five simulated selection cycles were −2.52, 2.25, and 2.26 for one restriction; 2.84 and 2.65 for two restrictions; and 3.90 for three restrictions. According to the results shown in Table 7.3, the averages of the estimated RESIM expected genetic gains per trait for five simulated selection cycles were −0.43, −0.75, and 3.90 for one restriction; −0.59 and 3.89 for two restrictions; and 3.90 for three restrictions. This means that the RESIM and RLPSI were the same only for three restrictions, whereas for one and two restrictions, the average of the estimated RESIM expected genetic gains per trait was higher than that of the estimated RLPSI expected genetic gains per trait only for trait 4.

Table 7.3 Estimated RESIM and PPG-ESIM expected genetic gain per trait for one, two, and three restrictions for five simulated selection cycles

Figure 7.2 presents the estimated accuracy of the RLPSI and the RESIM for one, two, and three null restrictions for five simulated selection cycles. In all five selection cycles, the estimated RESIM accuracy was greater than the RLPSI accuracy. This means that the RESIM is a better predictor of the net genetic merit than the RLPSI. Additional results associated with the frequency distribution of the estimated RESIM values are presented in Fig. 7.3. Figure 7.3a presents the frequency distribution of the estimated RESIM values with one null restriction for cycle 2, whereas Fig. 7.3b presents the frequency distribution of the estimated RESIM values with two null restrictions for cycle 5; both figures indicate that the estimated RESIM values approach normal distribution.

Fig. 7.2
figure 2

Estimated correlation values between the restricted linear phenotypic selection index (RLPSI) and the net genetic merit (H = wg); estimated correlation values between the restricted eigen selection index method (RESIM) and H for one, two and three null restrictions for four traits and 500 genotypes in one environment simulated for five selection cycles

Fig. 7.3
figure 3

Frequency distribution of 500 estimated RESIM values for (a) cycle 2 and (b) cycle 5, obtained from one selection cycle for 500 genotypes and four traits simulated in one environment

Finally, in Chap. 10 we present the results of comparing the ESIM with the LPSI and the RESIM with the RLPSI for many selection cycles. Such results are similar to those obtained in this chapter.

7.3 The Linear Phenotypic Predetermined Proportional Gain Eigen Selection Index Method

In a similar manner to the PPG-LPSI (see Chap. 3), in the PPG-ESIM the breeder pre-sets optimal levels (predetermined proportional gains) on certain traits before the selection is carried out. Let \( {\mathbf{d}}^{\prime }=\left[{d}_1\kern0.5em {d}_2\kern0.5em \cdots \kern0.5em {d}_r\right] \) be the vector of the PPGs (predetermined proportional gains) imposed by the breeder on r traits and assume that μq is the population mean of the qth trait before selection. The objective of the PPG-ESIM is to change μq to μq + dq, where dq is a predetermined change in μq (in the RESIM, dq = 0, q = 1, 2, ⋯, r, where r is the number of PPGs). That is, the PPG-ESIM attempts to make some traits change their expected genetic gain values based on a predetermined level, whereas the rest of the traits remain without restrictions.

The simplest way to solve the foregoing problem is by maximizing the PPG-ESIM heritability under the restriction DUCb = 0, where \( {\mathbf{D}}^{\prime }=\left[\begin{array}{ccccc}{d}_r& 0& \cdots & 0& -{d}_1\\ {}0& {d}_r& \cdots & 0& -{d}_2\\ {}\vdots & \vdots & \ddots & \vdots & \vdots \\ {}0& 0& \cdots & {d}_r& -{d}_{r-1}\end{array}\right] \) (see Chap. 3 for details) is a matrix (r − 1) × r, r is the number of PPGs, dq (q = 1, 2…, r) is the qth element of vector d, U′ is the RLPSI matrix of restrictions of 1s and 0s, and C is the covariance matrix of genotypic values. Matrix D′ is a Mallard (1972) matrix of PPGs used to impose predetermined restrictions.

The Mallard (1972) matrix of predetermined restrictions can be written as M = DΨ′, where Ψ′ = UC and U′ is the Kempthorne and Nordskog (1959) matrix of restrictions of 1s and 0s (1 indicates that the trait is restricted, i.e., dq = 0, and 0 that the trait has no restrictions).

To find the PPG-ESIM vector of coefficients that maximizes the PPG-ESIM selection response and expected genetic gain per trait, we can maximize \( {\rho}_{HI}^2=\frac{{\left({\mathbf{w}}^{\prime}\mathbf{Cb}\right)}^2}{\left({\mathbf{w}}^{\prime}\mathbf{Cw}\right)\left({\mathbf{b}}^{\prime}\mathbf{Pb}\right)} \) with respect to vectors b and w under the restrictions Mb = 0, bb = 1, wCw = 1, and bPb = 1, where wCw is the variance of H = wg and bPb is the variance of I = by, as did Cerón-Rojas et al. (2016) according to the canonical correlation theory, or we can solve this problem by maximizing \( \frac{{\mathbf{b}}^{\prime}\mathbf{Cb}}{\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}}} \) (Eq. 7.12) only with respect to vectors b under the restriction Mb = 0 and bb = 1, as we did to obtain the RESIM vector of coefficients. Both approaches give the same result, but we use the latter approach because it is easier to work with.

7.3.1 The PPG-ESIM Parameters

To obtain the PPG-ESIM vector of coefficients, we need to maximize the function

$$ f\left(\mathbf{b},{\mathbf{v}}^{\prime}\right)=\frac{{\mathbf{b}}^{\prime}\mathbf{Cb}}{\sqrt{{\mathbf{b}}^{\prime}\mathbf{Pb}}}-{\mathbf{v}}^{\prime }{\mathbf{M}}^{\prime}\mathbf{b} $$
(7.40)

with respect to vectors b and v′, where \( {\mathbf{v}}^{\prime }=\left[{v}_1\kern0.5em {v}_2\kern0.5em \cdots \kern0.5em {v}_{r-1}\right] \) is a vector of Lagrange multipliers. The derivatives of Eq. (7.40) with respect to b and v′ were:

$$ 2{\left({\mathbf{b}}^{\prime}\mathbf{Pb}\right)}^{1/2}\mathbf{Cb}-{\left({\mathbf{b}}^{\prime}\mathbf{Pb}\right)}^{-1/2}\left({\mathbf{b}}^{\prime}\mathbf{Cb}\right)\mathbf{Pb}-\mathbf{Mv}=\mathbf{0} $$
(7.41)

and

$$ {\mathbf{M}}^{\prime}\mathbf{b}=\mathbf{0}, $$
(7.42)

respectively, where Eq. (7.42) denotes the restriction imposed for maximizing Eq. (7.40). By using algebraic methods on Eq. (7.41) similar to those used to obtain Eq. (7.10) we get

$$ \left({\mathbf{K}}_P{\mathbf{P}}^{-1}\mathbf{C}-{\lambda}_P^2{\mathbf{I}}_t\right){\mathbf{b}}_P=\mathbf{0}, $$
(7.43)

where KP = [It − QP], QP = P−1ΨD(DΨP−1ΨD)−1DΨ′, Ψ′ = UC, It is an identity matrix t × t, \( {\lambda}_P^2={h}_{I_P}^2 \), and bP are the first eigenvalue and the first eigenvector of matrix KPP−1C respectively. Note that \( {h}_{I_P}^2 \) is PPG-ESIM heritability and λP is the maximum correlation between \( {I}_{\mathrm{P}}={\mathbf{b}}_P^{\prime}\mathbf{y} \) and H = wg. When D′ = U′, bP = bR (the vector of coefficients of the RESIM), and when U′ is a null matrix, bP = bE (the vector of coefficients of the ESIM). That is, the PPG-ESIM is more general than the RESIM and the ESIM and includes the latter two indices as particular cases. Matrices KP = [It − QP] and QP = P−1ΨD(DΨP−1ΨD)−1DΨ′ are the same as those obtained in the PPG-LPSI (see Chap. 3). Also, vector bP can be transformed as βP = FbP; matrix F was defined earlier.

Let SP = ΨP−1Ψ; then, under the assumption Dd = 0, it is possible to show that \( \mathbf{D}{\left({\mathbf{D}}^{\prime }{\mathbf{S}}_P\mathbf{D}\right)}^{-1}{\mathbf{D}}^{\prime }={\mathbf{S}}_P^{-1}-{\mathbf{S}}_P^{-1}\mathbf{d}{\left({\mathbf{d}}^{\prime }{\mathbf{S}}_P^{-1}\mathbf{d}\right)}^{-1}{\mathbf{d}}^{\prime }{\mathbf{S}}_P^{-1} \) (see Chap. 3), whence by substituting \( {\mathbf{S}}_P^{-1}-{\mathbf{S}}_P^{-1}\mathbf{d}{\left({\mathbf{d}}^{\prime }{\mathbf{S}}_P^{-1}\mathbf{d}\right)}^{-1}{\mathbf{d}}^{\prime }{\mathbf{S}}_P^{-1} \) for D(DSPD)−1D′ in matrix QP = P−1ΨD(DΨP−1ΨD)−1DΨ′, matrix KPP−1C can be written as

$$ {\mathbf{K}}_P{\mathbf{P}}^{-1}\mathbf{C}=\left[{\mathbf{I}}_t-{\mathbf{P}}^{-1}{\boldsymbol{\Psi} \mathbf{S}}^{-1}{\boldsymbol{\Psi}}^{\prime}\right]{\mathbf{P}}^{-1}\mathbf{C}+{\mathbf{A}}_P, $$
(7.44)

where Ψ′ = UC, AP = δα, δ = P−1Ψ(ΨP−1Ψ)−1d, and \( {\boldsymbol{\upalpha}}^{\prime }=\frac{{\mathbf{d}}^{\prime }{\mathbf{S}}^{-1}{\boldsymbol{\Psi}}^{\prime }{\mathbf{P}}^{-1}\mathbf{C}}{{\mathbf{d}}^{\prime }{\mathbf{S}}^{-1}\mathbf{d}} \). When AP is a null matrix, KPP−1C = KP−1C (matrix of the RESIM), and if U′ is a null matrix, KPP−1C = P−1C (matrix of the ESIM), this means that Eq. (7.44) is a mathematical equivalent form of matrix KPP−1C and that Eq. (7.44) does not require matrix D′. The easiest way to obtain bP and λP is to use matrix [It − P−1ΨS−1Ψ′]P−1C + AP in Eq. (7.43) instead of matrix KPP−1C.

In the PPG-ESIM context, vector w can be obtained as

$$ {\mathbf{w}}_P={\mathbf{C}}^{-1}\left[{\lambda}_P{\mathbf{Pb}}_P+{\mathbf{Mv}}_P\right], $$
(7.45)

whence H = wg can be written as \( {H}_P={\mathbf{w}}_P^{\prime}\mathbf{g} \). In Eq. (7.45), λP is the maximum correlation between \( {I}_P={\mathbf{b}}_P^{\prime}\mathbf{y} \) and \( {H}_P={\mathbf{w}}_P^{\prime}\mathbf{g} \), bP is the first eigenvector of matrix KPP−1C, \( {\mathbf{v}}_P={\lambda}_P^{-1}{\left({\mathbf{M}}^{\prime }{\mathbf{P}}^{-1}\mathbf{M}\right)}^{-1}{\mathbf{M}}^{\prime }{\mathbf{P}}^{-1}{\mathbf{Cb}}_P \), M = DΨ′, and Ψ′ = UC. In a similar manner to the RESIM context, we can assume that λP and \( {\lambda}_P^{-1} \) are proportionality constants and it can be shown that the covariance between \( {I}_P={\mathbf{b}}_P^{\prime}\mathbf{y} \) and \( {H}_P={\mathbf{w}}_P^{\prime}\mathbf{g} \) (\( {\sigma}_{H_P{I}_P} \)) is equal to the variance of \( {I}_P={\mathbf{b}}_P^{\prime}\mathbf{y} \) (\( {\sigma}_{I_P}^2={\mathbf{b}}_p^{\prime }{\mathbf{Pb}}_P \)), that is, \( {\sigma}_{H_P{I}_P}={\mathbf{w}}_P^{\prime }{\mathbf{Cb}}_P={\mathbf{b}}_p^{\prime }{\mathbf{Pb}}_P \).

The accuracy of the PPG-ESIM can also be written as

$$ {\rho}_{H_P{I}_P}=\frac{\sqrt{{\mathbf{b}}_P^{\prime }{\mathbf{Pb}}_P}}{\sqrt{{\mathbf{w}}_P^{\prime }{\mathbf{Cw}}_P}}, $$
(7.46)

where \( {\sigma}_{H_P}^2={\mathbf{w}}_P^{\prime }{\mathbf{Cw}}_P={\mathbf{b}}_P^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Pb}}_P+{\mathbf{b}}_P^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Q}}_P^{\prime }{\mathbf{Cb}}_P \) is the variance of HP. When D′ = U′, \( {\mathbf{w}}_P^{\prime }{\mathbf{Cw}}_P={\mathbf{w}}_R^{\prime }{\mathbf{Cw}}_R \) (the variance of HR), and when U′ is a null matrix, \( {\mathbf{w}}_P^{\prime }{\mathbf{Cw}}_P={\mathbf{w}}_E^{\prime }{\mathbf{Cw}}_E \) (the variance of HE). Hereafter, to simplify the notation, we write Eq. (7.46) as ρP or λP.

Let βP = FbP be the PPG-ESIM transformed vector of coefficients by matrix F. By Eqs. (7.1) and (7.46), the maximized selection response (RP) and expected genetic gain per trait (EP) of the PPG-ESIM can be written as

$$ {R}_P={k}_I\sqrt{{\boldsymbol{\upbeta}}_P^{\prime }{\mathbf{P}\boldsymbol{\upbeta}}_P} $$
(7.47)

and

$$ {\mathbf{E}}_P={k}_I\frac{{\mathbf{C}\boldsymbol{\upbeta}}_P}{\sqrt{{\boldsymbol{\upbeta}}_P^{\prime }{\mathbf{P}\boldsymbol{\upbeta}}_P}}, $$
(7.48)

respectively, where \( \sqrt{{\boldsymbol{\upbeta}}_P^{\prime }{\mathbf{P}\boldsymbol{\upbeta}}_P}={\sigma}_{I_P} \) is the standard deviation of the variance of \( {I}_P={\boldsymbol{\upbeta}}_P^{\prime}\mathbf{y} \). Equations (7.47) and (7.48) do not require economic weights. When F is an identity matrix, βP = bP, \( {I}_P={\mathbf{b}}_P^{\prime}\mathbf{y} \), \( {R}_P={k}_I\sqrt{{\mathbf{b}}_P^{\prime }{\mathbf{Pb}}_P} \), and \( {\mathbf{E}}_P={k}_I\frac{{\mathbf{Cb}}_P}{\sqrt{{\mathbf{b}}_P^{\prime }{\mathbf{Pb}}_P}} \).

Equation (7.47) can also be written as \( {R}_P={k}_I{\sigma}_{H_P}{\lambda}_P \), where \( {\sigma}_{H_P}=\sqrt{{\mathbf{b}}_P^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Pb}}_P+{\mathbf{b}}_P^{\prime }{\mathbf{PC}}^{-1}{\mathbf{Q}}_P^{\prime }{\mathbf{Cb}}_P} \) is the standard deviation of the variance of HP, and λP is the canonical correlation between HP and \( {I}_P={\boldsymbol{\upbeta}}_P^{\prime}\mathbf{y} \). When \( {\sigma}_{H_P}=1 \), Eq. (7.47) can be written as RP = kIλP, where λP is the covariance between \( {I}_P={\mathbf{b}}_P^{\prime}\mathbf{y} \) and \( H={\mathbf{w}}_P^{\prime}\mathbf{g} \).

The prediction efficiency of the PPG-ESIM can be obtained in a similar manner to the ESIM and RESIM. The accuracy of the PPG-ESIM (Eq. 7.46) can be used to construct the ratio of index accuracies. The PPG-ESIM mean square error or the VPE can be obtained as

$$ E\left[{\left({H}_P-{I}_P\right)}^2\right]={\sigma}_{H_P}^2+{\sigma}_{I_P}^2-2{\sigma}_{H_P{I}_P}={\sigma}_{H_P}^2-{\sigma}_{I_P}^2=\left(1-{\rho}_P^2\right){\sigma}_{H_P}^2. $$
(7.49)

Additional properties associated with the ESIM are also valid for the PPG-ESIM.

7.3.2 Estimating PPG-ESIM Parameters

The procedure used to estimate PPG-ESIM parameters is the same as that described for RESIM. Let \( \widehat{\mathbf{C}} \) and \( \widehat{\mathbf{P}} \) be the estimated matrices of C and P. In the PPG-ESIM context, we use matrix \( \widehat{\mathbf{S}}={\widehat{\mathbf{K}}}_P{\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}} \) to obtain the estimated eigenvalues and eigenvectors of equation

$$ \left(\widehat{\mathbf{S}}-{\widehat{\lambda}}_{Pj}^2{\mathbf{I}}_t\right){\widehat{\mathbf{b}}}_{Pj}=\mathbf{0}, $$
(7.50)

j = 1, 2, ⋯, t, where t is the number of traits in the PPG-ESIM index, \( {\widehat{\mathbf{K}}}_P=\left[{\mathbf{I}}_t-{\widehat{\mathbf{Q}}}_P\right] \), It is an identity matrix of size t × t and \( {\widehat{\mathbf{Q}}}_P={\widehat{\mathbf{P}}}^{-1}\widehat{\boldsymbol{\Psi}}\mathbf{D}{\left({\mathbf{D}}^{\prime}\widehat{{\boldsymbol{\Psi}}^{\prime }}{\widehat{\mathbf{P}}}^{-1}\widehat{\boldsymbol{\Psi}}\mathbf{D}\right)}^{-1}{\mathbf{D}}^{\prime}\widehat{{\boldsymbol{\Psi}}^{\prime }} \). As \( \widehat{\mathbf{S}} \) is an asymmetric matrix, the values of \( {\widehat{\mathbf{b}}}_{Pj} \) and \( {\widehat{\lambda}}_{Pj}^2 \) should be obtained using SVD (singular value decomposition).

According to SVD, we need to solve equation

$$ \left(\widehat{\mathbf{S}}{\widehat{\mathbf{S}}}^{\prime }-{\widehat{\mu}}_{Pj}{\mathbf{I}}_t\right){\widehat{\mathbf{b}}}_{P_j}=\mathbf{0}, $$
(7.51)

where \( {\widehat{\mu}}_{Pj}={\widehat{\lambda}}_{P_j}^4 \) (j= 1, 2, …, t). By Eq. (7.51), the estimated PPG-ESIM index (\( {I}_P={\mathbf{b}}_P^{\prime}\mathbf{y} \)) is \( {\widehat{I}}_P={\widehat{{\mathbf{b}}^{\prime}}}_{P_1}\mathbf{y} \). The estimator of the maximized PPG-ESIM selection response, and its expected genetic gain per trait, can be denoted as \( {\widehat{R}}_P={k}_I\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{P_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{P_1}} \) and \( {\widehat{\mathbf{E}}}_P={k}_I\frac{{\widehat{\mathbf{C}}\widehat{\mathbf{b}}}_{P_1}}{\sqrt{{\widehat{{\mathbf{b}}^{\prime}}}_{P_1}{\widehat{\mathbf{P}}\widehat{\mathbf{b}}}_{P_1}}} \) respectively, whereas the estimator of the maximized accuracy of the PPG-ESIM is \( {\widehat{\lambda}}_{P_1} \).

7.3.3 Numerical Examples

We compare the results of the PPG-LPSI and the PPG-ESIM using the Akbar et al. (1984) data described earlier. We restrict traits RL and SM, on both indices using the PPG vector \( {\mathbf{d}}^{\prime }=\left[3\kern0.5em -1\right] \). In Chap. 3, Sect. 3.1.4, we indicated how to construct matrix U′ and, in Sect. 3.2.4 of the same chapter, we described how to obtain matrix \( {\widehat{\mathbf{K}}}_P \) for one and two restrictions. Matrix \( {\widehat{\mathbf{K}}}_P \) is the same for the PPG-LPSI and the PPG-ESIM. Thus, we omit the steps for constructing matrices U′ and \( {\widehat{\mathbf{K}}}_P \).

Assume a selection intensity of 10% (kI = 1.755) and that the vector of economic weights is \( {\mathbf{w}}^{\prime }=\left[19.54\kern0.5em -3.56\kern0.5em 17.01\right] \). The estimated PPG-LPSI vector of coefficients for two predetermined restrictions was \( {\widehat{\mathbf{b}}}^{\prime }=\left[1.70\kern0.5em 1.04\kern0.5em 2.93\right] \), and its estimated selection response, expected genetic gain per trait, accuracy, and heritability were \( \widehat{R}=1.755\sqrt{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{P}}\widehat{\mathbf{b}}}=49.02 \), \( {\widehat{\mathbf{E}}}^{\prime }=1.755\frac{{\widehat{\mathbf{b}}}^{\prime}\widehat{\mathbf{C}}}{\sqrt{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{P}}\widehat{\mathbf{b}}}}=\left[1.25\kern0.5em -0.42\kern0.5em 1.36\right] \), \( \widehat{\rho}=\frac{\sqrt{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{P}}\widehat{\mathbf{b}}}}{\sqrt{{\mathbf{w}}^{\prime}\widehat{\mathbf{C}}\mathbf{w}}}=0.24 \), and \( {\widehat{h}}^2=\frac{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{C}}\widehat{\mathbf{b}}}{\widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{P}}\widehat{\mathbf{b}}}=0.12 \) respectively. In this case, \( \widehat{{\mathbf{b}}^{\prime }}\widehat{\mathbf{b}}=12.57 \); then, the estimated PPG-LPSI selection response using the normalized PPG-LPSI vector of coefficients was \( \widehat{R}=\frac{49.02}{12.57}=3.90 \), whereas the rest of the estimated PPG-LPSI parameters were the same.

In the PPG-ESIM, we need matrix \( \widehat{\mathbf{S}}={\widehat{\mathbf{K}}}_P{\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}} \) to obtain the eigenvalues and eigenvectors of \( \left(\widehat{\mathbf{S}}{\widehat{\mathbf{S}}}^{\prime }-{\widehat{\mu}}_{Pj}{\mathbf{I}}_t\right){\widehat{\mathbf{b}}}_{P_j}=\mathbf{0} \) that make up matrices \( {\mathbf{L}}_P^{1/2} \), \( {\mathbf{V}}_{P_1} \), and \( \widehat{\mathbf{S}}={\mathbf{V}}_{P_1}{\mathbf{L}}_P^{1/2}{\mathbf{V}}_{P_2}^{\prime } \), where \( {\widehat{\mu}}_{Pj}={\widehat{\lambda}}_{P_j}^4 \). It can be shown that \( \widehat{\mathbf{S}}={\widehat{\mathbf{K}}}_P{\widehat{\mathbf{P}}}^{-1}\widehat{\mathbf{C}}=\left[\begin{array}{ccc}0.1047& -0.0349& -0.0279\\ {}0.0678& -0.0226& -0.0213\\ {}-0.1970& 0.0657& 0.4119\end{array}\right] \), \( \widehat{\mathbf{S}}\widehat{{\mathbf{S}}^{\prime }}=\left[\begin{array}{ccc}0.0130& 0.0085& -0.0344\\ {}0.0085& 0.0056& -0.0236\\ {}-0.0344& -0.0236& 0.2118\end{array}\right] \), and \( {\mathbf{V}}_{P_1}=\left[\begin{array}{ccc}-0.1663& 0.8292& 0.5336\\ {}-0.1138& 0.5214& -0.8457\\ {}0.9795& 0.2014& -0.0076\end{array}\right] \), whereas the \( {\widehat{\mu}}_{Pj}={\widehat{\lambda}}_{P_j}^4 \)values were 0.2214, 0.0099, and 0.0, whence \( {\mathbf{L}}_P^{1/2}=\left[\begin{array}{ccc}0.4705& 0& 0\\ {}0& 0.0997& 0\\ {}0& 0& 0.0\end{array}\right] \). Thus, \( {\widehat{\mu}}_{P_1}={\widehat{\lambda}}_{P_1}^4=0.2214 \), \( {\widehat{\lambda}}_{P_1}^2=0.4705 \), and the estimated maximized PPG-ESIM accuracy was \( {\widehat{\lambda}}_{P_1}=0.6859 \).

We transformed the first eigenvector \( {\widehat{\mathbf{b}}}_{p_1}^{\prime }=\left[-0.1663\kern0.5em -0.1138\kern0.5em 0.9795\right] \) using matrix \( \mathbf{F}=\left[\begin{array}{ccc}-9& 0& 0\\ {}0& 1& 0\\ {}0& 0& 1\end{array}\right] \) to obtain vector \( {\widehat{\boldsymbol{\upbeta}}}_P={\widehat{\mathbf{b}}}_{p_1}^{\prime}\mathbf{F}=\left[1.4968\kern0.5em -0.1138\kern0.5em 0.9795\right] \) and \( {\widehat{\boldsymbol{\upbeta}}}_P^{\prime }{\widehat{\boldsymbol{\upbeta}}}_P=3.21 \), whence the estimates of the index, the selection response, and expected genetic gain per trait of the PPG-ESIM were \( {\widehat{I}}_P=1.4968\mathrm{RL}-0.1138\mathrm{SM}+0.9795\mathrm{EW} \), \( {\widehat{R}}_P=\frac{1.755\sqrt{{\widehat{\boldsymbol{\upbeta}}}_P^{\prime }{\widehat{\mathbf{P}}\widehat{\boldsymbol{\upbeta}}}_P}}{{\widehat{\boldsymbol{\upbeta}}}_P^{\prime }{\widehat{\boldsymbol{\upbeta}}}_P}=\frac{43.01}{3.21}=13.39 \), and \( {\widehat{\mathbf{E}}}_P^{\prime }=1.755\frac{{\widehat{\boldsymbol{\upbeta}}}_p^{\prime}\widehat{\mathbf{C}}}{\sqrt{{\widehat{\boldsymbol{\upbeta}}}_p^{\prime }{\widehat{\mathbf{P}}\widehat{\boldsymbol{\upbeta}}}_p}}=\left[3.05\kern0.5em -1.96\kern0.5em 0.19\right] \) respectively. The estimated PPG-LPSI selection response was \( \widehat{R}=\frac{49.02}{12.57}=3.90 \), which means that the estimated PPG-ESIM selection response was greater than the estimated PPG-LPSI response.

We compared PPG-ESIM efficiency versus LPSI efficiency to predict the net genetic merit using the ratio of the estimated PPG-ESIM accuracy (\( {\widehat{\lambda}}_{P_1}=0.6859 \)) to PPG-LPSI accuracy (\( \widehat{\rho}=0.24 \)), i.e., \( \frac{{\widehat{\lambda}}_{P_1}}{\widehat{\rho}}=\frac{0.6859}{0.24}=2.858 \) or, in percentage terms, \( {\widehat{p}}_P=100\left(2.858-1\right)=185.80 \). Then, the PPG-ESIM was a better predictor of the net genetic merit and its efficiency was 185.80% higher than that of the PPG-LPSI for this data set.

Now, we compare PPG-ESIM efficiency versus PPG-LPSI efficiency using the data set described in Sect. 2.8.1 of Chap. 2 for five phenotypic selection cycles, each with four traits (T1, T2, T3, and T4), 500 genotypes, and four replicates for each genotype. The economic weights for T1, T2, T3, and T4 were 1, −1, 1, and 1 respectively. For this data set, matrix F was an identity matrix of size 4 × 4 for all five selection cycles.

The first and second parts of columns 6, 7, and 8 in Table 7.1 present the estimated PPG-LPSI and PPG-ESIM selection responses for one, two, and three predetermined restrictions for five simulated selection cycles. The selection intensity was 10% (kI = 1.755) and the vectors of PPG for each predetermined restriction were \( {\mathbf{d}}_1^{\prime }=7 \), \( {\mathbf{d}}_2^{\prime }=\left[7\kern0.5em -3\right] \), and \( {\mathbf{d}}_3^{\prime }=\left[7\kern0.5em -3\kern0.5em 5\right] \) respectively, for all five selection cycles. The estimated PPG-LPSI selection response when the vector of coefficients was not normalized was presented in Chap. 3 (Table 3.5). The averages of the estimated PPG-LPSI selection response for each predetermined restriction were 4.70, 4.91, and 3.14, whereas the averages of the estimated PPG-ESIM selection response were 6.31, 6.28, and 6.75 respectively. These results indicate that the estimated PPG-ESIM selection response was greater than the estimated PPG-LPSI selection response for all predetermined restrictions.

The second part of Table 7.2 presents the estimated PPG-ESIM accuracy (\( {\widehat{\rho}}_P \)) and the ratio of \( {\widehat{\rho}}_P \) to the estimated PPG-LPSI accuracy (\( \widehat{\rho} \)), expressed in percentage terms, \( {\widehat{p}}_P=100\left({\widehat{\lambda}}_P-1\right) \), where \( {\widehat{\lambda}}_P={\widehat{\rho}}_P/\widehat{\rho} \), for one, two, and three predetermined restrictions for five simulated selection cycles. The estimated PPG-LPSI accuracies were presented in Chap. 3 (Table 3.6). The average estimated PPG-ESIM efficiency for each restriction was 9.76%, 11.71%, and 29.03% greater than the PPG-LPSI efficiency for this data set in all five selection cycles.

The second part of Table 7.3 presents the estimated PPG-ESIM expected genetic gain per trait for one, two, and three predetermined restrictions for five simulated selection cycles. The estimated PPG-LPSI expected genetic gains per trait for one, two, and three predetermined restrictions were presented in Chap. 3, Table 3.5, where it can be seen that the averages of the estimated PPG-LPSI expected genetic gains per trait for five simulated selection cycles were 6.85, −3.25, 2.62 and 1.48 for one restriction; 6.93, −2.97, 2.65 and 1.45 for two restrictions; and 5.20, −2.23, 3.72 and 1.43 for three restrictions, whereas for the same set of restrictions, the averages of the estimated PPG-ESIM expected genetic gain per trait were 5.67, −2.67, 1.81, and 2.97 for one restriction; 5.89, −2.52, 2.04, and 2.83 for two restrictions; and 5.71, −2.45, 4.08, and 0.82 for three restrictions (Table 7.3). Because the vectors of predetermined proportional gains for each predetermined restriction were \( {\mathbf{d}}_1^{\prime }=7 \), \( {\mathbf{d}}_2^{\prime }=\left[7\kern0.5em -3\right] \), and \( {\mathbf{d}}_3^{\prime }=\left[7\kern0.5em -3\kern0.5em 5\right] \), the averages of the estimated PPG-LPSI expected genetic gains per trait were closer than those of the estimated PPG-ESIM expected genetic gains per trait for one and two predetermined restrictions, whereas for three restrictions, the results of both selection indices were similar.