Factors associated with sensitive regression weights: A fungible parameter approach

Agler, Robert A.; De Boeck, Paul

doi:10.3758/s13428-019-01220-6

Factors associated with sensitive regression weights: A fungible parameter approach

Published: 03 June 2019

Volume 52, pages 207–223, (2020)
Cite this article

Download PDF

Behavior Research Methods Aims and scope Submit manuscript

Factors associated with sensitive regression weights: A fungible parameter approach

Download PDF

1253 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

Sensitive parameters serve as a weak foundation for scientific inferences, because they provide less certainty about the accuracy and trustworthiness of the estimated model. Fungible weights may be used to examine parameter sensitivity by looking at how much sets of interchangeable, slightly suboptimal linear regression weights, all of which yield an identical, slightly reduced value of R², differ from the optimal OLS weights. We find that in the two-predictor case, the range of a predictor’s fungible weights is almost completely explained by the absolute value of the correlation of the other predictor with the criterion variable (R² = .990); an interaction with the variance inflation factor (VIF) yields R² = 1. In the more complicated three-predictor case, the effects of the other two correlations yield R² = .839, and including the predictor’s VIF and its interactions yields R² = .910. The effects observed occur because alternative predictors with a high correlation with the criterion, or with each other, can compensate for the changes to a predictor’s weight while still yielding similar predicted values. An R function is provided to calculate the range of fungible weights for a given covariance matrix. We close with a discussion of some important implications of our results regarding parameter sensitivity and the trustworthiness of effect estimates.

Reporting reliability, convergent and discriminant validity with structural equation modeling: A review and best-practice recommendations

Article Open access 30 January 2023

A new criterion for assessing discriminant validity in variance-based structural equation modeling

Article Open access 22 August 2014

RMSEA, CFI, and TLI in structural equation modeling with ordered categorical data: The story they tell depends on the estimation methods

Article 04 June 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Fungible weights are sets of alternative, suboptimal weights that may be used to examine parameter sensitivity in multiple linear regression models by way of a minor decrement in R². Each set of weights yields the same value of R², as well as the same correlation between the ordinary least squares (OLS) and alternative predicted values (Waller, 2008). The degree of discrepancy between sets of fungible weights and the OLS weights is independent of sample size, and their behavior is determined by factors other than those that determine the confidence interval for regression weights (Jones, 2013; Pek, Chalmers, & Monette, 2016). Under some conditions, even very small decrements in R² can be associated with substantially different regression weights for the predictor variables. In those cases, the weights are considered sensitive and provide a poor basis for scientific conclusions (Green, 1977), and it is therefore prudent to ensure that the weights obtained in a study are not too sensitive.

The uncertainty of parameter estimates we focus on here is not the uncertainty that stems from sampling variation, but uncertainty stemming from possible model inaccuracies, which are rarely known in practice. Although the two types of uncertainty are related (Pek et al., 2016), tests for one cannot serve as tests for the other. Methods of examining parameter sensitivity can be understood as relaxing the degree of certainty about the model’s accuracy (rather than relaxing the degree of certainty about precision, as with standard errors and confidence intervals) and as an acknowledgment that all models are wrong and almost necessarily biased (Box, 1976; Edwards, 2013), due to, for instance, missing variables, incorrect error terms, or missing (interaction) terms, or because measurement error is ignored (Jaccard & Wan, 1995). Although the sources of bias may cancel each other out to some degree, in general it can be expected that more inaccurate models will also be more biased. Here we will make use of predictor-specific fungible weight intervals (FIs) to indicate the sensitivity of parameter estimates to possible model inaccuracy, in analogy to how confidence intervals (CIs) reflect the imprecision stemming from sampling variation. The two sources of uncertainty are different, and as will be shown, the two types of intervals are different, as well.

Before continuing, we wish to make clear that weights that yield a lower value of R² are not necessarily less accurate. On the contrary, the accurate weights will yield a lower value of R² if the model is incorrect. The estimated parameters in a linear regression model are optimal weights given the data and the model, but these estimates are unlikely to reflect the true effects associated with each variable if the model is not completely accurate (i.e., if it is not the “true model”). Examining the weights associated with a decrement in R² obviously does not guarantee recovery of the true effects, but it nonetheless provides an opportunity to investigate the potential consequences of an inaccurate model on the regression effect estimates, without being limited to any specific type of model inaccuracy.

Suppose that a model A is an accurate model and model B is an inaccurate model that does not include one or more predictors. If one were to use the true parameter values from model A in the same model B, the result would be a value of R² lower than that obtained with OLS estimation, and the accurate, unbiased parameters from model A would seem inferior to the OLS estimates. For example, suppose that the true model A includes a set of four predictor variables and a criterion variable with all rs = .4, and so all βs = .182 and R² = .291. If a predictor is excluded in the inaccurate model B, then the OLS estimates for the remaining three predictors would all be βs = .222 and R² = .267. However, if the true regression weights from model A were used in model B, then R² = .218, which is clearly inferior to the .267 value obtained with OLS. Similarly, when the true model A is a model with five predictor variables and all rs = .4, then R² = .308 and all βs = .154. However, if only three of the five predictors are used in a model B, the OLS estimates are again βs = .222 and R² = .267, whereas the correct values (from model A) used in model B yield an even larger reduction, R² = .185. Counterintuitively, then, optimal weights are not necessarily correct weights. Whereas these examples are calculated working with given true weights and known violations in an incorrect model, in practice the calculation of the sensitivity of weights is a form of reverse calculation that does not guarantee recovery of the true weights unless the specific violations are known (and we do not claim here that fungible weights may be used to recover the true model, as there has been insufficient research to make such a claim). Instead, consideration of alternative weights provides a general indication of how much the obtained parameters may reflect bias due to model violations of any form.

Fungible regression weights

Fungible regression weights are alternative weights that yield predicted values for the criterion variable that yield a prespecified criterion correlation with the OLS predicted values and identical suboptimal values of R²—thus, the term fungible weights. We denote the OLS vector of weights as b, and the vector of alternative weights as a. Similarly, we denote the OLS value of R² as $ {R}_b^2 $ and the fungible value as $ {R}_a^2 $, the predicted OLS weight values as $ {\widehat{y}}_b $, the predicted fungible weight values as $ {\widehat{y}}_a $, and the prespecified correlation between the two as $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $. For two predictor variables, there are two sets of alternative weights that satisfy the constraints, but for three or more predictors, there are an infinite number of alternative weights that do so (fungible weights are not defined for a single predictor). The full mathematical derivation of fungible weights may be found in Waller (2008), and a means to identify the minimum and maximally discrepant weight sets (i.e., the fungible extrema) may be found in Waller and Jones (2009). Fungible weights for logistic regression may be found in Jones and Waller (2016), and Lee, MacCallum, and Browne (2018) developed fungible parameters for structural equation modeling. Alternative weights that yield the same value of $ {R}_a^2 $ but do not satisfy the prespecified correlation are known as exchangeable weights (Pek et al., 2016).

Although we wish to minimize the use of equations throughout the present article, to reach a broader audience, a brief explanation of the geometry of fungible weights will be useful for the discussion of our results. Geometrically, fungible weight sets with three or more predictors lie at the intersection of a p – 1 dimensional (hyper) plane and a p-dimensional (hyper)ellipsoid. The intersection is a p – 1 dimensional ellipse or (hyper)ellipsoid. With two predictors, the weights sets are the two points at which a line intersects with an ellipse. The intersecting ellipse emerges because each weight is a tightly constrained function of the others, because of the prespecified correlation $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $ (Waller & Jones, 2009). The (hyper) plane is characterized by the set of weight vectors that satisfy the constraint that they yield the same value of $ {R}_a^2 $—specifically,

$$ {R}_a^2={\boldsymbol{a}}^{\prime }{\boldsymbol{R}}_{\boldsymbol{xx}}\boldsymbol{b} $$

and the (hyper) ellipsoid is characterized by the set of weight vectors that satisfy

$$ {R}_a^2={\boldsymbol{a}}^{\prime }{\boldsymbol{R}}_{\boldsymbol{xx}}\boldsymbol{a} $$

where R_xx is the predictor correlation matrix (Jones & Waller, 2016; Waller & Jones, 2009). Fungible weights are those that satisfy both equations.

We will refer to the ellipsoid as the all-possible-regressions ellipsoid, because for a given $ {R}_a^2 $ and predictor matrix, an infinite number of possible weight sets will yield the same $ {R}_a^2 $ (this is also true for $ {R}_b^2 $; Waller & Jones, 2011). Parameter sensitivity is evaluated on the basis of how tight the set of weights are around the parameters associated with optimal model fit, with tighter sets of weights providing a stronger basis for inferences from the OLS estimates. In other words, smaller ellipses indicate less sensitive parameters.

An example of fungible weights is shown in Fig. 1 with three predictor variables, with the variance explained for the OLS estimates being $ {R}_b^2 $ = .647. Given $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $ = .90, .95, and .99, the resultant alternative values are $ {R}_a^2 $ = .524, .584, and .634, respectively. The single dot represents the OLS estimates, and the ellipses represent the fungible weight sets for a given $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $ value. As can be seen, there are large discrepancies between the OLS estimates and some of the fungible weights. In the case of $ {\beta}_{X_1} $, the OLS weight is .073 and is significant for N = 450, whereas the fungible weights include both positive and negative weights for all shown values of $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $. For $ {\beta}_{X_2} $, the OLS weight is .239 and is significant for samples as small as N = 50, yet for $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $ = .90, the fungible weight sets also include negative weights.

Although it is easy to see that the sizes of the fungible weight ellipses differ for each predictor in this example, it is not immediately apparent what factors other than the arbitrary value of $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $ might contribute to the fungible ellipses. Several factors that affect fungible weights are detailed in Jones (2013). We distinguish these factors as being related to the predictor correlation matrix, to the predictor–criterion correlation vector, or to both. Factors related to the predictor correlation matrix may be understood as a matter of multicollinearity. Specifically, the eigenvalues of R_xx determine the shape of the ellipsoid, and the orientation of this ellipsoid is determined by the eigenvectors. As multicollinearity increases, the first eigenvalue will increase and the last eigenvalue will decrease, approaching zero. Roughly equal eigenvalues yield an ellipsoid that is roughly (hyper) spheroidal, and fungible weights yield (hyper) ellipses that are more circular. In contrast, the more discrepant the eigenvalues are, the thinner the ellipsoid becomes in at least one dimension (e.g., cigar or pancake shaped). The axes of the ellipsoid are calculated as follows:

$$ {l}_i=2\sqrt{\frac{R_{\alpha}^2}{\lambda_i}} $$

where l_i denotes the ith axis length, and λ_i denotes the ith ordered eigenvalue of the predictor matrix (Waller & Jones, 2011). It follows from the effects of the eigenvalues that measures of multicollinearity may help explain the fungible weight intervals without necessarily playing a direct role. The determinant is equal to the product of the eigenvalues of the matrix, so that as the discrepancy between the first and last eigenvalues increases, the determinant decreases because the last eigenvalue approaches zero. Similarly, the ratio between the first and last eigenvalues is known as the condition number, which is another measure of multicollinearity. Additionally, the familiar variance inflation factor (VIF) tends to increase as the discrepancy between the eigenvalues increases.

Factors related to the predictor–criterion correlation vector may be understood as being due to the $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $ constraint for the predicted values. The orientation of the eigenvectors of R_xx with respect to the vector of standardized regression coefficients (i.e., the correlation between the two) plays a role because, as we previously mentioned, the eigenvectors determine the orientation of the ellipsoid, and the intersecting plane is defined by the set of weight vectors that satisfy $ {R}_a^2={\boldsymbol{a}}^{\prime }{\boldsymbol{R}}_{\boldsymbol{xx}}\boldsymbol{b} $. As a result, where the plane intersects the ellipsoid is determined by both b and R_xx, with different intersection points being associated with different curvatures and thicknesses of the ellipsoid (Jones, 2013). Weight vectors more closely related to the eigenvectors will be more closely related to the variance of the predictors. Additionally, the angle of the intersecting plane is determined by the predictor–criterion correlations, because the correlation vector r_xy is orthogonal to the (hyper) plane by design (Waller & Jones, 2009), so the correlations can be expected to predict fungible weight behavior.

Finally, larger values of $ {R}_b^2 $ lead to larger ellipsoids composed of all weights that yield the same value of $ {R}_a^2 $, given the same value of $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $. To understand this, consider that just as there are an infinite number of weight vectors that satisfy the equation for the (hyper) ellipsoid, $ {R}_a^2={\boldsymbol{a}}^{\prime }{\boldsymbol{R}}_{\boldsymbol{xx}}\boldsymbol{a} $, there are also an infinite number of weight vectors that satisfy $ {R}_b^2={\boldsymbol{b}}^{\prime }{\boldsymbol{R}}_{\boldsymbol{xx}}\boldsymbol{b} $ (Waller & Jones, 2011). As a result, as $ {R}_b^2 $ increases, so too must the absolute value of some components of either R_xx or b. The number of predictors also impacts the fungible ellipses indirectly, due to the possible predictor correlations being constrained by the number of predictors (Jones, 2013), but we do not consider this matter further here.

Here, rather than the intersection ellipse, we focus our investigation on the range of each weight separately. We will refer to the range as the fungible interval, a sort of “validity interval” analogous to the reliability interval provided by a confidence interval. We use the range per predictor for a few reasons. In addition to the ease of interpreting the difference between two values, it is also the case that the familiar confidence interval is a range of weight values per predictor. There is also heavy bimodality in the distribution of the fungible weights, with peaks near the boundaries (Waller, 2008), and as a result, the range implies relatively little loss of information. Finally, we focus on the range rather than the fungible extrema (Waller & Jones, 2009) because the range is far more computationally simple. Fungible extrema are the two weight sets that either minimize or maximize the cosine (equivalently, the correlation) between the fungible and OLS weights. Though there is overlap between the minimally and maximally discrepant weight sets and the minimum and maximum weights for each predictor, the weight sets that include either the minimum or the maximum values of a weight are not necessarily fungible extrema, because there are 2p² weights for the range end points, given three or more predictors (where p denotes the number of predictors), but only 2p weights associated with the extrema.

Our purpose here, then, is to consider both how much the fungible interval can vary in size and what factors can help explain the differences, with the goal of providing a general sense of when the parameters may be sensitive to the effects of unknown model violations. Though the factors we explore are theoretically motivated, we take a relatively more applied approach in our reporting, with a preference for simple and familiar explanatory factors. We will work from correlation matrices in our studies, since they are always available to researchers, but knowledge of how the model is inaccurate is rarely so.

Fungible intervals for the two-predictor case

We begin with the two-predictor case for fungible intervals, because for two predictors there are only two weight sets. The two sets are the two end points of a line—that is, the fungible interval. In the next step we will consider the more complicated three-dimensional, three-predictor case, from which we can draw some possible generalizations to even larger predictor sets.

Method

For this study, the possible correlations between the two predictor variables and the criterion variable were – .5, – .4, – .3, – .2, – .1, 0, .1, .2, .3, .4, and .5.^{Footnote 1} Because there are three correlations in a three-variable correlation matrix, this resulted in 11³ = 1,331 matrices. We follow the example of confidence intervals and use the following criterion values: $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $ = .90, .95, and .99. These criterion values result in relatively modest drops in variance explained. For example, for a value of $ {R}_b^2 $ = .25, the resultant $ {R}_a^2 $values would be .203, .226, and .245, respectively.

For each matrix (all combinations here produce valid correlation matrices, although some—e.g., all rs = – .5 and $ {R}_b^2 $ = 1—are extremely unlikely and may be computationally difficult in practice), we calculated the OLS weights and the two fungible weight pairs for each $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $ value. All weights derived are standardized weights. Additionally, to provide some sense of the magnitude of the fungible intervals and provide a point of comparison, we also calculated 95% confidence intervals based on N = 100. To calculate the two sets of alternative weights, we used the R function provided in Waller (2008), with a small modification to allow estimation with two predictors.

Results and discussion

Comparison with confidence intervals

As with confidence levels and interval size, lower $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $ values result in larger intervals than do higher $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $ values. These results are to be expected because, generally speaking, increasingly discrepant predictions necessitate increasingly discrepant variable weights. The minimum and maximum interval sizes for $ {\beta}_{X_1} $ and $ {\beta}_{X_2} $ were identical: For each criterion correlation value, the largest intervals were 0.453, 0.343, and 0.161, for $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $values of .90, .95, and .99, respectively, and the smallest intervals were equal to zero—that is, the fungible weights were equal to the OLS weights. For comparison, the 95% confidence intervals ranged in size from .179 to .465. The magnitudes of each predictor’s fungible intervals were perfectly correlated across all three criterion values.

Figure 2 shows the end points for the two types of intervals. Since the plot for $ {\beta}_{X_2} $ is identical, we hereafter use the subscript i to denote a specific predictor, and the subscript i* to denote the other. The OLS point estimates and confidence interval end points are shown in black. The fungible interval end points are shown for $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $ = .90 and .99, in dark and light gray, respectively. The end points for $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $ = .90 overlap with those for $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $ = .99. The end points for .95 lie between these two sets of end points, and so are not shown. Both intervals are symmetrical around a set of weights. Specifically, the confidence intervals are symmetrical about the OLS weights, whereas the fungible interval end points are symmetrical about the transformed OLS weights (not shown) that are at the center of the fungible ellipse. These transformed weights are equal to $ \frac{R_a^2}{R_b^2}b $ (Waller & Jones, 2009). Because these weights are necessarily closer to 0 than the OLS weights, so too are the fungible interval end points.

Figure 3 shows the magnitude of the intervals (the difference between the lowest and highest weights), plotted in relation to the OLS values of $ {\beta}_{X_i} $. We use $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $ = .99 in the figure in order to minimize overlap between the confidence and fungible intervals; smaller criterion values simply result in increased distance between points without affecting the overall pattern. It is immediately apparent that the fungible and confidence interval magnitudes follow quite different patterns. Larger values of $ {\beta}_{X_i} $ are generally associated with tighter confidence intervals, but the fungible interval magnitudes are almost completely unrelated to the value of $ {\beta}_{X_i} $. Additionally, whereas the confidence intervals cluster in a singular arcing swath, the fungible intervals cluster in six groupings. Of note is that these clusters are not equally diffuse. From bottom to top, the first cluster includes entirely zero-magnitude intervals, and each subsequent cluster is increasingly diffuse. Considering the conditions in this study, this suggests that the absolute magnitude of one of the correlations may be the primary predictor of the fungible interval size, and that there is an interaction between it and at least one other variable.

Explanatory factors for fungible interval size

Because we are using regressions to describe regressions, using the same regression terminology (e.g., predictor and criterion or (in) dependent variables) for both can easily become confusing. For the regressions associated with fungible weights, we use the predictor and criterion terminology to refer to the variables involved, and for the regressions to explain the fungible interval size, we use the terms explanatory and explained variable(s).

We considered the following explanatory variables, in both pairs and trios, with all interactions included: $ \left|{r}_{X_iY}\right| $, $ \left|{r}_{X_{i\ast }Y}\right| $, $ \left|{r}_{X_1{X}_2}\right| $; VIF (there is only one value with two predictors); the determinant, condition number, and eigenvalues of the predictor matrix; the OLS regression weights; and$ {R}_b^2 $. We also considered the correlations (direction cosines) between the predictor matrix eigenvectors and the OLS weight vectors, and the axes of the ellipse (what would be the all-possible-regressions ellipsoid for three predictors). The explanatory variable weights we report are for standardized explanatory variables, but the explained variable—the fungible interval—is not standardized. We take this approach because standardization of the explanatory variables eases interpretation of the coefficients by making them comparable within the same analysis, but standardization of the explained variable would mask differences in the widths of the fungible intervals associated with values of $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $. All weights are exact, since no sampling was involved in this study (the confidence intervals were calculated directly from the standard errors based on correlation matrices and the assumed N = 100)

In contrast to the complicated, multifactorial determination of the shape and orientation of the (hyper) ellipsoid and orientation of the intersecting (hyper) plane (Jones, 2013) for three or more predictors, we found that the range for a given predictor’s fungible interval in the two-predictor case is very simply determined. As can be seen in Table 1, the magnitude of the $ {\beta}_{X_i} $ fungible interval is almost completely explained by |r_Xi ∗ Y|. This single variable is sufficient to yield $ {R}_b^2 $ = .990. By including VIF in the regression, the result is a model with $ {R}_b^2 $ = .997; all weights in these models were positive. With the addition of the interaction term, the result is $ {R}_b^2 $ = 1, so these three terms perfectly explain the range for the two-predictor case. We also briefly note that the combination of $ \mid {r}_{X_{i\ast }Y}\mid $ and the second axis of the ellipse resulted in $ {R}_b^2 $ = .992. We will revisit this point in the three-predictor case. Figure 4 illustrates the relationship between $ \mid {r}_{X_{i\ast }Y}\mid $ and the range of $ {\beta}_{X_i} $, with the magnitude of the fungible interval increasing with values of $ \mid {r}_{X_{i\ast }Y}\mid $. The increasing spread reflects the interaction between $ \mid {r}_{X_{i\ast }Y}\mid $ and VIF.

Table 1 Regression weights of explanatory variables for fungible intervals in the two-predictor case

Full size table

The results presented here may be understood by considering that satisfying the $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $ constraint requires that when the weight of one predictor goes up, the weights of the others go down. This is perfectly the case for two predictors, and thus for the two pairs of fungible weights, but for more predictors, it follows from the elliptical form. The results in Fig. 4 show that the fungible interval for the weight of one predictor increases with the correlation between the other predictor and the criterion variable, and slightly more so if VIF is high. Varying the weights of a predictor has fewer consequences for the prediction when the other variable is highly correlated with the criterion variable, and can therefore compensate for the weight changes, so there is more freedom for the weight to move. Consistent with this, when ∣r_Xi ∗ Y∣ is equal to 0, then the magnitude of the fungible interval for ∣r_XiY∣ is also equal to 0. Additionally, the intersecting plane is, by design, orthogonal to the predictor–criterion correlation vector (Waller & Jones, 2009, p. 594). In other words, the angle of the plane and the resultant intersection ellipse are entirely determined by the predictor–criterion correlations. It follows that if the correlation of one of the two predictors with the criterion is zero, then the fungible interval for the other predictor is zero, as can be seen in Fig. 1 of Waller and Jones (2009, p. 592). Our finding that the fungible interval shrinks to zero if the other correlation is zero confirms the geometric analysis in Waller and Jones (2009).

When X₁ and X₂ are highly correlated (high VIF), then the range is somewhat larger, because then one weight can better compensate for the other. The higher the correlation between the two predictors, the less it matters if one weight is increased at the cost of another, but this compensation is far from the primary factor determining the interval size. If X₁ is highly related to Y and X₂ is not related, then decreasing $ {\beta}_{X_1} $ and increasing $ {\beta}_{X_2} $ will have a large detrimental effect, because giving X₂ a larger weight adds noise to the prediction, leaving almost no freedom for $ {\beta}_{X_1} $ to change while still satisfying $ {r}_{{\widehat{y}}_a{\widehat{y}}_b} $, regardless of multicollinearity. The reverse is true if X₂ is highly correlated with Y but X₁ is not: Decreasing the value of $ {\beta}_{X_2} $ and increasing the value of $ {\beta}_{X_1} $ will, for the most part, simply add noise. It appears, then, that if there is a good alternative predictor, it does not matter too much which one does the predictive work, and this is slightly more the case if the two predictors are highly correlated.