Independent signals and noise
Low \(SN{R}_{x}\), no \({u}_{y}\) scaling
The results are summarized in the top three panels of Fig. 1 and the first three columns of Table 1. The blue curves in Fig. 1 are Normal densities using the mean and variance of the OLS estimates of \(\widehat{\beta }\) and the red curves are the same from using TLS. The solid line is the distribution of \(\widehat{\beta }\) from Eq. (4), dashed is from (5) and dotted is from (6). The true value of \(\beta\) is as indicated. The first three columns of Table 1 report, for the low \(SN{R}_{x}\) case, the bias \((Mean \widehat{\beta }-true \beta )\) and empirical one-sided 95% confidence interval widths (97.5th percentile of \(\widehat{\beta }-median \widehat{\beta })\), as well as the fraction of the \(\widehat{\beta }\) values exceeding 1.0, which measures the p-value on a test of model consistency or \(\beta =1\). The latter three columns report the same for the high \(SN{R}_{x}\) case.
Table 1 Distributions of \(\beta\) with no variance scaling on y.\(N=200\)
Note that in this case attenuation bias is multiplicative whereas OVB is additive. Since the \({x}_{i}\)’s and the noise components are uncorrelated we can estimate the univariate attenuation bias factor using the standard formula \(var\left({x}_{1}\right)/(var\left({x}_{1}\right)+var\left({u}_{1}\right))\) (Wooldridge p. 311), which in the low \(SN{R}_{x}\) case is \(1/1.25=0.80\). In the topmost panel of Fig. 1, with a true value of \(\beta =0.0\), the OLS estimators from Eqs. (4) and (5) are unbiased because the attenuation effect is multiplicative, with a slight efficiency loss in the latter case. In Eq. (6) the OLS estimator is biased upwards by 0.20 due to OVB. The application of TLS causes a dramatic increase in the variances. The estimates from Eqs. (4) and (5) are unbiased, but in Eq. (6) there is a larger upward bias (+ 0.29) than in the OLS case. Consequently, the severe loss of efficiency has no payoff in terms of relative reduction of bias.
The OLS pattern changes somewhat as the true value of \(\beta\) increases to 0.50. Now Eqs. (4) and (5) exhibit the expected attenuation bias with the OLS coefficient biased downward by 0.09 in each case. But Eq. (6) still imparts a + 0.20 omitted variable bias in the other direction, with the result that the net bias becomes + 0.11 in that case. TLS remains inefficient and is now biased as well, but in a different way than OLS. Using Eqs. (4) and (5) the distribution is still very wide and the mean is displaced upward by just under + 0.1, which in Eq. (6) is combined with the OVB to yield an overall + 0.41 bias. While the upward bias from Eqs. (5) and (6) are expected due to model error it is somewhat unexpected to observe that Eq. (4) yields a biased TLS estimator as well. However, as discussed in the introduction, other than in the univariate case the literature only provides assurances of asymptotic unbiasedness under restrictive conditions on the sample variances. In this case the conditions do not hold, but this could not have been determined in advance by a researcher. When Eq. (6) is estimated and true \(\beta =0.5\) the TLS bias rises to + 0.41. In the case of both Eqs. (5) and (6) the combination of bias and inefficiency means we would fail to reject a false null of \(\beta =1\).
When the true \(\beta =1\) the attenuation biases in the OLS Eqs. (4) and (5) become − 0.19 as expected, which is large enough to reject a true null of model consistency since \(\beta =1\) is outside the 95% confidence interval. The omitted variable bias in Eq. (6) is + 0.21 so the two biases cancel out. What appears to be an unbiased OLS coefficient in that example is in fact two mutually offsetting biases, but of course in practice one can’t guarantee such a fortuitous outcome. TLS in Eqs. (4) and (5) now exhibit a bias equal in magnitude but of opposite sign to that under OLS, and in the case of Eq. (6) the combined bias is now + 0.46, with nearly the entire distribution exceeding 1.0.
Summarizing, only when true \(\beta =0\) and there is no OVB does TLS yield an unbiased estimator, but OLS does as well in those cases, and it is more efficient. When true \(\beta >0\), OLS is biased downwards and TLS is biased upwards. When OVB is present the TLS estimator is biased to a greater magnitude than OLS. In this configuration of variances there is no case in which TLS is preferred to OLS.
High \(SN{R}_{x}\), no \({u}_{y}\) scaling
In the high \(SN{R}_{x}\) case (s = 5, bottom three panels in Fig. 1) the OLS attenuation bias coefficient is now much smaller, i.e. \(\frac{1}{1+\frac{1}{25}}\cong 0.962\) so we encounter a bias of approximately only − 0.04 when true \(\beta =1\). In other words, there is still an attenuation bias on OLS but in practical terms it is too small to matter much. When true \(\beta =0.0\) OLS is unbiased in Eqs. (4) and (5) and there is only a minor efficiency loss due to OVB. The OVB becomes slightly larger (+ 0.24) than in the low \(SN{R}_{x}\) case however, and since the attenuation bias is smaller the result is a larger upward shift in the \(\widehat{\beta }\) distributions in all three models compared to the low \(SN{R}_{x}\) case. The small attenuation bias on OLS emerges when \(\beta =0.5\) and 1.0, namely − 0.02 and − 0.04 respectively.
The TLS distributions remain inefficient with confidence interval widths at least twice the size of OLS. Also the bias magnitudes are not reduced by the increased signal strength: indeed they become slightly larger. Equations (4) and (5) still yield unbiased estimates when the true \(\beta =0\) but under higher values of \(\beta\) the biases grow by 0.02–0.04. Thus in all cases examined thus far OLS is a better option than TLS. Both estimators suffer bias problems, but as the signal-noise ratio in the explanatory variables increases OLS converges onto its true value while TLS does not. Both estimators suffer particularly large OVB in Eq. (6). It is worth noting that successfully detecting and remediating OVB would be more difficult under TLS because of the large variance estimates.
Downscaling\({u}_{y}\)
The low and high\(SN{R}_{x}\) experiments were re-run replacing \({u}_{y}\) with \({u}_{y}/2\) so the variance of the error on the dependent variable shrinks by a factor of 4, making it the same magnitude as that on \({x}_{i}\) in the low \(SN{R}_{x}\) case. Setting the noise variances equal across all variables coincides with a typical normalizing assumption in theoretical analyses of TLS estimation (e.g. Allen and Stott 2003), and in this case we will observe TLS is unbiased for Eq. (4) [although not for (5) or (6)]. Note that a researcher would not be able to measure the error variance on y independently of estimating the regression model itself so this change does not represent a methodological option, instead it represents an alternative conjecture regarding the unknown relative noise variances of the data.
The results are in Fig. 2 and Table 2. The OLS results remain qualitatively the same, with the combined effects of attenuation bias and OVB the same as before, but the variances are now smaller and the distributions are narrower. The empirical confidence interval widths are now virtually identical in the low and high \(SN{R}_{x}\) cases. The TLS variances shrink considerably compared to the unscaled \({u}_{y}\) case. In the low \(SN{R}_{x}\) case TLS in Eq. (4) now yields an unbiased estimator for all three values of \(\beta\), although it remains uniformly less efficient than OLS. A comparison between Figs. 1 and 2 shows that the efficiency loss can be very sensitive to the variance of the error term of the dependent variable, which is not directly measurable.
Table 2 Distributions of \(\beta\) with ¼ variance scaling on y. \(N=200\)
In the high \(SN{R}_{x}\) case the noise variances are no longer equal and Eq. (4) yields an upwardly-biased TLS estimate when true \(\beta >0\). Note that an improvement in the signal-noise ratio on the \({x}_{i}\)’s is associated with an increase in the TLS upward bias. Equation (5) yields a biased estimate for both OLS and TLS which worsens as true \(\beta\) gets larger. In the high \(SN{R}_{x}\) case the magnitude of the bias on TLS equals or exceeds that on OLS under all values of \(\beta\). Under Eq. (6) TLS continues to exhibit both a large upward bias and inefficiency.
Table 3 summarizes some distributional results by showing the 0.975 percentile value of the slope coefficient for Eq. (4) when true \(\beta =0\). Because the attenuation bias is multiplicative and there is no OVB in this case the distributions of estimates from both methods are correctly zero-centred so the OLS entry indicates the start of the region in which a Type I error would occur. The relative inefficiency of TLS implies that a part of its distribution overlaps the Type I region and a larger critical value is needed to reject the zero null. In the upper left cells, corresponding to no variance scaling on y and low \(SN{R}_{x}\) the OLS 2.5% upper tail starts at 0.127 whereas the TLS 2.5% upper tail begins at 0.434, which is about 3.4 times higher. In all cases the TLS upper tail cut-off point is higher than that for OLS, with the smallest gap in the lower right grouping, with the variance on \({u}_{y}\) downscaled and high \(SN{R}_{x}\).
Table 3 97.5th percentile of \(\widehat{\beta }\) when true \(\beta =0\) using Eq. (4)
Discussion of results thus far
The estimator comparisons are summarized in Table 4. The rows are grouped by true value of \(\beta\) and within each group, the rows correspond to whether the model is correctly specified (no OVB), missing an uncorrelated explanatory variable (OVB-1) or missing a correlated explanatory variable (OVB-2). The preferred estimator is the one shown and the ground for the preference is indicated in brackets. (e) indicates that one is more efficient than the other, but neither one is materially less biased. (b) denotes that one is less biased than the other but both are about equally efficient. (b,e) indicates that the preference is based on grounds both of bias and efficiency improvements.
Table 4 Preferred estimators. Term in brackets denotes basis of preference: b – reduced bias, e – increased efficiency
TLS is preferred in only 4 of 36 cases, and in one there is no clear preference. If the researcher has no information at all about which cell best describes the information set and the true model, OLS is a safer choice. If specific conclusions depend on using TLS rather than OLS the researcher needs to make the case why the conditions support using TLS. The difficulty is that the case may depend on the relative sizes of unobservable variances. As the variance of the error term on \({x}_{1}\) goes from high to low the preference moves unambiguously towards OLS. This accords with theory since the rationale for TLS is the presence of errors in the explanatory variables, and the smaller these are, the closer we get to conditions that guarantee OLS is unbiased and efficient. But even within the low \(SN{R}_{x}\) case, note that the preference moves unambiguously towards OLS as the variance on y increases (compare column 2 to column 1). This is because consistency properties of \({\widehat{\mathbf{b}}}_{TLS}\) depend on the relative variances of the dependent and independent variables. While the researcher may have reason to believe the data set belong to the low \(SN{R}_{x}\) category (for instance if the signals are from individual climate models rather than ensemble averages) there is no information within the sample itself that would tell a researcher whether the data belong to the high or low \({u}_{y}\) variance group.
Likewise the preference moves unambiguously towards OLS as the true \(\beta\) approaches zero. This raises an important point about signal detection: rejection of the zero null on \(\beta\) cannot be conditional on the decision to use TLS, since if the null were true we would not choose TLS. Signal detection can only be based on TLS results if it is confirmed using OLS. Unfortunately it is rare for researchers report results from both TLS and OLS estimation, which means readers cannot tell if published signal detection results were consistent with the estimator choice.
Finally the preference moves unambiguously towards OLS as the likelihood of OVB-2 rises. This cannot be ruled in or out merely by inspecting the estimation results. It needs to be specifically tested for and ruled out using information from outside the original information set (see, e.g. Davidson and MacKinnon 2004). However the construction of informative F tests for this purpose would be undermined by the greatly inflated variances from TLS estimation. Hence such tests would be better constructed using OLS variances.
Several authors have noted that optimal fingerprinting coefficients can be very unstable and even sensitive to the choice of which climate model is used to generate the internal variability covariance matrix (see Jones et al. 2013, 2016 for examples). The instability typically takes the form of extremely wide confidence intervals. Examining the 72 distributions in Figs. 1 and 2, if the TLS standard errors are very large but those of OLS are not, it is likely we are in a Fig. 1-type information set, in which the error variance on y is large enough that OLS would be preferred. If the OLS and TLS coefficient variances are reasonably similar we are probably looking at a case more like Fig. 2 in which TLS will be preferred unless \(SN{R}_{x}\) is high, in which case TLS overcorrects and OLS is preferred.
It is noteworthy that the biases in OLS and TLS tend to go in opposite directions. More specifically, the bias in TLS is always \(\ge 0\) in these experiments, although there was nothing in the set-up that suggested a priori this would be the outcome. The derivation in the Appendix reveals that unlike OLS the TLS slope coefficient depends on unbiased error variance estimates, which are themselves difficult to obtain. Consequently biased variances will introduce biases in the slope coefficients. This has numerous implications even if the bias does not affect the binary decision of whether a signal is detected or not. The magnitudes of signal scaling coefficients are themselves important because they are used in the computation of transient climate response (TCR) magnitudes (Jones et al. 2016) and Transient Response to Cumulative Emissions (TCRE) magnitudes (Gillett et al. 2013) which, in turn, are important for estimating future warming rates and potential carbon budgets for compliance with policy goals such as the Paris targets (Nijsse et al. 2020). A systematic upward bias in the signal coefficients would result in a downward bias on the Paris-compliant carbon budget.
Correlated signals and noise
We consider the case with \(c=-0.15\) which implies a correlation of about − 0.2 between \({w}_{1}\) and \({w}_{2}\). This parameter value is well below the range (– 0.29 to – 0.91) computed from the CMIP5 sample noted above and is therefore a conservative magnitude. It was also chosen to avoid the “weak signal” regime in DelSole et al. (2019) which refers to highly correlated signals which creates problems for constructing confidence intervals. Setting c to 0.9, for instance, causes the TLS results to become degenerate and uninformative. The chosen parameter value is small enough to belong to the “strong signal” regime, in the sense that \({w}_{1}\) and \({w}_{2}\) are still largely independent, but large enough to provide interesting contrasts with the uncorrelated case.
The results are summarized in Tables 5 and 6; Figs. 3 and 4, which follow the formats of Tables 1 and 2 and Figs. 1 and 2. Figure 3 shows the results for the unscaled \({u}_{y}\) case. The OLS confidence intervals are wider by a negligible amount but the TLS ones become considerably wider. The biases noted previously retain their respective signs and become larger. In all six panels in Fig. 3, OLS imparts a downward bias and TLS imparts an upward bias regardless of the estimation [Eqs. (4), (5) or (6)]. Interestingly OLS exhibits an additive negative bias so its distribution is centered slightly below zero even when true \(\beta =0\). Moving from the low \(SN{R}_{x}\) to the high \(SN{R}_{x}\) case we can see that the OLS bias shrinks but—as before—the TLS bias gets larger.
Table 5 Distributions of \(\beta\) with no variance scaling on y and \(c=-0.15\).\(N=200\)
Table 6 Distributions of \(\beta\) with ¼ variance scaling on y. \(N=200\)
Of particular note, when the true value of \(\beta =0\), in the low \(SN{R}_{x}\) case under Eq. (6) and in the high \(SN{R}_{x}\) case with all three estimation equations the TLS distribution is centered so far above zero that the lower 2.5% bound is itself above zero and we therefore have a Type I error, or a false signal detection. In the OLS case a false detection would occur for Eq. (6) at the 10% significance level in the low \(SN{R}_{x}\) case and at 5% in the high \(SN{R}_{x}\) case.
Figure 4 shows the results with the variance on \({u}_{y}\) scaled down so the noise variances are equal on all variables. OLS retains its downward bias in the low \(SN{R}_{x}\) case using Eqs. (4) and (5) whereas TLS is unbiased using Eq. (4). The omission of an uncorrelated explanatory variable (Eq. 5) does not change the OLS estimator but does impart an upward bias on TLS. Both OLS and TLS yield Type I errors in the Eq. (6) case, and in the high \(SN{R}_{x}\) case TLS does additionally in the Eq. (5) case.
Focusing on the case where the true \(\beta =0\), between Figs. 4 and 5 OLS exhibits a Type I error in four of the 12 cases (whenever Eq. (6) is used) and TLS exhibits a Type I error in in seven of the 12 cases. Additionally, both OLS and TLS tend to be more likely to reject a true null hypothesis in the higher \(SN{R}_{x}\) case, because the variance shrinks more quickly than the bias. Also, in the high \(SN{R}_{x}\) case the positive bias increases on TLS but not on OLS. With the exception of the low \(SN{R}_{x}\) case with \({u}_{y}\) variance scaled down, TLS exhibits an upward bias exceeding the downward bias on OLS.
Overall, as with uncorrelated signals, when the explanatory signals are negatively correlated TLS compares unfavourably to OLS in most cases. Except when there is no model misspecification (Eq. 4) and the error terms all have equal variances TLS is biased, inefficient and prone to false positives. Even if the true \(\beta >0\) the TLS coefficient magnitude is biased upward. Since there is no way for the researcher to tell which specific case is being observed OLS in general would be a safer choice but if it suspected that we are in a low \(SN{R}_{x}\) regime with no OVB it should be assumed that OLS underestimates \(\beta\) and results from using a single signal vector should be compared to those from an ensemble average of signal vectors.