Assessing the consistency of the fixed-effects estimator: a regression-based Wald test

Spierdijk, Laura

doi:10.1007/s00181-022-02298-2

Assessing the consistency of the fixed-effects estimator: a regression-based Wald test

Open access
Published: 09 September 2022

Volume 64, pages 1599–1630, (2023)
Cite this article

Download PDF

You have full access to this open access article

Empirical Economics Aims and scope Submit manuscript

Assessing the consistency of the fixed-effects estimator: a regression-based Wald test

Download PDF

Laura Spierdijk ORCID: orcid.org/0000-0002-1573-0570¹

2523 Accesses
2 Citations
Explore all metrics

Abstract

Under large-n and fixed-T panel data asymptotics, we develop a method to test a sufficient condition for the FE estimator’s consistency using a stacked regression framework. The resulting test exploits a previously unnoted relation between the fixed-effects estimator and the short- and long-differences estimators. It takes the familiar form of a panel-robust Wald test, but is also shown to be asymptotically equivalent to a GMM test. We provide a theoretical comparison between our test and two existing ones from the literature, which are shown to focus on generic strict exogeneity conditions instead of being specifically related to the FE estimator’s moment conditions. We investigate our test’s finite-sample properties in a simulation study, where we continue the comparison with the other tests. We show that our test has good finite-sample properties, especially if the estimator of the covariance matrix is based on a panel bootstrap. The practical use of our test is illustrated in two applications to existing data from the literature.

Inference in Two-Step Panel Data Models with Time-Invariant Regressors: Bootstrap Versus Analytic Estimators

Large-N and Large-T Properties of Panel Data Estimators and the Hausman Test

Robust estimation and moment selection in dynamic fixed-effects panel data models

Article Open access 11 December 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Since the early days of econometrics, the fixed-effects (or within) estimator has been widely used to estimate the linear panel regression model in the presence of individual effects correlated with the regressors (Mundlak 1961; Mundlak and Hoch 1965). Because the fixed-effects (FE) estimator exploits a single moment condition for each covariate, it is just identified. Viewed from a GMM perspective, it is therefore not possible to test the validity of these moment conditions by means of the J-test for overidentifying moment conditions.^{Footnote 1} In other situations, we may have certain suspicions that some covariates are unlikely to satisfy the moment conditions imposed by the FE estimator. If instrumental variables (IVs) are available for these covariates, we could test the FE estimator against the FE-IV estimator using a Hausman test (Hausman 1978).

To our best knowledge, consistency tests for the FE estimator that do not require IVs are rare. The present study seeks to fill the gap in the literature by developing a test to validate a sufficient condition for the consistency of the FE estimator that does not use such side information. We derive the test by exploiting a previously unnoted relation between the FE estimator and the short- and long-differences estimators. The resulting test can be performed in the familiar form of a panel-robust Wald test for certain parameter restrictions in a stacked regression framework.

Because our test turns out asymptotically equivalent to a GMM test, our approach also fits in the familiar setting of GMM estimation and specification testing. Consequently, the asymptotic properties of our Wald test are standard and well documented in the literature (Newey 1985; Cameron and Trivedi 2005; Hall 2005). The theoretical part of our study uses the link with GMM testing to draw a formal comparison between our test and the ones proposed by Wooldridge (2010) and Su et al. (2016), where the latter is an extension of the former. Both tests make use of auxiliary regressions to assess the validity of certain strict exogeneity conditions. We show that our test validates sufficient conditions for the consistency of the FE estimator, while the other two focus on more generic strict exogeneity conditions that are neither sufficient nor necessary for the consistency of the FE estimator.

We investigate our test’s finite-sample properties in a simulation study, where we continue the comparison between our Wald test and the strict exogeneity test of Su et al. (2016). We use a simulation design in which the two tests are either both consistent or both inconsistent. Our Wald test generally exhibits good finite-sample properties, especially if the estimator of the covariance matrix is based on a panel bootstrap.

The empirical behavior of our Wald test is illustrated in two empirical applications that elaborate on existing studies from the literature. Both McKinnish (2008) and Erickson and Whited (2000) apply the linear panel regression model to data sets containing an explanatory variable that is suspected to be subject to measurement error, which would render the FE estimator inconsistent. In the context of our theoretical results, these data sets provide a particularly relevant empirical case for our Wald test. For the linear panel regression model applied to the data of McKinnish (2008), our Wald test rejects the sufficient condition for the FE estimator’s consistency. Applied to the data of Erickson and Whited (2000), however, our Wald test finds no such evidence. We also run the test of Su et al. (2016) and draw the comparison with the outcomes of our test.

Because our test validates a sufficient condition for the consistency of the FE estimator, it is possible that the FE estimator is consistent even though this condition does not hold. Furthermore, we will show that there is also a possibility that the Wald test has low power in certain cases. We will provide recommendations on how to remedy such type I and type II errors using additional analysis. Hence, although our test does not require instrumental variables, it should be combined with further investigations.

Our approach connects to different strands of literature. From the time series literature, we take the idea of a test that exploits taking differences (e.g., Plosser et al., 1982, Davidson et al., 1985, Breusch and Godfrey, 1986, Thursby, 1989). We combine this idea with the insight of Griliches and Hausman (1986, p. 114) that the linear panel regression model is misspecified if short- and long-differences estimators differ significantly. The resulting stacked regression framework facilitates researchers to routinely run our Wald test. In this way, we extend the panel data literature about large-n and fixed-T specification testing, which includes but is not limited to tests for overidentifying restrictions (Hayakawa 2019), random effects vs. fixed effects and FE vs. FE-IV (Hausman 1978; Baltagi et al. 2003; Amini et al. 2012; Joshi and Wooldridge 2019), unit roots (Harris and Tzavalis 1999), selectivity bias (Verbeek and Nijman 1992; Wooldridge 1995), cross-sectional dependence (Sarafidis and Wansbeek 2012) and GMM-based test for autocorrelation in error terms (Arellano and Bond 1991).

The setup of the remainder of this study is as follows. Section 2 describes the regression framework that we propose to estimate and test the FE estimator. Section 3 introduces the test statistic and discusses its statistical properties based on the literature (asymptotic behavior), followed by a simulation study (finite-sample behavior) in Sect. 4. Both sections draw the comparison with the test proposed by Su et al. (2016). Our approach is illustrated in Sect. 5, where we provide two applications to existing data from the literature. Lastly, Sect. 6 concludes. An appendix with supplementary material is available.

2 Regression framework

We consider the situation that we are interested in estimating the static linear panel regression model with $T\ge 3$ time observations, given by

$$\begin{aligned} {{\textbf {y}}}_i=\gamma _i\,{\varvec{\iota }}_T+{{\textbf {X}}}_i\,{\varvec{\beta }}_0+{\varvec{\varepsilon }}_i \qquad [i=1,\ldots ,n], \end{aligned}$$

(1)

where ${{\textbf {y}}}_i$ ($T\times 1$) is the dependent variable, $\gamma _i$ the individual-specific intercept, ${\varvec{\iota }}_T$ ($T\times 1$) a vector of ones, ${{\textbf {X}}}_i$ ($T\times k$) the matrix of observed covariates, ${\varvec{\beta }}_0$ ($k\times 1$) the unknown coefficient vector and ${\varvec{\varepsilon }}_i$ ($T\times 1$) the error term.

2.1 FE and differences estimators

Let ${\textbf {D}}_j$ be the $(T-j)\times T$ matrix that takes differences over time span $j=1,\ldots ,T-1$ and write $\varvec{\Delta }_j={\textbf {D}}_j'{\textbf {D}}_j$. We define ${\widehat{{\varvec{\beta }}}}_j$ as the OLS estimator of ${\varvec{\beta }}_0$ in (1) after taking differences over time span j, yielding

$$\begin{aligned} {\widehat{{\varvec{\beta }}}}_{j}=\left( \sum _{i=1}^{n}{{\textbf {X}}}_{i}'\varvec{\Delta }_j{{\textbf {X}}}_{i}\right) ^{-1}\sum _{i=1}^{n}{{\textbf {X}}}_{i}'\varvec{\Delta }_j{{\textbf {y}}}_{i}. \end{aligned}$$

(2)

We denote the centering matrix of order T by ${\textbf {A}}_T={\textbf {I}}_T-{\varvec{\iota }}_T{\varvec{\iota }}_T'/T$. This matrix is symmetric, idempotent of rank $T-1$ and orthogonal to ${\varvec{\iota }}_T$. We observe that $\varvec{\Delta }_j$ has the jth pseudo-diagonal equal to $-1$, with all other pseudo-diagonals are zero. Moreover, $\sum _j\varvec{\Delta }_j$ has diagonal elements equal to $T-1$ since all rows add to zero. As a result, ${\textbf {A}}_T=(\varvec{\Delta }_1+\ldots +\varvec{\Delta }_{T-1})/T$.

We use the relation between ${\textbf {A}}_T$ and the $\varvec{\Delta }_j$s to rewrite the FE estimator of ${\varvec{\beta }}_0$ in terms of the differences estimators ${\widehat{{\varvec{\beta }}}}_{j}$, resulting in

$$\begin{aligned} {\widehat{\varvec{\beta }}}_{{\tiny {\hbox {FE}}}}=\sum _{j=1}^{T-1}{\textbf {W}}_j\left( \sum _{i=1}^{n}{{\textbf {X}}}_{i}'\varvec{\Delta }_j{{\textbf {X}}}_{i}\right) ^{-1}\sum _{i=1}^{n}{{\textbf {X}}}_{i}'\varvec{\Delta }_j{{\textbf {y}}}_{i}=\sum _{j=1}^{T-1}{\textbf {W}}_j\,{\widehat{\varvec{\beta }}}_{j}, \end{aligned}$$

(3)

where

$$\begin{aligned} {\textbf {W}}_j= & {} \left( \sum _{i=1}^{n}\sum _{t=1}^{T-1}{{\textbf {X}}}_{i}'\varvec{\Delta }_t{{\textbf {X}}}_{i}\right) ^{-1}\sum _{i=1}^{n}{{\textbf {X}}}_{i}'\varvec{\Delta }_j{{\textbf {X}}}_{i} \end{aligned}$$

(4)

$$\begin{aligned} \sum _{j=1}^{T-1}{\textbf {W}}_j= & {} {\textbf {I}}_k, \quad {\textbf {W}}_j \quad \text{ symmetric } \text{ and } \text{ positive } \text{ definite }. \end{aligned}$$

(5)

The complete derivation of this result is given in Appendix A. This leads to Result 1.

Result 1

The FE estimator is the weighted matrix average of differences estimators, i.e.,

$$\begin{aligned} {\widehat{\varvec{\beta }}}_{{\tiny {\hbox {FE}}}}=\sum _{j=1}^{T-1}{\textbf {W}}_j\,{\widehat{\varvec{\beta }}}_{j}, \end{aligned}$$

(6)

with ${\textbf {W}}_j$ as in (4).

Now let $\varvec{\beta }_j\equiv \hbox {plim}_{n\rightarrow \infty }{\widehat{\varvec{\beta }}}_j$ and assume that standard regularity conditions for large-n and fixed-T panel data hold. We note that

(7)

and observe that also these weight matrices sum to the identity matrix. Consequently, from (6) it follows that $\hbox {plim}_{n\rightarrow \infty }{\widehat{\varvec{\beta }}}_{{\tiny {\hbox {FE}}}}=\varvec{\beta }_0$ under $H_{0}: \varvec{\beta }_{1}=\ldots =\varvec{\beta }_{T-1}=\varvec{\beta }_0$. Stated differently, a sufficient condition for the consistency of the FE estimator is that all of the differences estimators are consistent. This leads to Corollary 1.

Corollary 1

Assume that the large-n and fixed-T panel data regularity conditions for GMM estimators as listed in Su et al. (2016) hold. Then the FE estimator is consistent if each of the differences estimators is consistent; i.e., if $\hbox {plim}_{n\rightarrow \infty }{\widehat{\varvec{\beta }}}_j=\varvec{\beta }_0$ ($j=1,\ldots ,T-1$), then $\hbox {plim}_{n\rightarrow \infty }{\widehat{\varvec{\beta }}}_{{\tiny {\hbox {FE}}}}=\varvec{\beta }_0$.

We note that $H_0$ is not a necessary condition for the consistency of the FE estimator. Under $H_1: \varvec{\beta }_{j}\ne \varvec{\beta }_{j+1}$ (at least one $j=1,\ldots ,T-2$), the FE estimator can still be consistent. This result follows directly from Result 1, but we will come back to it in Sect. 3.4.4.

2.2 Motivating examples

To illustrate the link between the (in-)consistency of the FE- and differences estimators, Table 1 provides four motivating examples. We consider the linear panel regression model with (i) classical measurement error, (ii) non-classical measurement error, (iii) omitted variables and (iv) simultaneity. The precise model specifications are described in the first column of Table 1 and given in more detail in Section B of the appendix with supplementary material. To ensure stationarity, we assume that all autoregressive parameters fall in the interval $(-1,1)$. In each of the four cases, the FE estimator and differences estimators are inconsistent for non-trivial parameter values. The second and third column in Table 1 report the inconsistencies.^{Footnote 2}

Table 1 Motivating examples: inconsistency of difference and FE estimators

Full size table

2.3 Stacked regression

In order to use Corollary 1 for the construction of a statistical test for a sufficient condition for the FE estimator’s consistency, it turns out useful to estimate the $\varvec{\beta }_j$s jointly. Let

$$\begin{aligned} {{\textbf {y}}}_{i1}= & {} {\textbf {D}}_1 {{\textbf {y}}}_i=(y_{i2}-y_{i1},y_{i3}-y_{i2},\ldots \ldots ,y_{iT}-y_{i,T-1})'\\ {{\textbf {y}}}_{i2}= & {} {\textbf {D}}_2 {{\textbf {y}}}_i=(y_{i3}-y_{i1},y_{i4}-y_{i2},\ldots ,y_{iT}-y_{i,T-2})'\\&\vdots&\\ {{\textbf {y}}}_{i,T-1}= & {} {\textbf {D}}_{T-1}{{\textbf {y}}}_i=y_{iT}-y_{i1}, \end{aligned}$$

and define ${{\textbf {X}}}_{i1},\ldots ,{{\textbf {X}}}_{i,T-1}$ and ${\varvec{\varepsilon }}_{i1},\ldots ,{\varvec{\varepsilon }}_{i,T-1}$ analogously. Next, let

$$\begin{aligned} {\widetilde{{\textbf {y}}}}_i=\left( \begin{array}{c}{{\textbf {y}}}_{i1}\\ {{\textbf {y}}}_{i2}\\ \vdots \\ {{\textbf {y}}}_{i,T-1} \end{array} \right) ,\qquad {\widetilde{{{\textbf {X}}}}}_i=\left( \begin{array}{cccc}{{\textbf {X}}}_{i1}&{}0&{}\ldots &{}0\\ 0&{}{{\textbf {X}}}_{i2}&{}\ldots &{}0\\ \vdots &{}\vdots &{}\ddots &{}\vdots \\ 0&{}0&{}\ldots &{}{{\textbf {X}}}_{i,T-1} \end{array}\right) , {\widetilde{{\varvec{\varepsilon }}}}_i=\left( \begin{array}{c}{\varvec{\varepsilon }}_{i1}\\ {\varvec{\varepsilon }}_{i2}\\ \vdots \\ {\varvec{\varepsilon }}_{i,T-1}\end{array}\right) . \end{aligned}$$

We then write

$$\begin{aligned} {\widetilde{{\textbf {y}}}}_i= {\widetilde{{\textbf {X}}}}_i\left( \begin{array}{c} \varvec{\beta }_1\\ \varvec{\beta }_2\\ \vdots \\ \varvec{\beta }_{T-1} \end{array}\right) +{\widetilde{{\varvec{\varepsilon }}}}_{i}. \end{aligned}$$

(8)

The stacked regression model in (8) allows us to estimate the $\varvec{\beta }_j$s jointly by means of OLS. We observe that the FE estimator arises as the constrained OLS estimator of ${\varvec{\beta }}_{{\tiny {\hbox {D}}}}=({\varvec{\beta }}_{1}', {\varvec{\beta }}_{2}', \ldots , {\varvec{\beta }}_{T-1}')'$ in (8), under the parameter restriction $\varvec{\beta }_{1}=\ldots =\varvec{\beta }_{T-1}$.

3 Test procedure

Corollary 1 states that, if each differences estimator is consistent, also the FE estimator must be consistent. This observation is the starting point of our Wald test. In brief, we first estimate ${\widehat{\varvec{\beta }}}_{{\tiny {\hbox {D}}}}$ in (8) using (unconstrained) OLS. Subsequently, we use a Wald test to test $H_0$ against $H_1$.

3.1 Wald test

To calculate the Wald test statistic, we need a cluster-robust estimator of the asymptotic covariance matrix in addition to ${\widehat{\varvec{\beta }}}_{{\tiny {\hbox {D}}}}$. This estimator is given by

$$\begin{aligned} \widehat{\varvec{\Omega }}_{{\tiny {\hbox {D}}}}=\left( \sum _{i=1}^{n}{\widetilde{{\textbf {X}}}}_{i}'{\widetilde{{\textbf {X}}}}_{i}\right) ^{-1}\sum _{i=1}^{n}{\widetilde{{\textbf {X}}}}_{i}' {\widehat{\varvec{u}}}_{i}{\widehat{\varvec{u}}}_{i}' {\widetilde{{\textbf {X}}}}_{i} \left( \sum _{i=1}^{n}{\widetilde{{\textbf {X}}}}_{i}'{\widetilde{{\textbf {X}}}}_{i}\right) ^{-1}, \end{aligned}$$

(9)

where $\widehat{{{\textbf {u}}}}_{i}={\widetilde{{\textbf {y}}}}_{i}-{\widetilde{{\textbf {X}}}}_{i}\,{\widehat{\varvec{\beta }}}_{{\tiny {\hbox {D}}}}$. The Wald test statistic for the parameter restrictions $\varvec{\beta }_{j}=\varvec{\beta }_{j+1}$ ($j=1,\ldots ,T-2$) is given by

$$\begin{aligned} q_{{\tiny {\hbox {W}}}}={\widehat{\varvec{\beta }}}_{{\tiny {\hbox {D}}}}'\,{\textbf {R}}'\left( \sum _{i=1}^{n}{\textbf {R}}\,{\widehat{\varvec{\Omega }}}_{{\tiny {\hbox {D}}}}\,{\textbf {R}}'\right) ^{-1}{\textbf {R}}\, {\widehat{\varvec{\beta }}}_{{\tiny {\hbox {D}}}}, \end{aligned}$$

(10)

where ${\textbf {R}}={\textbf {B}}\otimes {\textbf {I}}_k$ and ${\textbf {B}}$ is the $k(T-2)\times (T-1)$ matrix taking first differences, given by

$$\begin{aligned} {\textbf {B}}=\left( \begin{array}{ccccc} -1 &{} 1 &{} 0 &{} \ldots &{} 0 \\ 0 &{} -1 &{} 1 &{} \ldots &{} 0 \\ \vdots &{} \vdots &{} \ddots &{}\ddots &{} \vdots \\ 0 &{} 0 &{} \ldots &{} -1 &{} 1 \\ \end{array} \right) . \end{aligned}$$

(11)

Under $H_0$, the asymptotic distribution of the Wald test statistic is Chi-square with $k(T-2)$ degrees of freedom, while under fixed alternatives the test statistic converges in probability to infinity (Cameron and Trivedi 2005, Section 7.6.2). We therefore reject $H_0$ if $q_{\hbox {{W}}}$ exceeds the $(1-\alpha )\%$ critical value of the Chi-square distribution with $k(T-2)$ degrees of freedom, with $\alpha $ the chosen significance level. The usual asymptotic properties of the Wald test hold under standard large-n and fixed-T panel data regularity conditions, as summarized in the following result.

Result 2

Under the regularity assumptions as listed in Su et al. (2016), the asymptotic distribution of the Wald test statistic under $H_0$ is Chi-square with $k(T-2)$ degrees of freedom. The Wald test statistic converges in probability to infinity under $H_1$. The Wald test has nominal asymptotic size under $H_0$ and unit asymptotic power under $H_{1}$.

If $H_0$ is rejected, the pattern in the ${\widehat{\varvec{\beta }}}_j$s can help to assess the economic relevance of the rejection. This becomes particularly relevant if n is large, since large samples incur the risk of detecting economically minor violations of the null (Griliches and Hausman 1986, p. 110). If all ${\widehat{\varvec{\beta }}}_j$s are close in value to the FE estimator, the economic importance of the rejection is considered limited. An informal visualization of the Wald test is obtained by plotting each element of ${\widehat{\varvec{\beta }}}_j$ as a function of j, with the value of the FE estimator of each covariate’s coefficient added as a horizontal line. We will refer to these plots as the ‘differences curves.’ These curves will be illustrated in the section with empirical applications.

3.2 Relation to GMM tests

To analyze the properties of our Wald test in more detail, it turns out useful to draw the parallel with overidentifying tests in a GMM framework. From Newey and West (1987) and Newey and McFadden (1994) and the linearity of the moment conditions, we infer that $q_{\hbox {W}}$ is numerically identical to a GMM test statistic for a stacked regression model. This is formalized in Result 3.

Result 3

Under the regularity assumptions as listed in Su et al. (2016), the Wald test statistic $q_{\tiny {\hbox {W}}}$ is numerically identical to the overidentifying test statistic based on the two-step GMM estimator ${\widehat{\varvec{\beta }}}_{\tiny {\hbox {GMM}}}$ of $\varvec{\beta }_0$ in the stacked regression model

$$\begin{aligned} {\widetilde{{\textbf {y}}}}_i= \left( \begin{array}{c}{{\textbf {X}}}_{i1}\\ {{\textbf {X}}}_{i2}\\ \vdots \\ {{\textbf {X}}}_{i,T-1} \end{array}\right) \varvec{\beta }_0+{\widetilde{{\varvec{\varepsilon }}}}_{i}, \end{aligned}$$

(12)

using the instrument matrix ${\widetilde{\varvec{Z}}}_i={\widetilde{{\textbf {X}}}}_i$ provided that both test statistics use the same estimator for the covariance matrix, with the requirement that this estimator is consistent under $H_0$.

Result 3 requires both test statistics to use the same consistent estimator for the covariance matrix. In practice, the GMM test statistic uses ${\widehat{\varvec{\beta }}}_{{\tiny {\hbox {GMM}}}}$ to obtain a panel-robust estimator of the covariance matrix, while the Wald test statistic uses ${\widehat{\varvec{\beta }}}_{{\tiny {\hbox {D}}}}$ to do so. Because both estimators of the covariance matrix are consistent under the null, this difference in covariance matrices does not matter for the asymptotic properties of the test statistics (Cameron and Trivedi 2005, ). We thus conclude that our Wald test statistic (with the panel-robust estimator of the covariance matrix based on ${\widehat{\varvec{\beta }}}_{{\tiny {\hbox {D}}}}$) is asymptotically equivalent to the GMM test statistic (with the panel-robust estimator of the covariance matrix based on ${\widehat{\varvec{\beta }}}_{{\tiny {\hbox {GMM}}}}$). The two tests have the same asymptotic power and size, under both the null and any (fixed or local) alternative hypothesis (Newey and West 1987; Newey and McFadden 1994).^{Footnote 3}

Corollary 2

Under the regularity assumptions as listed in Su et al. (2016), the Wald test statistic $q_{tiny{\hbox {W}}}$ is asymptotically equivalent to the overidentifying J-statistic corresponding to the two-step estimator of $\varvec{\beta }_0$ in (12) with instruments ${\widetilde{\varvec{Z}}}_i$. The two tests have the same asymptotic power and size, under both the null and any (fixed or local) alternative hypothesis.

With ${\widetilde{\varvec{Z}}}_i$ as the instrument matrix in the equivalent GMM test, we thus see that the overidentifying moment conditions are the $k(T-1)$ moment conditions imposed by the differences estimators. Hence, our test boils down to a GMM test for the overidentifying moment conditions

(13)

These moment conditions arise by ‘unfolding’ the moment condition imposed by the FE estimator

(14)

Corollary 1 already established the link between (the probability limits of) ${\widehat{\varvec{\beta }}}_{FE}$ and the ${\widehat{\varvec{\beta }}}_{j}$s. By means of (13) and (14), we have now also shown how ‘unfolding’ connects the moment conditions of the FE and differences estimators.

3.3 Trivial power

Result 2 makes clear that the power of the Wald test arises from the differences in the $\varvec{\beta }_{j}$s for different values of j under $H_{1}$. In certain cases where the FE estimator is inconsistent, such differences may not exist though. Despite the FE estimator’s inconsistency, we will then find $\varvec{\beta }_{1}=\varvec{\beta }_2=\ldots =\varvec{\beta }_{T-1}\not = \varvec{\beta }_0$. Consequently, the asymptotic rejection rate of the Wald test will be equal to the chosen significance level, yielding ‘trivial’ asymptotic power. As shown by Newey (1985) and (Hall 2005, Ch. 5), the issue of trivial power is inherent with overidentifying tests. These authors also provide a more technical discussion of the region where GMM tests have trivial power.

The practical implication of the existence of a parameter region with trivial asymptotic power is that our Wald test may have low empirical power in certain situations. In the motivating examples of omitted variables and measurement error shown in Table 1, trivial power arises for $\rho =\delta $. This can be inferred from the expressions for the $\beta _j$s in the table, which do not vary with j for $\rho =\delta $. A test to assess whether two panel variables have the same degree of persistence could therefore prove useful in this scenario. In practice, however, misspecification is likely to be much more complex than for the motivating examples of Table 1. Consequently, we typically do not know when trivial power will arise and to what extent it is related to the persistence in the observed variables. It is therefore hard to think of a statistical test that could be used to recognize a case of trivial power. In fact, to our best knowledge, no remedy against trivial power exists other than cautiously interpreting the outcomes of GMM tests (Parente and Santos Silva 2012). It therefore remains important to look for other evidence against the FE estimator if the test does not reject the null hypothesis, such as coefficient signs and magnitudes that are implausible from an economic perspective.

3.4 Comparison with existing tests

As mentioned in Introduction, tests for the consistency of the FE estimator that do not require IVs are rare. The two tests that come closest are the ones of (Wooldridge 2010, p. 324-325) and Su et al. (2016). These approaches also take the linear panel regression model in (1) as the starting point.

3.4.1 Wald test of Wooldridge (2010)

The test of Wooldridge (2010) is based on the auxiliary OLS regression

(15)

after taking the within transformation. With ${\textbf {I}}_{T-1}$ the identity matrix of order $T-1$, the matrices and ${{\textbf {F}}_{1}}$ are defined as the $(T-1)\times T$ block matrices

(16)

with ${\textbf {0}}_{T-1}$ a $(T-1)$-dimensional column vector of zeros.

The term ${{\textbf {F}}_{1}}{{\textbf {X}}}_{i}\,\varvec{\zeta }_{1}$ in (15) ensures that the regression model contains the one-period ahead lead values of the covariates as regressors. Because this leads to the loss of the last time period, we need the matrix to ensure that the other vectors also contain the right time observations. In more familiar notation, we would write (15) as $y_{it}=\gamma _{i}+{{\textbf {x}}}_{it}'\,{\varvec{\beta }}_0+{{\textbf {x}}}_{i,t+1}'\varvec{\zeta }_{1}+\varepsilon _{it}$. The reason that we use the above alternative notation is to facilitate the comparison with our own ‘differences’ approach, as will become clear below.

The test takes the form of a standard panel-robust Wald test for the null hypothesis ${\bar{H}}_0:\varvec{\zeta }_1=0$ against the alternative hypothesis ${\bar{H}}_1:\varvec{\zeta }_1\ne 0$. It is motivated by the fact that, under strict exogeneity, the FE estimator of $\varvec{\zeta }_1$ will have a zero probability limit, while the FE estimator of $\varvec{\beta }_0$ will converge in probability to $\varvec{\beta }_0$. Under the usual regularity conditions for panel data, the resulting test has nominal asymptotic size under ${\bar{H}}_0$ and unit asymptotic power under ${\bar{H}}_1$.

3.4.2 Sup-Wald test of Wooldridge (2010)

The extension proposed by Su et al. (2016) is based on the idea that Wooldridge’s approach of adding one-period ahead lead values to the regression model is rather arbitrary. They overcome this by allowing for a wider range of leads and lags. More specifically, Su et al. (2016) consider auxiliary regressions of the form

(17)

where and ${\textbf {F}}_{\ell }$ are defined in analogy with (16).

Su et al. (2016) estimate the regression equation in (17) for all $\ell \in {\mathcal {S}}_T$ by means of OLS after applying the within transformation.^{Footnote 4} Subsequently, they test the null hypothesis ${\tilde{H}}_0: \varvec{\zeta }_\ell ={\textbf {0}}$ for all $\ell \in {\mathcal {S}}_T$ against the alternative hypothesis ${\tilde{H}}_1: \varvec{\zeta }_\ell \ne {\textbf {0}}$ for some $\ell \in {\mathcal {S}}_T$ using a sup-Wald test. The sup-Wald test statistic is obtained as follows. For each individual null hypothesis ${\tilde{H}}_0^{\ell }: \varvec{\zeta }_{\ell }={\textbf {0}}$, they calculate the corresponding individual Wald test statistic. Subsequently, the supremum is taken over the individual Wald test statistics, yielding the sup-Wald test statistic. The underlying idea is that the supremum of a range of test statistics will behave more like the most powerful among them. The critical values of the sup-Wald test statistic are determined by means of a panel bootstrap. Under standard panel data regularity conditions, the resulting sup-Wald test has nominal asymptotic size under ${\tilde{H}}_0$ and unit asymptotic power under ${\tilde{H}}_1$.

3.4.3 GMM framework

To facilitate the comparison with our own approach, it is convenient to draw the parallel with GMM overidentying tests one more time. From Newey and West (1987) and Newey and McFadden (1994), we infer that the Wald test of Wooldridge (2010) is asymptotically equivalent to a GMM test based on the two-step GMM estimator of ${\varvec{\beta }}_0$ in

(18)

with instruments and ${{\textbf {F}}_{1}}{{\textbf {X}}}_{i}$, after applying the within transformation to both the regression equation and the instruments; i.e., after pre-multiplying both with the centering matrix of order $T-1$. Hence, Wooldridge’s approach tests 2k moment conditions, namely

(19)

where

(20)

We apply Newey and West (1987) and Newey and McFadden (1994) one more time to infer that each individual Wald test of ${\tilde{H}}_0^{\ell }$ is equivalent to a GMM test of the 2k overidentifying moment conditions

(21)

where ${\textbf {A}}^{*}_{T-|\,\ell \,|}$ and ${\textbf {A}}^{\dagger }_{T-|\,\ell \,|}$ are defined in analogy with (20). For fixed $\ell $, the first moment condition in (21) comes close to the moment condition of the within estimator—being — but applies to the subsample that excludes $\ell $ years from the full sample. The second moment condition is similar to the first, but is formulated in terms of lead or lags of the regressors. The moment conditions in (21) are necessary—but not sufficient—for strict exogeneity.

We thus see that the Wald test of Wooldridge (2010) and the individual Wald tests constituting the sup-Wald test of Su et al. (2016) reduce to overidentifying tests, like our own Wald test. They both tests conditions that are necessary for strict exogeneity. Because strict exogeneity is a sufficient condition for the consistency of the FE estimator, the conditions validated by the sup-Wald test are neither necessary nor sufficient for the consistency of the FE estimator.

3.4.4 Comparison Wald and sup-Wald tests

Similarities Because the Wald test of Wooldridge (2010) and the individual Wald tests constituting the sup-Wald test of Su et al. (2016) also reduce to overidentifying tests, like our own Wald test, they will also have a parameter region with trivial power (Newey 1985; Hall 2005). This property will be illustrated in detail in Sect. 4.

Differences The main difference among the three tests is that each of them tests a different null hypothesis, which we have labeled as $H_0$, ${\bar{H}}_0$ and ${\tilde{H}}_0$, respectively. Because the three tests involve different parameter restrictions in different auxiliary regressions, it is not straightforward how to compare them. The translation of each test to a GMM framework has turned out to facilitate the comparison of the three null hypotheses and has made clear that each of the three tests focuses on different moment conditions.

According to Result 3, our Wald test looks at the moment conditions of the differences estimators, which arise as the ‘unfolded’ moment condition of the FE estimator. The underlying relation between the (probability limits of the) FE and differences estimators is specified in Corollary 1 and holds under both $H_0$ and $H_1$.

The test of Wooldridge (2010) considers the moment conditions in (19), while the sup-Wald test focuses on those in (21). Both tests focus on generic strict exogeneity conditions instead of being specifically related to the FE estimator’s orthogonality conditions. For fixed $\ell $, the FE estimator of $\varvec{\zeta }_\ell $ in (17) has a zero probability limit under strict exogeneity, while the FE estimator of ${\varvec{\beta }}_0$ then converges in probability to ${\varvec{\beta }}_0$ (Su et al. 2016). In other cases, however, it is not known how the inconsistency of the FE estimator of ${\varvec{\beta }}_0$ (our parameter of interest) relates to the probability limit of the FE estimator of $\varvec{\zeta }_\ell $ (the auxiliary parameter to run the test).

To formalize the above considerations, let ${\varvec{\beta }}_{\tiny {\hbox {FE}}}=\hbox {plim}_{n\rightarrow \infty }{\widehat{{\varvec{\beta }}}}_{{\tiny {\hbox {FE}}}}$ and define $H_0^{\tiny {\hbox {FE}}}: {\varvec{\beta }}_{\tiny {\hbox {FE}}}={\varvec{\beta }}_0$ and $H_1^{\tiny {\hbox {FE}}}: {\varvec{\beta }}_{\tiny {\hbox {FE}}}\not ={\varvec{\beta }}_0$. We will now view the Wald and sup-Wald tests as tests of $H_0^{\tiny {\hbox {FE}}}$ instead of $H_0$ and ${\tilde{H}}_0$, which means that we will derive expressions for the rejection probabilities under $H_0^{\tiny {\hbox {FE}}}$ and $H_1^{\tiny {\hbox {FE}}}$. This will allow us to compare both tests’ type I and type II errors if viewed as tests of $H_0^{\tiny {\hbox {FE}}}$.

We start with the Wald test and assume that $H_0^{{\tiny {\hbox {FE}}}}$ is true. In this scenario, there are two cases: (i) $H_0$ is true or (ii) $H_1$ holds true. In case (i), the rejection probability under $H_0^{{\tiny {\hbox {FE}}}}$ is also a rejection probability under $H_0$. The asymptotic value of this probability equals the nominal size of the Wald test according to Result 2. Case (ii) can occur because consistency of the differences estimators is a sufficient but not a necessary assumption for the consistency of the FE estimator. In case (ii), the rejection probability has a unit asymptotic value, unless there is trivial power. In the latter case, the asymptotic value of the rejection probability is equal to the nominal size of the Wald test. In sum, the asymptotic size of the Wald test is at least nominal if we view the test as a test of $H_0^{{\tiny {\hbox {FE}}}}$.

At this point, we would ideally provide a specific example of case (ii), where some of the differences estimators are inconsistent while the FE estimator is still consistent. However, it is not straightforward to illustrate the Wald test’s oversizedness in this way, because the process of constructing such an example quickly goes beyond what is still analytically tractable.^{Footnote 5} Fortunately, we can characterize the conditions under which case (ii) will occur in general terms using the GMM analogy. If (14) holds true but (13) does not, then the FE estimator is consistent while some of the differences estimators are inconsistent.

Next we assume that $H_1^{{\tiny {\hbox {FE}}}}$ is true. Because $H_0$ is a sufficient condition for the consistency of the FE estimator, $H_1$ must also hold true. Hence, the rejection probability under $H_1^{{\tiny {\hbox {FE}}}}$ is also a rejection probability under $H_1$, with a unit asymptotic value according to Result 2. In sum, the Wald test viewed as a test of $H_0^{{\tiny {\hbox {FE}}}}$ has unit asymptotic power under $H_1^{{\tiny {\hbox {FE}}}}$, unless we encounter a scenario of trivial power.

For the sup-Wald test, the derivation of the rejection probability under $H_0^{{\tiny {\hbox {FE}}}}$ is similar as above and leads to the insight that the asymptotic value of this probability is larger than or equal to the nominal size of the sup-Wald test. Under $H_1^{{\tiny {\hbox {FE}}}}$, the situation is more complex than above, since either ${\tilde{H}}_0$ or ${\tilde{H}}_1$ may hold true. The reason for this is that the sup-Wald test validates conditions that are neither necessary nor sufficient for the consistency of the FE estimator. As a result, there are two possibilities for the asymptotic power of the sup-Wald test if seen as a test of $H_0^{{\tiny {\hbox {FE}}}}$. In case (i), it has a unit value under $H_1^{{\tiny {\hbox {FE}}}}$, which occurs if ${\tilde{H}}_1$ holds true and no trivial power arises. In case (ii), it has a value equal to the nominal size of the sup-Wald test. This occurs if ${\tilde{H}}_0$ holds true, or if ${\tilde{H}}_1$ holds true in combination with trivial power. Also at this point it is hard to construct analytically tractable examples that illustrate both cases.

In sum, if we view the Wald and sup-Wald tests as tests of the consistency of the FE estimator, we conclude that both of them are asymptotically oversized. Furthermore, the Wald test will have unit asymptotic power, unless we encounter a case of trivial power that is inherent with GMM testing. The sup-Wald test has one additional possibility for trivial asymptotic power, which arises because the specific moment conditions validated by this test are neither sufficient nor necessary for the consistency of the FE estimator.

4 Simulation study

Our simulation study focus on the motivating examples listed in Table 1, for which the three tests’ finite-sample performance is an empirical matter. Because the sup-Wald test of Su et al. (2016) generally turns out superior to the test of Wooldridge (2010), we confine the comparison to our Wald test and the sup-Wald test. For a detailed simulation study into the finite-sample differences between the tests of Wooldridge (2010) and Su et al. (2016), we refer to the latter study.

4.1 Simulation setup

We start with some explanation about the design of our simulation study. As said, our simulations focus on the motivating examples in Table 1. With exception of two cases of trivial power, both our Wald test and the sup-Wald test turn out consistent in each of these examples. The consistency of our Wald test follows directly from the results in Table 1.

To determine whether the sup-Wald test is consistent in the motivating examples, we have to assess whether ${\tilde{H}}_{1}$ holds true.^{Footnote 6} However, for fixed $\ell $, it already turns out infeasible to obtain an analytical expression for the probability limit of the FE estimator of $\varvec{\zeta }_\ell $ under ${\tilde{H}}_{1}^{\ell }: \varvec{\zeta }_\ell =0$. Calculations quickly grow complex because the constituent regressions of the sup-Wald test already contain two covariates in the simplest possible case. We therefore take a more practical view to determine whether the sup-Wald test is consistent in each of our motivating examples. Whenever our simulations yield unit empirical power for sufficiently large values of n, we view this as a reliable indicator that the sup-Wald test is consistent.

We set $\rho =\delta $ in two of our simulation experiments. Both our Wald test and the sup-Wald test turn out to have trivial power in these cases. For our Wald test, the trivial power follows immediately from the results in Table 1, which show that the $\beta _j$s do not vary with j. For the sup-Wald test, the inconsistency follows from the low empirical power that persists across large values of n in our simulations.

The more technical details of our simulation design are as follows. We run simulations for three of the motivating examples considered in Table 1: classical measurement error (‘ME’), omitted variables (‘OV’) and simultaneity (‘S’). Throughout, we run 10,000 simulation runs for each simulation experiment to obtain the empirical rejection rates for our Wald test. We consider values $n=100$, $n=500$ and $n=1000$, each with $T=5$ and $T=10$. We set the significance level for each test equal to 5%. For classical measurement error and omitted variables, we simulate two scenarios in terms of the persistence parameters: $\delta =0.3$ and $\rho =0.9$ and $\delta =0.3$ and $\rho =0.6$. For simultaneity, we consider the cases $\rho =0.6$ and $\rho =0.9$. For each simulation experiment, we report the empirical size and power for both tests. The reported empirical power has not been size adjusted.

4.2 Empirical power and size

The right-hand side of Table 3 reports the empirical power for our Wald test, with the full set of parameters for each of the three models as specified in the table notes. These notes also report the probability limits of the underlying models’ $R^2$, as well as the reliability and noise-to-signal ratios for the models with classical measurement error. The simulation results show that the Wald test’s finite-sample power can turn out relatively low for smaller values of n and T. This is especially the case if the distance between $\rho $ and $\delta $ is relatively small.

Table 2 Simulation results: empirical size

Full size table

Table 3 Simulation results: empirical power

Full size table

We also run simulations for these three models in the absence of any measurement error, omitted variables or simultaneity, yielding the empirical size of the Wald test. The results in the right-hand side of Table 2 point out that the size of the Wald test is substantially above nominal for $n=100$ and $T=10$. In the other cases, the rejection rates are fairly close to nominal. For the large-n and fixed-T asymptotics to apply, we must have $n>>T$. For $n=100$ and $T=10$, the ratio n/T might be too small. We therefore try to improve the finite-sample results by resorting to a bootstrap-based test statistic. Indeed the above nominal rejection rates for $n=100$ and $T=10$ can be circumvented by using a panel wild bootstrap to estimate the covariance matrix. The bootstrap-based covariance matrix then replaces the formula-based covariance matrix in the Wald test statistic.^{Footnote 7} Table 2 reports the bootstrap-based rejection rates in parentheses, which are close to nominal. The use of the bootstrap only turned out necessary for $n=100$ and $T=10$, but from a robustness perspective one may consider using the bootstrap regardless of the panel dimensions.

We consider the same simulation experiments for the sup-Wald strict exogeneity test of Su et al. (2016) and report the outcomes in the left-hand sides of Tables 2 and 3. Also this test’s empirical size is close to nominal. In terms of empirical power, the simulation results make clear that there are two situations: one in which the sup-Wald test’s empirical power is relatively low in comparison to our Wald test and one in which both tests perform similarly. Especially for $n=100$, our Wald test tends to have relatively high empirical power. For the ME model with $\rho =0.6$, this pattern also persists for larger values of n. However, there is one exception and that is the model with an omitted variable. Also here we identify the above two situations, but with the roles of the two tests reversed: one in which the sup-Wald test’s empirical power is relatively high in comparison to our Wald test and one in which both tests perform similarly. We explain the good performance of the sup-Wald test in the presence of an omitted variable from the fact that the sup-Wald test’s auxiliary regressions are close to the true panel regression model, because the observed covariate’s lags and leads are correlated with the omitted variable. Hence, the sup-Wald test performs very well in this specific case of misspecification. As we will see below, however, this result does not hold true for all parameter values.

4.3 Additional simulations

We consider five additional sets of simulations. First, we simulate the three motivating examples (measurement error, omitted variable and simultaneity) with n and T equal to the values that we will later encounter in our empirical applications. This gives rise to simulations (i) and (ii). In (i), we take $n=51$ and $T=20$, leading to a relatively small value of n/T instead of $n>>T$. In (ii), we set $n=737$ and $T=4$. In (iii), we extend the basic ME model with $n=51$ and $n=100$ as to include an additional error-free regressor. In (iv) and (v), we reconsider the basic ME and omitted variable model, respectively, with a parameter setting that results in trivial power for our Wald test according to Table 1 ($\delta =\rho =0.6$). Table 4 reports the empirical power and size for both tests in each of the five cases. For (i) and (iii), the table reports bootstrap-based rejection rates for our Wald test.

Table 4 Additional simulation results: empirical size and power

Full size table

In (i), we observe that the sup-Wald test has higher empirical power in the omitted variables case, while the two tests perform comparably in terms of empirical power in the presence of simultaneity. In case of ME, our Wald test’s empirical power is slightly better. In (ii), the sup-Wald only outperforms in terms of empirical power in the omitted variable case. In the other two cases, our test outperforms. In (iii), the sup-Wald test outperforms in terms of empirical power. In (iv) and (v), we observe that both tests have trivial power. For our test, this outcome was expected on the basis of Table 1. Apparently, also the sup-Wald test needs a difference in persistence between the unobserved regressor and the measurement error in (iv) and between the observed and omitted regressor in (v) in order to have non-trivial power.^{Footnote 8}

4.4 Cautionary remark

Because it is infeasible to do analytical calculations for the sup-Wald test and the constituent individual Wald tests, we do not provide an example that illustrates that the sup-Wald test indeed test a different null hypothesis than our Wald test. For the same reason, our simulation study considers misspecification that renders the two tests both consistent or both inconsistent. Although these simulations prove insightful, we emphasize that in more general cases of misspecification the two tests will typically test different null hypotheses. The null hypothesis of our Wald test is a sufficient condition for the FE estimator’s consistency, while the null hypothesis of the sup-Wald test is neither sufficient nor necessary for the consistency of the FE estimator. The finding our simulation study that our Wald test does not always have higher empirical power than the sup-Wald test must be viewed in this context.

5 Empirical applications

This section considers two existing panel data sets from the literature, which each contain an explanatory variable that was suspected to be subject to measurement error in the studies that introduced these data. In the context of our theoretical results, these data sets provide a particularly relevant empirical case for our test. Our goal is to investigate the consistency of the FE estimator using our Wald test. For the sake of comparison, we will also report results for the sup-Wald test.

5.1 Birth rates and welfare

Economic theory suggests that a government transfer program that reduces the cost of supporting a child should lead to a rise in birth rates. As pointed out by McKinnish (2008), childbearing is a commitment to current and future consumption. We may therefore expect fertility decisions to be relatively unresponsive to transitory fluctuations in welfare benefits. This would imply that welfare benefits are erroneous relative to the conceptual variable of interest, even though these benefits are generally reported without error in administrative records. As explained by Griliches and Hausman (1986), this kind of ‘conceptual’ measurement error is isomorphic to the errors-in-variables model with measurement error that is less persistent than the unobserved regressor and would render the FE estimator inconsistent due to endogeneity of the observed regressor.

Table 5 Estimation results

Full size table

Table 6 Test outcomes

Full size table

McKinnish (2008) aims to provide an empirical investigation of the presence of such conceptual measurement error in welfare benefits. She uses a panel data set consisting of US state-level birth rates by white women in the age group 20–24.5 years and AFDC benefit levels for a family of four with no additional income. The panel data set with $n=51$ and $T=20$ covers the 1973–1992 period. The data set also contains a measure of the earnings per capita in each state. Both welfare benefits and earnings per capita are deflated and expressed in prices of the base year 1982–84.

We consider the linear panel regression model specified as

$$\begin{aligned} \log (y_{it})=\alpha _i + \delta _t+\beta _w \log (w_{it})+\beta _{e}\log (e_{it})+\varepsilon _{it}, \end{aligned}$$

(22)

where $y_{it}$ denotes the birth rate in state i in year t, $\alpha _i$ a state fixed effect, $\delta _t$ a year fixed effect, $w_{it}$ the welfare benefit (i.e., the allegedly error-ridden regressor), and $e_{it}$ the earnings per capita.

McKinnish (2008) estimates the linear panel regression model in (22) using data that is differenced over a time span of $j=1,3,5,7$ years. We denote the resulting coefficient estimates of the welfare benefit by ${\widehat{\beta }}_{w,j}$. McKinnish (2008) compares the ${\widehat{\beta }}_{w,j}$s for different values of j. In this way, she proceeds in a similar fashion as Goolsbee (2000). McKinnish (2008) establishes a monotonically increasing pattern in the ${\widehat{\beta }}_{w,j}$s, which she contributes to the presence of conceptual measurement error.

We estimate the linear panel regression model in (22) using data that is differenced over a time span of $j=1,\ldots ,10$ years. The estimation results are summarized in the upper panel of Table 5. This table also reports the results based on the FE estimator. At the 5% significance level, our bootstrap-based Wald test rejects the null hypothesis $H_{0}: \beta _{e,j}=\beta _{e,j+1}; \beta _{w,j}=\beta _{w,j+1}$ for $j=1,\ldots ,9$ at the 5% level; see Table 6. Although we do not have a case of large n here, we show the differences curves in Fig. 1a and b for the welfare benefit and the earnings variables anyhow, for the sake of illustration. The differences curves confirm the economic relevance of the rejection.^{Footnote 9}

In sum, our test outcome substantiates the doubts of McKinnish (2008) about the consistency of the FE estimator of (22). As explained in Sect. 3.4.4, if the Wald test rejects it is still possible that the FE is consistent. Because of the additional evidence against the FE estimator’s consistency provided by McKinnish (2008), we consider that possibility unlikely here. Although our test results are consistent with the presence of conceptual measurement error in the welfare benefit variable, the source of the inconsistency—measurement error or something else—remains an open question. For example, the data used by McKinnish (2008) are aggregated across different cohorts and states that may respond differently to changes in welfare over time, which may also render the FE estimator inconsistent.

We note that the sup-Wald test rejects ${\tilde{H}}_0$ at the 5% level; see Table 6. As explained in Sect. 3.4.4, this test focuses on generic strict exogeneity conditions instead of being specifically related to the FE estimator’s orthogonality conditions. We therefore view the rejection as a sign that other moment conditions related to strict exogeneity do not hold either. We refer to McKinnish (2008) for additional estimations that exploit less stringent moment conditions.

5.2 Investments and Tobin’s q

Erickson and Whited (2000) analyze the impact of Tobin’s q on the investment rate, with Tobin’s q the ratio of the market valuation of a firm’s capital stock to its replacement value. The theoretical motivation for studying this relation is the standard model of a perfectly competitive firm. This model is based on the maximization of net shareholder wealth, in the presence of convex adjustment costs following changes in the capital stock (e.g., Blundell et al. 1992). According to this model, Tobin’s q has a positive effect on the investment rate. An empirical complication is the measurement error problem associated with Tobin’s q. This problem arises due to the difference between marginal q, the conceptual variable of interest, and measured q as defined above. Erickson and Whited (2000) discuss the possible sources of measurement error in measured q and propose an estimator that controls for such error by exploring higher-order moments. Their empirical analysis is based on a Compustat firm-level panel data set for the 1992–1995 period, with $n=737$ and $T=4$.

We consider the linear panel regression model given by

$$\begin{aligned} y_{it}=\alpha _{i}+\delta _t+\beta _q q_{it}+\beta _{c}c_{it}+\beta _{fc}c_{it}f_{i}+\varepsilon _{it}, \end{aligned}$$

(23)

where $y_{it}$ denotes the ratio of investments to the replacement value of the capital stock for firm i in year t, $\alpha _i$ a firm fixed effect, $\delta _t$ a year fixed effect, $q_{it}$ the proxy of marginal Tobin’s q, $c_{it}$ cash flow divided by the replacement value of the capital stock, and $f_{i}$ a 0–1 variable indicating whether a firm is financially constrained or not. The indicator variable $f_{i}$ is constructed on the basis of a firm’s lack of bond rating and does not vary over time; its own marginal effect is therefore contained in the fixed effect $\alpha _i$.

In the presence of measurement error in q, the FE estimator of (23) will typically be inconsistent due to a lack of strict exogeneity of the proxy of marginal q. We estimate the linear panel regression model in (23) after differencing over a time span of $j=1,2,3$ years.^{Footnote 10} Detailed estimation results are given in the lower panel of Table 5. This table also reports the estimation results based on the FE estimator. At the 5% significance level, our Wald test fails to reject the null hypothesis $H_{0}: \beta _{q,j}=\beta _{q,j+1}, \beta _{c,j}=\beta _{c,j+1}; \beta _{fc,j}=\beta _{fc,j+1}$ for $j=1,2$; see again Table 6.

We note that the sup-Wald test rejects ${\tilde{H}}_0$ at the 5% level, as shown in Table 6. As noted in Sect. 3.4.4, it remains unclear what the rejection of ${\tilde{H}}_0$ means for the inconsistency of the FE estimator.

We conclude that our Wald test finds no evidence against the consistency of the FE estimator. As mentioned in Sect. 3.3, we should remain aware of the possibility that the test may have low power in certain cases. Low power could also arise from limited data variability due to taking differences, yielding coefficient estimates with relatively large standard errors. In such a scenario, our test could fail to reject in the presence of misspecification. This explanation does not seem very likely in the present case, though. The strong significance of the estimated coefficients in the lower panel of Table 5 suggests that the time-differenced data still contain a sufficient amount of variation. Another possibility is that the inconsistencies in the differences estimators do not depend on j.

Given these considerations and the rejection by the sup-Wald test, it remains important to look for other evidence against the FE estimator’ consistency, such as coefficient signs that are unlikely from an economic perspective. Here, we find the coefficient signs that we would expect on the basis of economic theory: Tobin’s q and the cash flow variable both have a positive effect on the expected investment rate, which is smaller if firms are financially constraint. In sum, also these additional investigations do not find evidence against the consistency of the FE estimator.

6 Conclusion

The FE estimator is widely used to estimate the linear panel regression model. Under large-n and fixed-T panel data asymptotics, we have developed a test to validate a sufficient condition for the FE estimator’s consistency using a stacked regression framework. Our test takes the familiar form of a panel-robust Wald test. Because our test is asymptotically equivalent to a specific GMM test for overidentifying restrictions, our approach also fits in the familiar setting of GMM estimation and specification testing.

We have shown that our Wald test will generally test a different null hypothesis than the strict exogeneity test of Wooldridge (2010) and the extension proposed by Su et al. (2016). Our Wald test is specifically tailored for testing a sufficient condition for the FE estimator’s consistency. The other two tests, by contrast, consider more generic strict exogeneity conditions that are neither sufficient nor necessary for the FE estimator’s consistency.

The Wald test’s finite-sample properties have been investigated in a simulation study, where we continued the comparison with the strict exogeneity test of Su et al. (2016). Our Wald test has been shown to possess good finite-sample properties, especially if the estimator of the covariance matrix is based on a panel bootstrap. We have also illustrated the test in two applications to existing studies from the literature.

If our tests rejects, it is still possible that the FE estimator is consistent. Further testing, as discussed below, would be needed to exclude this possibility. If our test does not reject, there is no evidence against the FE estimator’s consistency. Although this is the most favorable outcome, researchers should still be aware of the possibility that the test may have low power in certain cases. It therefore remains important to look for other evidence against the FE estimator, such as coefficient signs and magnitudes that are unlikely from an economic perspective. Researchers should also recognize the possibility that low pow could arise from limited data variability due to taking differences, yielding coefficient estimates with relatively large standard errors. Hence, although our test does not require IVs, it should be used in combination with additional analysis.

As usual, finding a well-specified model remains to a large extent a case-by-case puzzle without guaranteed success, depending on, e.g., prior information and the availability of valid and strong instruments. As a general guideline, it is nevertheless useful to consider the existing literature on the selection of moment conditions. Here a distinction is made between (i) separating valid moment conditions from invalid ones and (ii) the elimination of redundant conditions; i.e., conditions that do not contribute to a reduction in the GMM estimator’s variance (Okui 2009). Various consistent selection procedures have been proposed, including methods that add a penalty term to the usual J-statistic for overidentification (Andrews 1999). Ideally, a fully integrated selection procedure for moment conditions should include our Wald test’s moment conditions, as well as the more generic exogeneity conditions underlying the sup-Wald test of Su et al. (2016). We leave the development of such an integrated approach as a topic for future research.

A final topic for further research relates to the panel dimensions. The asymptotic distribution of our test statistic has been derived under large-n and fixed-T asymptotics, making the test suitable when $n\gg T$. This was the format in the classical panel data literature, but there has been increasing attention to panel data where n and T are of a different relative size, requiring different asymptotics. A first step would be to investigate the asymptotic behavior of our Wald test statistic for n fixed and $T\rightarrow \infty $, or for $n\rightarrow \infty $ and $T\rightarrow \infty $ jointly in some way.

Data availability

Available from the author upon request

Notes

The J-test is also known as the Hansen–Sargan, GMM or overidentifying test.
The inconsistencies of the FE estimator are denoted in matrix notation to save space. According to Corollary 1, they equal the weighted average of the inconsistencies of the differences estimators. This result gives some intuition to the matrix expressions.
Based on Newey and West (1987) and Newey and McFadden (1994), we infer that the Wald test statistic is also identical to three other well-known test statistics to test $H_{0}$: the LM test statistic, the distance-difference test statistic (where the distance-difference test is the GMM equivalent of the likelihood-ratio test) and the minimum Chi-square test statistic. Again the equality only holds if the same consistent estimator for the covariance matrix is used for these test statistics.
Su et al. (2016) also discuss two varieties of their main test: one in which (17) is estimated using the first differences estimator and one in which a subset of ${\mathcal {S}}_T$ is considered in ${\tilde{H}}_0$. For the sake of brevity, our discussion here is confined to the more efficient FE estimator that exploits the full set ${\mathcal {S}}_T$.
We note that the differences and FE estimators are all inconsistent in the motivating examples of Table 1.
This follows from the result proved in Su et al. (2016), showing that the sup-Wald test has unit asymptotic power under ${\tilde{H}}_{1}$.
Another possibility is to use the panel wild bootstrap to obtain bootstrap-based critical values for the Wald test statistic. Because this approach turned out to yield poorer empirical power and size, we do not consider it in our analysis.
We note that the power comparison between the two tests may not always be entirely fair, due to small differences in the tests’ empirical sizes. Because the simulations for the sup-Wald test are computationally very intensive, it was not feasible to size adjust the test as discussed by, e.g., Lloyd (2005).
The difference curve for the earnings variable is highly non-monotonic. Unreported examples in the setting of Table 1 confirm that non-monotonic patterns are indeed possible for certain model configurations, such as non-classical ME model with a nonzero correlation between the measurement and regression errors.
This data set of Erickson and Whited (2000) is available at http://toni.marginalq.com/publications.html.

References

Amini S, Delgado M, Henderson D, Parmeter C (2012) Fixed vs random: the Hausman test four decades later. In: Baltagi B, Carter Hill R, Newey W, White H (eds) Essays in Honor of Jerry Hausman (Advances in Econometrics), vol 29. Emerald Group Publishing Limited, Bingley, pp 479–513
Chapter Google Scholar
Andrews D (1999) Consistent moment selection procedures for generalized method of moments estimation. Econometrica 67:543–563
Article Google Scholar
Arellano M, Bond S (1991) Some Tests of Specification for Panel Data: Monte Carlo Evidence and an Application to Employment Equations. Rev Econ Stud 58:277–297
Article Google Scholar
Baltagi B, Bresson G, Pirotte A (2003) Fixed effects, random effects or Hausman-Taylor? A pretest estimator. Econ Lett 79:361–369
Article Google Scholar
Blundell R, Bond S, Devereux M, Schiantarelli F (1992) Investment and Tobin’s $Q$: evidence from company panel data. J Econom 51:233–257
Article Google Scholar
Breusch T, Godfrey L (1986) Data transformation tests. Econ J 96:47–58
Article Google Scholar
Cameron A, Trivedi P (2005) Microeconometrics methods and applications. Cambridge University Press, New York
Book Google Scholar
Davidson R, Godfrey L, MacKinnon J (1985) A simplified version of the differencing test. Int Econ Rev 26:639–647
Article Google Scholar
Erickson T, Whited T (2000) Measurement error and the relationship between investment and $q$. J Polit Econ 108:1027–1057
Article Google Scholar
Goolsbee A (2000) The importance of measurement error in the cost of capital. Natl Tax J 53:215–228
Article Google Scholar
Griliches Z, Hausman J (1986) Errors in variables in panel data. J Econom 31:93–118
Article Google Scholar
Hall A (2005) Generalized method of moments. Oxford University Press, Oxford
Google Scholar
Harris R, Tzavalis E (1999) Inference for unit roots in dynamic panels where the time dimension is fixed. J Econom 91:201–226
Article Google Scholar
Hausman J (1978) Specification tests in econometrics. Econometrica 46:1251–1271
Article Google Scholar
Hayakawa K (2019) Alternative over-identifying restriction test in the GMM estimation of panel data models. Econom Stat 10:71–95
Google Scholar
Joshi R, Wooldridge J (2019) Correlated random effects models with endogenous explanatory variables and unbalanced panels. Ann Econ Stat 134:243–268
Article Google Scholar
Lloyd C (2005) Estimating test power adjusted for size. J Stat Comput Simul 75(11):921–933
Article Google Scholar
McKinnish T (2008) Panel data models and transitory fluctuations in the explanatory variable. In: Fomby T, Hill R, Millimet D, Smith J, Vytlacil E (eds) Advances in Econometrics. Vol. 21: Modelling and Evaluating Treatment Effects in Econometrics. pp 335–358
Mundlak Y (1961) Empirical production function free of management bias. J Farm Econ 43:44–56
Article Google Scholar
Mundlak Y, Hoch I (1965) Consequences of alternative specifications in estimation of Cobb-Douglas production functions. Econometrica 33:814–828
Article Google Scholar
Newey W (1985) Generalized method of moments specification testing. J Econom 29:229–256
Article Google Scholar
Newey W, McFadden D (1994) Large sample estimation and hypthesis testing. In: Griliches Z, Intriligator M (eds) Handbook of Econometrics, vol 4. North Holland
Newey W, West K (1987) Hypothesis testing with efficient method of moments estimation. Int Econ Rev 28:777–787
Article Google Scholar
Okui R (2009) The optimal choice of moments in dynamic panel data models. J Econom 151:1–16
Article Google Scholar
Parente P, Santos Silva J (2012) A cautionary note on tests of overidentifying restrictions. Econ Lett 115:314–317
Article Google Scholar
Plosser C, Schwert G, White H (1982) Differencing as a test of specification. Int Econ Rev 23:535–552
Article Google Scholar
Sarafidis V, Wansbeek T (2012) Cross-sectional dependence in panel data analysis. Economet Rev 31:483–531
Article Google Scholar
Su L, Zhang Y, Wei J (2016) A practical test for strict exogeneity in linear panel data models with fixed effects. Econ Lett 147:27–31
Article Google Scholar
Thursby J (1989) A comparison of several specification error tests for a general alternative. Int Econ Rev 30:217–230
Article Google Scholar
Verbeek M, Nijman T (1992) Testing for selectivity bias in panel data models. Int Econ Rev 33:681–703
Article Google Scholar
Wooldridge J (2010) Econometric analysis of cross section and panel data, 2nd edn. MIT Press, Cambridge
Google Scholar
Wooldridge JM (1995) Selection corrections for panel data models under conditional mean independence assumptions. J Econom 68:115–132
Article Google Scholar

Download references

Acknowledgements

Laura Spierdijk would like to thank Terra McKinnish for sharing her data. She is also grateful to Tom Wansbeek and Erik Meijer for providing useful comments and suggestions.

Funding

Laura Spierdijk gratefully acknowledges financial support by a Vidi grant (452.11.007) in the ‘Vernieuwingsimpuls’ program of the Netherlands Organization for Scientific Research (NWO). Her work was also supported by the Netherlands Institute for Advanced Study in the Humanities and Social Sciences (NIAS-KNAW). The usual disclaimer applies.

Author information

Authors and Affiliations

Section Financial Engineering, Department of High-tech Business and Entrepreneurship, Faculty of Behavioural, Management and Social Sciences, University of Twente, Enschede, The Netherlands
Laura Spierdijk

Authors

Laura Spierdijk
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Laura Spierdijk.

Ethics declarations

Conflicts of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix

A. FE estimator is weighted average of difference estimators

The full derivation of (3) writes as

$$\begin{aligned} {\widehat{{\varvec{\beta }}}}_{{\tiny {\hbox {FE}}}}= & {} \left( \sum _{i=1}^{n}{{\textbf {X}}}_{i}'{\textbf {A}}_T{{\textbf {X}}}_{i}\right) ^{-1}\sum _{i=1}^{n}{{\textbf {X}}}_{i}'{\textbf {A}}_T{{\textbf {y}}}_{i}=\left( \sum _{i=1}^{n}\sum _{t=1}^{T-1}{{\textbf {X}}}_{i}'\varvec{\Delta }_t{{\textbf {X}}}_{i}\right) ^{-1}\sum _{i=1}^{n}\sum _{j=1}^{T-1}{{\textbf {X}}}_{i}'\varvec{\Delta }_j{{\textbf {y}}}_{i}\nonumber \\= & {} \sum _{j=1}^{T-1}\left( \sum _{i=1}^{n}\sum _{t=1}^{T-1}{{\textbf {X}}}_{i}'\varvec{\Delta }_t{{\textbf {X}}}_{i}\right) ^{-1}\sum _{i=1}^{n}{{\textbf {X}}}_{i}'\varvec{\Delta }_j{{\textbf {y}}}_{i}=\sum _{j=1}^{T-1}{\textbf {W}}_j\left( \sum _{i=1}^{n}{{\textbf {X}}}_{i}'\varvec{\Delta }_j{{\textbf {X}}}_{i}\right) ^{-1}\sum _{i=1}^{n}{{\textbf {X}}}_{i}'\varvec{\Delta }_j{{\textbf {y}}}_{i}\nonumber \\= & {} \sum _{j=1}^{T-1}{\textbf {W}}_j\,{\widehat{{\varvec{\beta }}}}_{j}, \end{aligned}$$

(A.1)

where

$$\begin{aligned} {\textbf {W}}_j= & {} \left( \sum _{i=1}^{n}\sum _{t=1}^{T-1}{{\textbf {X}}}_{i}'\varvec{\Delta }_t{{\textbf {X}}}_{i}\right) ^{-1}\sum _{i=1}^{n}{{\textbf {X}}}_{i}'\varvec{\Delta }_j{{\textbf {X}}}_{i}\end{aligned}$$

(A.2)

$$\begin{aligned} \sum _{j=1}^{T-1}{\textbf {W}}_j= & {} {\textbf {I}}_k, \quad {\textbf {W}}_j\quad \text{ symmetric } \text{ and } \text{ positive } \text{ definite }. \end{aligned}$$

(A.3)

B. Motivating examples: calculations

This appendix makes use of a few elementary properties of stationary AR(1) processes, which we summarize here for completeness. Assume that $x_{it}$ and $z_{it}$ are generated by stationary AR(1) processes, such that

$$\begin{aligned} x_{it}= & {} \rho x_{i,t-1}+\theta _{it} \qquad [0<\rho <1]\end{aligned}$$

(B.1)

$$\begin{aligned} z_{it}= & {} \delta z_{i,t-1}+\eta _{it} \qquad [0<\delta <1]. \end{aligned}$$

(B.2)

We assume that , and for all i and t. We also assume that ${\mathbb {C}}\hbox {ov}\,(\theta _{mt},\eta _{is})=0$ for $m\ne n$, ${\mathbb {C}}\hbox {ov}\,(\theta _{is},\eta _{it})=0$ for $s\not =t$, and ${\mathbb {C}}\hbox {ov}\,(\theta _{it},\eta _{it})=\sigma _{\theta \eta }$. Lastly, we assume that ${\mathbb {C}}\hbox {ov}\,(\theta _{mt},\varepsilon _{is})={\mathbb {C}}\hbox {ov}\,(\eta _{mt},\varepsilon _{is})=0$ for all m, i, s, t.

For $k\ge 1$, we can write

$$\begin{aligned} x_{it}=\rho ^{k}x_{i,t-k}+\sum _{\ell =0}^{k-1}\rho ^{\ell }\theta _{i,t-\ell }, \qquad z_{it}=\delta ^{k}z_{i,t-k}+\sum _{\ell =0}^{k-1}\delta ^{\ell }\eta _{i,t-\ell }. \end{aligned}$$

(B.3)

By letting $k\rightarrow \infty $, we find

$$\begin{aligned} x_{it}=\sum _{\ell =0}^{\infty }\rho ^{\ell }\theta _{i,t-\ell },\qquad z_{it}=\sum _{\ell =0}^{\infty }\delta ^{\ell }\eta _{i,t-\ell }. \end{aligned}$$

(B.4)

Using these alternative formulations for $x_{it}$ and $z_{it}$, we find for $j\ge 0$,

(B.5)

(B.6)

We also have

$$\begin{aligned} {\mathbb {C}}\hbox {ov}\,(x_{it},z_{i,t-j})= & {} \delta ^{-j}\sum _{\ell =j}^{\infty }(\delta \rho )^{\ell }{\mathbb {C}}\hbox {ov}\,(\theta _{i,t-\ell },\eta _{i,t-\ell })=\delta ^{-j}\sigma _{\theta \eta }\nonumber \\&\times \sum _{\ell =j}^{\infty }(\delta \rho )^{\ell }=\rho ^{j}\sigma _{\theta \eta }/(1-\delta \rho ). \end{aligned}$$

(B.7)

Similarly, we find

$$\begin{aligned} {\mathbb {C}}\hbox {ov}\,(x_{i,t-j},z_{it})= & {} \delta ^{j}\sigma _{\theta \eta }/(1-\delta \rho ). \end{aligned}$$

(B.8)

1.1 B.1. (Non-)Classical measurement error

We start with the errors-in-variables model and allow for non-classical measurement error, with classical measurement as a special case. We will derive the inconsistency in both cases.

Model Consider the linear panel regression model with measurement error, given by

$$\begin{aligned} y_{it}= & {} \alpha _{i}+\beta \xi _{it}+\varepsilon _{it}\end{aligned}$$

(B.9)

$$\begin{aligned} x_{it}= & {} \xi _{it}+ v_{it}, \end{aligned}$$

(B.10)

where $n=1,\ldots ,n$ and $t=1,\ldots ,T$. We assume that $(\varepsilon _{it})$ is i.i.d. with and for all i and t. Regarding $(\xi _{it})$ and $(v_{it})$, we assume that they are generated by stationary AR(1) processes, such that

$$\begin{aligned} \xi _{it}= & {} \rho \xi _{i,t-1}+\theta _{it} \qquad [0<\rho <1]\end{aligned}$$

(B.11)

$$\begin{aligned} v_{it}= & {} \delta v_{i,t-1}+\eta _{it} \qquad [0<\delta <1]. \end{aligned}$$

(B.12)

We assume that , and for all i and t. Furthermore, we assume that ${\mathbb {C}}\hbox {ov}\,(\theta _{mt},\eta _{is})=0$ for $m\ne n$, ${\mathbb {C}}\hbox {ov}\,(\theta _{is},\eta _{it})=0$ for $s\not =t$, ${\mathbb {C}}\hbox {ov}\,(\theta _{it},\eta _{it})=\sigma _{\theta \eta }$ and ${\mathbb {C}}\hbox {ov}\,(\theta _{mt},\varepsilon _{is})=0$ for all m, i, s, t. Lastly, we assume that ${\mathbb {C}}\hbox {ov}\,(\varepsilon _{mt},\eta _{is})=0$ for $m\ne i$, ${\mathbb {C}}\hbox {ov}\,(\varepsilon _{is},\eta _{it})=0$ for all s, t. If $\sigma _{\theta \eta }\not =0$, we have a form of non-classical measurement error.

Inconsistency We first show that the FE estimator will usually be inconsistent. Let ${{\textbf {X}}}_i=(x_{i1},\ldots ,x_{iT})'$ and ${{\textbf {y}}}_i=(y_{i1},\ldots ,y_{iT})'$. With ${\textbf {A}}$ the $T\times T $ centering matrix we obtain

$$\begin{aligned} \mathop {\hbox {plim}}\limits _{n\rightarrow \infty } {\widehat{\beta }}_{\tiny {{\hbox {FE}}}}= & {} \mathop {\hbox {plim}}\limits _{n\rightarrow \infty } \dfrac{\sum _{i=1}^{n}{{\textbf {x}}}_i'{\textbf {A}}{{\textbf {y}}}_i}{\sum _{i=1}^{n}{{\textbf {x}}}_i'{\textbf {A}}{{\textbf {X}}}_i}=\dfrac{\hbox {tr}[{\textbf {A}}(\varvec{\Sigma }_{\xi }+\varvec{\Sigma }_{\xi v})]}{\hbox {tr}[{\textbf {A}}(\varvec{\Sigma }_{\xi }+\varvec{\Sigma }_{v})]}\beta , \end{aligned}$$

(B.13)

where $\varvec{\Sigma }_v$ contains the covariances ${\mathbb {C}}\hbox {ov}\,(v_{ns},v_{nt})$ and $\varvec{\Sigma }_{\xi v}$ the covariances ${\mathbb {C}}\hbox {ov}\,(\xi _{ns},v_{nt})$. This yields the inconsistency

$$\begin{aligned} \mathop {\hbox {plim}}\limits _{n\rightarrow \infty } {\widehat{\beta }}_{{\tiny {\hbox {FE}}}}-\beta= & {} \dfrac{\hbox {tr}[{\textbf {A}}(\varvec{\Sigma }_{\xi v}-\varvec{\Sigma }_{v})]}{\hbox {tr}[{\textbf {A}}(\varvec{\Sigma }_{\xi }+\varvec{\Sigma }_{v})]}\beta , \end{aligned}$$

(B.14)

The inconsistency will typically be nonzero if at least .

We now turn to the estimators ${\widehat{\beta }}_j$ that are obtained after taking differences over time span j. It holds that

(B.15)

Under the given assumptions, the numerator in (B.15) reduces to

(B.16)

Furthermore, the denominator can be written as

(B.17)

The inconsistency thus boils down to

$$\begin{aligned} \mathop {\hbox {plim}}\limits _{n\rightarrow \infty } {\widehat{\beta }}_{j}-\beta= & {} \frac{-\beta [\sigma ^{2}_{\eta }(1-\delta ^j)/(1-\delta ^2)+\sigma _{\theta \eta }(1-(\delta ^j + \rho ^{j})/2)/(1-\delta \rho )]}{\sigma ^{2}_{\theta }(1-\rho ^j)/(1-\rho ^2)+\sigma ^{2}_{\eta }(1-\delta ^j)/(1-\delta ^2)+2\sigma _{\theta \eta }(1-(\delta ^j + \rho ^{j})/2)/(1-\delta \rho )}\nonumber \\= & {} \frac{-\beta [\sigma ^{2}_{v}(1-\delta ^j)+\sigma _{\xi v}(1-(\delta ^j + \rho ^{j})/2)]}{\sigma ^{2}_{\xi }(1-\rho ^j)+\sigma ^{2}_{v}(1-\delta ^j)+2\sigma _{\xi v}(1-(\delta ^j + \rho ^{j})/2)}. \end{aligned}$$

(B.18)

Because

$$\begin{aligned} (1-\delta ^{j})(1-\rho ^{j+1})>(1-\delta ^{j+1})(1-\rho ^{j}) \end{aligned}$$

(B.19)

if and only if $\delta <\rho $, it is readily seen that the inconsistency’s magnitude decreases with j if and only if $\delta <\rho $. For $\delta >\rho $, the magnitude of the inconsistency is increasing and for $\delta =\rho $ the inconsistency does not depend on j. For both classical and non-classical measurement error, the inconsistency does not vanish for larger values of j.

1.2 B.2. Omitted variables

The second source of endogeneity that we consider is an omitted variable.

Model Consider the linear panel regression model with two regressors, given by

$$\begin{aligned} y_{it}=\alpha _{i}+\beta x_{it}+\gamma z_{it}+\varepsilon _{it}, \end{aligned}$$

(B.20)

where $i=1,\ldots ,n$ and $t=1,\ldots ,T$. We assume that $(\varepsilon _{it})$ is i.i.d. with and for all i and t. Regarding the explanatory variables, we assume that $x_{it}$ and $z_{it}$ are generated by stationary AR(1) processes, such that

$$\begin{aligned} x_{it}= & {} \rho x_{i,t-1}+\theta _{it} \qquad [0<\rho <1]\end{aligned}$$

(B.21)

$$\begin{aligned} z_{it}= & {} \delta z_{i,t-1}+\eta _{it} \qquad [0<\delta <1]. \end{aligned}$$

(B.22)

We assume that , and for all i and t. Furthermore, we assume that ${\mathbb {C}}\hbox {ov}\,(\theta _{mt},\eta _{is})=0$ for $m\ne n$, ${\mathbb {C}}\hbox {ov}\,(\theta _{is},\eta _{it})=0$ for $s\not =t$, and ${\mathbb {C}}\hbox {ov}\,(\theta _{it},\eta _{it})=\sigma _{\theta \eta }$. Lastly, we assume that ${\mathbb {C}}\hbox {ov}\,(\theta _{mt},\varepsilon _{is})={\mathbb {C}}\hbox {ov}\,(\eta _{mt},\varepsilon _{is})=0$ for all m, n, s, t.

We estimate the omitted variable regression

$$\begin{aligned} y_{it}=\alpha _{i}+\beta x_{it}+\varepsilon _{it}, \end{aligned}$$

(B.23)

and are interested in the probability limit of ${\widehat{\beta }}_{j}$, the estimator of $\beta $ based on the model after taking differences over time span j.

Inconsistency We first show that the FE estimator for $\beta $ will usually be inconsistent. Using similar matrix notation as before, we obtain

$$\begin{aligned} \mathop {\hbox {plim}}\limits _{n\rightarrow \infty }{\widehat{\beta }}_{{\tiny {\hbox {FE}}}}= & {} \mathop {\hbox {plim}}\limits _{n\rightarrow \infty } \dfrac{\sum _{i=1}^{n}{{\textbf {x}}}_i'{\textbf {A}}{{\textbf {y}}}_i}{\sum _{i=1}^{n}{{\textbf {x}}}_i'{\textbf {A}}{{\textbf {X}}}_i}=\beta +\dfrac{\hbox {tr}({\textbf {A}}\varvec{\Sigma }_{zx})}{\hbox {tr}({\textbf {A}}\varvec{\Sigma }_x)}\gamma , \end{aligned}$$

(B.24)

yielding the inconsistency

$$\begin{aligned} \mathop {\hbox {plim}}\limits _{n\rightarrow \infty } {\widehat{\beta }}_{{\tiny {\hbox {FE}}}}-\beta= & {} \dfrac{\hbox {tr}({\textbf {A}}\varvec{\Sigma }_{zx})}{\hbox {tr}({\textbf {A}}\varvec{\Sigma }_x)}\gamma . \end{aligned}$$

(B.25)

The inconsistency will be nonzero for $\gamma \ne 0$ and $\varvec{\Sigma }_{zx}\ne {\textbf {0}}$.

We now turn to the estimators ${\widehat{\beta }}_j$ that are obtained after taking differences over time span j. It holds that

(B.26)

Under the given assumptions, the numerator reduces to

(B.27)

For the denominator, we find

(B.28)

The probability limit then becomes

$$\begin{aligned} \mathop {\hbox {plim}}\limits _{n\rightarrow \infty }{\widehat{\beta }}_{j}= & {} \frac{\beta \sigma _{\theta }^2(1-\rho ^{j})/(1-\rho ^2)+\gamma \sigma _{\theta \eta }(1-(\delta ^j + \rho ^{j})/2)/(1-\delta \rho )}{\sigma _{\theta }^2(1-\rho ^{j})/(1-\rho ^2)}\nonumber \\= & {} \beta +\frac{\gamma \sigma _{\theta \eta }[1-(\delta ^j + \rho ^{j})/2]/(1-\delta \rho )}{\sigma ^{2}_{\theta }(1-\rho ^{j})/(1-\rho ^2)}. \end{aligned}$$

(B.29)

The inconsistency thus boils down to

$$\begin{aligned} \mathop {\hbox {plim}}\limits _{n\rightarrow \infty }{\widehat{\beta }}_{j}-\beta= & {} \frac{\gamma \sigma _{\theta \eta }[1-(\delta ^j + \rho ^{j})/2]/(1-\delta \rho )}{\sigma ^{2}_{\theta }(1-\rho ^{j})/(1-\rho ^2)}=\frac{\gamma \sigma _{xz}[1-(\delta ^j + \rho ^{j})/2]}{\sigma ^{2}_{x}(1-\rho ^{j})}.\nonumber \\ \end{aligned}$$

(B.30)

As a sanity check on the above expression, we notice that the inconsistency is zero for $\sigma _{\theta \eta }=0$. The inconsistency should be zero in this particular case, because $\sigma _{\theta \eta }=0$ implies that $x_{it}$ and $z_{it}$ are uncorrelated.

Because

$$\begin{aligned} (1-\rho ^{j+1})(1-(\delta ^{j}+\rho ^{j})/2)>(1-\rho ^{j})(1-(\delta ^{j+1}+\rho ^{j+1})/2) \end{aligned}$$

(B.31)

if and only if $\delta <\rho $, is readily seen that $\hbox {plim}_{n\rightarrow \infty }|{\widehat{\beta }}_{j}-\beta |>\hbox {plim}_{n\rightarrow \infty }|{\widehat{\beta }}_{j+1}-\beta |$ if and only if $\delta <\rho $. The inconsistency’s magnitude is increasing for $\delta >\rho $ and for $\delta =\rho $ the inconsistency does not depend on j. We note that the inconsistency does not vanish for larger values of j.

1.3 B.3. Simultaneity

The third source of endogeneity that we consider is simultaneity.

Model We consider the simultaneous equations model given by the structural equations

$$\begin{aligned} y_{it}= & {} \beta _{i}+\beta x_{it}+\varepsilon _{it}\end{aligned}$$

(B.32)

$$\begin{aligned} x_{it}= & {} \alpha _{i}+\alpha y_{it}+u_{it}. \end{aligned}$$

(B.33)

We assume that $(\varepsilon _{it})$ is i.i.d. with and , independent of $(u_{it})$. Here $(u_{it})$ is a stationary AR(1) process defined by

$$\begin{aligned} u_{it}= & {} \rho u_{i,t-1}+\theta _{it}\qquad [0<\rho <1], \end{aligned}$$

(B.34)

with , and ${\mathbb {C}}\hbox {ov}\,(\theta _{mt},\varepsilon _{is})={\textbf {0}}$ for all m, i, t, s.

Solving the two equations yields the reduced forms

$$\begin{aligned} y_{it}= & {} \frac{\beta _{i}+\beta \alpha _{i}}{1-\alpha \beta } +\frac{\beta u_{it}+\varepsilon _{it}}{1-\alpha \beta }\end{aligned}$$

(B.35)

$$\begin{aligned} x_{it}= & {} \frac{\alpha _{i}+\alpha \beta _{i}}{1-\alpha \beta }+\frac{u_{it}+\alpha \varepsilon _{it}}{1-\alpha \beta }. \end{aligned}$$

(B.36)

We estimate (B.32) in jth differences, thereby ignoring (B.33). We are interested in the probability limit of ${\widehat{\beta }}_{j}$, the estimator of $\beta $ based on the model in jth differences. We want to know how the inconsistency depends on j.

Inconsistency We first show that the FE estimator for $\beta $ will usually be inconsistent. Using similar matrix notation as before, we obtain

$$\begin{aligned} \mathop {\hbox {plim}}\limits _{n\rightarrow \infty } {\widehat{\beta }}_{{\tiny {\hbox {FE}}}}= & {} \mathop {\hbox {plim}}\limits _{n\rightarrow \infty } \dfrac{\sum _{i=1}^{n}{{\textbf {x}}}_i'{\textbf {A}}{{\textbf {y}}}_i}{\sum _{i=1}^{n}{{\textbf {x}}}_i'{\textbf {A}}{{\textbf {X}}}_i}=\dfrac{\hbox {tr}({\textbf {A}}\varvec{\Sigma }_{u})\beta +\hbox {tr}({\textbf {A}}\varvec{\Sigma }_\varepsilon )\alpha }{\hbox {tr}({\textbf {A}}\varvec{\Sigma }_u)+\hbox {tr}({\textbf {A}}\varvec{\Sigma }_{\varepsilon })\alpha ^2}, \end{aligned}$$

(B.37)

yielding the inconsistency

$$\begin{aligned} \mathop {\hbox {plim}}\limits _{n\rightarrow \infty } {\widehat{\beta }}_{{\tiny {\hbox {FE}}}}-\beta= & {} \dfrac{\hbox {tr}({\textbf {A}}\varvec{\Sigma }_\varepsilon )\alpha (1-\alpha \beta )}{\hbox {tr}({\textbf {A}}\varvec{\Sigma }_u)+\hbox {tr}({\textbf {A}}\varvec{\Sigma }_{\varepsilon })\alpha ^2}. \end{aligned}$$

(B.38)

The inconsistency will be nonzero if $\alpha \ne 0$ and $\alpha \not =1/\beta $.

We now turn to the estimators ${\widehat{\beta }}_j$ that are obtained after taking differences over time span j. The probability limit of the resulting estimator for $\beta $ equals

(B.39)

Under the given assumption, this reduces to

$$\begin{aligned} \mathop {\hbox {plim}}\limits _{n\rightarrow \infty }{\widehat{\beta }}_{j}= & {} \frac{2[\alpha \sigma ^{2}_{\varepsilon }+\beta \sigma ^{2}_{\theta }(1-\rho ^{j})/(1-\rho ^2)]}{2[\alpha ^2\sigma ^{2}_{\varepsilon }+\sigma ^{2}_{\theta }(1-\rho ^{j})/(1-\rho ^2)]} =\frac{\alpha \sigma ^{2}_{\varepsilon }+\beta \sigma ^{2}_{\theta }(1-\rho ^{j})/(1-\rho ^2)}{\alpha ^2\sigma ^{2}_ {\varepsilon }+\sigma ^{2}_{\theta }(1-\rho ^{j})/(1-\rho ^2)}\\&=\frac{\alpha \sigma ^{2}_{\varepsilon }+\beta \sigma ^{2}_{u}(1-\rho ^{j})}{\alpha ^2\sigma ^{2}_{\varepsilon } +\sigma ^{2}_{u}(1-\rho ^{j})}. \end{aligned}$$

This gives the inconsistency

$$\begin{aligned} \mathop {\hbox {plim}}\limits _{n\rightarrow \infty }{\widehat{\beta }}_{j}-\beta =\frac{\alpha \sigma ^{2}_{\varepsilon } (1-\alpha \beta )}{\alpha ^2\sigma ^{2}_{\varepsilon }+\sigma ^{2}_{u}(1-\rho ^{j})}. \end{aligned}$$

(B.40)

The inconsistency is positive for $\alpha (1-\alpha \beta )>0$ and negative for $\alpha (1-\alpha \beta )<0$. Its magnitude decreases with j for $0<\rho <1$. We note that the inconsistency does not vanish for larger values of j.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Spierdijk, L. Assessing the consistency of the fixed-effects estimator: a regression-based Wald test. Empir Econ 64, 1599–1630 (2023). https://doi.org/10.1007/s00181-022-02298-2

Download citation

Received: 09 June 2021
Accepted: 17 August 2022
Published: 09 September 2022
Issue Date: April 2023
DOI: https://doi.org/10.1007/s00181-022-02298-2

Keywords

JEL Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Assessing the consistency of the fixed-effects estimator: a regression-based Wald test

Abstract

Similar content being viewed by others

Inference in Two-Step Panel Data Models with Time-Invariant Regressors: Bootstrap Versus Analytic Estimators

Large-N and Large-T Properties of Panel Data Estimators and the Hausman Test

Robust estimation and moment selection in dynamic fixed-effects panel data models

1 Introduction

2 Regression framework

2.1 FE and differences estimators

Result 1

Corollary 1

2.2 Motivating examples

2.3 Stacked regression

3 Test procedure

3.1 Wald test

Result 2

3.2 Relation to GMM tests

Result 3

Corollary 2

3.3 Trivial power

3.4 Comparison with existing tests

3.4.1 Wald test of Wooldridge (2010)

3.4.2 Sup-Wald test of Wooldridge (2010)

3.4.3 GMM framework

3.4.4 Comparison Wald and sup-Wald tests

4 Simulation study

4.1 Simulation setup

4.2 Empirical power and size

4.3 Additional simulations

4.4 Cautionary remark

5 Empirical applications

5.1 Birth rates and welfare

5.2 Investments and Tobin’s q

6 Conclusion

Data availability

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Appendices

Appendix

A. FE estimator is weighted average of difference estimators

B. Motivating examples: calculations

1.1 B.1. (Non-)Classical measurement error

1.2 B.2. Omitted variables

1.3 B.3. Simultaneity

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation