Semiparametric efficient estimators in heteroscedastic error models

Kim, Mijeong; Ma, Yanyuan

doi:10.1007/s10463-017-0622-0

Semiparametric efficient estimators in heteroscedastic error models

Published: 13 October 2017

Volume 71, pages 1–28, (2019)
Cite this article

Annals of the Institute of Statistical Mathematics Aims and scope Submit manuscript

Mijeong Kim¹ &
Yanyuan Ma²

467 Accesses
2 Citations
Explore all metrics

Abstract

In the mean regression context, this study considers several frequently encountered heteroscedastic error models where the regression mean and variance functions are specified up to certain parameters. An important point we note through a series of analyses is that different assumptions on standardized regression errors yield quite different efficiency bounds for the corresponding estimators. Consequently, all aspects of the assumptions need to be specifically taken into account in constructing their corresponding efficient estimators. This study clarifies the relation between the regression error assumptions and their, respectively, efficiency bounds under the general regression framework with heteroscedastic errors. Our simulation results support our findings; we carry out a real data analysis using the proposed methods where the Cobb–Douglas cost model is the regression mean.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Statistical inference for heteroscedastic semi-varying coefficient EV models under restricted condition

Article 05 May 2016

Heteroscedasticity Detection and Estimation with Quantile Difference Method

Article 29 August 2015

Two-Step Estimation in a Heteroscedastic Linear Regression Model

Article 27 April 2018

References

Bement, T. R., Williams, J. S. (1969). Variance of weighted regression estimators when sampling errors are independent and heteroscedastic. Journal of the American Statistical Association, 64, 1369–1382.
Bickel, P. J., Klaassen, C. A., Ritov, Y., Wellner, J. A. (1998). Efficient and adaptive estimation for semiparametric models. New York: Springer.
Carroll, R. J. (1982). Adapting for heteroscedasticity in linear models. The Annals of Statistics, 10, 1224–1233.
Article MathSciNet MATH Google Scholar
Carroll, R. J., Ruppert, D. (1982). Robust estimation in heteroscedastic linear models. The Annals of Statistics, 10, 429–441.
Fang, Y., Lian, H., Liang, H., Ruppert, D. (2015). Variance function additive partial linear models. Electronic Journal of Statistics, 9, 2793–2827.
Fuller, W. A., Rao, J. (1978). Estimation for a linear regression model with unknown diagonal covariance matrix. The Annals of Statistics, 6, 1149–1158.
Hall, P., Carroll, R. (1989). Variance function estimation in regression: The effect of estimating the mean. Journal of the Royal Statistical Society: Series B (Methodological), 51, 3–14.
Jacquez, J. A., Mather, F. J., Crawford, C. R. (1968). Linear regression with non-constant, unknown error variances: Sampling experiments with least squares, weighted least squares and maximum likelihood estimators. Biometrics, 24, 607–626.
Joanes, D., Gill, C. (1998). Comparing measures of sample skewness and kurtosis. Journal of the Royal Statistical Society: Series D (The Statistician), 47, 183–189.
Kim, M., Ma, Y. (2012). The efficiency of the second-order nonlinear least squares estimator and its extension. Annals of the Institute of Statistical Mathematics, 64, 751–764.
Lian, H., Liang, H., Carroll, R. J. (2015). Variance function partially linear single-index models. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 77, 171–194.
Ma, Y., Chiou, J.-M., Wang, N. (2006). Efficient semiparametric estimator for heteroscedastic partially linear models. Biometrika, 93, 75–84.
Müller, H.-G., Zhao, P.-L. (1995). On a semiparametric variance function model and a test for heteroscedasticity. The Annals of Statistics, 23, 946–967.
Nadarajah, S. (2005). A generalized normal distribution. Journal of Applied Statistics, 32, 685–694.
Article MathSciNet MATH Google Scholar
Nerlove, M. (1963). Returns to scale in electricity supply. En “Measurement in economics-studies in mathematical economics and econometrics in memory of Yehuda Grunfeld” edited by Carl F. Christ. Palo Alto: Stanford University Press.
Google Scholar
Tsiatis, A. (2006). Semiparametric theory and missing data. New York: Springer.
MATH Google Scholar

Download references

Acknowledgements

Mijeong Kim was supported by a Ewha Womans University Research Grant of 2015 and a National Research Foundation of Korea (NRF) grant funded by the Korean Government (NRF-2017R1C1B5015186). Yanyuan Ma was supported by National Science Foundation DMS-1608540.

Author information

Authors and Affiliations

Department of Statistics, Ewha Womans University, Seoul, 03760, Republic of Korea
Mijeong Kim
Department of Statistics, Penn State University, University Park, PA, 16802, USA
Yanyuan Ma

Authors

Mijeong Kim
View author publications
You can also search for this author in PubMed Google Scholar
Yanyuan Ma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mijeong Kim.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 306 KB)

Appendix

1.1 Proof of Proposition 1

First, note that the construction of t ensures the property

$$\begin{aligned} E(\epsilon t|\mathbf{X})=E\left( \epsilon ^3|\mathbf{X}\right) -E\left( \epsilon ^3|\mathbf{X}\right) E\left( \epsilon ^2|\mathbf{X}\right) -E\left( \epsilon |\mathbf{X}\right) =E\left( \epsilon ^3|\mathbf{X}\right) -E\left( \epsilon ^3|\mathbf{X}\right) =0, \end{aligned}$$

which is crucial for the following proof.

Following the semiparametric consideration in (1), we estimate the parameters ${\varvec{\theta }}$, while the nuisance parameter is the distribution $f_{\epsilon \mid \mathbf{X}}(\epsilon ,\mathbf{x})$, which is subject to the constraints $E(\epsilon |\mathbf{X})=0$ and $E(\epsilon ^2|\mathbf{X})=1$. Note that we do not assume any parametric distribution model on $\epsilon $. Thus, (1) is a semiparametric model. We first write out the joint distribution of $(\epsilon ,\mathbf{X})$

$$\begin{aligned} f_{\epsilon ,\mathbf{X}}(\epsilon ,\mathbf{x})=f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{x})f_{\mathbf{X}}(\mathbf{x})= \eta _1(\mathbf{x})\eta _2(\epsilon ,\mathbf{x}), \end{aligned}$$

where $\eta _1(\cdot )$ and $\eta _2(\cdot )$ denote the infinite-dimensional nuisance parameter function. Since $\eta _1(\cdot )$ and $\eta _2(\cdot )$ are probability density functions, we have

$$\begin{aligned} \int \eta _1(\mathbf{x})\hbox {d}\mathbf{x}=1,\quad \text{ and } \int \eta _2(\epsilon ,\mathbf{x})\hbox {d}\epsilon =1\quad \text{ for } \text{ all } \mathbf{x}. \end{aligned}$$

(15)

In the Hilbert space $\mathcal {H}$ formed by all the mean zero finite variance functions, the nuisance tangent space of a semiparametric model is the mean squared closure of all the parametric submodel nuisance tangent spaces. A parametric submodel is defined as a parametric model included by the original semiparametric model and includes the true density $f(\epsilon ,\mathbf{X};{\varvec{\theta }}_0,{\varvec{\gamma }}_0)$. A nuisance tangent space of the parametric model $f(\epsilon ,\mathbf{X};{\varvec{\theta }},{\varvec{\gamma }})$ is a linear space spanned by the nuisance score vector $S_{{\varvec{\gamma }}}=\partial \mathrm{log}f(\epsilon ,\mathbf{X};{\varvec{\theta }}_0,{\varvec{\gamma }})/\partial {\varvec{\gamma }}|_{{\varvec{\gamma }}_0}$. From the above general concept, we first derive the nuisance tangent space $\Lambda $. From Condition (2), we have

$$\begin{aligned} \int \epsilon \eta _2(\epsilon ,\mathbf{x})\hbox {d}\epsilon =0 \text{ for } \text{ all } \mathbf{x},\quad \text{ and } \int \epsilon ^2\eta _2(\epsilon ,\mathbf{x})\hbox {d}\epsilon =1\quad \text{ for } \text{ all } \mathbf{x}. \end{aligned}$$

(16)

From (15) and (16), it follows that

$$\begin{aligned} \Lambda _1= & {} \{\mathbf{a}(\mathbf{x}):E\{\mathbf{a}(\mathbf{X})\}=\mathbf{0}\}, \\ \Lambda _2= & {} \{\mathbf{b}(\mathbf{x},\epsilon ): E\{\mathbf{b}(\mathbf{X},\epsilon )|\mathbf{X}\}=E\{\epsilon \mathbf{b}(\mathbf{X},\epsilon )|\mathbf{X}\}=E\{\epsilon ^2\mathbf{b}(\mathbf{X},\epsilon )|\mathbf{X}\}=\mathbf{0}\} \\= & {} \{\mathbf{b}(\mathbf{x},\epsilon ):E\{\mathbf{b}(\mathbf{X},\epsilon )|\mathbf{X}\}=E\{\epsilon \mathbf{b}(\mathbf{X},\epsilon )|\mathbf{X}\}=E\{t\mathbf{b}(\mathbf{X},\epsilon )|\mathbf{X}\}=\mathbf{0} \}. \end{aligned}$$

In the above, we use

$$\begin{aligned} E\{t\mathbf{b}(\mathbf{X},\epsilon )|\mathbf{X}\}= & {} E\{\epsilon ^2\mathbf{b}(\mathbf{X},\epsilon )|\mathbf{X}\}-E(\epsilon ^3|\mathbf{X})E\{\epsilon \mathbf{b}(\mathbf{X},\epsilon )|\mathbf{X}\}-E\{\mathbf{b}(\mathbf{X},\epsilon )|\mathbf{X}\}\\= & {} E\{\epsilon ^2\mathbf{b}(\mathbf{X},\epsilon )|\mathbf{X}\}=\mathbf{0}. \end{aligned}$$

Combining $\Lambda _1$ and $\Lambda _2$, the nuisance tangent space $\Lambda $ can be written as

$$\begin{aligned} \Lambda= & {} \Lambda _1\oplus \Lambda _2 \\= & {} \{\mathbf{h}_1(\mathbf{x})+\mathbf{h}_2(\mathbf{x},\epsilon ): E\{\mathbf{h}_1(\mathbf{X})\}=E\{\mathbf{h}_2(\mathbf{x},\epsilon )|\mathbf{X}\}=E\{\epsilon \mathbf{h}_2(\mathbf{x},\epsilon )|\mathbf{X}\}\\&=E\{t\mathbf{h}_2(\mathbf{X},\epsilon )|\mathbf{X}\}=\mathbf{0}\}\\= & {} \{\mathbf{h}(\mathbf{x},\epsilon ): E\{\mathbf{h}(\mathbf{x},\epsilon )\}=E\{\epsilon \mathbf{h}(\mathbf{x},\epsilon )|\mathbf{X}\}=E\{t\mathbf{h}(\mathbf{X},\epsilon )|\mathbf{X}\}=\mathbf{0}\}. \end{aligned}$$

In order to find $\Lambda ^\perp $, we let $\mathbf{K}=\{\mathbf{g}(\mathbf{x},\epsilon ):\mathbf{g}(\mathbf{x},\epsilon )=\mathbf{g}_1(\mathbf{x})\epsilon +\mathbf{g}_2(\mathbf{x})t\}$ and have $\Lambda ^\perp =\mathbf{K}$ by showing that $\mathbf{K}\subset \Lambda ^\perp $ and $\Lambda ^\perp \subset \mathbf{K}$.

For arbitrary $\mathbf{h}(\mathbf{x},\epsilon )\in \Lambda $ and $\mathbf{g}(\mathbf{x},\epsilon )=\mathbf{g}_1(\mathbf{x})\epsilon +\mathbf{g}_2(\mathbf{x})t\in \mathbf{K}$,

$$\begin{aligned}&E\{\mathbf{h}(\mathbf{X},\epsilon )^\mathrm{T}\mathbf{g}(\mathbf{X},\epsilon )\}\\&\quad =E\left[ E\{\mathbf{h}(\mathbf{X},\epsilon )^\mathrm{T}\mathbf{g}(\mathbf{X},\epsilon )|\mathbf{X}\}\right] \\&\quad =E\left[ E\{\epsilon \mathbf{h}(\mathbf{X},\epsilon )|\mathbf{X}\}^\mathrm{T}\mathbf{g}_1(\mathbf{X})\right] +E\left[ E\{t\mathbf{h}(\mathbf{X},\epsilon )|\mathbf{X}\}^\mathrm{T}\mathbf{g}_2(\mathbf{X})\right] =0. \end{aligned}$$

Hence, $\mathbf{g}(\mathbf{x},\epsilon )=\mathbf{g}_1(\mathbf{x})\epsilon +\mathbf{g}_2(\mathbf{x})t\in \Lambda ^\perp $. Thus, $\mathbf{K}\subset \Lambda ^\perp $.

Now, we will show $\Lambda ^\perp \subset \mathbf{K}$.

For arbitrary $\mathbf{h}(\mathbf{x},\epsilon )\in \Lambda ^\perp $, we can decompose $\mathbf{h}(\mathbf{x},\epsilon )$ into $ \mathbf{h}(\mathbf{x},\epsilon )=\mathbf{r}_1(\mathbf{x},\epsilon )+\mathbf{r}_2(\mathbf{x},\epsilon ), $ where

$$\begin{aligned} \mathbf{r}_1(\mathbf{X},\epsilon )= & {} \mathbf{h}(\mathbf{X},\epsilon )-E\{\epsilon \mathbf{h}(\mathbf{X},\epsilon )|\mathbf{X}\}\epsilon -\frac{E\{t\mathbf{h}(\mathbf{X},\epsilon )|\mathbf{X}\}}{E(t^2|\mathbf{X})}t,\\ \mathbf{r}_2(\mathbf{X},\epsilon )= & {} E\{\epsilon \mathbf{h}(\mathbf{X},\epsilon )|\mathbf{X}\}\epsilon +\frac{E\{t\mathbf{h}(\mathbf{X},\epsilon )|\mathbf{X}\}}{E(t^2|\mathbf{X})}t. \end{aligned}$$

It is obvious that $\mathbf{r}_2(\mathbf{x},\epsilon )\in \mathbf{K}$. Since $\mathbf{r}_2(\mathbf{x},\epsilon )\in \mathbf{K}\subset \Lambda ^\perp $ and $\mathbf{h}(\mathbf{x},\epsilon )\in \Lambda ^\perp $, we have $\mathbf{r}_1(\mathbf{x},\epsilon )=\mathbf{h}(\mathbf{x},\epsilon )-\mathbf{r}_2(\mathbf{x},\epsilon )\in \Lambda ^\perp $. For $\mathbf{r}_1$, we can easily verify that

$$\begin{aligned} E\{\mathbf{r}_1(\mathbf{X},\epsilon )\}=E\{\epsilon \mathbf{r}_1(\mathbf{X},\epsilon )|\mathbf{X}\}=E\{t\mathbf{r}_1(\mathbf{X},\epsilon )|\mathbf{X}\}=\mathbf{0}. \end{aligned}$$

This indicates that $\mathbf{r}_1(\mathbf{x},\epsilon )\in \Lambda $. Since $\mathbf{r}_1(\mathbf{x},\epsilon )$ is also an element of $\Lambda ^\perp $, it follows that $\mathbf{r}_1(\mathbf{x},\epsilon )=\mathbf{0}$, and we obtain $\mathbf{h}(\mathbf{x},\epsilon )=\mathbf{r}_2(\mathbf{x},\epsilon )\in \mathbf{K}$. We next show that $\mathbf{h}(\mathbf{x},\epsilon )\in \mathbf{K}$ for arbitrary $\mathbf{h}(\mathbf{x},\epsilon )\in \Lambda ^\perp $. Thus, $\Lambda ^\perp \subset \mathbf{K}$.

Consequently, we have

$$\begin{aligned} \Lambda ^\perp =\{\mathbf{g}(\mathbf{x},\epsilon ):\mathbf{g}(\mathbf{x},\epsilon )=\mathbf{g}_1(\mathbf{x})\epsilon +\mathbf{g}_2(\mathbf{x})t\}. \end{aligned}$$

$\square $

1.2 Proof of Theorem 1

In model (1), we obtain the score functions of ${\varvec{\theta }}=({\varvec{\alpha }}^\mathrm{T},{\varvec{\beta }}^\mathrm{T})^\mathrm{T}$ as

$$\begin{aligned} {\varvec{S}}_{{\varvec{\alpha }}}= & {} \frac{\partial \mathrm{log}f_{\mathbf{X},Y}(\mathbf{X},y)}{\partial {\varvec{\alpha }}} =-\frac{\partial f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})/\partial \epsilon }{f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})}e^{-\sigma (\mathbf{X},{\varvec{\beta }})}\mathbf{m}'_{\varvec{\alpha }}(\mathbf{X},{\varvec{\alpha }}), \\ {\varvec{S}}_{{\varvec{\beta }}}= & {} \frac{\partial \mathrm{log}f_{\mathbf{X},Y}(\mathbf{X},y)}{\partial {\varvec{\beta }}} =-\frac{\partial f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})/\partial \epsilon }{f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})} \epsilon {\varvec{\sigma }}'_{{\varvec{\beta }}}(\mathbf{X},{\varvec{\beta }}) -{\varvec{\sigma }}'_{{\varvec{\beta }}}(\mathbf{X},{\varvec{\beta }}). \end{aligned}$$

By projecting the above score vectors onto $\Lambda ^\perp $, we can find the efficient score vectors of ${\varvec{\alpha }}$ and ${\varvec{\beta }}$.

First, ${\varvec{S}}_{\mathrm{eff},{\varvec{\alpha }}}$ and ${\varvec{S}}_{\mathrm{eff},{\varvec{\beta }}}$ are the form of $\mathbf{g}_1(\mathbf{x})\epsilon +\mathbf{g}_2(\mathbf{x})t$. Thus, ${\varvec{S}}{_\mathrm{eff}}\in \Lambda ^\perp $.

Now, we verify that ${\varvec{S}}_{{\varvec{\theta }}}-{\varvec{S}}{_\mathrm{eff}}\in \Lambda $. ${\varvec{S}}_{{\varvec{\theta }}}-{\varvec{S}}{_\mathrm{eff}}$ is given by

$$\begin{aligned} {\varvec{S}}_{{\varvec{\alpha }}}-{\varvec{S}}_{\mathrm{eff},{\varvec{\alpha }}}= & {} e^{-\sigma (\mathbf{X},{\varvec{\beta }})}\mathbf{m}'_{{\varvec{\alpha }}} \left\{ -\frac{\partial f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})/\partial \epsilon }{f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})}-\epsilon +\frac{E(\epsilon ^3|\mathbf{X})}{E(t^2|\mathbf{X})}t \right\} , \\ {\varvec{S}}_{{\varvec{\beta }}}-{\varvec{S}}_{\mathrm{eff},{\varvec{\beta }}}= & {} {\varvec{\sigma }}'_{\varvec{\beta }}(\mathbf{X},{\varvec{\beta }}) \left\{ -\frac{\partial f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})/\partial \epsilon }{f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})}\epsilon -1-\frac{2t}{E(t^2|\mathbf{X})}\right\} . \end{aligned}$$

We verify ${\varvec{S}}_{{\varvec{\theta }}}-{\varvec{S}}{_\mathrm{eff}}\in \Lambda $ by showing the following.

$$\begin{aligned} E({\varvec{S}}_{{\varvec{\theta }}}-{\varvec{S}}{_\mathrm{eff}})=\mathbf{0}, E\{\epsilon ({\varvec{S}}_{{\varvec{\theta }}}-{\varvec{S}}{_\mathrm{eff}})|\mathbf{X}\}=\mathbf{0}, E\{t({\varvec{S}}_{{\varvec{\theta }}}-{\varvec{S}}{_\mathrm{eff}})|\mathbf{X}\}=\mathbf{0}. \end{aligned}$$

We can check the details as follows.

$$\begin{aligned} E({\varvec{S}}_{{\varvec{\alpha }}}-{\varvec{S}}_{\mathrm{eff},{\varvec{\alpha }}}|\mathbf{X})= & {} e^{-\sigma (\mathbf{X},{\varvec{\beta }})}\mathbf{m}'_{{\varvec{\alpha }}}(\mathbf{X},{\varvec{\alpha }}) E\left\{ -\frac{\partial f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})/\partial \epsilon }{f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})}-\epsilon +\frac{E(\epsilon ^3|\mathbf{X})}{E(t^2|\mathbf{X})}t |\mathbf{X}\right\} =\mathbf{0}, \\ E({\varvec{S}}_{{\varvec{\beta }}}-{\varvec{S}}_{\mathrm{eff},{\varvec{\beta }}}|\mathbf{X})= & {} {\varvec{\sigma }}'_{\varvec{\beta }}(\mathbf{X},{\varvec{\beta }}) E\left\{ -\frac{\partial f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})/\partial \epsilon }{f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})}\epsilon -1-\frac{2t}{E(t^2|\mathbf{X})}|\mathbf{X}\right\} =\mathbf{0}. \end{aligned}$$

The above results imply that $E({\varvec{S}}_{{\varvec{\theta }}}-{\varvec{S}}{_\mathrm{eff}})=\mathbf{0}$.

$$\begin{aligned} E\{\epsilon ({\varvec{S}}_{{\varvec{\alpha }}}-{\varvec{S}}_{\mathrm{eff},{\varvec{\alpha }}})|\mathbf{X}\}= & {} e^{-\sigma (\mathbf{X},{\varvec{\beta }})}\mathbf{m}'_{{\varvec{\alpha }}}(\mathbf{X},{\varvec{\alpha }}) \\&E\left\{ -\frac{\partial f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})/\partial \epsilon }{f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})}\epsilon -\epsilon ^2 +\frac{E(\epsilon ^3|\mathbf{X})}{E(t^2|\mathbf{X})}\epsilon t |\mathbf{X}\right\} =\mathbf{0},\\ E\{t({\varvec{S}}_{{\varvec{\alpha }}}-{\varvec{S}}_{\mathrm{eff},{\varvec{\alpha }}})|\mathbf{X}\}= & {} e^{-\sigma (\mathbf{X},{\varvec{\beta }})}\mathbf{m}'_{{\varvec{\alpha }}}(\mathbf{X},{\varvec{\alpha }}) \\&E\left\{ -\frac{\partial f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})/\partial \epsilon }{f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})}t-\epsilon t +\frac{E(\epsilon ^3|\mathbf{X})}{E(t^2|\mathbf{X})}t^2 |\mathbf{X}\right\} =\mathbf{0}, \\ E\{\epsilon ({\varvec{S}}_{{\varvec{\beta }}}-{\varvec{S}}_{\mathrm{eff},{\varvec{\beta }}})|\mathbf{X}\}= & {} {\varvec{\sigma }}'_{\varvec{\beta }}(\mathbf{X},{\varvec{\beta }}) E\left\{ -\frac{\partial f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})/\partial \epsilon }{f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})}\epsilon ^2-\epsilon -\frac{2\epsilon t}{E(t^2|\mathbf{X})}|\mathbf{X}\right\} =\mathbf{0}, \\ E\{t({\varvec{S}}_{{\varvec{\beta }}}-{\varvec{S}}_{\mathrm{eff},{\varvec{\beta }}})|\mathbf{X}\}= & {} {\varvec{\sigma }}'_{\varvec{\beta }}(\mathbf{X},{\varvec{\beta }}) E\left\{ -\frac{\partial f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})/\partial \epsilon }{f_{\epsilon |\mathbf{X}}(\epsilon ,\mathbf{X})}\epsilon t-t-\frac{2t^2}{E(t^2|\mathbf{X})}|\mathbf{X}\right\} =\mathbf{0}. \end{aligned}$$

The above equations verify that ${\varvec{S}}_{{\varvec{\theta }}}-{\varvec{S}}{_\mathrm{eff}}\in \Lambda $.

Now, we find the optimal efficiency matrix. First, we calculate ${\varvec{S}}_\mathrm{eff}{\varvec{S}}^\mathrm{T}_\mathrm{eff}$.

$$\begin{aligned} {\varvec{S}}_\mathrm{eff}{\varvec{S}}^\mathrm{T}_\mathrm{eff}=\left\{ \begin{array}{cc} {\varvec{S}}_{\mathrm{eff},{\varvec{\alpha }}}{\varvec{S}}_{\mathrm{eff},{\varvec{\alpha }}}^\mathrm{T}&{}\quad {\varvec{S}}_{\mathrm{eff},{\varvec{\alpha }}}{\varvec{S}}_{\mathrm{eff},{\varvec{\beta }}}^\mathrm{T}\\ {\varvec{S}}_{\mathrm{eff},{\varvec{\beta }}}{\varvec{S}}_{\mathrm{eff},{\varvec{\alpha }}}^\mathrm{T}&{} \quad {\varvec{S}}_{\mathrm{eff},{\varvec{\beta }}}{\varvec{S}}_{\mathrm{eff},{\varvec{\beta }}}^\mathrm{T}\end{array} \right\} , \end{aligned}$$

where

$$\begin{aligned} {\varvec{S}}_{\mathrm{eff},{\varvec{\alpha }}}{\varvec{S}}_{\mathrm{eff},{\varvec{\alpha }}}^\mathrm{T}= & {} \left\{ \epsilon - \frac{E(\epsilon ^3|\mathbf{X})}{E(t^2|\mathbf{X})}t \right\} ^2 e^{-2\sigma (\mathbf{X},{\varvec{\beta }})} \mathbf{m}'_{{\varvec{\alpha }}}(\mathbf{X},{\varvec{\alpha }}){\mathbf{m}'}^\mathrm{T}_{{\varvec{\alpha }}}(\mathbf{X},{\varvec{\alpha }}), \\ {\varvec{S}}_{\mathrm{eff},{\varvec{\alpha }}}{\varvec{S}}_{\mathrm{eff},{\varvec{\beta }}}^\mathrm{T}= & {} \frac{2te^{-\sigma (\mathbf{X},{\varvec{\beta }})}}{E(t^2|\mathbf{X})}\left\{ \epsilon - \frac{E(\epsilon ^3|\mathbf{X})}{E(t^2|\mathbf{X})}t \right\} \mathbf{m}'_{{\varvec{\alpha }}} (\mathbf{X},{\varvec{\alpha }}){{\varvec{\sigma }}'}^\mathrm{T}_{\varvec{\beta }}(\mathbf{X},{\varvec{\beta }}), \\ {\varvec{S}}_{\mathrm{eff},{\varvec{\beta }}}{\varvec{S}}_{\mathrm{eff},{\varvec{\beta }}}^\mathrm{T}= & {} \frac{4t^2}{\{E(t^2|\mathbf{X})\}^2} {\varvec{\sigma }}'_{\varvec{\beta }}(\mathbf{X},{\varvec{\beta }}){{\varvec{\sigma }}'}^\mathrm{T}_{\varvec{\beta }}(\mathbf{X},{\varvec{\beta }}). \end{aligned}$$

Since $E({\varvec{S}}_\mathrm{eff}{\varvec{S}}^\mathrm{T}_\mathrm{eff})=E\{E({\varvec{S}}{_\mathrm{eff}}{\varvec{S}}{_\mathrm{eff}}^\mathrm{T}|\mathbf{X})\}$, we have each block inside $E({\varvec{S}}_\mathrm{eff}{\varvec{S}}^\mathrm{T}_\mathrm{eff})$ as follows.

1.
$E({\varvec{S}}_{\mathrm{eff},{\varvec{\alpha }}}{\varvec{S}}_{\mathrm{eff},{\varvec{\alpha }}}^\mathrm{T})$ is equivalent to
$$\begin{aligned} E\left( E\left[ \left\{ \epsilon - \frac{E(\epsilon ^3|\mathbf{X})}{E(t^2|\mathbf{X})}t \right\} ^2|\mathbf{X}\right] e^{-2\sigma (\mathbf{X},{\varvec{\beta }})}\mathbf{m}'_{{\varvec{\alpha }}}(\mathbf{X},{\varvec{\alpha }}){\mathbf{m}'}^\mathrm{T}_{{\varvec{\alpha }}}(\mathbf{X},{\varvec{\alpha }})\right) . \end{aligned}$$
From the above expression, we can rewrite the part $E\left[ \left\{ \epsilon - \frac{E(\epsilon ^3|\mathbf{X})}{E(t^2|\mathbf{X})}t \right\} ^2|\mathbf{X}\right] $ as
$$\begin{aligned} { E\left[ \left\{ \epsilon - \frac{E(\epsilon ^3|\mathbf{X})}{E(t^2|\mathbf{X})}t \right\} ^2|\mathbf{X}\right] =1+\frac{\{E(\epsilon ^3|\mathbf{X})\}^2}{E(t^2|\mathbf{X})}}. \end{aligned}$$
Thus, $E({\varvec{S}}_{\mathrm{eff},{\varvec{\alpha }}}{\varvec{S}}_{\mathrm{eff},{\varvec{\alpha }}}^\mathrm{T})$ becomes
$$\begin{aligned} E\left( \left[ 1+\frac{\{E(\epsilon ^3|\mathbf{X})\}^2}{E(t^2|\mathbf{X})}\right] e^{-2\sigma (\mathbf{X},{\varvec{\beta }})}\mathbf{m}'_{{\varvec{\alpha }}}(\mathbf{X},{\varvec{\alpha }}){\mathbf{m}'}^\mathrm{T}_{{\varvec{\alpha }}}(\mathbf{X},{\varvec{\alpha }})\right) . \end{aligned}$$
2.
$E({\varvec{S}}_{\mathrm{eff},{\varvec{\alpha }}}{\varvec{S}}_{\mathrm{eff},{\varvec{\beta }}}^\mathrm{T})$ can be rewritten as
$$\begin{aligned}&E\left[ E\left\{ \epsilon t- \frac{E(\epsilon ^3|\mathbf{X})}{E(t^2|\mathbf{X})}t^2 |\mathbf{X}\right\} \frac{2}{E(t^2|\mathbf{X})}e^{-\sigma (\mathbf{X},{\varvec{\beta }})} \mathbf{m}'_{{\varvec{\alpha }}} (\mathbf{X},{\varvec{\alpha }}){{\varvec{\sigma }}'}^\mathrm{T}_{\varvec{\beta }}(\mathbf{X},{\varvec{\beta }}) \right] \\&\quad =E\left\{ -\frac{2E(\epsilon ^3|\mathbf{X})}{E(t^2|\mathbf{X})}e^{-\sigma (\mathbf{X},{\varvec{\beta }})} \mathbf{m}'_{{\varvec{\alpha }}} (\mathbf{X},{\varvec{\alpha }}){{\varvec{\sigma }}'}^\mathrm{T}_{\varvec{\beta }}(\mathbf{X},{\varvec{\beta }}) \right\} . \end{aligned}$$
3.
$E({\varvec{S}}_{\mathrm{eff},{\varvec{\beta }}}{\varvec{S}}_{\mathrm{eff},{\varvec{\beta }}}^\mathrm{T})$ is equivalent to
$$\begin{aligned} E\left( \frac{4E(t^2|\mathbf{X})}{\{E(t^2|\mathbf{X})\}^2}{\varvec{\sigma }}'_{\varvec{\beta }}(\mathbf{X},{\varvec{\beta }}){{\varvec{\sigma }}'}^\mathrm{T}_{\varvec{\beta }}(\mathbf{X},{\varvec{\beta }})\right) =E\left\{ \frac{4}{E(t^2|\mathbf{X})}{\varvec{\sigma }}'_{\varvec{\beta }}(\mathbf{X},{\varvec{\beta }}){{\varvec{\sigma }}'}^\mathrm{T}_{\varvec{\beta }}(\mathbf{X},{\varvec{\beta }})\right\} . \end{aligned}$$

Theorem 1 is the immediate consequence of the above calculations. $\square $

About this article

Cite this article

Kim, M., Ma, Y. Semiparametric efficient estimators in heteroscedastic error models. Ann Inst Stat Math 71, 1–28 (2019). https://doi.org/10.1007/s10463-017-0622-0

Download citation

Received: 08 November 2016
Revised: 20 September 2017
Published: 13 October 2017
Issue Date: 01 February 2019
DOI: https://doi.org/10.1007/s10463-017-0622-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Semiparametric efficient estimators in heteroscedastic error models

Abstract

Access this article

Similar content being viewed by others

Statistical inference for heteroscedastic semi-varying coefficient EV models under restricted condition

Heteroscedasticity Detection and Estimation with Quantile Difference Method

Two-Step Estimation in a Heteroscedastic Linear Regression Model

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

Supplementary material 1 (pdf 306 KB)

Appendix

1.1 Proof of Proposition 1

1.2 Proof of Theorem 1

About this article

Cite this article

Keywords

Navigation

Semiparametric efficient estimators in heteroscedastic error models

Abstract

Access this article

Similar content being viewed by others

Statistical inference for heteroscedastic semi-varying coefficient EV models under restricted condition

Heteroscedasticity Detection and Estimation with Quantile Difference Method

Two-Step Estimation in a Heteroscedastic Linear Regression Model

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

Supplementary material 1 (pdf 306 KB)

Appendix

Appendix

1.1 Proof of Proposition 1

1.2 Proof of Theorem 1

About this article

Cite this article

Share this article

Keywords

Search

Navigation