Admissibility in general linear model with respect to an inequality constraint under balanced loss

Zhang, Shangli; Gui, Wenhao

doi:10.1186/1029-242X-2014-70

Admissibility in general linear model with respect to an inequality constraint under balanced loss

Research
Open access
Published: 13 February 2014

Volume 2014, article number 70, (2014)
Cite this article

Download PDF

You have full access to this open access article

Journal of Inequalities and Applications Submit manuscript

Admissibility in general linear model with respect to an inequality constraint under balanced loss

Download PDF

Shangli Zhang^1,2 &
Wenhao Gui^1,3

1244 Accesses
2 Citations
Explore all metrics

Abstract

Since Zellner (Bayesian and Non-Bayesian Estimation Using Balanced Loss Functions, pp. 377-390, 1994) proposed the balanced loss function, many researchers have been attracted to the field concerned. In this paper, under a generalized balanced loss function, we investigate the admissibility of linear estimators of the regression coefficient in general Gauss-Markov model (GGM) with respect to an inequality constraint. The necessary and sufficient conditions that the linear estimators of regression coefficient function are admissible are established, in the class of homogeneous/inhomogeneous linear estimation, respectively.

MSC:62C05, 62F10.

Machine Learning Optimization Techniques: A Survey, Classification, Challenges, and Future Research Issues

Article 29 March 2024

Data-driven distributionally robust optimization using the Wasserstein metric: performance guarantees and tractable reformulations

Article Open access 07 July 2017

Bias-constrained integer least squares estimation: distributional properties and applications in GNSS ambiguity resolution

Article Open access 14 May 2024

1 Introduction

Throughout this paper, the symbols $A^{'}$ , $μ (A)$ , $A^{+}$ , $A^{-}$ , $rk (A)$ and $tr (A)$ stand for the transpose, the range, Moore-Penrose inverse, generalized inverse, rank, and trace of matrix A, respectively.

Consider the following Gauss-Markov model:

{\begin{cases} y = X β + ε, \\ E (ε) = 0, Cov (ε) = σ^{2} I_{n}, \end{cases}

(1.1)

where y is a $n \times 1$ observable random vector. X is an $n \times p$ known design matrix and $rk (X) = p$ , ε is a $n \times 1$ random error vector. β and $σ^{2}$ are unknown parameters.

Since $rk (X) = p$ , β in model (1.1) is estimable, i.e., there exists an A, such that $E (A Y) = β$ . The classic estimator of regression coefficient is the least square estimator $\hat{β} = {(X^{'} X)}^{- 1} X^{'} y$ , which is the value of d that minimizes the following expression:

{(y - X d)}^{'} (y - X d) .

(1.2)

It is also the best linear unbiased estimator (BLUE) of β. It indicates some goodness-of-fit of the model. For any estimator of β, the precision of this estimator is widely used to determine it is good or not. That is, under the quadratic loss function

{(d - β)}^{'} (d - β),

(1.3)

we select the estimate to achieve a minimum of the risk. [1] embraced the two standards above and proposed the concept of balanced loss. The balanced loss function is defined as

L_{0} (d (y), β, σ^{2}) = w {(y - X d)}^{'} (y - X d) + (1 - w) {(d - β)}^{'} S (d - β),

(1.4)

where $0 \leq w \leq 1$ , S is a known positive definite matrix. The balanced loss function takes both the precision of the estimator and the goodness-of-fit of the model into account. Compared to the standards in (1.2) and (1.3), it is a more comprehensive one that measures the estimate.

Much work has been done on the parameter estimation under the balanced loss function. [2–5] studied the risk function of some specific estimators. [6–8] did some work on the application of the balanced loss function. [9–13] investigated the goodness of the estimators under the balanced loss function.

In model (1.1), the errors have homogeneity of variance and no correlations. But in most real problems, this condition is not always satisfied. In this case, model (1.1) is generalized to the following one:

{\begin{cases} y = X β + ε, \\ E (ε) = 0, Cov (ε) = σ^{2} V . \end{cases}

(1.5)

For model (1.5), the BLUE of β is ${\hat{β}}_{R} = {(X^{'} D^{+} X)}^{- 1} X^{'} D^{+} y$ . According to Rao’s unified theory of least squares, it is the minimum value d of the following expression:

{(y - X d)}^{'} D^{+} (y - X d),

where $D = V + X X^{'}$ . We can prove that when V is nonsingular, ${\hat{β}}_{R} = {(X^{'} V^{- 1} X)}^{- 1} X^{'} V^{- 1} y$ is a generalized least square estimate. Therefore, the balanced loss function (1.4) cannot be applied to this model. Based on [1], the idea of balanced loss, we propose a general balanced loss

L (d (y), β, σ^{2}) = w {(y - X d)}^{'} D^{+} (y - X d) + (1 - w) {(d - β)}^{'} S (d - β),

(1.6)

where $0 \leq w \leq 1$ , S is a known matrix.

In most cases, we have some prior information in model (1.5). For example, the parameters are constrained to some subset, such as an inequality and ellipsoidal constraints. In this paper, considering model (1.5) with the balanced loss (1.6), we investigate the admissibility of linear estimator of regression coefficient in the linear model with an inequality constraint. The inequality constraint we will discuss is

T = {(β, σ^{2}) | β \in C = {β : r^{'} β \geq 0}, σ^{2} > 0},

(1.7)

where r is a known vector. If $r = 0_{n}$ , then the constraint condition always holds. This model embraces the unconstraint case.

Definition 1.1 Suppose $d_{1} (Y)$ and $d_{2} (Y)$ are two estimators of β, if for any $(β, σ^{2})$ , we have

R (d_{1}, β, σ^{2}) \leq R (d_{2}, β, σ^{2})

and there exists $(β_{*}, σ_{*}^{2})$ , such that $R (d_{2}, β_{*}, σ_{*}^{2}) > R (d_{1}, β_{*}, σ_{*}^{2})$ , where the risk function $R (d, β, σ^{2}) = E L (d, β, σ^{2})$ , then $d_{1} (Y)$ is said to be better than $d_{2} (Y)$ . If there does not exist any estimator in set Ξ that is better than $d (Y)$ , where parameters β and σ take values in T, then $d (Y)$ is called the admissible estimator of Kβ in the set Ξ. We denote it by $d (Y) \overset{Ξ}{\sim} K β [T]$ .

We use the following notations in this paper.

\begin{matrix} H L = {A Y : A is a p \times n matrix}, \\ L = {A Y + a : A is a p \times n matrix, a \in R^{p}}, \end{matrix}

where HL is the class of homogeneous linear estimators and L is the class of inhomogeneous linear estimators.

The admissibility is the most basic and influential rationality requirement of classical statistical decision theory. When the parameters are unconstrained, comprehensive results have been obtained. For instance, [14–17]etc. studied the admissibility in univariate linear model. As [18, 19] pointed out, when the parameters are constrained, the least square estimator may not be admissible. So it is significant to discuss the admissibility of linear estimator in linear model with some constraints. For the Gauss-Markov model with constraints, [20] developed the admissible estimator. Some other researchers dedicated to this study. [21–24] studied the admissibility in the linear model with an ellipsoidal constraint. For the linear model with an inequality constraint, [25–28] studied the admissibility of linear estimator of parameters in the univariate and multivariate linear models under the quadratic and matrix loss, respectively. However, under the balanced loss, the model with an inequality constraint has not been considered.

2 Admissibility in the class of homogeneous linear estimators

In this section, we study the admissibility in the class of homogeneous linear estimators. Let the quadratic loss in model (1.5) be

{(d (Y) - g (β))}^{'} (d (Y) - g (β)),

(2.1)

where $d (Y)$ is an estimator of $g (β)$ .

Lemma 2.1 Consider the model (1.5) with the loss function (2.1), $A Y \overset{H L}{\sim} K β [T]$ if and only if

(1)
$A V = A X {(X^{'} D^{+} X)}^{-} X^{'} D^{+} V$ ;
(2)
$A X W X^{'} A^{'} \leq A X W K^{'}$ ;
(3)
$rk (A X - K) W X^{'} = rk (A X - K)$ ,

where $D = V + X X^{'}$ , $W = {(X^{'} D^{+} X)}^{-} - I_{p}$ .

Proof The proof can be obtained from Theorem 2.1 in [26] and Theorem 2.1 in [16]. □

Lemma 2.2 Under model (1.5) with the loss function (1.6), suppose $A Y \in H L$ is an estimator of β, we have

R (A Y, β, σ^{2}) \geq R (A P_{X} Y, β, σ^{2}),

(2.2)

and the equality holds if and only if

A V = A P_{X} V,

(2.3)

where $P_{X} = X {(X^{'} D^{+} X)}^{- 1} X^{'} D^{+}$ .

Proof Since

\begin{aligned} R (A Y, β, σ^{2}) \\ = E {w {(Y - X A Y)}^{'} D^{+} (Y - X A Y) + (1 - w) {(A Y - β)}^{'} S (A Y - β)} \\ = σ^{2} [w tr (V D^{+}) - 2 w tr (A V D^{+} X) + tr (A V A^{'} B)] + β^{'} {(A X - I_{p})}^{'} B (A X - I_{p}) β, \end{aligned}

(2.4)

where $B = w X^{'} D^{+} X + (1 - w) S > 0$ . Notice that $V D^{+} X = (D - X X^{'}) D^{+} X = X (I_{n} - X^{'} D^{+} X)$ , we have $P_{X} V D^{+} X = V D^{+} X$ .

Therefore,

\begin{aligned} R (A Y, β, σ^{2}) - R (A P_{X} Y, β, σ^{2}) \\ = σ^{2} tr [(A V A^{'} - A P_{X} V P_{X}^{'} A^{'}) B] \\ = σ^{2} tr [A (I_{n} - P_{X}) V {(I_{n} - P_{X})}^{'} A^{'} B] \\ \geq 0, \end{aligned}

(2.5)

and the equality holds if and only if $A V = A P_{X} V$ . □

Remark 2.1 This lemma indicates the class of estimators ${A P_{X} Y : A is a p \times n matrix}$ is a complete class of HL. That is, for any estimator δ not in ${A P_{X} Y : A is a p \times n matrix}$ , there exists an estimator $δ^{'}$ in ${A P_{X} Y : A is a p \times n matrix}$ such that $δ^{'}$ is better than δ.

Consider the following linear model:

{\begin{cases} Z = (X^{'} D^{+} X) β + ε, \\ E (ε) = 0, Cov (ε) = σ^{2} (X^{'} D^{+} V D^{+} X) . \end{cases}

(2.6)

Let $C = (1 - w) B^{- 1} S$ , clearly, Cβ is estimable in model (2.6). We take the loss function

L_{B} (d (Z), C β, σ^{2}) = {(d (Z) - C β)}^{'} B (d (Z) - C β) .

(2.7)

Lemma 2.3 Under model (2.6) with the loss function (2.7), suppose $A_{1} Y, A Y \in H L$ , for any $(β, σ^{2}) \in T$ , where $A_{1} Y$ is better than AY if and only if

(1) tr (A_{1} X W X^{'} A_{1}^{'} B) - 2 w tr (A_{1} V D^{+} X) \leq tr (A X W X^{'} A^{'} B) - 2 w tr (A V D^{+} X)

(2.8)

(2) β^{'} {(A_{1} X - I_{p})}^{'} B (A_{1} X - I_{p}) β \leq β^{'} {(A X - I_{p})}^{'} B (A X - I_{p}) β .

(2.9)

Proof The lemma can easily be verified from (2.4). □

Lemma 2.4 Consider the model (1.5) with the loss function (1.6), suppose $A Y \in H L$ , then $A P_{X} Y \overset{H L}{\sim} β$ if and only if $\tilde{A} Z \overset{H L}{\sim} C β$ in model (2.6) with the loss function (2.7), where $\tilde{A} = A X {(X^{'} D^{+} X)}^{- 1} - w B^{- 1}$ .

Proof Since $P_{X} V = P_{X} V P_{X}^{'} = X W X^{'}$ , we have

\begin{aligned} R (A P_{X} Y, β, σ^{2}) \\ = E L (A P_{X} Y, β, σ^{2}) \\ = E {w {(Y - X A P_{X} Y)}^{'} D^{+} (Y - X A P_{X} Y) + (1 - w) {(A P_{X} Y - β)}^{'} S (A P_{X} Y - β)} \\ = σ^{2} [w tr (V D^{+}) - 2 w tr (A V D^{+} X) + tr (A X W X^{'} A^{'} B)] \\ + β^{'} {(A X - I_{p})}^{'} B (A X - I_{p}) β . \end{aligned}

(2.10)

Notice that

\begin{aligned} \tilde{A} (X^{'} D^{+} V D^{+} X) {\tilde{A}}^{'} = A X W X^{'} A^{'} + 2 w A V D^{+} X B^{- 1} + w^{2} B^{- 1} X^{'} D^{+} V D^{+} X B^{- 1}, \\ \tilde{A} (X^{'} D^{+} X) - C = A X - I_{p} . \end{aligned}

Therefore,

\begin{aligned} E L_{B} (\tilde{A} Z, C β, σ^{2}) \\ = E {(\tilde{A} Z - C β)}^{'} B (\tilde{A} Z - C β) \\ = σ^{2} [tr (\tilde{A} X^{'} D^{+} V D^{+} X) tr ({\tilde{A}}^{'} B)] \\ + β^{'} {[\tilde{A} (X^{'} D^{+} X) - C]}^{'} B [\tilde{A} (X^{'} D^{+} X) - C] β \\ = σ^{2} [w^{2} tr (B^{- 1} X^{'} D^{+} V D^{+} X) - 2 w tr (A V D^{+} X) + tr (A X W X^{'} A^{'} B)] \\ + β^{'} {(A X - I_{p})}^{'} B (A X - I_{p}) β . \end{aligned}

(2.11)

Equations (2.10), (2.11) and Lemma 2.3 indicate that if there exists an estimator of β, $A_{1} P_{X} Y$ is better than $A P_{X} Y$ , then ${\tilde{A}}^{'} Z$ , the estimator of Cβ, is better than $\tilde{A} Z$ . □

Lemma 2.5 Consider the model (2.6) with the loss function (2.7), $A Z \sim C Θ$ holds if and only if under the quadratic loss, $A Z \sim C Θ$ holds.

Proof The proof is straightforward. We omit the details. □

Theorem 2.1 Consider the model (2.6) with the loss function (2.7), $A Y \overset{H L}{\sim} S β (T)$ if and only if $A Y \overset{H L}{\sim} S β$ .

Proof The necessity is trivial. We only need to prove the sufficiency. For any $(β, σ^{2}) \in T$ , if there exists $A_{1} Y$ that is better than AY, by Lemma 2.3, for any $β \in C$ , (2.8) and (2.9) hold. Notice that (2.9) still holds if replacing β by −β. In other words, for any $β \in \tilde{C} = {β : - β \in C}$ , (2.9) holds. Since $C \cup \tilde{C} = R^{P}$ , thus, for any $(β, σ^{2})$ , (2.8) and (2.9) hold. It contradicts with $A Y \overset{H L}{\sim} S β$ . □

Theorem 2.2 Consider the model (1.5) with the loss function (1.6), $A Y \overset{H L}{\sim} β (T)$ if and only if

(1)
$A V = A P_{X} V$ ;
(2)
$\bar{A} X W X^{'} \bar{A^{'}} \leq (1 - w) \bar{A} X W N B^{- 1}$ ;
(3)
$rk [(A X - I_{p}) W] = rk (A X - I_{p})$ ,

where $\bar{A} = A - w B^{- 1} X^{'} D^{+}$ .

Proof By Lemma 2.2, (1) holds. Further, $A Y \overset{H L}{\sim} β (T)$ is equivalent to $A P_{X} Y \overset{H L}{\sim} β$ . By Lemma 2.4, in the model (1.5) with the loss (1.6), $A Y \overset{H L}{\sim} β (T)$ holds if and only if $\tilde{A} Z \overset{H L}{\sim} C β$ in model (2.6) with the loss (2.7), where $\tilde{A} = A X {(X^{'} D^{+} X)}^{- 1} - w B^{- 1}$ . It is also equivalent to $\tilde{A} Z \overset{H L}{\sim} C β$ in model (2.6) with the loss (2.1) by Lemma 2.5. Therefore, when the condition (1) is satisfied, according to Lemma 2.1 and simple computations, we have $A Y \overset{H L}{\sim} β (T)$ holds if and only if (2) and (3) are satisfied. □

Remark 2.2 The following example indicates that the conditions in the above theorem can be satisfied.

Consider the following example: we take $X = S = I$ , $V = (\begin{array}{c} 1 & 0 \\ 0 & 0 \end{array})$ , then $D = (\begin{array}{c} 2 & 0 \\ 0 & 1 \end{array})$ , $W = (\begin{array}{c} 1 & 0 \\ 0 & 0 \end{array})$ . Also let $w = 0.5$ , then the loss function is

L_{0} (d (y), β, σ^{2}) = \frac{1}{2} [{(y - d)}^{'} (y - d) + {(d - β)}^{'} (d - β)] .

For the diagonal matrix $A = (\begin{array}{c} a & 0 \\ 0 & b \end{array})$ , we consider the admissibility of Ay. The condition (1) in Theorem 2.2 is satisfied. Theorem 2.2(3) implies that $b = 1$ . Theorem 2.2(2) implies that $\frac{1}{3} \leq a \leq 1$ . Thus, only if $b = 1$ and $\frac{1}{3} \leq a \leq 1$ , Ay is an admissible estimate of β.

3 Admissibility in the class of inhomogeneous linear estimators

In this section, we study the admissibility in the class of inhomogeneous linear estimators.

Lemma 3.1 Let C be a cone in $R^{P}$ . For any vector b and real number d,

β^{'} b + d \leq 0, \forall β \in C

(3.1)

if and only if $b \in C^{*}$ and $d \leq 0$ , where $C^{*} = {α : α^{'} β \leq 0, \forall β \in C}$ is the dual cone of C.

Proof This lemma can be found in [26]. □

Theorem 3.1 Consider the model (1.5) with the loss function (1.6), if $A Y + a \overset{L}{\sim} β (T)$ , then

(1)
$a \in μ (A X - I_{p})$ ;
(2)
$α^{'} {(A X - I_{p})}^{+} a \geq 0$ , $\forall α \in μ ({(A X - I_{p})}^{'}) \cap C^{*}$ ;
(3)
$A Y \overset{H L}{\sim} β (T)$ .

Proof (1) Let P be an orthogonal projection matrix on $μ (B^{\frac{1}{2}} (A X - I_{p}))$ . Take $b = B^{- \frac{1}{2}} P B^{\frac{1}{2}} a$ , then $b \in μ (A X - I_{p})$ . Since

\begin{array}{rcl} R (A Y + a, β, σ^{2}) & = & σ^{2} [w tr (V D^{+}) - 2 w tr (A V D^{+} X) + tr (A V A^{'} B)] \\ + {[(A X - I_{p}) β + a]}^{'} B [(A X - I_{p}) β + a] . \end{array}

(3.2)

Therefore,

\begin{aligned} R (A Y + a, β, σ^{2}) - R (A Y + b, β, σ^{2}) \\ = {[(A X - I_{p}) β + a]}^{'} B [(A X - I_{p}) β + a] - {[(A X - I_{p}) β + b]}^{'} B [(A X - I_{p}) β + b] \\ = a^{'} B a - b^{'} B b \\ = a^{'} B^{\frac{1}{2}} (I_{P} - P) B^{\frac{1}{2}} a \\ \geq 0, \end{aligned}

(3.3)

and the equality holds if and only if $B^{\frac{1}{2}} a = P B^{\frac{1}{2}} a$ , $a = B^{- \frac{1}{2}} P B^{\frac{1}{2}} a = b$ . This means if $a \notin μ (A X - I_{p})$ , then $A Y + b$ is better than $A Y + a$ . It is a contradiction.

(2) Assume there exists $α \in μ ({(A X - I_{p})}^{'}) \cap C^{*}$ , such that $α^{'} {(A X - I_{p})}^{+} a \leq 0$ . Then there exists $α_{0}$ , such that $α = {(A X - I_{p})}^{'} α_{0}$ . Take $b = a + λ B^{- 1} α_{0}$ , where $λ > 0$ . For any $(β, σ^{2}) \in T$ , we have

R (A Y + b, β, σ^{2}) - R (A Y + a, β, σ^{2}) = 2 λ α^{'} β + 2 λ α^{'} {(A X - I_{p})}^{+} a + λ^{2} α_{0}^{'} B^{- 1} α_{0} .

According to Lemma 3.1, for any λ small enough and any $(β, σ^{2}) \in T$ ,

R (A Y + b, β, σ^{2}) - R (A Y + a, β, σ^{2}) \leq 0 .

$A Y + b$ is better than $A Y + a$ , which contradicts $A Y + a \overset{L}{\sim} β (T)$ .

(3) By (1), there exists $a_{0}$ such that $a = (A X - I_{P}) a_{0}$ . Suppose $A_{1} Y$ is as good as AY, thus, for any $(β, σ^{2}) \in T$ ,

R (A_{1} Y, β, σ^{2}) \leq R (A Y, β, σ^{2}) .

By Lemma 2.3, (2.8) and (2.9) hold. Notice that for any $β \in \tilde{C} = {β : - β \in C}$ , (2.9) still holds and $C \cup \tilde{C} = R^{P}$ , and therefore (2.9) is equivalent to

{(A_{1} X - I_{p})}^{'} B (A_{1} X - I_{p}) \leq {(A X - I_{p})}^{'} B (A X - I_{p}) .

(3.4)

We obtain, from (2.8) and (3.4), for any $(β, σ^{2}) \in T$ ,

\begin{aligned} σ^{2} [w tr (V D^{+}) - 2 w tr (A_{1} V D^{+} X) + tr (A_{1} V A_{1}^{'} B)] \\ + {(β + a_{0})}^{'} {(A_{1} X - I_{p})}^{'} B (A_{1} X - I_{p}) (β + a_{0}) \\ \leq σ^{2} [w tr (V D^{+}) - 2 w tr (A V D^{+} X) + tr (A V A^{'} B)] \\ + {(β + a_{0})}^{'} {(A X - I_{p})}^{'} B (A X - I_{p}) (β + a_{0}) . \end{aligned}

(3.5)

That is,

R (A_{1} Y + (A_{1} X - I_{p}) a_{0}, β, σ^{2}) \leq R (A Y + a, β, σ^{2}) .

(3.6)

Since $A Y \overset{H L}{\sim} β (T)$ , thus, the equality in (3.6) holds if and only if the equality in (3.5) holds. Notice that for $(β, σ^{2}) \in T$ and any $λ > 0$ , we have $(λ β, σ^{2}) \in T$ . Therefore,

\begin{aligned} R (A_{1} Y, β, σ^{2}) \\ = σ^{2} [w tr (V D^{+}) - 2 w tr (A_{1} V D^{+} X) + tr (A_{1} V A_{1}^{'} B)] \\ + β^{'} {(A_{1} X - I_{p})}^{'} B (A_{1} X - I_{p}) β \\ = σ^{2} [w tr (V D^{+}) - 2 w tr (A V D^{+} X) + tr (A V A^{'} B)] \\ + β^{'} {(A X - I_{p})}^{'} B (A X - I_{p}) β \\ = R (A Y, β, σ^{2}) . \end{aligned}

It implies that no estimator is better than AY. Thus, $A Y \overset{H L}{\sim} β (T)$ . □

In fact, the converse part of Theorem 3.1 is also true. We present this in the following theorem.

Theorem 3.2 Consider the model (1.5) with the loss function (1.6), $A Y + a \overset{L}{\sim} β (T)$ holds if and only if

(1)
$a \in μ (A X - I_{p})$ ;
(2)
$α^{'} {(A X - I_{p})}^{+} a \geq 0$ , $\forall α \in μ ({(A X - I_{p})}^{'}) \cap C^{*}$ ;
(3)
$A Y \overset{H L}{\sim} β (T)$ .

Proof By the proof of (1) in Theorem 3.1, we need to prove that there does not exist $p \times n$ matrix $A_{1}$ and $b \in R^{P}$ such that $A_{1} Y + (A_{1} X - I_{p}) b$ is better than $A Y + (A X - I_{p}) a_{0}$ , where $(A X - I_{p}) a_{0} = a$ .

Suppose $A_{1} Y + (A_{1} X - I_{p}) b$ is as good as $A Y + (A X - I_{p}) a_{0}$ , then for any $(β, σ^{2}) \in T$ ,

R (A_{1} Y + (A_{1} X - I_{p}) b, β, σ^{2}) \leq R (A Y + (A X - I_{p}) a_{0}, β, σ^{2}) .

Hence,

\begin{aligned} σ^{2} [w tr (V D^{+}) - 2 w tr (A_{1} V D^{+} X) + tr (A_{1} V A_{1}^{'} B)] \\ + {(β + b)}^{'} {(A_{1} X - I_{p})}^{'} B (A_{1} X - I_{p}) (β + b) \\ \leq σ^{2} [w tr (V D^{+}) - 2 w tr (A V D^{+} X) + tr (A V A^{'} B)] \\ + {(β + a_{0})}^{'} {(A X - I_{p})}^{'} B (A X - I_{p}) (β + a_{0}) . \end{aligned}

(3.7)

Notice that for any $k > 0$ , $(β, k σ^{2}) \in T$ , plug it in (3.7) and let k go to ∞ and 0 respectively, we have

tr (A_{1} X W X^{'} A_{1}^{'} B) - 2 w tr (A_{1} V D^{+} X) \leq tr (A X W X^{'} A^{'} B) - 2 w tr (A V D^{+} X)

and

\begin{aligned} {(β + b)}^{'} {(A_{1} X - I_{p})}^{'} B (A_{1} X - I_{p}) (β + b) \\ \leq {(β + a_{0})}^{'} {(A X - I_{p})}^{'} B (A X - I_{p}) (β + a_{0}) . \end{aligned}

(3.8)

Similarly, replacing β with λβ in (3.8) and let λ go to ∞, we have

β^{'} {(A_{1} X - I_{p})}^{'} B (A_{1} X - I_{p}) β \leq β^{'} {(A X - I_{p})}^{'} B (A X - I_{p}) β .

Therefore, $R (A_{1} Y, β, σ^{2}) \leq R (A Y, β, σ^{2})$ . Since $A Y \overset{H L}{\sim} β (T)$ , we get

\begin{aligned} σ^{2} [w tr (V D^{+}) - 2 w tr (A_{1} V D^{+} X) + tr (A_{1} V A_{1}^{'} B)] + β^{'} {(A_{1} X - I_{p})}^{'} B (A_{1} X - I_{p}) β \\ = σ^{2} [w tr (V D^{+}) - 2 w tr (A V D^{+} X) + tr (A V A^{'} B)] + β^{'} {(A X - I_{p})}^{'} B (A X - I_{p}) β . \end{aligned}

Using the same technique, for any $(β, σ^{2}) \in T$ , we have

tr (A_{1} X W X^{'} A_{1}^{'} B) - 2 w tr (A_{1} V D^{+} X) = tr (A X W X^{'} A^{'} B) - 2 w tr (A V D^{+} X),

(3.9)

{(A_{1} X - I_{p})}^{'} B (A_{1} X - I_{p}) = {(A X - I_{p})}^{'} B (A X - I_{p}) .

(3.10)

From (3.7), (3.9), and (3.10), we get, for any $(β, σ^{2}) \in T$ ,

\begin{aligned} 2 β^{'} {(A X - I_{p})}^{'} B (A X - I_{p}) b + b^{'} {(A X - I_{p})}^{'} B (A X - I_{p}) b \\ \leq 2 β^{'} {(A X - I_{p})}^{'} B (A X - I_{p}) a_{0} + a_{0}^{'} {(A X - I_{p})}^{'} B (A X - I_{p}) a_{0} . \end{aligned}

Hence,

2 β^{'} {(A X - I_{p})}^{'} B (A X - I_{p}) [b - {(A X - I_{p})}^{+} a] + b^{'} {(A X - I_{p})}^{'} B (A X - I_{p}) b - a^{'} B a \leq 0 .

From Lemma 3.1,

b^{'} {(A X - I_{p})}^{'} B (A X - I_{p}) b - a^{'} B a \leq 0,

(3.11)

{(A X - I_{p})}^{'} B (A X - I_{p}) [b - {(A X - I_{p})}^{+} a] \in C^{*} .

(3.12)

This together with the condition (2) implies that

\begin{aligned} {[b - {(A X - I_{P})}^{+} a]}^{'} {(A X - I_{p})}^{'} B (A X - I_{p}) {(A X - I_{P})}^{+} a \\ = b^{'} {(A X - I_{P})}^{'} B a - a^{'} B a \geq 0 . \end{aligned}

Hence,

b^{'} {(A X - I_{P})}^{'} B a \geq a^{'} B a .

(3.13)

From (3.11) and (3.13), we have ${[(A X - I_{P}) b - a]}^{'} B [(A X - I_{P}) b - a] \leq 0$ . Thus, ${[(A X - I_{P}) b - a]}^{'} B [(A X - I_{P}) b - a] = 0$ .

B (A X - I_{P}) b = B a = B (A X - I_{P}) a_{0} .

(3.14)

Plug (3.9), (3.10), and (3.14) into (3.7) and we find that the equality in (3.7) holds. It means there does not exist an estimator that is better than $A Y + a$ . Therefore, $A Y + a \overset{L}{\sim} β (T)$ holds. □

We summarize Theorem 2.2 and Theorem 3.2 in the following theorem.

Theorem 3.3 Consider the model (1.5) with the loss function (1.6), $A Y + a \overset{L}{\sim} β (T)$ holds if and only if

(1)
$a \in μ (A X - I_{p})$ ;
(2)
$α^{'} {(A X - I_{p})}^{+} a \geq 0$ , $\forall α \in μ ({(A X - I_{p})}^{'}) \cap C^{*}$ ;
(3)
$A V = A P_{X} V$ ;
(4)
$\bar{A} X W X^{'} \bar{A^{'}} \leq (1 - w) \bar{A} X W S B^{- 1}$ ;
(5)
$rk [(A X - I_{p}) W] = rk (A X - I_{p})$ .

Conclusion

In this paper, under a generalized balanced loss function, we study the admissibility of linear estimators of the regression coefficient in general Gauss-Markov model with respect to an inequality constraint. The necessary and sufficient conditions that the linear estimators of regression coefficient function are admissible are obtained, in the class of homogeneous and inhomogeneous linear estimation, respectively.

References

Zellner A: Bayesian and Non-Bayesian Estimation Using Balanced Loss Functions. Springer, Berlin; 1994:377–390.
Google Scholar
Rodrigues J, Zellner A: Weighted balanced loss function and estimation of the mean time to failure. Commun. Stat., Theory Methods 1994,23(12):3609–3616. 10.1080/03610929408831468
Article MATH MathSciNet Google Scholar
Wan AT: Risk comparison of the inequality constrained least squares and other related estimators under balanced loss. Econ. Lett. 1994,46(3):203–210. 10.1016/0165-1765(94)00485-4
Article Google Scholar
Giles JA, Giles DE, Ohtani K: The exact risks of some pre-test and stein-type regression estimators under balanced loss. Commun. Stat., Theory Methods 1996,25(12):2901–2924. 10.1080/03610929608831878
Article MATH MathSciNet Google Scholar
Ohtani K: The exact risk of a weighted average estimator of the ols and stein-rule estimators in regression under balanced loss. Stat. Risk Model. 1998,16(1):35–46.
MATH MathSciNet Google Scholar
Shalabh : Least squares estimators in measurement error models under the balanced loss function. Test 2001,10(2):301–308. 10.1007/BF02595699
Article MATH MathSciNet Google Scholar
Gruber DMH: The efficiency of shrinkage estimators with respect to Zellner’s balanced loss function. Commun. Stat., Theory Methods 2004,33(2):235–249. 10.1081/STA-120028372
Article MATH Google Scholar
Akdeniz F, Wan AT, Akdeniz E: Generalized Liu type estimators under Zellner’s balanced loss function. Commun. Stat., Theory Methods 2005,34(8):1725–1736. 10.1081/STA-200066357
Article MATH MathSciNet Google Scholar
Dey DK, Ghosh M, Strawderman WE: On estimation with balanced loss functions. Stat. Probab. Lett. 1999,45(2):97–101. 10.1016/S0167-7152(99)00047-4
Article MATH MathSciNet Google Scholar
Ohtani K: Inadmissibility of the Stein-rule estimator under the balanced loss function. J. Econom. 1999,88(1):193–201. 10.1016/S0304-4076(98)00030-X
Article MATH MathSciNet Google Scholar
Xu X, Wu Q: Linear admissible estimators of regression coefficient under balanced loss. Acta Math. Sci. 2000, 4: 468–473.
Google Scholar
Jozani JM, Marchand É, Parsian A: On estimation with weighted balanced-type loss function. Stat. Probab. Lett. 2006,76(8):773–780. 10.1016/j.spl.2005.10.026
Article MATH Google Scholar
Cao M: ϕ admissibility for linear estimators on regression coefficients in a general multivariate linear model under balanced loss function. J. Stat. Plan. Inference 2009,139(9):3354–3360. 10.1016/j.jspi.2009.03.013
Article MATH Google Scholar
Rao CR: Estimation of parameters in a linear model. Ann. Stat. 1976,4(6):1023–1037. 10.1214/aos/1176343639
Article MATH Google Scholar
LaMotte LR: Admissibility in linear estimation. Ann. Stat. 1982,10(1):245–255. 10.1214/aos/1176345707
Article MATH MathSciNet Google Scholar
Wu Q: Admissibility of linear estimators of regression coefficient in a general Gauss-Markoff model. Acta Math. Appl. Sin. 1986, 2: 251–256.
Google Scholar
Dong L, Wu Q: The sufficient and necessary conditions of admissible linear estimates for random regression coefficients and parameters under the quadratic loss function. Acta Math. Sin. 1988,31(2):145–157.
MATH MathSciNet Google Scholar
Marquaridt DW: Generalized inverses, ridge regression, biased linear estimation, and nonlinear estimation. Technometrics 1970,12(3):591–612.
Article Google Scholar
Perlman MD: Reduced mean square error estimation for several parameters. Sankhyā, Ser. B 1972,34(1):89–92.
MathSciNet Google Scholar
Hoffmann K: Admissibility of linear estimators with respect to restricted parameter sets. Stat.: J. Theor. Appl. Stat. 1977,8(4):425–438.
MATH Google Scholar
Mathew T: Admissible linear estimation in singular linear models with respect to a restricted parameter set. Commun. Stat., Theory Methods 1985,14(2):491–498. 10.1080/03610928508828927
Article MATH MathSciNet Google Scholar
Lu C: Admissibility of inhomogeneous linear estimators in linear models with respect to incomplete ellipsoidal restrictions. Commun. Stat., Theory Methods 1995,24(7):1737–1742. 10.1080/03610929508831582
Article MATH Google Scholar
Zhang S, Gui W: Admissibility of linear estimators in a growth curve model subject to an incomplete ellipsoidal restriction. Acta Math. Sci. 2008,28(1):194–200. 10.1016/S0252-9602(08)60020-X
Article MATH MathSciNet Google Scholar
Zhang S, Gui W, Liu G: Characterization of admissible linear estimators in the general growth curve model with respect to an incomplete ellipsoidal restriction. Linear Algebra Appl. 2009,431(1):120–131.
Article MATH MathSciNet Google Scholar
Zhang S, Liu G, Gui W: Admissible estimators in the general multivariate linear model with respect to inequality restricted parameter set. J. Inequal. Appl. 2009., 2009: Article ID 718927
Google Scholar
Lu C, Shi N: Admissible linear estimators in linear models with respect to inequality constraints. Linear Algebra Appl. 2002,354(1):187–194.
Article MATH MathSciNet Google Scholar
Zhang S, Fang Z, Qin H, Han L: Characterization of admissible linear estimators in the growth curve model with respect to inequality constraints. J. Korean Stat. Soc. 2011,40(2):173–179. 10.1016/j.jkss.2010.09.002
Article MATH MathSciNet Google Scholar
Zhang S, Fang Z, Liu G: Characterization of admissible linear estimators in multivariate linear model with respect to inequality constraints under matrix loss function. Commun. Stat., Theory Methods 2013,42(15):2837–2850. 10.1080/03610926.2011.615441
Article MATH MathSciNet Google Scholar

Download references

Acknowledgements

This work was partially supported by National Natural Science Foundation of China (61070236, U1334211, 11371051) and the Project of State Key Laboratory of Rail Traffic Control and Safety (RCS2012ZT004), Beijing Jiaotong University.

Author information

Authors and Affiliations

Department of Mathematics, Beijing Jiaotong University, Beijing, 100044, China
Shangli Zhang & Wenhao Gui
State Key Laboratory of Rail Traffic Control and Safety, Beijing Jiaotong University, Beijing, 100044, China
Shangli Zhang
Department of Mathematics and Statistics, University of Minnesota Duluth, Duluth, 55812, USA
Wenhao Gui

Authors

Shangli Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Wenhao Gui
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wenhao Gui.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

All authors contributed equally and significantly in writing this article. All authors read and approved the final manuscript.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Zhang, S., Gui, W. Admissibility in general linear model with respect to an inequality constraint under balanced loss. J Inequal Appl 2014, 70 (2014). https://doi.org/10.1186/1029-242X-2014-70

Download citation

Received: 03 September 2013
Accepted: 30 January 2014
Published: 13 February 2014
DOI: https://doi.org/10.1186/1029-242X-2014-70

Admissibility in general linear model with respect to an inequality constraint under balanced loss

Abstract

Similar content being viewed by others

Machine Learning Optimization Techniques: A Survey, Classification, Challenges, and Future Research Issues

Data-driven distributionally robust optimization using the Wasserstein metric: performance guarantees and tractable reformulations

Bias-constrained integer least squares estimation: distributional properties and applications in GNSS ambiguity resolution

1 Introduction

2 Admissibility in the class of homogeneous linear estimators

3 Admissibility in the class of inhomogeneous linear estimators

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Admissibility in general linear model with respect to an inequality constraint under balanced loss

Abstract

Similar content being viewed by others

Machine Learning Optimization Techniques: A Survey, Classification, Challenges, and Future Research Issues

Data-driven distributionally robust optimization using the Wasserstein metric: performance guarantees and tractable reformulations

Bias-constrained integer least squares estimation: distributional properties and applications in GNSS ambiguity resolution

1 Introduction

2 Admissibility in the class of homogeneous linear estimators

3 Admissibility in the class of inhomogeneous linear estimators

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation