Strong consistency of estimators in partially linear models for longitudinal data with mixing-dependent structure

Zhou, Xing-cai; Lin, Jin-guan

doi:10.1186/1029-242X-2011-112

Strong consistency of estimators in partially linear models for longitudinal data with mixing-dependent structure

Research
Open access
Published: 17 November 2011

Volume 2011, article number 112, (2011)
Cite this article

Download PDF

You have full access to this open access article

Journal of Inequalities and Applications Submit manuscript

Strong consistency of estimators in partially linear models for longitudinal data with mixing-dependent structure

Download PDF

Xing-cai Zhou^1,2 &
Jin-guan Lin¹

2519 Accesses
2 Citations
Explore all metrics

Abstract

For exhibiting dependence among the observations within the same subject, the paper considers the estimation problems of partially linear models for longitudinal data with the φ-mixing and ρ-mixing error structures, respectively. The strong consistency for least squares estimator of parametric component is studied. In addition, the strong consistency and uniform consistency for the estimator of nonparametric function are investigated under some mild conditions.

Efficient Inference about the Partially Linear Varying Coefficient Model with Random Effect for Longitudinal Data

Double penalized variable selection procedure for partially linear models with longitudinal data

Article 15 October 2014

Empirical Likelihood for Partially Linear Errors-in-variables Models with Longitudinal Data

Article 01 July 2022

1 Introduction

Longitudinal data (Diggle et al. [1]) are characterized by repeated observations over time on the same set of individuals. They are common in medical and epidemiological studies. Examples of such data can be easily found in clinical trials and follow-up studies for monitoring disease progression. Interest of the study is often focused on evaluating the effects of time and covariates on the outcome variables. Let t_ijbe the time of the j th measurement of the i th subject, x_ij∈ R^pand y_ijbe the i th subject's observed covariate and outcome at time t_ijrespectively. We assume that the full dataset {(x_ij, y_ij, t_ij), i = 1,..., n, j = 1,..., m_i}, where n is the number of subjects and m_iis the number of repeated measurements of the i th subject, is observed and can be modeled as the following partially linear models

y_{i j} = x_{i j}^{T} β + g (t_{i j}) + e_{i j},

(1.1)

where β is a p × 1 vector of unknown parameter, g(⋅) is an unknown smooth function, e_ijare random errors with E(e_ij) = 0. We assume without loss of generality that t_ijare all scaled into the interval I = [0, 1]. Although the observations, and therefore the e_ij, from the different subjects are independent, they can be dependent within each subject.

Partially linear models keep the flexibility of nonparametric models, while maintaining the explanatory power of parametric models (Fan and Li [2]). Many authors have studied the models in the form of (1.1) under some additional assumptions or restrictions. If the nonparametric component g(⋅) is known or not present in the models, they become the general linear models with repeated measurements, which were studied under Gaussian errors in a amount of literature. Some works have been integrated into PROC MIXED of the SAS Systems for estimation and inference for such models. If g(⋅) is unknown but there are no repeated measurements, that is m₁ = ⋅ ⋅ ⋅ = m_n= 1, the models (1.1) are reduced to non-longitudinal partially linear regression models, which were firstly introduced by Engle et al. [3] to study the effect of weather on electricity demand, and further studied by Heckman [4], Speckman [5] and Robinson [6], among others. A recent survey of the estimation and application of the models can be found in the monograph of Häardle et al. [7]. When the random errors of the models (1.1) are independent replicates of a zero mean stationary Gaussian process, Zeger and Diggle [8] obtained estimators of the unknown quantities and analyzed time-trend CD4 cell numbers among HIV sero-converters; Moyeed and Diggle [9] gave the rate of convergence for such estimators; Zhang et al. [10] proposed the maximum penalized Gaussian likelihood estimator. Introducing the counting process technique to the estimation scheme, Fan and Li [2] established asymptotic normality and rate of convergence of the resulting estimators. Under the models (1.1) for panel data with a one-way error structure, You and Zhou [11] and You et al. [12] developed the weighted semiparametric least square estimator and derived asymptotic properties of the estimators. In practice, a great deal of the data in econometrics, engineering and natural sciences occur in the form of time series in which observations are not independent and often exhibit evident dependence. Recently, the non-longitudinal partially linear regression models with complex error structure have attracted increasing attention by statisticians. For example, see Schick [13] with AR(1) errors, Gao and Anh [14] with long-memory errors, Sun et al. [15] with MA(∞) errors, Baek and Liang [16] and Zhou et al. [17] with negatively associated (NA) errors, and Li and Liu [18], Chen and Cui [19] and Liang and Jing [20] with martingale difference sequence, among others.

For longitudinal data, an inherent characteristic is the dependence among the observations within the same subject. Some authors have not considered the with-subject dependence to study the asymptotic behaviors of estimation in the semipara-metric models with assumption that the m_iare all bounded, see, for example, He et al. [21], Xue and Zhu [22] and the references therein. Li et al. [23] and Bai et al. [24] showed that ignoring the data dependence within each subject causes a loss of efficiency of statistical inference on the parameters of interest. Hu et al. [25] and Wang et al. [26] took into consideration within-subject correlations for analyzing longitudinal data and obtained some asymptotic results based on the assumption that max_1≤i≤nm_iis bounded for all n. Chi and Reinsel [27] considered linear models for longitudinal data that contain both individual random effects components and with-individual errors that follow an (autoregressive) AR(1) time series process and gave some estimation procedures, but they did not investigate asymptotic properties of estimations. In fact, the observed responses within the same subject are correlated and may be represented by a sequence of responses {y_ij, j ≥ 1} for the i-individual with an intrinsic dependence structure, such as mixing conditions. For example, in hydrology, many measures may be represented by a sequence of responses {y_ij, j ≥ 1} for the i th year at t_ij, where t_ijrepresents the time elapsed from the beginning of the i th year, and {e_ij, j ≥ 1} are the measurements of the deviation from the mean ${x_{i j}^{T} β + g (t_{i j}), j \geq 1}$ . It is not reasonable that $E (e_{i j_{1}} e_{i j_{2}}) = 0$ for j₁ ≠ j₂. In practice, {e_ij, j ≥ 1} may be "weak error's structure", such as mixing-dependent structure. In this paper, we consider the estimation problems for the models (1.1) with the φ-mixing and ρ-mixing error structures for exhibiting dependence among the observations within the same subject respectively and are mainly devoted to strong consistency of estimators.

Let {X_m, m ≥ 1} be a sequence of random variables defined on probability space $(Ω, ℱ, P), ℱ_{k}^{l} = σ (X_{i}, k \leq i \leq l)$ be σ-algebra generated by X_k, . . ., X_l, and denote $L^{2} (ℱ_{k}^{l})$ be the set of all $ℱ_{k}^{l}$ measurable random variables with second moments.

A sequence of random variables {X_m, m ≥ 1} is called to be φ-mixing if

φ (m) = \sup_{k \geq 1, A \in ℱ_{1}^{k}, P (A) \neq 0, B \in ℱ_{k + m}^{\infty}} | P (B | A) - P (B) | \to 0, as m \to \infty .

A sequence of random variables {X_m, m ≥ 1} is called to be ρ-mixing if maximal correlation coefficient

ρ (m) = \sup_{k \geq 1, X \in L^{2} (ℱ_{1}^{k}), Y \in L^{2} (ℱ_{k + m}^{\infty})} \frac{| cov (X, Y) |}{\sqrt{Var (X) \cdot Var (Y)}} \to 0, as m \to \infty .

The concept of mixing sequence is central in many areas of economics, finance and other sciences. A mixing time series can be viewed as a sequence of random variables for which the past and distant future are asymptotically independent. A number of limit theorems for φ-mixing and ρ-mixing random variables have been studied by many authors. For example, see Shao [28], Peligrad [29], Utev [30], Kiesel [31], Chen et al. [32] and Zhou [33] for φ-mixing; Peligrad [34], Peligrad and Shao [35, 36], Shao [37] and Bradley [38] for ρ-mixing. Some limit theories can be found in the monograph of Lin and Lu [39]. Recently, the mixing-dependent error structure has also been used to study the nonparametric and semiparametric regression models, for instance, Roussas [40], Truong [41], Fraiman and Iribarren [42], Roussas and Tran [43], Masry and Fan [44], Aneiros and Quintela [45], and Fan and Yao [46].

The rest of this paper is organized as follows. In Section 2, we give least square estimator (LSE) ${\hat{β}}_{n}$ of β based on the nonparametric estimator of g(·) under the mixing-dependent error structure and state some main results. Section 3 is devoted to sketches of several technical lemmas and corollaries. The proofs of main results are given in Section 4. We close with concluding remarks in the last section.

2 Estimators and main results

For models (1.1), if β is known to be the true parameter, then by Ee_ij= 0, we have

g (t_{i j}) = E (y_{i j} - x_{i j}^{T} β), 1 \leq i \leq n, 1 \leq j \leq m_{i} .

Hence, a natural nonparametric estimator of g(·) given β is

g_{n}^{*} (t, β) = \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} W_{n i j} (t) (y_{i j} - x_{i j}^{T} β),

(2.1)

where $W_{n i j} (t) = W_{n i j} (t, t_{11}, t_{12}, \dots, t_{n m_{n}})$ is the weight function defined on I. Now, in order to estimate β, we minimize

S S (β) = \sum_{i = 1}^{n} {\sum_{j = 1}^{m_{i}} (y_{i j} - x_{i j}^{T} β - g_{n}^{*} (t_{i j}, β))}^{2} .

The minimizer to the above equation is found to be

{\hat{β}}_{n} = {(\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} {\tilde{x}}_{i j} {\tilde{x}}_{i j}^{T})}^{- 1} \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} {\tilde{x}}_{i j} ỹ_{i j},

(2.2)

where ${\tilde{x}}_{i j} = x_{i j} - \sum_{k = 1}^{n} \sum_{l = 1}^{m_{i}} W_{n k l} (t_{i j}) x_{k l}$ and $ỹ_{i j} = y_{i j} - \sum_{k = 1}^{n} \sum_{l = 1}^{m_{i}} W_{n k l} (t_{i j}) y_{k l}$ .

So, a plug-in estimator of the nonparametric component g(·) is given by

ĝ_{n} (t) = \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} W_{n i j} (t) (y_{i j} - x_{i j}^{T} {\hat{β}}_{n}) .

(2.3)

In this paper, let {e_ij,1 ≤ j ≤ m_i} be φ-mixing or ρ-mixing with Ee_ij= 0 for each i(1 ≤ i ≤ n), and {e_i, 1 ≤ i ≤ n} be mutually independent, where $e_{i} = {(e_{i 1}, \dots, e_{i m_{i}})}^{T}$ . For each i, denote φ_i(·) and ρ_i(·) be the i th mixing coefficients of the sequence of φ-mixing and ρ-mixing, respectively. Define $S_{n}^{2} = \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} {\tilde{x}}_{i j} {\tilde{x}}_{i j}^{T}, \tilde{g} (t) = g (t) - \sum_{k = 1}^{n} \sum_{l = 1}^{m_{i}} W_{n k l} (t) g (t_{k l})$ , denote I(·) be the indicator function, || · || be the Euclidean norm, and set ⌊z⌋ ≤ z < ⌊z⌋ + 1 for the integer part of z. In the sequence, C and C₁ denote positive constants whose values may vary at each occurrence.

For obtaining our main results, we list some assumptions:

A1 (i) {e_ij, 1 ≤ j ≤ m_i} are φ-mixing with Ee_ij= 0 for each i;

(ii)
{e_ij, 1 ≤ j ≤ m_i} are ρ-mixing with Ee_ij= 0 for each i.

A2 (i) max_1≤i≤nm_i= o(n^δ) for some $0 < δ < \frac{r - 2}{2 r}$ and r > 2;

(ii)
$lim_{n \to \infty} \frac{1}{N (n)} S_{n}^{2} = \sum$ , where Σ is a positive definite matrix and $N (n) = \sum_{i = 1}^{n} m_{i}$
(iii)
g(·) satisfies the first-order Lipschitz condition on [0, 1].

A3 For n large enough, the probability weight functions W_nij(·) satisfy

(i)
$\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} W_{n i j} (t) = 1$ for each t ∈ [0, 1];
(ii)
$sup_{0 \leq t \leq 1} max_{1 \leq i \leq n, 1 \leq j \leq m_{i}} W_{n i j} (t) = O (n^{- \frac{1}{2}})$ ;
(iii)
$sup_{0 \leq t \leq 1} \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} W_{n i j} (t) I (| t_{i j} - t | > ε) = o (1)$ for any ϵ > 0;
(iv)
$max_{1 \leq k \leq n, 1 \leq l \leq m_{i}} | | \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} W_{n i j} (t_{k l}) x_{i j} | | = O (1)$ ,
(v)
$sup_{0 \leq t \leq 1} ∥\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} W_{n i j} (t) x_{i j}∥ = O (1)$ ,
(vi)
$max_{1 \leq i \leq n, 1 \leq j \leq m_{i}} |W_{n i j} (s) - W_{n i j} (t)| \leq C | s - t |$ uniformly for s, t ∈ [0, 1].

Remark 2.1 For obtaining the asymptotic properties of estimators of the models (1.1), many authors often assumed that {m_i, 1 ≤ i ≤ n} are bounded. Under the weak condition A2(i), we obtain the strong consistency of estimators of the models (1.1) with mixing-dependent structure. The condition of {m_i, 1 ≤ i ≤ n} being a bounded sequence is a special case of A2(i).

Remark 2.2 Assumption A2(ii) implies that

\frac{1}{N (n)} \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} | | {\tilde{x}}_{i j} | | = O (1) and max_{1 \leq i \leq n, 1 \leq j \leq m_{i}} | | {\tilde{x}}_{i j} | | = o (N {(n)}^{\frac{1}{2}}) .

Remark 2.3 As a matter of fact, there exist some weights satisfying assumption A3. For example, under some regularity conditions, the following Nadaraya-Watson kernel weight satisfies assumption A3:

W_{n i j} (t) = K (\frac{t - t_{i j}}{h_{n}}) {[\sum_{k = 1}^{n} \sum_{l = 1}^{m_{i}} K (\frac{t - t_{k l}}{h_{n}})]}^{- 1},

where K(·) is a kernel function and h_nis a bandwidth parameter. Assumption A3 has also been used by Hardle et al. [7], Baek and Liang [16], Liang and Jing [20] and Chen and You [47].

Theorem 2.1 Suppose that A1(i) or A1(ii), and A2 and A3(i)-(iii) hold. If

max_{1 \leq i \leq n, 1 \leq j \leq m_{i}} E (| e_{i j} |^{p}) \leq C, a . s .

(2.4)

for p > 3, then

{\hat{β}}_{n} \to β, a . s . .

(2.5)

Theorem 2.2 Suppose that A1(i) or A1(ii), and A2, A3(i-iv) and (2.4) hold. For any t ∈ [0, 1], we have

ĝ_{n} (t) \to g (t), a . s . .

(2.6)

Theorem 2.3 Suppose that A1(i) or A1(ii), and A2, A3(i-iii), A3(v-vi) and (2.4) hold. We have

sup_{0 \leq t \leq 1} | ĝ_{n} (t) - g (t) | = o (1), a . s . .

(2.7)

3 Several technical lemmas and corollaries

In order to prove the main results, we first introduce some lemmas and corollaries. Let $S_{j} = \sum_{l = 1}^{j} X_{l}$ for j ≥ 1, and $S_{k} (i) = \sum_{j = k + 1}^{k + i} X_{j}$ for i ≥ 1 and k ≥ 0.

Lemma 3.1. (Shao [28]) Let {X_m, m ≥ 1} be a φ-mixing sequence.

(1) If EX_i= 0, then

E S_{k}^{2} (i) \leq 8000 i exp \{6 \sum_{j = 1}^{⌊log i⌋} φ^{1 ∕ 2} (2^{j})\} max_{k + 1 \leq j \leq k + i} E X_{j}^{2} .

(2) Suppose that there exists an array {c_km} of positive numbers such that $max_{1 \leq i \leq m} E S_{k}^{2} (i) \leq c_{k m}$ for every k ≥ 0, m ≥ 1. Then, for any q ≥ 2, there exists a positive constant C = C(q, φ(·)) such that

E max_{1 \leq i \leq m} | S_{k} (i) |^{q} \leq C (c_{k m}^{q ∕ 2} + E max_{k < i \leq k + m} | X_{i} |^{q}) .

Lemma 3.2. (Shao [37]) Let {X_m, m ≥ 1} be a ρ -mixing sequence with EY_i= 0. Then, for any q ≥ 2, there exists a positive constant C = C(q, ρ(·)) such that

\begin{gathered} E max_{1 \leq j \leq m} | S_{j} |^{q} \leq C (m^{q ∕ 2} exp \{C \sum_{j = 1}^{⌊log m⌋} ρ (2^{j})\} max_{1 \leq j \leq m} {(E | X_{j} |^{2})}^{q ∕ 2} \\ + m exp \{C \sum_{j = 1}^{⌊log m⌋} ρ^{2 ∕ q} (2^{j})\} max_{1 \leq j \leq m} E | Y_{j} |^{q} θ) . \end{gathered}

Lemma 3.3. Suppose that A1(i) or A1(ii) holds. Let α > 1,0 < r < α and

e_{i j}^{'} = e_{i j} I (| e_{i j} | \leq ε i^{\frac{1}{r}} m_{i}),

(3.1)

e_{i j}^{″} = e_{i j} - e_{i j}^{'} = e_{i j} I (e_{i j} > ε i^{\frac{1}{r}} m_{i}) + e_{i j} I (e_{i j} < - ε i^{\frac{1}{r}} m_{i})

(3.2)

for any ε > 0. If

max_{1 \leq i \leq n} max_{1 \leq j \leq m_{i}} E (| e_{i j} |^{α}) \leq C, a . s .,

(3.3)

we have

\sum_{i = 1}^{\infty} \sum_{j = 1}^{m_{i}} | {e^{″}}_{i j} | < \infty, a . s . .

Proof Note that $| e_{i j}^{″} | = | e_{i j} | I (| e_{i j} | > ε i^{\frac{1}{r} m_{i}})$ . Let $ξ_{i} = \sum_{j = 1}^{m_{i}} | e_{i j} |, {ξ^{'}}_{i} = \sum_{j = 1}^{m_{i}} | e_{i j} | \cdot I (\sum_{j = 1}^{m_{i}} | e_{i j} | \leq ε i^{\frac{1}{r}} m_{i}), {ξ^{″}}_{i} = ξ_{i} - {ξ^{'}}_{i} = \sum_{j = 1}^{m_{i}} | e_{i j} | I (\sum_{j = 1}^{m_{i}} | e_{i j} | > ε i^{\frac{1}{r}} m_{i})$ , and $| ξ_{i}^{″} |_{d} = | ξ_{i}^{″} | I (| ξ_{i}^{″} | \leq d)$ for fixed d > 0. First, we prove

\sum_{i = 1}^{\infty} | {ξ^{″}}_{i} | < \infty, a . s . .

(3.4)

Note that

{| ξ_{i}^{″} | > d} = \{\sum_{j = 1}^{m_{i}} | e_{i j} | I (\sum_{j = 1}^{m_{i}} | e_{i j} | > ε i^{\frac{1}{r}} m_{i}) > d\} = \{\sum_{j = 1}^{m_{i}} | e_{i j} | > ε i^{\frac{1}{r}} m_{i}\}

(3.5)

for i large enough. By Markov's inequality, C_r-inequality, and (3.3), we have

\begin{align} \sum_{i = 1}^{\infty} P (|{ξ^{″}}_{i}| d) & \leq C \sum_{i = 1}^{\infty} P (\sum_{j = 1}^{m_{i}} |e_{i j}| > ε i^{\frac{1}{r}} m_{i}) \\ \leq C \sum_{i = 1}^{\infty} i^{- \frac{α}{r}} m_{i}^{- α} E {|\sum_{j = 1}^{m_{i}} |e_{i j}||}^{α} \leq C \sum_{i = 1}^{\infty} i^{- \frac{α}{r}} m_{i}^{- 1} \sum_{j = 1}^{m_{i}} E {|e_{i j}|}^{α} \\ \leq C lim_{n \to \infty} \sum_{i = 1}^{n} i^{- \frac{α}{r}} max_{1 \leq i \leq n} max_{1 \leq j \leq m_{i}} E {|e_{i j}|}^{α} \\ \leq C \sum_{i = 1}^{\infty} i^{- \frac{α}{r}} < \infty, \end{align}

(3.6)

From (3.5), ${| ξ_{i}^{″} | \leq d} = \{\sum_{j = 1}^{m_{i}} | e_{i j} | \leq ε i^{\frac{1}{r}} m_{i}\}$ for i large enough. One gets

\begin{align} E (| {ξ^{″}}_{i} |_{d}) & = E (| {ξ^{″}}_{i} | I (| {ξ^{″}}_{i} | \leq d)) \\ = E (\sum_{j = 1}^{m_{i}} | e_{i j} | I (\sum_{j = 1}^{m_{i}} | e_{i j} | > ε i^{\frac{1}{r}} m_{i}) I (\sum_{j = 1}^{m_{i}} | e_{i j} | \leq ε i^{\frac{1}{r}} m_{i})) = 0 \end{align}

and

\begin{align} Var (| {ξ^{″}}_{i} |_{d}) & \leq E (| {ξ^{″}}_{i} |_{d}^{2}) = E {(| {ξ^{″}}_{i} | I (| {ξ^{″}}_{i} | \leq d))}^{2} \\ = E (| {ξ^{″}}_{i} |^{2} I (| {ξ^{″}}_{i} | \leq d)) \leq d E (| {ξ^{″}}_{i} | I (| {ξ^{″}}_{i} | \leq d)) = 0 \end{align}

for i large enough. Therefore,

\sum_{i = 1}^{\infty} E (| {ξ^{″}}_{i} |_{d}) < \infty, \sum_{i = 1}^{\infty} Var (| {ξ^{″}}_{i} |_{d}) < \infty .

(3.7)

Since ${ξ_{i}^{″}, 1 \leq i \leq n}$ is a sequence of independent random variables, (3.4) holds from (3.6) and (3.7) by Three Series Theorem. Then,

\begin{align} \sum_{i = 1}^{\infty} \sum_{j = 1}^{m_{i}} | e_{i j}^{″^{}} | & = \sum_{i = 1}^{\infty} \sum_{j = 1}^{m_{i}} | e_{i j} | I (| e_{i j} | > ε i^{\frac{1}{r}} m_{i}) \\ \leq \sum_{i = 1}^{\infty} \sum_{j = 1}^{m_{i}} | e_{i j} | I (\sum_{j = 1}^{m_{i}} | e_{i j} | > ε i^{\frac{1}{r}} m_{i}) = \sum_{i = 1}^{\infty} | ξ_{i}^{″^{}} | < \infty, a . s . . \end{align}

Thus, we complete the proof of Lemma 3.3.

Lemma 3.4. Let {e_ij, 1 ≤ j ≤ m_i} be the φ-mixing with Ee_ij= 0 for each i (1 ≤ i ≤ n). Assume that {a_nij(·), 1 ≤ i ≤ n,1 ≤ j ≤ m_i} is a function array defined on [0, 1], satisfying $\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} | a_{n i j} (t) | = O (1)$ and $max_{1 \leq i \leq n, 1 \leq j \leq m_{i}} | a_{n i j} (t) | = O (n^{- \frac{1}{2}})$ for any t ∈ [0, 1], and A2(i) and (2.4) hold. Then, for any t ∈ [0, 1] we have

\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} a_{n i j} (t) e_{i j} = o (1), a . s . .

(3.8)

Proof Based on (3.1) and (3.2), we denote $ζ_{n i j} = e_{i j}^{'} - E (e_{i j}^{'}), η_{n i j} = e_{i j}^{″} - E (e_{i j}^{″})$ and take r satisfying 2 < r < p - 1. Since e_ij= ζ_nij+ η_nij, we have

\begin{align} \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} a_{n i j} (t) e_{i j} & = \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} a_{n i j} (t) ζ_{n i j} + \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} a_{n i j} (t) {e^{″}}_{i j} - \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} a_{n i j} (t) E ({e^{″}}_{i j}) \\ = : A_{1 n} + A_{2 n} - A_{3 n} . \end{align}

(3.9)

First, we prove

A_{1 n} \to 0, a . s . .

(3.10)

Denoting ${\tilde{ζ}}_{n i} = \sum_{j = 1}^{m_{i}} a_{n i j} (t) ζ_{n i j}$ , we know that $^{{{\tilde{ζ}}_{n i}, 1 \leq i \leq n}}$ is a sequence of independent random variables with $E {\tilde{ζ}}_{n i} = 0$ . By Markov's inequality, and Rosenthal's inequality, for any ε > 0 and q ≥ 2, one gets

\begin{align} P (|\sum_{i = 1}^{n} {\tilde{ζ}}_{n i}| > ε) & \leq ε^{- q} E {|\sum_{i = 1}^{n} {\tilde{ζ}}_{n i}|}^{q} \\ \leq C (\sum_{i = 1}^{n} E | {\tilde{ζ}}_{n i} |^{q} + {(\sum_{i = 1}^{n} E {\tilde{ζ}}_{n i}^{2})}^{\frac{q}{2}}) \\ = : A_{11 n} + A_{12 n} . \end{align}

(3.11)

Note that φ_i(m) → 0 as m → ∞, hence $\sum_{k = 1}^{⌊log m_{i}⌋} φ_{i}^{1 ∕ 2} (2^{k}) = o (log m_{i})$ . Further, $exp \{λ \sum_{k = 1}^{⌈log m_{i}⌉} φ_{i}^{1 ∕ 2} (2^{k})\} = o (m_{i}^{τ})$ for any λ > 0 and τ > 0.

For A_11n, by Lemma 3.1, A2(i) and (2.4), and taking q > p, we have

\begin{align} A_{11 n} & = C \sum_{i = 1}^{n} E {|\sum_{j = 1}^{m_{i}} a_{n i j} (t) ζ_{n i j}|}^{q} \\ \leq C \sum_{i = 1}^{n} [{(m_{i} exp \{6 \sum_{k = 1}^{⌊log m_{i}⌋} φ_{i}^{1 ∕ 2} (2^{k})\} max_{1 \leq k \leq m_{i}} E | a_{n i k} (t) ζ_{n i k} |^{2})}^{q ∕ 2} + \sum_{j = 1}^{m_{i}} E | a_{n i j} ζ_{n i j} |^{q}] \\ \leq C \sum_{i = 1}^{n} [{(m_{i}^{1 + τ} n^{- 1})}^{q ∕ 2} + \sum_{j = 1}^{m_{i}} n^{- \frac{q}{2}} E | ζ_{n i j} |^{p} | ζ_{n i j} |^{q - p}] \\ \leq C n^{- \frac{q}{2}} \sum_{i = 1}^{n} {m_{i}}^{\frac{(τ + 1) q}{2}} + C n^{- \frac{q}{2}} \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} {(i^{\frac{1}{r}} m_{i})}^{q - p} \\ \leq {C_{n}}^{- (\frac{q}{2} - \frac{(τ + 1) δ q}{2} - 1)} + C n^{- (\frac{q}{2} - \frac{q}{r} + \frac{p}{r} - (q - p + 1) δ - 1)} \end{align}

Take $q > max \{\frac{2 r (2 + δ)}{r - 2 r δ - 2}, \frac{4}{1 - δ}, p\}$ . We have $\frac{q}{2} - \frac{δ q}{2} > 2$ and $\frac{q}{2} - \frac{q}{r} + \frac{p}{r} - (q - p + 1) δ > 2$ .

Next, take τ > 0 small enough such that $\frac{q}{2} - \frac{(τ + 1) δ q}{2} > 2$ . Thus, we have

\sum_{n = 1}^{\infty} A_{11 n} < \infty .

(3.12)

For A_12n, by Lemma 3.1 and (2.4), we have

\begin{align} A_{12 n} & = C {\{\sum_{i = 1}^{n} E {|\sum_{j = 1}^{m_{i}} a_{n i j} (t) ζ_{n i j}|}^{2}\}}^{\frac{q}{2}} \\ \leq C {\{\sum_{i = 1}^{n} m_{i} exp \{6 \sum_{k = 1}^{⌊log m_{i}⌋} φ_{i}^{1 ∕ 2} (2^{k})\} max_{1 \leq j \leq m_{i}} E | a_{n i j} (t) ζ_{n i j} |^{2}\}}^{\frac{q}{2}} \\ \leq C {\{\sum_{i = 1}^{n} m_{i}^{τ + 1} \sum_{j = 1}^{m_{i}} E | a_{n i j} (t) ζ_{n i j} |^{2}\}}^{\frac{q}{2}} \\ \leq C n^{- (\frac{q}{4} - \frac{(τ + 1) δ q}{2})} \end{align}

Note that $δ < \frac{r - 2}{2 r} < \frac{1}{2}$ . Taking $q > \frac{4}{1 - 2 δ}$ , we have $\frac{q}{4} - \frac{δ q}{2} > 1$ . Next, take τ > 0 small enough such that $\frac{q}{2} - \frac{(τ + 1) δ q}{2} > 1$ . Thus, we have

\sum_{n = 1}^{\infty} A_{12 n} < \infty .

(3.13)

Combining (3.11)-(3.13), we obtain (3.10).

By Lemma 3.3 and $max_{1 \leq i \leq n, 1 \leq j \leq m_{i}} | a_{n i j} (t) | = O (n^{- \frac{1}{2}})$ for any t ∈ [0, 1], we have

| A_{2 n} | \leq max_{i \leq i \leq n, 1 \leq j \leq m_{i}} | a_{n i j} (t) | \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} | e_{i j}^{″^{}} | = O (n^{- \frac{1}{2}}) .

(3.14)

Note that $\frac{p - 1}{r} > 1$ and δ > 0. From (2.4), we have

\begin{align} | A_{3 n} | & = |\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} a_{n i j} (t) E (e_{i j}^{″^{}})| \\ \leq n^{- \frac{1}{2}} \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} E (| e_{i j} | I (| e_{i j} | > ε i^{\frac{1}{r}} m_{i})) \\ = n^{- \frac{1}{2}} \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} E (| e_{i j} |^{p} | e_{i j} |^{1 - p} I (| e_{i j} | > ε i^{\frac{1}{r}} m_{i})) \\ \leq C n^{- \frac{1}{2}} \sum_{i = 1}^{n} {\sum_{j = 1}^{m_{i}} (i^{\frac{1}{r}} m_{i})}^{1 - p} \leq C n^{- \frac{1}{2}} \sum_{i = 1}^{n} i^{- \frac{p - 1}{r}} m_{i}^{2 - p} \\ \leq C n^{- ((p - 2) δ + \frac{1}{2})} = o (1) . \end{align}

(3.15)

From (3.9), (3.10), (3.14) and (3.15), we have (3.8).

Corollary 3.1. In Lemma 3.4, if {e_ij, 1 ≤ j ≤ m_i} are ρ -mixing with Ee_ij= 0 for each i (1 ≤ i ≤ n), then (3.8) holds.

Proof From the proof of Lemma 3.4, it is enough to prove that $\sum_{n = 1}^{\infty} A_{11 n} < \infty$ and $\sum_{n = 1}^{\infty} A_{12 n} < \infty$ .

Note that ρ_i(m) → 0 as m → ∞, hence $\sum_{k = 1}^{⌊log m_{i}⌋} ρ_{i}^{2 ∕ q} (2^{k}) = o (log m_{i})$ . Further, $exp \{λ \sum_{k = 1}^{⌈log m_{i}⌉} ρ_{i}^{2 ∕ q} (2^{k})\} = o (m_{i}^{τ})$ for any λ > 0 and τ > 0.

For A_11n, by Lemma 3.2 and (2.4), and taking q > p, we get

\begin{align} A_{11 n} & = C \sum_{i = 1}^{n} E {|\sum_{j = 1}^{m_{i}} a_{n i j} (t) ζ_{n i j}|}^{q} \\ \leq C \sum_{i = 1}^{n} (m_{i}^{\frac{q}{2}} exp \{C_{1} \sum_{k = 1}^{⌊log m_{i}⌋} ρ_{1} (2^{k})\} max_{1 \leq k \leq m_{i}} {(E | a_{n i k} ζ_{n i k} |^{2})}^{\frac{q}{2}} \\ + m_{i} exp \{C_{1} \sum_{k = 1}^{⌊log m_{i}⌋} ρ_{i}^{2 ∕ q} (2^{k})\} max_{1 \leq k \leq m_{i}} E | a_{n i k} ζ_{n i k} |^{q}) \\ \leq C \sum_{i = 1}^{n} (m_{i}^{τ + \frac{q}{2}} n^{- \frac{q}{2}} + m_{i}^{τ + 1} n^{- \frac{q}{2}} {(i^{\frac{1}{r}} m_{i})}^{q - p}) \\ \leq C n^{- (\frac{q}{2} - (r + \frac{q}{2}) δ - 1)} + C n^{- (\frac{q}{2} - \frac{q}{r} + \frac{p}{r} - (q + p + r + 1) δ - 1)} \end{align}

Take $q > max \{\frac{2 r (2 + δ)}{r - 2 r δ - 2}, \frac{4}{1 - δ}, p\}$ . We have $\frac{q}{2} - \frac{q δ}{2} > 2$ and $\frac{q}{2} - \frac{q}{r} + \frac{p}{r} - (q - p + 1) δ > 2$ .

Next, take τ > 0 small enough such that $\frac{q}{2} - (τ + \frac{q}{2}) δ > 2$ and $\frac{q}{2} - \frac{q}{r} + \frac{p}{r} - (q + p + τ + 1) δ > 2$ . Thus, $\sum_{n = 1}^{\infty} A_{11 n} < \infty$ .

For A_12n, by Lemma 3.2 and (2.4), we have

\begin{align} A_{12 n} & = C {\{\sum_{i = 1}^{n} E {|\sum_{j = 1}^{m_{i}} a_{n i j} (t) ζ_{n i j}|}^{2}\}}^{\frac{q}{2}} \\ \leq C {(\sum_{i = 1}^{n} m_{i} exp \{C_{1} \sum_{k = 1}^{⌊log m_{i}⌋} ρ_{1} (2^{k})\} max_{1 \leq j \leq m_{i}} E | a_{n i k} ζ_{n i k} |^{2})}^{\frac{q}{2}} \\ \leq C {(\sum_{i = 1}^{n} m_{i}^{τ + 1} \sum_{j = 1}^{m_{i}} E | a_{n i j} ζ_{n i j} |^{2})}^{\frac{q}{2}} \\ \leq C n^{- (\frac{q}{4} - \frac{(τ + 1) δ q}{2})} \end{align}

Note that $δ < \frac{1}{2}$ from A2(i). Taking $q > \frac{4}{1 - 2 δ}$ , we have $\frac{q}{4} - \frac{δ q}{2} > 1$ . Next, take τ > 0 small enough such that $\frac{q}{2} - \frac{(τ + 1) δ q}{2} > 1$ . Thus, $\sum_{n = 1}^{\infty} A_{12 n} < \infty$ .

So, we complete the proof of Lemma 3.4.

Remark 3.1 If the real function array {a_nij(t),1 ≤ i ≤ n, 1 ≤ j < m_i} is replaced with the real constant array {a_nij, 1 ≤ i ≤ n, 1 ≤ j ≤ m_i}, the results of Lemma 3.4 and Corollary 3.1 hold obviously.

Lemma 3.5. Let {e_ij, 1 ≤ j ≤ m_i} be the φ-mixing with Ee_ij= 0 for each i (1 ≤ i ≤ n). Assume that {a_nij(·), 1 ≤ i ≤ n, 1 ≤ j ≤ m_i} is a function array defined on [0, 1], satisfying $\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} | a_{n i j} (t) | = O (1)$ and $max_{1 \leq i \leq n, 1 \leq j \leq m_{i}} | a_{n i j} (t) | = O (n^{- \frac{1}{2}})$ uniformly for t ∈ [0, 1], and $max_{1 \leq i \leq n, 1 \leq j \leq m_{i}} | a_{n i j} (s) - a_{n i j} (t) | \leq C | s - t |$ uniformly for s,t ∈ [0, 1], where C is a constant. If A2(i) and (2.4) hold, then

sup_{0 \leq t \leq 1} |\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} a_{n i j} (t) e_{i j}| = o (1), a . s . .

(3.16)

Proof Based on (3.1) and (3.2), we denote $ζ_{n i j} = e_{i j}^{'} - E e_{i j}^{'}$ and take r satisfying 2 < r < p - 1. Using the finite covering theorem, [0, 1] is covered by $O (n^{2 + \frac{1}{r}})$ 's neighborhoods D_nwith center s_nand radius $n^{- (2 + \frac{1}{r})}$ , and for each t ∈ [0, 1], there exists some neighborhood D_n(s_n(t)) with center s_n(t) and radius $n^{- (2 + \frac{1}{r})}$ such that t ∈ D_n(s_n(t)). Since E(e_ij) = 0, we have

\begin{align} |\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} a_{n i j} (t) e_{i j}| & \leq |\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} a_{n i j} (t) e_{i j}^{″^{}}| + |\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} (a_{n i j} (t) - a_{n i j} (s_{n} (t))) e_{i j}^{'^{}}| \\ + |\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} a_{n i j} (s_{n} (t)) ζ_{n i j}| + |\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} (a_{n i j} (t) - a_{n i j} (s_{n} (t))) E (e_{i j}^{'^{}})| \\ + |\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} a_{n i j} (t) E (e_{i j}^{″^{}})| . \\ = : B_{1 n} (t) + B_{2 n} (t) + B_{3 n} (t) + B_{4 n} (t) + B_{5 n} (t) . \end{align}

Denote $sup max_{t, i, j} = sup_{0 \leq t \leq 1} max_{1 \leq i \leq n, 1 \leq j \leq m_{i}}$ . By Lemma 3.3 and the proof of (3.15), noting that $δ < \frac{1}{2}$ , we have

\begin{align} sup_{0 \leq t \leq 1} B_{1 n} (t) & \leq sup max_{t, i, j} |a_{n i j} (t)| \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} |e_{i j}^{″^{}}| = O (n^{- \frac{1}{2}}), a . s ., \\ sup_{0 \leq t \leq 1} B_{2 n} (t) & \leq sup max_{t, i, j} |a_{n i j} (t) - a_{n i j} (s_{n} (t))| \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} |e_{i j}^{'^{}}| \leq C n^{- (2 + \frac{1}{r})} n^{2 δ} n^{1 + \frac{1}{r}} = o (1), \\ sup_{0 \leq t \leq 1} B_{4 n} (t) & \leq sup max_{t, i, j} |a_{n i j} (t) - a_{n i j} (s_{n} (t))| \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} E (|e_{i j}^{'^{}}|) = o (1), \\ sup_{0 \leq t \leq 1} B_{5 n} (t) & \leq sup max_{t, i, j} |a_{n i j} (t)| \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} E (|e_{i j}| I (|e_{i j}| > ε i^{\frac{1}{r}})) = o (1) . \end{align}

Now, it is enough to show sup_0≤t≤1 B_3n(t) = o(1), a.s..

From (3.11), A_11nand A_12n, for the given t ∈ [0, 1] and u ∈ D_n(s_n(t)), we have

P (|\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} a_{n i j} (u) ζ_{n i j}| > ε) \leq C (n^{- (\frac{q}{2} - \frac{q}{r} + \frac{p}{r} - (q - p + 1) δ - 1)} + n^{- (\frac{q}{2} - \frac{(r + 1) δ q}{2} - 1)} + n^{- (\frac{q}{4} - \frac{(r + 1) δ q}{2})}) .

Then, we obtain

\begin{align} P (sup_{0 \leq t \leq 1} B_{3 n} (t) > ε) & \leq P (sup_{0 \leq t \leq 1} sup_{u \in D_{n} (s_{n} (t))} |\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} a_{n i j} (u) ζ_{n i j}| > ε) \\ \leq C n^{2 + \frac{1}{r}} (n^{- (\frac{q}{2} - \frac{q}{r} + \frac{p}{r} - (q - p + 1) δ - 1)} + n^{- (\frac{q}{2} - \frac{(r + 1) δ q}{2} - 1)} + n^{- (\frac{q}{4} - \frac{(r + 1) δ q}{2})}) \\ \leq C (n^{- (\frac{q}{2} - \frac{q}{r} - δ d - δ - 4)} + n^{- (\frac{q}{2} - \frac{(r + 1) δ q}{2} - 4)} + n^{- (\frac{q}{4} - \frac{(r + 1) δ q}{2} - 3)}) . \end{align}

Take $q > max \{\frac{2 r (5 + δ)}{r - 2 r δ - 2}, \frac{16}{1 - 2 δ}, p\}$ . We have $\frac{q}{2} - \frac{q}{r} - δ q - δ > 5, \frac{q}{2} - \frac{δ q}{2} > 5$ and $\frac{q}{4} - \frac{δ q}{2} > 4$ . Next, take τ > 0 small enough such that $\frac{q}{2} - \frac{(r + 1) δ q}{2} > 5$ and $\frac{q}{4} - \frac{(r + 1) δ q}{2} > 4$ . Thus, we have $\sum_{n = 1}^{\infty} P ({sup}_{0 \leq t \leq 1} B_{3 n} (t) > ε) < \infty$ . Thus, sup_0≤t≤1B_3n(t) = o(1),a.s.. Therefore, (3.16 ) holds.

Corollary 3.2. In Lemma 3.5, if {e_ij, 1 ≤ j ≤ m_i} are ρ -mixing with Ee_ij= 0 for each i (1 ≤ i ≤ n), then (3.16) holds.

Proof By Corollary 3.1, with arguments similar to the proof of Lemma 3.5, we have (3.16).

4 Proof of Theorems

Proof of Theorem 2.1 From (1.1) and (2.2), we have

\begin{align} {\hat{β}}_{n} - β & = {(\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} {\tilde{x}}_{i j} {\tilde{x}}_{i j}^{T})}^{- 1} \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} {\tilde{x}}_{i j} (ỹ_{i j} - {\tilde{x}}_{i j}^{T} β) \\ = S_{n}^{- 2} \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} {\tilde{x}}_{i j} [(y_{i j} - x_{i j}^{T} β) - \sum_{k = 1}^{n} \sum_{l = 1}^{m_{i}} W_{n k l} (t_{i j}) (y_{k l} - x_{k l}^{T} β)] \\ = S_{n}^{- 2} \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} {\tilde{x}}_{i j} [(g (t_{i j}) + e_{i j}) - \sum_{k = 1}^{n} \sum_{l = 1}^{m_{i}} W_{n k l} (t_{i j}) (g (t_{k l}) + e_{k l})] \\ = S_{n}^{- 2} \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} {\tilde{x}}_{i j} [e_{i j} - \sum_{k = 1}^{n} \sum_{l = 1}^{m_{i}} W_{n k l} (t_{i j}) e_{k l} + \tilde{g} (t_{i j})] \\ = S_{n}^{- 2} [\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} {\tilde{x}}_{i j} e_{i j} - \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} {\tilde{x}}_{i j} (\sum_{k = 1}^{n} \sum_{l = 1}^{m_{i}} W_{n k l} (t_{i j}) e_{k l}) + \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} {\tilde{x}}_{i j} \tilde{g} (t_{i j})] \\ = {(\frac{S_{n}^{2}}{N (n)})}^{- 1} [\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} \frac{{\tilde{x}}_{i j}}{N (n)} e_{i j} - \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} \frac{{\tilde{x}}_{i j}}{N (n)} (\sum_{k = 1}^{n} \sum_{l = 1}^{m_{i}} W_{n k l} (t_{i j}) e_{k l}) + \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} \frac{{\tilde{x}}_{i j}}{N (n)} \tilde{g} (t_{i j})] \\ = : D_{1 n} + D_{2 n} + D_{3 n} . \end{align}

(4.1)

From A2(ii), $∥{\{\frac{S_{n}^{2}}{N (n)}\}}^{- 1}∥ = O (1)$ . By Remark 2.2, we have

\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} \frac{| | {\tilde{x}}_{i j} | |}{N (n)} = O (1) and max_{1 \leq i \leq n, 1 \leq j \leq m_{i}} \frac{| | {\tilde{x}}_{i j} | |}{N (n)} = o (n^{\frac{1}{2}}) .

(4.2)

According to (4.2) and Remark 3.1, we have

| | D_{1 n} | | \leq C |\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} \frac{| | {\tilde{x}}_{i j} | |}{N (n)} e_{i j}| = o (1), a . s . .

(4.3)

By A3(i-ii), (4.2), Lemma 3.4 or Corollary 3.1, we have

| | D_{2 n} | | \leq C max_{1 \leq i \leq n, 1 \leq j \leq m_{i}} ∥\sum_{k = 1}^{n} \sum_{l = 1}^{m_{i}} W_{n k l} (t_{i j}) e_{k l}∥ \cdot \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} \frac{| | {\tilde{x}}_{i j} | |}{N (n)} = o (1) . a . s . .

(4.4)

From A2(iii) and A3(iii), we obtain

\begin{align} max_{1 \leq i \leq n, 1 \leq j \leq m_{i}} |\tilde{g} (t_{i j})| & = max_{1 \leq i \leq n, 1 \leq j \leq m_{i}} |g (t_{i j}) - \sum_{k = 1}^{n} \sum_{l = 1}^{m_{i}} W_{n k l} (t_{i j}) g (t_{k l})| \\ \leq max_{1 \leq i \leq n, 1 \leq j \leq m_{i}} |\sum_{k = 1}^{n} \sum_{l = 1}^{m_{i}} W_{n k l} (t_{i j}) (g (t_{i j}) - g (t_{k l})) I (| t_{i j} - t_{k l} | > ε)| \\ + max_{1 \leq i \leq n, 1 \leq j \leq m_{i}} |\sum_{k = 1}^{n} \sum_{l = 1}^{m_{i}} W_{n k l} (t_{i j}) (g (t_{i j}) - g (t_{k l})) I (| t_{i j} - t_{k l} | \leq ε)| = o (1) . \end{align}

(4.5)

Together with (4.2), one gets

| | D_{3 n} | | \leq C max_{1 \leq i \leq n, 1 \leq j \leq m_{i}} | \tilde{g} (t_{i j}) | \cdot \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} \frac{| | {\tilde{x}}_{i j} | |}{N (n)} = o (1) .

(4.6)

By (4.1), (4.3), (4.4) and (4.6), (2.5) holds.

Proof of Theorem 2.2 From (1.1) and (2.3), we have

\begin{align} ĝ_{n} (t) - g (t) & = \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} W_{n i j} (t) (y_{i j} - x_{i j}^{T} {\hat{β}}_{n}) - g (t) \\ = \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} W_{n i j} (t) ((y_{i j} - x_{i j}^{T} {\hat{β}}_{n}) - (y_{i j} - x_{i j}^{T} β)) + \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} W_{n i j} (t) (y_{i j} - x_{i j}^{T} β) - g (t) \\ = \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} W_{n i j} (t) x_{i j}^{T} (β - {\hat{β}}_{n}) + \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} W_{n i j} (t) (g (t_{i j}) + e_{i j}) - g (t) \\ = \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} W_{n i j} (t) x_{i j}^{T} (β - {\hat{β}}_{n}) + \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} W_{n i j} (t) e_{i j} - \tilde{g} (t) \\ = : E_{1 n} + E_{2 n} + E_{3 n} . \end{align}

(4.7)

By A3(iv) and (2.5), one gets

| E_{1 n} | \leq ∥\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} W_{n i j} (t) x_{i j}∥ | | β - {\hat{β}}_{n} | | = o (1), a . s . .

(4.8)

By Lemma 3.4 or Corollary 3.1, E_2n= o(1), a.s.; With arguments similar to (4.5), we have E_3n= o(1). Therefore, together with (4.7) and (4.8), (2.6) holds.

Proof of Theorem 2.3 Here, we still use (4.7), but E_inin (4.7) are replaced by E_in(t) for i = 1,2 and 3. By A3(v) and (2.5), we get

sup_{0 \leq t \leq 1} | E_{1 n} (t) | \leq sup_{0 \leq t \leq 1} ∥\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} W_{n i j} (t) x_{i j}∥ | | β - {\hat{β}}_{n} | | = o (1), a . s . .

By Lemma 3.5 or Corollary 3.2, sup_0≤t≤1|E_2n(t)| = o(1), a.s.; Similar to the arguments in (4.5), we have sup_0≤t≤1|E_2n(t)| = o(1). Hence, (2.7) is proved.

5 Simulation study

To evaluate the finite-sample performance of the least squares estimator ${\hat{β}}_{n}$ and the nonparametric estimator $ĝ_{n} (t)$ , we respectively take two forms of functions for g(·):

I . g (t) = exp (3 t); II . g (t) = cos (\frac{3 π}{2} t),

consider the case where p = 1 and m_i= m = 12, and take the design points t_ij= ((i - 1)m + j)/(nm), x_ij~ N(1, 1) and the errors e_ij= 0.2e_{i, j-1}+ ϵ_ij, where ϵ_ijare i.i.d. N(0,1) random variables, and e_i,0~ N(0,1) for each i.

The kernel function is taken as the Epanechnikov kernel $K (t) = \frac{3}{4} (1 - t^{2}) I (| t | \leq 1)$ , and the weight function is given by Nadaraya-Watson kernel weight $W_{i j} (t) = K (\frac{t - t_{i j}}{h_{n}}) ∕ \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} K (\frac{t - t_{i j}}{h_{n}})$ . The bandwidth h is selected by a "leave-one-subject-out" cross validation method. In the simulations, we draw B = 1000 random samples of sizes 150,200,300 and 500 for β = 2, respectively. We obtain the estimators ${\hat{β}}_{n}$ and $ĝ_{n} (t)$ from (2.2) and (2.3), respectively. Let ${\hat{β}}_{n}^{(b)}$ be b th least squares estimator of β under the size n. Some numerical results for ${\hat{β}}_{n}$ are computed by

\begin{gathered} {\bar{β}}_{n} = \frac{1}{B} \sum_{b = 1}^{B} {\hat{β}}_{n}^{(b)}, \hat{SD} ({\hat{β}}_{n}) = {(\frac{1}{B - 1} \sum_{b = 1}^{B} {({\hat{β}}_{n}^{(b)} - \bar{β})}^{2})}^{1 ∕ 2}, \\ \hat{Bias} ({\hat{β}}_{n}) = \bar{β} - β, \hat{MSE} ({\hat{β}}_{n}) = \frac{1}{B - 1} \sum_{b = 1}^{B} {({\hat{β}}_{n}^{(b)} - β)}^{2}, \end{gathered}

which are listed in Table 1.

Table 1 The estimators of β and some indices of their accuracy for the different sample size n and nonparametric function g(·)

Full size table

In addition, for assessing estimator of the nonparametric component g(·), we study the square root of mean-squared errors (RMSE) based on 1000 repetitions. Denote $ĝ_{n}^{(b)} (t)$ be the b th estimator of g(t) under the size n, and ${\bar{ĝ}}_{n} (t) = \sum_{b = 1}^{B} ĝ_{n}^{(b)} (t) ∕ B$ be the average estimator of g(t). We compute

R M S E_{n} = {(\frac{1}{M} \sum_{s = 1}^{M} {({\bar{ĝ}}_{n} (t_{s}) - g (t_{s}))}^{2})}^{1 ∕ 2},

and

R M S E_{n}^{(b)} = {(\frac{1}{M} \sum_{s = 1}^{M} {(ĝ_{n}^{(b)} (t_{s}) - g (t_{s}))}^{2})}^{1 ∕ 2}, b = 1, 2, \dots, B,

where {t_s, s = 1,..., M} is a sequence of regular grid points on [0, 1]. Figures 1 and 2 respectively provide the average estimators of the nonparametric function g(·) and RMSE_nvalues for Cases I and II, respectively. The boxplots for ${RMSE}_{n}^{(b)} (b = 1, 2, \dots, B)$ values for Cases I and II are presented in Figure 3.

From Table 1, we see that (i) $| \hat{Bias} ({\hat{β}}_{n}) |, \hat{SD} ({\hat{β}}_{n})$ and $\hat{MSE} ({\hat{β}}_{n})$ do decrease with increasing the sample size n; (ii) the larger the sample size n is, the closer the ${\bar{β}}_{n}$ is to the true value 2. From Figures 1, 2 and 3, we observe that the biases of estimators of the nonparametric component g(·) decrease as the sample size n increases. These show that, for semiparametric partially linear regression models for longitudinal data based on mixing error's structure, the least squares estimator of parametric component β and the estimator of nonparametric component g(·) work well.

6 Concluding remarks

An inherent characteristic of longitudinal data is the dependence among the observations within the same subject. For exhibiting dependence among the observations within the same subject, we consider the estimation problems of partially linear models for longitudinal data with the φ-mixing and ρ- mixing error structures, respectively. The strong consistency for least squares estimator ${\hat{β}}_{n}$ of parametric component β is studied. In addition, the strong consistency and uniform consistency for the estimator $ĝ_{n} (\cdot)$ of nonparametric function g(·) are investigated under some mild conditions.

In the paper, we only consider $(x_{i j}^{T}, t_{i j})$ are known and nonrandom design points, as Baek and Liang [16], and Liang and Jing [20]. In the monograph of Hardle et al. [7], they respectively considered the two cases: the fixed design and the random design, to study non-longitudinal partially linear regression models. Our results can also be extended to the case of $(x_{i j}^{T}, t_{i j})$ being random. The interested readers can consider the work. In addition, we consider partially linear models for longitudinal data with only φ-mixing and ρ-mixing. In fact, our results with other mixing-dependent structures, such as α-mixing, φ*-mixing and ρ*-mixing, can also be obtained by the same arguments in our paper. At present, we have not given the asymptotic normality of estimators, since some details need further discussion. We will devote to establish the asymptotic normality of ${\hat{β}}_{n}$ and $ĝ_{n} (\cdot)$ in our future work.

References

Diggle LD, Heagerty P, Liang K, Zeger S: Analysis of Longitudinal Data. 2nd edition. Oxford University Press, New York; 2002.
Google Scholar
Fan J, Li R: New estimation and model selection procedure for semiparametric modeling in longitudinal data analysis. J Am Stat Assoc 2004, 99: 710–723. 10.1198/016214504000001060
Article MathSciNet Google Scholar
Engle R, Granger C, Rice J, Weiss A: Nonparametric estimates of the relation between weather and electricity sales. J Am Stat Assoc 1986, 81: 310–320. 10.2307/2289218
Article Google Scholar
Heckman N: Spline smoothing in a partly linear models. J R Stat Soc B 1986, 48: 244–248.
MathSciNet Google Scholar
Speckman P: Kernel smoothing in partial linear models. J R Stat Soc B 1988, 50: 413–436.
MathSciNet Google Scholar
Robinson PM: Root-n-consistent semiparametric regression. Econometrica 1988, 56: 931–954. 10.2307/1912705
Article MathSciNet Google Scholar
Härdle W, Liang H, Gao JT: Partial Linear Models. Physica-Verlag, Heidelberg; 2000.
Chapter Google Scholar
Zeger SL, Diggle PL: Semiparametric models for longitudinal data with application to CD4 cell numbers in HIV seroconverters. Biometrics 1994, 50: 689–699. 10.2307/2532783
Article Google Scholar
Moyeed RA, Diggle PJ: Rate of convergence in semiparametric modeling of longitudinal data. Aust J Stat 1994, 36: 75–93. 10.1111/j.1467-842X.1994.tb00640.x
Article MathSciNet Google Scholar
Zhang D, Lin X, Raz J, Sowerm MF: Semiparametric stochastic mixed models for longitudinal data. J Am Stat Assoc 1998, 93: 710–719. 10.2307/2670121
Article Google Scholar
You JH, Zhou X: Partially linear models and polynomial spline approximations for the analysis of unbalanced panel data. J Stat Plan Inference 2009, 139: 679–695. 10.1016/j.jspi.2007.04.037
Article MathSciNet Google Scholar
You JH, Zhou X, Zhou Y: Statistical inference for panel data semiparametric partially linear regression models with heteroscedastic errors. J Multivar Anal 2010, 101: 1079–1101. 10.1016/j.jmva.2010.01.003
Article MathSciNet Google Scholar
Schick A: An adaptive estimator of the autocorrelation coefficient in regression models with autocoregressive errors. J Time Ser Anal 1998, 19: 575–589. 10.1111/1467-9892.00109
Article MathSciNet Google Scholar
Gao JT, Anh VV: Semiparametric regression under long-range dependent errors. J Stat Plan Inference 1999, 80: 37–57. 10.1016/S0378-3758(98)00241-9
Article MathSciNet Google Scholar
Sun XQ, You JH, Chen GM, Zhou X: Convergence rates of estimators in partial linear regression models with MA(∞) error process. Commun Stat Theory Methods 2002, 31: 2251–2273. 10.1081/STA-120017224
Article MathSciNet Google Scholar
Baek J, Liang HY: Asymptotics of estimators in semi-parametric model under NA samples. J Stat Plan Inference 2006, 136: 3362–3382. 10.1016/j.jspi.2005.01.008
Article MathSciNet Google Scholar
Zhou XC, Liu XS, Hu SH: Moment consistency of estimators in partially linear models under NA samples. Metrika 2010, 72: 415–432. 10.1007/s00184-009-0260-5
Article MathSciNet Google Scholar
Li GL, Liu LQ: Strong consistency of a class estimators in partial linear model under martingale difference sequence. Acta Math Sci (Ser A) 2007, 27: 788–801.
Google Scholar
Chen X, Cui HJ: Empirical likelihood inference for partial linear models under martingale difference sequence. Stat Probab Lett 2008, 78: 2895–2910. 10.1016/j.spl.2008.04.012
Article MathSciNet Google Scholar
Liang HY, Jing BY: Asymptotic normality in partial linear models based on dependent errors. J Stat Plan Inference 2009, 139: 1357–1371. 10.1016/j.jspi.2008.08.005
Article MathSciNet Google Scholar
He X, Zhu ZY, Fung WK: Estimation in a semiparametric model for longitudinal data with unspecified dependence structure. Biometrika 2002, 89: 579–590. 10.1093/biomet/89.3.579
Article MathSciNet Google Scholar
Xue LG, Zhu LX: Empirical likelihood-based inference in a partially linear model for longitudinal data. Sci China (Ser A) 2008, 51: 115–130. 10.1007/s11425-008-0020-4
Article MathSciNet Google Scholar
Li GR, Tian P, Xue LG: Generalized empirical likelihood inference in semipara-metric regression model for longitudinal data. Acta Math Sin (Engl Ser) 2008, 24: 2029–2040. 10.1007/s10114-008-6434-7
Article MathSciNet Google Scholar
Bai Y, Fung WK, Zhu ZY: Weighted empirical likelihood for generalized linear models with longitudinal data. J Multivar Anal 2010, 140: 3445–3456.
MathSciNet Google Scholar
Hu Z, Wang N, Carroll RJ: Profile-kernel versus backfitting in the partially linear models for longitudinal/clustered data. Biometrika 2004, 91: 251–262. 10.1093/biomet/91.2.251
Article MathSciNet Google Scholar
Wang S, Qian L, Carroll RJ: Generalized empirical likelihood methods for analyzing longitudinal data. Biometrika 2010, 97: 79–93. 10.1093/biomet/asp073
Article MathSciNet Google Scholar
Chi EM, Reinsel GC: Models for longitudinal data with random effects and AR(1) errors. J Am Stat Assoc 1989, 84: 452–459. 10.2307/2289929
Article MathSciNet Google Scholar
Shao QM: A moment inequality and its application. Acta Math Sin 1988, 31: 736–747.
Google Scholar
Peligrad M: The r -quick version of the strong law for stationary φ -mixing sequences. In Proceedings of the International Conference on Almost Everywere Convergence in Probability and Statistics. Academic Press, New York; 1989:335–348.
Google Scholar
Utev SA: Sums of random variables with φ -mixing. Sib Adv Math 1991, 1: 124–155.
MathSciNet Google Scholar
Kiesel R: Summability and strong laws for φ -mixing random variables. J Theor Probab 1998, 11: 209–224. 10.1023/A:1021655227120
Article MathSciNet Google Scholar
Chen PY, Hu TC, Volodin A: Limiting behaviour of moving average processes under φ -mixing assumption. Stat Probab Lett 2009, 79: 105–111. 10.1016/j.spl.2008.07.026
Article MathSciNet Google Scholar
Zhou XC: Complete moment convergence of moving average processes under φ- mixing assumptions. Statist Probab Lett 2010, 80: 285–292. 10.1016/j.spl.2009.10.018
Article MathSciNet Google Scholar
Peligrad M: On the central limit theorem for p-mixing sequences of random variables. Ann Probab 1987, 15: 1387–1394. 10.1214/aop/1176991983
Article MathSciNet Google Scholar
Peligrad M, Shao QM: Estimation of variance for p-mixing sequences. J Multivar Anal 1995, 52: 140–157. 10.1006/jmva.1995.1008
Article MathSciNet Google Scholar
Peligrad M, Shao QM: A note on estimation of the variance of partial sums for p-mixing random variables. Stat Probab Lett 1996, 28: 141–145.
Article MathSciNet Google Scholar
Shao QM: Maximal inequalities for partial sums of p-mixing sequences. Ann Probab 1995, 23: 948–965. 10.1214/aop/1176988297
Article MathSciNet Google Scholar
Bradley R: A stationary rho-mixing Markov chain which is not "interlaced" rho-mixing. J Theor Probab 2001, 14: 717–727. 10.1023/A:1017545123473
Article Google Scholar
Lin ZY, Lu CR: Limit Theory for Mixing Dependent Random Variables. Science Press/Kluwer Academic Publishers, Beijing/London; 1996.
Google Scholar
Roussas GG: Nonparametric regression estimation under mixing conditions. Stoch Process Appl 1990, 36: 107–116. 10.1016/0304-4149(90)90045-T
Article MathSciNet Google Scholar
Truong YK: Nonparametric curve estimation with time series errors. J Stat Plan Inference 1991, 28: 167–183. 10.1016/0378-3758(91)90024-9
Article MathSciNet Google Scholar
Fraiman R, Iribarren GP: Nonparametric regression estimation in models with weak error's structure. J Multivar Anal 1991, 37: 180–196. 10.1016/0047-259X(91)90079-H
Article Google Scholar
Roussas GG, Tran LT: Asymptotic normality of the recursive kernel regression estimate under dependence conditions. Ann Stat 1992, 20: 98–120. 10.1214/aos/1176348514
Article MathSciNet Google Scholar
Masry E, Fan JQ: Local polynomial estimation of regression functions for mixing processes. Scand J Stat 1997, 24: 165–179. 10.1111/1467-9469.00056
Article MathSciNet Google Scholar
Aneiros G, Quintela A: Asymptotic properties in partial linear models under dependence. Test 2001, 10: 333–355. 10.1007/BF02595701
Article MathSciNet Google Scholar
Fan JQ, Yao QW: Nonlinear Time Series--Nonparametric and Parametric methods. Springer, New York; 2003.
Chapter Google Scholar
Chen GM, You JH: An asymptotic theory for semiparametric generalized least squares estimation in partially linear regression models. Stat Pap 2005, 46: 173–193. 10.1007/BF02762967
Article MathSciNet Google Scholar

Download references

Acknowledgements

The authors are grateful to an Associate Editor and two anonymous referees for their constructive suggestions that have greatly improved this paper. This work is partially supported by NSFC (no. 11171065), Anhui Provincial Natural Science Foundation (no. 11040606M04), NSFJS (no. BK2011058) and Youth Foundation for Humanities and Social Sciences Project from Ministry of Education of China (no. 11YJC790311).

Author information

Authors and Affiliations

Department of Mathematics, Southeast University, Nanjing, 210096, People's Republic of China
Xing-cai Zhou & Jin-guan Lin
Department of Mathematics and Computer Science, Tongling University, Tongling, 244000, Anhui, People's Republic of China
Xing-cai Zhou

Authors

Xing-cai Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Jin-guan Lin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jin-guan Lin.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

The two authors contributed equally to this work. All authors read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Zhou, Xc., Lin, Jg. Strong consistency of estimators in partially linear models for longitudinal data with mixing-dependent structure. J Inequal Appl 2011, 112 (2011). https://doi.org/10.1186/1029-242X-2011-112

Download citation

Received: 26 July 2011
Accepted: 17 November 2011
Published: 17 November 2011
DOI: https://doi.org/10.1186/1029-242X-2011-112

Strong consistency of estimators in partially linear models for longitudinal data with mixing-dependent structure

Abstract

Similar content being viewed by others

Efficient Inference about the Partially Linear Varying Coefficient Model with Random Effect for Longitudinal Data

Double penalized variable selection procedure for partially linear models with longitudinal data

Empirical Likelihood for Partially Linear Errors-in-variables Models with Longitudinal Data

1 Introduction

2 Estimators and main results

3 Several technical lemmas and corollaries

4 Proof of Theorems

5 Simulation study

6 Concluding remarks

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors' contributions

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Strong consistency of estimators in partially linear models for longitudinal data with mixing-dependent structure

Abstract

Similar content being viewed by others

Efficient Inference about the Partially Linear Varying Coefficient Model with Random Effect for Longitudinal Data

Double penalized variable selection procedure for partially linear models with longitudinal data

Empirical Likelihood for Partially Linear Errors-in-variables Models with Longitudinal Data

1 Introduction

2 Estimators and main results

3 Several technical lemmas and corollaries

4 Proof of Theorems

5 Simulation study

6 Concluding remarks

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors' contributions

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation