1 Introduction

First, we will recall the definition of negatively associated (NA) random variables.

Definition 1.1 A finite family { X 1 ,, X n } is said to be negatively associated (NA) if for any disjoint subsets A,B{1,2,,n}, and any real coordinatewise nondecreasing functions f on R A , g on R B ,

Cov ( f ( X k , k A ) , g ( X k , k B ) ) 0.

A sequence of random variables { X n } n 1 is said to be negatively associated (NA) if for every n2, X 1 , X 2 ,, X n are NA.

The concept of an NA sequence was introduced by Joag-Dev and Proschan [1]. There are many good results of NA random variables. For example, Matula [2] obtained the three series theorem, Su et al. [3] gave the moment inequality, Shao [4] investigated the maximal inequality, Yuan et al. [5] studied the central limit theorem, Yang [6] and Sung [7] investigated the exponential inequality, etc.

In this article, we investigate the Berry-Esséen bound of sample quantiles for NA random variables and obtain the rate O( n 1 / 6 logn). Our result extends the corresponding one of Yang et al. [8] obtaining O( n 1 / 9 ). Let us give some details of the p th quantile.

Let { X n } n 1 be a sequence of random variables defined on a fixed probability space (Ω,F,P) with a common marginal distribution function F(x)=P( X 1 x), where F is a distribution function (continuous from the right, as usual). For 0<p<1, the p th quantile of F is defined as

ξ p =inf { x : F ( x ) p }

and is alternately denoted by F 1 (p). The function F 1 (t), 0<t<1, is called the inverse function of F. With a sample X 1 , X 2 ,, X n , n1, let F n represent the empirical distribution function based on X 1 , X 2 ,, X n , which is defined as F n (x)= 1 n i = 1 n I( X i x), xR, where I(A) denotes the indicator function of a set A and ℝ is the real line. Let 0<p<1, we define

F n 1 (p)=inf { x : F n ( x ) p }

as the p th quantile of sample.

Throughout the paper, C, C 1 , C 2 ,,d denote some positive constants not depending on n, which may be different in various places. x denotes the largest integer not exceeding x and second-order stationary means that

( X 1 , X 1 + k ) = d ( X i , X i + k ),i1,k1.

For 0<p<1, denote ξ p = F 1 (p), ξ p , n = F n 1 (p) and Φ(t) is the distribution function of a standard normal variable. Yang et al. [[8], Theorem 1.1] presented the Berry-Esséen bound of sample quantiles for an NA sequence as follows.

Theorem 1.1 Let 0<p<1 and { X n } n 1 be a second-order stationary NA sequence with common marginal distribution function F and E X n =0 for n=1,2, . Assume that in a neighborhood of ξ p , F possesses a positive continuous density f and a bounded second derivative F . Suppose that there exists an ε 0 >0 such that for x[ ξ p ε 0 , ξ p + ε 0 ],

j = 2 j|Cov [ I ( X 1 x ) , I ( X j x ) ] |<
(1.1)

and

Var [ I ( X 1 ξ p ) ] +2 j = 2 Cov [ I ( X 1 ξ p ) , I ( X j ξ p ) ] = σ 2 ( ξ p )>0.
(1.2)

Then

sup < t < |P ( n 1 / 2 ( ξ p , n ξ p ) σ ( ξ p ) / f ( ξ p ) t ) Φ(t)|=O ( n 1 / 9 ) ,n.
(1.3)

For the work on Berry-Esséen bounds of sample quantiles, one can refer to Reiss [9] or Chapter 2 of Serfling [10]. Cai and Roussas [11] studied the smooth estimate of quantiles under an association sample, Rio [12] obtained the Berry-Esséen bounds of sample quantiles under a φ-mixing sequence, Lahiri and Sun [13] and Yang et al. [14] investigated the Berry-Esséen bounds of sample quantiles under an α-mixing sequence, etc. For more work on Berry-Esséen bounds, we can refer to Chapter 3 of Hall and Heyde [15], Chapter 5 of Petrov [16], Gao et al. [17], Chapter 5 of Härdle et al. [18], and to the references therein too.

Moreover, value-at-risk (VaR) is a popular measure of the market risk associated with an asset or a portfolio of assets. It has been chosen by the Basel Committee on Banking Supervision as a benchmark risk measure and has been used by financial institutions for asset management and minimization of risk. Let { X t } t = 1 n be the market value of an asset over n periods of a time unit, and let Y t =log( X t / X t 1 ) be the log-returns. Suppose { Y t } t = 1 n is a strictly stationary dependent process with marginal distribution function F. Given a positive value p close to zero, the 1p level VaR is

v p =inf { x : F ( x ) p } ,

which specifies the smallest amount of loss such that the probability of the loss in market value being large than v p is less than p. So, the study of VaR is a specific application of the p th quantile. For more details, one can refer to Chen and Tang [19] and the references therein.

In this paper, by the exponential inequality and properties of NA random variables, we go on studying the Berry-Esséen bound of sample quantiles for an NA sequence and get a better rate of normal approximation. For the details, see Theorem 2.1 in Section 2. Some preliminaries and the proof of Theorem 2.1 are presented in Section 3.

2 Main result

Theorem 2.1 Let 0<p<1 and { X n } n 1 be a second-order stationary NA sequence with common marginal distribution function F. Assume that in a neighborhood of ξ p , F possesses a positive continuous density f and a bounded second derivative F . Let n 0 be some positive integer. Suppose that there exists an ε 0 >0 such that for x[ ξ p ε 0 , ξ p + ε 0 ]

|Cov [ I ( X 1 x ) , I ( X j x ) ] |C j 5 / 2 ,j n 0
(2.1)

and condition (1.2) holds. Then

sup < t < |P ( n 1 / 2 ( ξ p , n ξ p ) σ ( ξ p ) / f ( ξ p ) t ) Φ(t)|=O ( n 1 / 6 log n ) ,n.
(2.2)

Remark 2.1 Obviously, the condition (2.1) of Theorem 2.1 is relatively stronger than (1.1) of Theorem 1.1, but the normal approximation rate O( n 1 / 6 logn) in (2.2) is better than O( n 1 / 9 ) in (1.3). So our result Theorem 2.1 extends Theorem 1.1 of Yang et al. [8]. It is pointed out that the condition of mean zero in Theorem 1.1 should be removed. In fact, the process of estimating (3.9) on page 12 of Yang et al. [8], was used the Lemma 2.2 of Yang et al. [8], which requires the condition of mean zero, but Z i in (3.9) of Yang et al. [8], defined by Z i =I[ X i ξ p +tA n 1 / 2 ]EI[ X i ξ p +tA n 1 / 2 ], satisfies the condition of mean zero. Thus, the mean zero condition of Theorem 1.1 of Yang et al. [8] is not needed. It coincides with the independent case, which does not need the mean zero condition. For the details, one can see Serfling [[10], Theorem C, p.81] or Theorem A of Yang et al. [8].

3 Some preliminaries and the proof of Theorem 2.1

First, we give some preliminaries, which will be used to prove our Theorem 2.1.

Lemma 3.1 [[6], Lemma 3.5]

Let { X n } n 1 be a NA sequence with E X n =0, | X n |b, a.s. n=1,2, . Denote Δ n = i = 1 n E X i 2 . Then for ε>0,

P ( | i = 1 n X i | > ε ) 2exp { ε 2 2 ( 2 Δ n + b ε ) } .

Lemma 3.2 Let { X n } n 1 be a stationary NA sequence with E X n =0 and | X n |d<, n=1,2, . Assume that there exists a β3/2 such that

j = b n |Cov( X 1 , X j )|=O ( b n β )
(3.1)

for all 0< b n as n and

lim inf n n 1 Var ( i = 1 n X i ) = σ 1 2 >0.
(3.2)

Then

sup < t < |P ( i = 1 n X i Var ( i = 1 n X i ) t ) Φ(t)|=O ( n 1 / 6 log n ) ,n.
(3.3)

Proof By taking the same notation as that in the proof of Lemma 2.1 of Yang et al. [8], we partition the set {1,2,,n} into 2 k n +1 subsets with large block of size μ= μ n and small block of size ν= ν n . Let

μ n = n 2 / 3 , ν n = n 1 / 3 ,k= k n = n μ n + ν n = n 1 / 3

and Z n , i = X i / Var ( i = 1 n X i ) . Define η j , ξ j , ζ k as follows:

η j = i = j ( μ + ν ) + 1 j ( μ + ν ) + μ Z n , i , 0 j k 1 , ξ j = i = j ( μ + ν ) + μ + 1 ( j + 1 ) ( μ + ν ) Z n , i , 0 j k 1 , ζ k = i = k ( μ + ν ) + 1 n Z n , i .

Denote

S n := i = 1 n X i Var ( i = 1 n X i ) = j = 0 k 1 η j + j = 0 k 1 ξ j + ζ k := S n + S n ′′ + S n ′′′ .

By Lemma A.3 in Yang et al. [8] with a=2 ε n =2M n 1 / 6 logn we have

sup < t < | P ( S n t ) Φ ( t ) | = sup < t < | P ( S n + S n ′′ + S n ′′′ t ) Φ ( t ) | sup < t < | P ( S n t ) Φ ( t ) | + 2 ε n 2 π + P ( | S n ′′ | > ε n ) + P ( | S n ′′′ | > ε n ) ,
(3.4)

where M is a positive constant.

Combining the definition of NA with the definition of ξ j , j=0,1,,k1, we can easily prove that { ξ 0 , ξ 1 ,, ξ k 1 } is NA. Together the condition (3.2) with (2.8) of Yang et al. [8], it has E ( S n ′′ ) 2 C 1 n 1 / 3 . On the other hand, it can be seen that | ξ j | C 2 n 1 / 6 , j=0,1,,k1. Thus, we take M large enough and apply Lemma 3.1, and we obtain for n large enough

P ( | S n ′′ | > ε n ) 2 exp { M 2 n 1 / 3 log 2 n 2 ( 2 C 1 n 1 / 3 + C 2 M n 1 / 6 n 1 / 6 log n ) } = 2 exp { M 2 log 2 n 2 ( 2 C 1 + C 2 M log n ) } C 3 n 1 .
(3.5)

Meanwhile, by (2.9) of Yang et al. [8], it follows E ( S n ′′′ ) 2 C 4 n 1 / 3 . Since | Z n , i | C 5 n 1 / 2 , by Lemma 3.1 again, one has for n large enough

P ( | S n ′′′ | > ε n ) 2 exp { M 2 n 1 / 3 log 2 n 2 ( 2 C 4 n 1 / 3 + C 5 M n 1 / 2 n 1 / 6 log n ) } 2 exp { M 2 log 2 n 2 ( 2 C 4 + C 5 M ) } C 6 n 1 .
(3.6)

Similar to the proof of (2.18) in Yang et al. [8], by (3.1), it can be seen that

| ϕ ( t ) ψ ( t ) | = | E exp ( i t j = 0 k 1 η j ) j = 0 k 1 E exp ( i t η j ) | 4 t 2 0 i < j k 1 l 1 = 1 μ n l 2 = 1 μ n | Cov ( Z n , λ i + l 1 , Z n , λ j + l 2 ) | C 1 t 2 n 1 i < j n j i ν n | Cov ( X i , X j ) | C 2 t 2 j ν n | Cov ( X 1 , X j ) | C 3 t 2 ν n β C 4 t 2 n β / 3 .

Combining the above inequality with T= n 2 β 1 12 , β3/2, we obtain

D 1 n = T T | ϕ ( t ) ψ ( t ) t |dtC n β / 3 T 2 =O ( n 1 / 6 ) .

On the other hand, we take T= n 2 β 1 12 , β3/2, in (2.23) of Yang et al. [8] and have D 2 n =O( n 1 / 6 ).

Consequently, by the proof of (2.26) of Yang et al. [8], it is easy to check that

sup < t < |P ( S n t ) Φ(t)|=O ( n 1 / 6 ) .
(3.7)

Finally, by (3.4)-(3.7), (3.3) holds. □

Lemma 3.3 Let { X n } n 1 be a stationary NA sequence with E X n =0 and | X n |d<, n=1,2, . Assume that there exists an n 0 such that

|Cov( X 1 , X j )|C j 5 / 2 ,j n 0
(3.8)

and

Var( X 1 )+2 j = 2 Cov( X 1 , X j )= σ 0 2 >0.

Then

sup < t < |P ( i = 1 n X i n σ 0 t ) Φ(t)|=O ( n 1 / 6 log n ) .
(3.9)

Proof By the condition (3.8), it is checked that

j = b n |Cov( X 1 , X j )|C j = b n j 5 / 2 =O ( b n 3 / 2 ) ,

providing b n as n. So by (3.8), the condition (3.1) of Lemma 3.2 holds. Combining Lemma 3.2 with the proof of Lemma 2.2 of Yang et al. [8], we have (3.9) finally. □

Proof of Theorem 2.1 By taking the same notation as that in the proof of Theorem 1.1 in Yang et al. [8], one checks the proof of (3.9) in Yang et al. [8] and obtains by Lemma 3.3

sup | t | L n | G n ( t ) Φ ( t ) | sup | t | L n | P [ i = 1 n Z i n σ ( n , t ) < c n t ] Φ ( c n t ) | + sup | t | L n | Φ ( t ) Φ ( c n t ) | C 1 ( σ 2 ( ξ p ) ) n 1 6 log n + sup | t | L n | Φ ( t ) Φ ( c n t ) | ,

where C 1 ( σ 2 ( ξ p )) is a positive constant depending only on σ 2 ( ξ p ). Therefore, (2.2) follows by the same steps as those in the proof of Theorem 1.1 of Yang et al. [8]. □