1 Introduction

Let {X n , n ≥ 1} be a sequence of random variables defined on a fixed probability space ( Ω , F , P ) . The exponential inequality for the partial sums i = 1 n ( X i - E X i ) plays an important role in various proofs of limit theorems. In particular, it provides a measure of convergence rate for the strong law of large numbers. There exist several versions available in the literature for independent random variables with assumptions of uniform boundedness or some, quite relaxed, control on their moments. If the independent case is classical in the literature, the treatment of dependent variables is more recent.

First, we will recall the definitions of some dependence structure.

Definition 1.1. A finite collection of random variables X1, X2,..., X n is said to be negatively associated (NA) if for every pair of disjoint subsets A1, A2 of {1, 2,..., n},

C o v { f ( X i : i A 1 ) , g ( X j : j A 2 ) } 0 ,
(1.1)

whenever f and g are coordinatewise nondecreasing (or coordinatewise nonincreasing) such that this covariance exists. An infinite sequence of random variables {X n , n ≥ 1} is NA if every finite subcollection is NA.

Definition 1.2. A finite collection of random variables X1, X2,..., X n is said to be negatively upper orthant dependent (NUOD) if for all real numbers x1, x2,..., x n ,

P ( X i > x i , i = 1 , 2 , , n ) i = 1 n P ( X i > x i ) ,
(1.2)

and negatively lower orthant dependent (NLOD) if for all real numbers x1, x2,..., x n ,

P ( X i x i , i = 1 , 2 , , n ) i = 1 n P ( X i x i ) .
(1.3)

A finite collection of random variables X1, X2,..., X n is said to be negatively orthant dependent (NOD) if they are both NUOD and NLOD. An infinite sequence {X n , n ≥ 1} is said to be NOD if every finite subcollection is NOD.

The concept of NA random variables was introduced by Alam and Saxena [1] and carefully studied by Joag-Dev and Proschan [2]. Joag-Dev and Proschan [2] pointed out that a number of well-known multivariate distributions possesses the negative association property, such as multinomial, convolution of unlike multinomial, multivariate hypergeometric, Dirichlet, permutation distribution, negatively correlated normal distribution, random sampling without replacement, and joint distribution of ranks. The notion of NOD random variables was introduced by Lehmann [3] and developed in Joag-Dev and Proschan [2]. Obviously, independent random variables are NOD. Joag-Dev and Proschan [2] pointed out that NA random variables are NOD, but neither NUOD nor NLOD implies NA. They also presented an example in which X = (X1, X2, X3, X4) possesses NOD, but does not possess NA. Hence, we can see that NOD is weaker than NA.

Recently, Giuliano et al. [4] introduced the following notion of acceptability.

Definition 1.3. We say that a finite collection of random variables X1, X2,..., X n is acceptable if for any real λ,

E exp λ i = 1 n X i i = 1 n E exp ( λ X i ) .
(1.4)

An infinite sequence of random variables {X n , n ≥ 1} is acceptable if every finite subcollection is acceptable.

Since it is required that the inequality (1.4) holds for all λ, Sung et al. [5] weakened the condition on λ and gave the following definition of acceptability.

Definition 1.4. We say that a finite collection of random variables X1, X2,..., X n is acceptable if there exists δ > 0 such that for any real λ∈ (-δ, δ),

E exp λ i = 1 n X i i = 1 n E exp ( λ X i ) .
(1.5)

An infinite sequence of random variables {X n , n ≥ 1} is acceptable if every finite subcollection is acceptable.

First, we point out that Definition 1.3 of acceptability will be used in the current article. As is mentioned in Giuliano et al. [4], a sequence of NOD random variables with a finite Laplace transform or finite moment generating function near zero (and hence a sequence of NA random variables with finite Laplace transform, too) provides us an example of acceptable random variables. For example, Xing et al. [6] consider a strictly stationary NA sequence of random variables. According to the sentence above, a sequence of strictly stationary and NA random variables is acceptable.

Another interesting example of a sequence {Z n , n ≥ 1} of acceptable random variables can be constructed in the following way. Feller [[7], Problem III.1] (cf. also Romano and Siegel [[8], Section 4.30]) provides an example of two random variables X and Y such that the density of their sum is the convolution of their densities, yet they are not independent. It is easy to see that X and Y are not negatively dependent either. Since they are bounded, their Laplace transforms E exp(λX) and E exp(λY) are finite for any λ. Next, since the density of their sum is the convolution of their densities, we have

E exp ( λ ( X + Y ) ) = E exp ( λ X ) E exp ( λ Y ) .

The announced sequence of acceptable random variables {Z n , n ≥ 1} can be now constructed in the following way. Let (X k , Y k ) be independent copies of the random vector (X, Y), k ≥ 1. For any n ≥ 1, set Z n = X k if n = 2k + 1 and Z n = Y k if n = 2k. Hence, the model of acceptable random variables that we consider in this article (Definition 1.3) is more general than models considered in the previous literature. Studying the limiting behavior of acceptable random variables is of interest.

Recently, Sung et al. [5] established an exponential inequality for a random variable with the finite Laplace transform. Using this inequality, they obtained an exponential inequality for identically distributed acceptable random variables which have the finite Laplace transforms. The main purpose of the article is to establish some exponential inequalities for acceptable random variables under very mild conditions. Furthermore, we will study the complete convergence for acceptable random variables using the exponential inequalities.

Throughout the article, let {X n , n ≥ 1} be a sequence of acceptable random variables and denote S n = i = 1 n X i for each n ≥ 1.

Remark 1.1. If {X n , n ≥ 1} is a sequence of acceptable random variables, then {-X n , n ≥ 1} is still a sequence of acceptable random variables. Furthermore, we have for each n ≥ 1,

E exp λ i = 1 n ( X i - E X i ) = exp - λ i = 1 n E X i E exp λ i = 1 n X i i = 1 n exp - λ E X i i = 1 n E exp ( λ X i ) = i = 1 n E exp ( λ ( X i - E X i ) ) .

Hence, {X n - EX n , n ≥ 1} is also a sequence of acceptable random variables.

The following lemma is useful.

Lemma 1.1. If X is a random variable such that aXb, where a and b are finite real numbers, then for any real number h,

E e h X b - E X b - a e h a + E X - a b - a e h b .
(1.6)

Proof. Since the exponential function exp(hX) is convex, its graph is bounded above on the interval aXb by the straight line which connects its ordinates at X = a and X = b. Thus

e h X e h b - e h a b - a ( X - a ) + e h a = b - X b - a e h a + X - a b - a e h b ,

which implies (1.6).

The rest of the article is organized as follows. In Section 2, we will present some exponential inequalities for a sequence of acceptable random variables, such as Bernstein-type inequality, Hoeffding-type inequality. The Bernstein-type inequality for acceptable random variables generalizes and improves the corresponding results of Yang [9] for NA random variables and Wang et al. [10] for NOD random variables. In Section 3, we will study the complete convergence for acceptable random variables using the exponential inequalities established in Section 2.

2 Exponential inequalities for acceptable random variables

In this section, we will present some exponential inequalities for acceptable random variables, such as Bernstein-type inequality and Hoeffding-type inequality.

Theorem 2.1. Let {X n , n ≥ 1} be a sequence of acceptable random variables with EX i = 0 and E X i 2 = σ i 2 < for each i ≥ 1. Denote B n 2 = i = 1 n σ i 2 for each n ≥ 1. If there exists a positive number c such that |X i | ≤ cB n for each 1 ≤ in, n ≥ 1, then for any ε > 0,

P S n B n ε exp - ε 2 2 1 - ε c 2 i f ε c 1 , exp - ε 4 c i f ε c 1 .
(2.1)

Proof. For fixed n ≥ 1, take t > 0 such that tcB n ≤ 1. It is easily seen that

E X i k ( c B n ) k - 2 E X i 2 , k 2 .

Hence,

E e t X i = 1 + k = 2 t k k ! E X i k 1 + t 2 2 E X i 2 1 + t 3 c B n + t 2 12 c 2 B n 2 + 1 + t 2 2 E X i 2 1 + t 2 c B n exp t 2 2 E X i 2 1 + t 2 c B n .

By Definition 1.3 and the inequality above, we have

E e t S n = E i = 1 n e t X i i = 1 n E e t X i exp t 2 2 B n 2 1 + t 2 c B n ,

which implies that

P S n B n ε exp - t ε B n + t 2 2 B n 2 1 + t 2 c B n .
(2.2)

We take t = ε B n when εc ≤ 1, and take t = 1 c B n when εc > 1. Thus, the desired result (2.1) can be obtained immediately from (2.2).

Theorem 2.2. Let {X n , n ≥ 1} be a sequence of acceptable random variables with EX i = 0 and |X i | ≤ b for each i ≥ 1, where b is a positive constant. Denote σ i 2 =E X i 2 and B n 2 = i = 1 n σ i 2 for each n ≥ 1. Then, for any ε > 0,

P S n ε exp - ε 2 2 B n 2 + 2 3 b ε
(2.3)

and

P S n ε 2 exp - ε 2 2 B n 2 + 2 3 b ε .
(2.4)

Proof. For any t > 0, by Taylor's expansion, EX i = 0 and the inequality 1 + xex, we can get that for i = 1, 2,..., n,

E exp { t X i } = 1 + j = 2 E ( t X i ) j j ! 1 + j = 2 t j E X i j j ! = 1 + t 2 σ i 2 2 j = 2 t j - 2 E X i j 1 2 σ i 2 j ! = 1 + t 2 σ i 2 2 F i ( t ) exp t 2 σ i 2 2 F i ( t ) ,
(2.5)

where

F i ( t ) = j = 2 t j - 2 E X i j 1 2 σ i 2 j ! , i = 1 , 2 , , n .

Denote C = b/ 3 and M n = b ε 3 B n 2 +1. Choosing t > 0 such that tC < 1 and

t C M n - 1 M n = C ε C ε + B n 2 .

It is easy to check that for i = 1, 2,..., n and j ≥ 2,

E X i j σ i 2 b j - 2 1 2 σ i 2 C j - 2 j ! ,

which implies that for i = 1, 2,..., n,

F i ( t ) = j = 2 t j - 2 E X i j 1 2 σ i 2 j ! j = 2 ( t C ) j - 2 = ( 1 - t C ) - 1 M n .
(2.6)

By Markov's inequality, Definition 1.3, (2.5) and (2.6), we can get

P S n ε e - t ε E exp t S n e - t ε i = 1 n E exp { t X i } exp - t ε + t 2 B n 2 2 M n .
(2.7)

Taking t = ε B n 2 M n = ε C ε + B n 2 . It is easily seen that tC < 1 and t C = C ε C ε + B n 2 . Substituting t = ε B n 2 M n into the right-hand side of (2.7), we can obtain (2.3) immediately. By (2.3), we have

P S n - ε = P - S n ε exp - ε 2 2 B n 2 + 2 3 b ε ,
(2.8)

since {-X n , n ≥ 1} is still a sequence of acceptable random variables. The desired result (2.4) follows from (2.3) and (2.8) immediately.   □

Remark 2.1. By Theorem 2.2, we can get that for any t > 0,

P S n n t 2 exp - n 2 t 2 2 B n 2 + 2 3 b n t

and

P S n B n t 2 exp - t 2 2 + 2 3 b t B n .

It is well known that the upper bound of P (|S n | ≥ nt) is also 2 exp - n 2 t 2 2 B n 2 + 2 3 b n t . So Theorem 2.3 extends corresponding results for independent random variables without necessarily adding any extra conditions. In addition, it is easy to check that

exp - ε 2 2 B n 2 + 2 3 b ε < exp - ε 2 2 ( 2 B n 2 + b ε ) ,

which implies that our Theorem 2.2 generalizes and improves the corresponding results of Yang [9, Lemma 3.5] for NA random variables and Wang et al. [10, Theorem 2.3] for NOD random variables.

In the following, we will provide the Hoeffding-type inequality for acceptable random variables.

Theorem 2.3. Let {X n , n ≥ 1} be a sequence of acceptable random variables. If there exist two sequences of real numbers {a n , n ≥ 1} and {b n , n ≥ 1} such that a i X i b i for each i ≥ 1, then for any ε > 0 and n ≥ 1,

P S n - E S n ε exp - 2 ε 2 i = 1 n ( b i - a i ) 2 ,
(2.9)
P S n - E S n - ε exp - 2 ε 2 i = 1 n ( b i - a i ) 2 ,
(2.10)

and

P S n - E S n ε 2 exp - 2 ε 2 i = 1 n ( b i - a i ) 2 .
(2.11)

Proof. For any h > 0, by Markov's inequality, we can see that

P S n - E S n ε E e h S n - E S n - ε .
(2.12)

It follows from Remark 1.1 that

E e h S n - E S n - ε = e - h ε E i = 1 n e h ( X i - E X i ) e - h ε i = 1 n E e h ( X i - E X i ) .
(2.13)

Denote EX i = μ i for each i ≥ 1. By a i X i b i and Lemma 1.1, we have

E e h ( X i - E X i ) e - h μ i b i - μ i b i - a i e h a i + μ i - a i b i - a i e h b i e L ( h i ) ,
(2.14)

where

L ( h i ) = - h i p i + ln ( 1 - p i + p i e h i ) , h i = h ( b i - a i ) , p i = μ i - a i b i - a i .

The first two derivatives of L(h i ) with respect to h i are

L ( h i ) = - p i + p i ( 1 - p i ) e - h i + p i , L ( h i ) = p i ( 1 - p i ) e - h i ( 1 - p i ) e - h i + p i 2 .
(2.15)

The last ratio is of the form u(1 - u), where 0 < u < 1. Hence,

L ( h i ) = ( 1 - p i ) e - h i ( 1 - p i ) e - h i + p i 1 - ( 1 - p i ) e - h i ( 1 - p i ) e - h i + p i 1 4 .
(2.16)

Therefore, by Taylor's expansion and (2.16), we can get

L ( h i ) L ( 0 ) + L ( 0 ) h i + 1 8 h i 2 = 1 8 h i 2 = 1 8 h 2 ( b i - a i ) 2 .
(2.17)

By (2.12), (2.13), and (2.17), we have

P S n - E S n ε exp - h ε + 1 8 h 2 i = 1 n ( b i - a i ) 2 .
(2.18)

It is easily seen that the right-hand side of (2.18) has its minimum at h= 4 ε i = 1 n ( b i - a i ) 2 . Inserting this value in (2.18), we can obtain (2.9) immediately. Since {-X n , n ≥ 1} is a sequence of acceptable random variables, (2.9) implies (2.10). Therefore, (2.11) follows from (2.9) and (2.10) immediately. This completes the proof of the theorem.

3 Complete convergence for acceptable random variables

In this section, we will present some complete convergence for a sequence of acceptable random variables. The concept of complete convergence was introduced by Hsu and Robbins [11] as follows. A sequence of random variables {U n , n ≥ 1} is said to converge completely to a constant C if n = 1 P ( U n - C > ε ) < for all ε > 0. In view of the Borel-Cantelli lemma, this implies that U n C almost surely (a.s.). The converse is true if the {U n , n ≥ 1} are independent. Hsu and Robbins [11] proved that the sequence of arithmetic means of independent and identically distributed (i.i.d.) random variables converges completely to the expected value if the variance of the summands is finite. Erdös [12] proved the converse. The result of Hsu-Robbins-Erdös is a fundamental theorem in probability theory and has been generalized and extended in several directions by many authors.

Define the space of sequences

H = { b n } : n = 1 h b n < for every 0 < h < 1 .

The following results are based on the space of sequences H.

Theorem 3.1. Let {X n , n ≥ 1} be a sequence of acceptable random variables with EX i = 0 and |X i | ≤ b for each i ≥ 1, where b is a positive constant. Assume that i = 1 n E X i 2 =O ( b n ) for some { b n } H. Then,

b n - 1 S n 0 c o m p l e t e l y a s n .
(3.1)

Proof. For any ε > 0, it follows from Theorem 2.2 that

n = 1 P S n b n ε 2 n = 1 exp - b n 2 ε 2 2 i = 1 n E X i 2 + 2 3 b b n ε 2 n = 1 exp { - C b n } < ,

which implies (3.1). Here, C is a positive number not depending on n.   □

Theorem 3.2. Let {X n , n ≥ 1} be a sequence of acceptable random variables with |X i | ≤ c <for each i ≥ 1, where c is a positive constant. Then, for every { b n } H,

( b n n ) - 1 2 ( S n - E S n ) 0 c o m p l e t e l y a s n .
(3.2)

Proof. For any ε > 0, it follows from Theorem 2.3 that

n = 1 P S n - E S n ( b n n ) 1 2 ε 2 n = 1 exp - ε 2 2 c 2 b n < ,

which implies (3.2).   □

Theorem 3.3. Let {X n , n ≥ 1} be a sequence of acceptable random variables with EX i = 0 and E X i 2 = σ i 2 < for each i≥ 1. Denote B n 2 = i = 1 n σ i 2 for each n≥ 1. For fixed n ≥ 1, there exists a positive number H such that

E X i m m ! 2 σ i 2 H m - 2 , i = 1 , 2 , , n
(3.3)

for any positive integer m ≥ 2. Then,

b n - 1 S n 0 c o m p l e t e l y a s n ,
(3.4)

provided that { b n 2 B n 2 } H and { b n } H.

Proof. By (3.3), we can see that

E e t X i = 1 + t 2 2 σ i 2 + t 3 6 E X i 3 + 1 + t 2 2 σ i 2 1 + H t + H 2 t 2 +

for i = 1, 2,..., n, n ≥ 1. When t 1 2 H , it follows that

E e t X i 1 + t 2 σ i 2 2 1 1 - H t 1 + t 2 σ i 2 e t 2 σ i 2 , i = 1 , 2 , , n .
(3.5)

Therefore, by Markov's inequality, Definition 1.3 and (3.5), we can get that for any x ≥ 0 and t 1 2 H ,

P i = 1 n X i x = P i = 1 n X i x + P i = 1 n ( - X i ) x e - t x E exp t i = 1 n X i + e - t x E exp t i = 1 n ( - X i ) e - t x E exp t i = 1 n X i + e - t x E exp t i = 1 n ( - X i ) = e - t x E exp t i = 1 n X i + e - t x E exp t i = 1 n ( - X i ) e - t x i = 1 n E e t X i + i = 1 n E e - t X i 2 exp - t x + t 2 B n 2 .

Hence,

P i = 1 n X i x 2min t 1 2 H exp - t x + t 2 B n 2 .

If 0x B n 2 H , then

min t 1 2 H exp - t x + t 2 B n 2 = exp - x 2 B n 2 x + x 2 4 B n 4 B n 2 = exp - x 2 4 B n 2 ;

if x B n 2 H , then

min t 1 2 H exp - t x + t 2 B n 2 = exp - 1 2 H x + 1 4 H 2 B n 2 exp - x 4 H .

From the statements above, we can get that

P i = 1 n X i x 2 e - x 2 4 B n 2 , 0 x B n 2 H , 2 e - x 4 H , x B n 2 H ,

which implies that for any x ≥ 0,

P i = 1 n X i x 2 exp - x 2 4 B n 2 + 2 exp - x 4 H .

Therefore, the assumptions of {b n } yield that

n = 1 P 1 b n i = 1 n X i ε 2 n = 1 exp - b n 2 ε 2 4 B n 2 + 2 n = 1 exp - b n ε 4 H < .

This completes the proof of the theorem.   □