The replacement of indicator functions by integrated beta kernels in the definition of the empirical tail dependence function is shown to produce a smoothed version of the latter estimator with the same asymptotic distribution but superior finite-sample performance. The link of the new estimator with the empirical beta copula enables a simple but effective resampling scheme.
A. Kiriliouk gratefully acknowledges support from the Fonds de la Recherche Scientifique (FNRS).
J. Segers gratefully acknowledges funding by contract “Projet d’Actions de Recherche Concertées” No. 12/17-045 of the “Communauté française de Belgique” and by IAP research network Grant P7/06 of the Belgian government (Belgian Science Policy).
L. Tafakori would like to thank the Australian Research Council for supporting this work through Laureate Fellowship FL130100039.
Appendix A: Proofs of Propositions 3.5 and 3.6
Proof of Proposition 3.5
Fix ε ∈ (0,δ].Since νn,k,xis a probability measure, we can bring the termBn,k(x)inside the integral. Split the integral according to the two cases|y −x|∞≤ ε or |y −x|∞ > ε,where |z|∞ = max(|z1| ,…, |zd|)for \(\boldsymbol {z} \in \mathbb {R}^{d}\).For x ∈ [0, 1]d, the absolute value in (3.3) is bounded by
In the first term in (A.1), we have x ∈ [0, 1]d,y ∈ [0,n/k]d, and|y −x|∞≤ ε ≤ δ, whencey ∈ [0, 1 + δ]d.The supremum is thus bounded by the maximal increment ofBn,kon[0, 1 + δ]dbetween points at adistance at most ε apart,i.e.,
By Condition 3.3, wecan find for every η > 0asufficiently small ε > 0 such that
Thefirst term in (A.1) can thus be made arbitrarily small with arbitrarily large probability, uniformlyin x ∈ [0, 1]dandfor sufficiently large n.
For the second term in (A.1), note first that
Indeed, since ℓ isa stdf, we have 0 ≤ ℓ(y) ≤ y1 + ⋯ + yd ≤ dn/k for y ∈ [0,n/k]d; for thepilot estimator \(\hat {\ell }_{n,k}\),use Condition 3.2.
If S is a Bin(n,u) random variable, Bennett’s inequality van der Vaart and Wellner (1996, Proposition A.6.2) states that
where\(h(1+\eta ) = {\int }_{0}^{\eta } \log (1+t) \,\mathrm {d} t\) forη ≥ 0. Note that\(h(1+\eta ) > \frac {1}{3} \eta ^{2}\) forη ∈ [0, 1]. Itfollows that
As ∂{x h(1 + ε/x)}/∂x < 0for0 < x < 1, we have inf x∈[0,1]{x h(1 + ε/x)} = h(1 + ε). We concludethat
Together, thesupremum over x ∈ [0, 1]dof the second term in (A.1) is of the order
It therefore converges to zero in probability since log(n) = o(k)byassumption. □
Remark A.1
In Proposition 3.5, the condition log(n) = o(k)was imposed to control the remainder term in (A.5). That term arose from an application ofBennett’s inequality in (A.4), producing an upper bound to the binomial tail probability theline before. We now show that the same probability also admits a lower bound that would yield the same condition on k. Indeed, starting from the left-hand side of (A.3), we have
where m = mn = ⌊k(x1 + ε) + 1⌋and⌊ ⋅ ⌋ is the floor function. Stirling’s formula says that \(n! = \sqrt {2\pi n} (n/e)^{n} \{1 + \mathrm {o}(1)\}\) asn →∞ and thus, since k = kn →∞,also
But then
since m ≥ kx1. For n sufficiently large, k = knis such that mn ≤ k(x1 + 2ε)and thus (kx1/m)m ≥ ρkwith \(\rho = \{x_{1}/(x_{1}+ 2\varepsilon )\}^{x_{1}+ 2\varepsilon }\). Wefind
Recallthat in the proof of Proposition 3.5, we needed to control the second term in (A.1). Inview of (A.2) and the above lower bound, we need a sequence k such that, for everyε > 0, we have(n/k)ρk → 0. Here0 < ρ = ρ(x1, ε) < 1 approaches 1 as ε↓ 0, for every fixed x1 > 0. Butthen k = knmustbe such that log(n) − log(k) + k log(ρ) →−∞ as n →∞, and since log(ρ) < 0can be arbitrarily close to 0 as ε↓0,we still need that log(n) = o(k).
Proof of Proposition 3.6
Fix x ∈ [0,M]d.For j ∈{1,…,d}such that xj = 0, the binomial distribution Bin(n, (k/n)xj)is concentrated on 0. As a consequence, the integral overy ∈ [0,n/k]dwith respect to νn,k,xcan be restricted to the set of those y ∈ [0,n/k]dsuch that yj = 0for all j ∈{1,…,d}for which xj = 0.Call this set \(\mathbb {D}(n,k,\boldsymbol {x})\).
For \(\boldsymbol {y} \in \mathbb {D}(n,k,\boldsymbol {x})\), the function\([0, 1] \to \mathbb {R} : t \mapsto f(\boldsymbol {x} + t(\boldsymbol {y}-\boldsymbol {x}))\) is continuouson [0, 1]and is continuously differentiable on (0, 1); indeed, ifxj = 0, then the jth componentof x + t(y −x)vanishes and thusdoes not depend on t ∈ [0, 1],while if xj > 0,then that component is (strictly) positive for allt ∈ [0, 1), so that, by assumption,\(\dot {f}_{j}(\boldsymbol {x} + t(\boldsymbol {y}-\boldsymbol {x}))\) exists and iscontinuous in t ∈ [0, 1). WritingJ(x) = {j = 1,…,d : xj > 0}, we find, by the fundamental theorem of calculus,
where the last step is justified via \(\int y_{j} \,\mathrm {d} \nu _{n,k,\boldsymbol {x}}(\boldsymbol {y}) = \operatorname {\mathbb {E}}[ \operatorname {Bin}(n, (k/n)x_{j})/k ] = x_{j}\).Taking absolute values, we find, forx ∈ [0,M]d,
We will find anupper bound for In,k(x,j).
Let K > 0be suchthat \(\left | \dot {f}_{i} \right | \le K\) forall i ∈{1,…,d}.Choose δ ∈ (0,M]and ε ∈ (0,δ/2]. InIn,k(x,j), split the integral over\(\boldsymbol {y} \in \mathbb {D}(n,k,\boldsymbol {x})\) into two pieces,depending on whether |y −x| ≤ ε or |y −x| > ε, where\(\left | \boldsymbol {z} \right | = ({z_{1}^{2}} + {\cdots } + {z_{d}^{2}})^{1/2}\) denotesthe Euclidean norm of \(\boldsymbol {z} \in \mathbb {R}^{d}\).
In In,k(x,j), the integralover \(\boldsymbol {y} \in \mathbb {D}(n,k,\boldsymbol {x})\) for which |y −x| > ε is bounded by
To analyze the integral in In,k(x,j)over those \(\boldsymbol {y} \in \mathbb {D}(n,k,\boldsymbol {x})\) forwhich |y −x| ≤ ε, we need to distinguishbetween two cases: xj < δ and xj ≥ δ. Incase xj < δ,the integral is simply bounded by
In case xj ≥ δ, the inequality |y −x| ≤ ε ≤ δ/2 and the fact that x ∈ [0,M]d and y ∈ [0, ∞)d imply that y belongs to theset
The integralin In,k(x,j)over\(\boldsymbol {y} \in \mathbb {D}(n,k,\boldsymbol {x})\) suchthat |y −x| ≤ ε is bounded by
using the Cauchy–Schwarz inequality and the first two moments of the binomial distribution.
Assembling all the pieces, we obtain
As a consequence, for every δ ∈ (0,M] and every ε ∈ (0,δ/2], wehave
The function\(\dot {f}_{j}\) is continuous and thus uniformly continuous on the compact set \(\mathbb {B}_{j}(M, \delta )\). As consequence, inf ε> 0ωj(M,δ,ε) = 0.The limit superior in the previous display is thus bounded by\(2dK \sqrt {\delta }\), for allδ ∈ (0,M], and must thereforebe equal to zero. □
