Abstract
The aim of this paper is to establish recurrence relations satisfied by product moments and covariances of kth records arising from discrete distributions. They will be evaluated for geometric underlying distribution. Then we use these results to obtain formulas for correlation coefficients of geometric kth records. We consider all three known types of kth records: strong, ordinary, and weak.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Let \(X_{1},X_{2},\ldots \) be a sequence of independent and identically distributed (iid) random variables (rv’s) with cumulative distribution function (cdf) F. Let \(X_{1:n}\le \cdots \le X_{n:n}\) stand for the respective order statistics of a random sample \((X_{1},X_{2},\ldots ,X_{n})\). For positive integer k, we define occurrence times \((T_{n}^{(k)},\,n\ge 0)\) and values of kth records \((R_{n}^{(k)},\,n\ge 0)\) as follows:
Note that for \(k=1\) the sequences of 1st record times \((T_{n}^{(1)},\,n\ge 0)\) and 1st records \((R_{n}^{(1)},\,n\ge 0)\) reduce to the sequences of record times \((T_{n},n\ge 0)\) and record values \((R_{n},n\ge 0)\), respectively. For a deeper discussion of records and kth records we refer the reader to the monographs of Arnold et al. (1998) and Nevzorov (2001).
If the underlying distribution is continuous, then the sequence of kth records from a cdf F is distributed as the sequence of records from a cdf \(F_{1:k}=1-(1-F)^{k}\):
where \({\mathop {=}\limits ^\mathrm{d}}\) denotes equality in distribution; see Nevzorov (2001, Theorem 22.6). This fact implies that in the continuous case distributional properties of kth records can be derived in a direct manner from those of records. However, there are some results concerning kth records which do not follow from the corresponding ones for records. An example of such non-trivial results are various bounds on moments of kth records, see for example Raqab and Rychlik (2002, 2004, 2011), Klimczak and Rychlik (2004, 2005), Charalambides and Rychlik (2008), Goroncy and Rychlik (2011), Jasiński (2016), Kozyra and Rychlik (2017).
The situation becomes more complicated for discrete distributions. López-Blázquez et al. (2005) and Dembińska and López-Blázquez (2005b) showed that then (1.1) is not valid. Moreover, in the last-mentioned paper definitions of strong and weak kth records were introduced.
Definition 1
The sequences of strong kth record times \((L_{n}^{(k)},\,n\ge 0)\) and strong kth records \((S_{n}^{(k)},\,n\ge 0)\) are given by:
Definition 2
The sequences of weak kth record times \((U_{n}^{(k)},\,n\ge 0)\) and weak kth records \((W_{n}^{(k)},\,n\ge 0)\) are given by:
When \(k=1\), the definition of 1st weak records amounts to the definition of weak records \((W_{n},\,n\ge 0)\), which were introduced by Vervaat (1973) and investigated among others by Stepanov (1992, 1993), Wesołowski and Ahsanullah (2001), Stepanov et al. (2003), Dembińska and Stepanov (2006), Danielak and Dembińska (2007a, b), Ahsanullah and Aliev (2011), Karczewski and Wesołowski (2017).
Properties of weak kth records, (ordinary) kth records and strong kth records are extensively studied in the literature. We refer to Dembińska and López-Blázquez (2005a), Dembińska (2007a, b, 2008), Dembińska and Danielak (2008) and Oncel and Aliev (2016). However, many problems concerning kth records from discrete distributions are still open. The aim of this paper is to fill this gap in the literature. Precisely, we derive recurrence relations satisfied by product moments of kth records arising from discrete distributions. These results together with the corresponding ones for single moments obtained by Dembińska and Danielak (2008) allow us to establish direct formulas for covariances and correlation coefficients between \(R_{m}^{(k)}\) and \(R_{n}^{(k)}\), \(W_{m}^{(k)}\) and \(W_{n}^{(k)}\) or \(S_{m}^{(k)}\) and \(S_{n}^{(k)}\), \(0\le m<n\) in some specific cases, when the underlying distribution is geometric.
2 Basic properties of discrete kth records
In this section, we recall some results concerning the distribution of discrete kth records, which can be found in Dembińska and López-Blázquez (2005b) and Dembińska and Danielak (2008). They will be useful in Sect. 3.
From now on we assume that \(X_{1},X_{2},\ldots \) are iid rv’s with support on non-negative integers, probability mass function \(p_{j}=\mathrm{P}\,(X_{1}=j)\) and \(q_{j}=\mathrm{P}\,(X_{1}\ge j),\) \(j\ge 0\).
The following lemma describes how the distributional properties of discrete strong kth records can be derived from properties of records \((R_{n},\,n\ge 0)\).
Lemma 1
For any \(k=1,2,\ldots \) the sequence of strong kth records \((S_{n}^{(k)},\,n\ge 0)\) from a cdf F is identical in distribution to a sequence of records \((R_{n},\,n\ge 0)\) from a cdf \(F_{1:k}=1-(1-F)^{k}\):
For ordinary and weak kth records the analogs of (2.1) are not valid. To analyze their properties, it is convenient to define new rv’s \(\eta _{m}^{(k)}, \xi _{m}^{(k)}\), \(m=0,1,\ldots ,\) \(k=1,2,\ldots \) as follows:
To pass from dependent ordinary kth records and weak kth records to independent rv’s we use the lemma and representation given below.
Lemma 2
For any \(k=1,2,\ldots \) and \(m=0,1,\ldots \):
-
(a)
the rv’s \(\eta _{0}^{(k)},\eta _{1}^{(k)},\ldots \) are independent and \(\eta _{m}^{(k)}\) has a binomial distribution b\((k,\frac{p_{m}}{q_{m}})\) with
$$\begin{aligned} P(\eta _{m}^{(k)}=s)=\left( {\begin{array}{c}k\\ s\end{array}}\right) \left( \frac{p_{m}}{q_{m}}\right) ^{s}\left( 1-\frac{p_{m}}{q_{m}}\right) ^{k-s},\quad s=0,1,\ldots ,k, \end{aligned}$$ -
(b)
the rv’s \(\xi _{0}^{(k)},\xi _{1}^{(k)},\ldots \) are independent and \(\xi _{m}^{(k)}\) has a negative binomial distribution nb\((k,1-\frac{p_{m}}{q_{m}})\) with
$$\begin{aligned} P(\xi _{m}^{(k)}=s)=\left( {\begin{array}{c}s+k-1\\ s\end{array}}\right) \left( \frac{p_{m}}{q_{m}}\right) ^{s}\left( 1-\frac{p_{m}}{q_{m}}\right) ^{k},\quad s=0,1,\ldots . \end{aligned}$$
Representation 1
For \(m=1,2,\ldots ,\) \(n=0,1,\ldots ,\) and \(k=1,2,\ldots \):
Now, we are in a position to derive formulas for product moments and covariances of kth records, weak kth records and strong kth records. All these results will be evaluated for the geometric underlying distribution ge(p) with \(p_{j}=pq^{j},\) \(j=0,1,\ldots ,\) where \(q=1-p\in (0,1)\). We will use the recurrence relations for the first moment of any discrete ordinary and weak kth records established by Dembińska and Danielak (2008). They showed that for \(k=1,2,\ldots \)
with \(N_{m}^{(k)}=\eta _{0}^{(k)}+\eta _{1}^{(k)}\cdots +\eta _{m}^{(k)}\) and \(M_{m}^{(k)}=\xi _{0}^{(k)}+\xi _{1}^{(k)}\cdots +\xi _{m}^{(k)}\) respectively, where the rv’s \(\xi _{i}^{(k)}\) and \(\eta _{i}^{(k)}\) are described in Lemma 2. Moreover, in the geometric case, they obtained simple expressions for means and variances as follows
where \((f(q))^{(j)}\) stands for the jth derivative of the function f(q). In the next section, (2.4) and (2.5) will be used to get covariances and correlation coefficients for geometric kth records.
3 Product moments and covariances
To derive formulas expressing product moments of kth records arising from discrete distributions we will use a multivariate extension of Hoeffding’s (1940) lemma. Block and Fang (1988) showed that if \(Z_{1},\ldots ,Z_{d}\) are non negative rv’s then
If \(Z_{1},\ldots ,Z_{d}\) take values in the set of non-negative integers, the above expression can be written as
In particular, we obtain well-known formulas for the first two moments of a single rv \(Z_{1}\):
3.1 kth records
We begin with a general result on ordinary kth records. We use the convention that \(R_{-1}^{(k)}=0\) for \(k\ge 1\).
Theorem 1
For any \(k=1,2,\ldots ,\) and \(0\le m<n\):
Proof
Using (3.1) for \(d=2\) and Representation 1 we get
When \(m_2<m_1\), the inequality \(N_{m_2}^{(k)}=\eta _{0}^{(k)}+\ldots +\eta _{m_{2}}^{(k)}\le m<n\) implies \(\mathrm{P}\,(N_{m_1}^{(k)}=m,N_{m_{2}}^{(k)}\le n)=\mathrm{P}\,(N_{m_1}^{(k)}=m).\) Thus
By similar arguments, for \(m_{2}\ge m_{1}\), we get
Hence, the independence of rv’s \(N_{m_1}^{(k)}\) and \(N_{m_2}^{(k)}-N_{m_1}^{(k)}\) implies
Now, applying (3.3) and (3.4) to (3.2), we complete the proof. \(\square \)
Theorem 1 and formula (2.2) give the following recurrence relations for covariances between kth records.
Corollary 1
For any \(k=1,2,\ldots ,\) and \(0\le m<n\):
The geometric distribution seems to be the only one for which the recurrence relations given in Theorem 1 and Corollary 1 lead to simple expressions.
Theorem 2
Assume that the parent distribution is geometric ge(p). Then for \(k=1,2,\ldots \) and \(0\le m<n\):
and
and \(\mathrm{Var}\,(R_{0}^{(k)})\) is given by (2.5).
Proof
In the geometric case, the rv’s \(\eta _{i}^{(k)}\), \(i=0,1,\ldots \) are not only independent, but also have the same binomial distribution b(k, p). Hence, \(N_{m_1}^{(k)}\) has a binomial distribution with parameters \((m_1+1)k\) and p. With the convention \(\left( {\begin{array}{c}a\\ b\end{array}}\right) =0\) for \(a<b\), it follows that
where \(I_{\{.\}}\) is the indicator function such as \(I_{\{m=0\}}=1\) if \(m=0\) and \(I_{\{m=0\}}=0\) otherwise. Using the formulas
and
for \(m\ge 0\), after some algebra, we obtain
Further, notice that \(N_{m_2}^{(k)}-N_{m_1}^{(k)}\) has a binomial distribution with parameters \((m_{2}-m_{1})k\) and p. Consequently, by (3.7) we have
This implies
Using again (3.7), we can rewrite the last series as
Hence
Combining the above equality with (3.8) and Theorem 1, we get (3.5).
Now, (3.5) and (2.4) give immediately (3.6). Indeed, for \(m=0\) we obtain
The last equality is a consequence of (2.5). \(\square \)
Recurrence relations (3.5) and (3.6) can be further simplified for \(k=1\) and \(k=2\). An easy computation shows that if \(k=1\) then we get
Note that the above conclusion can be also derived by use of the well-known representation \(R_{n}{\mathop {=}\limits ^\mathrm{d}}\sum _{j=1}^{n+1}X_{j}+n\), where \(X_{1},X_{2},\ldots \) are original iid ge(p) rv’s; see Arnold et al. (1998).
Setting \(k=2\) and \(m=0\) in (3.5) and (3.6) yields
By the identity
after some algebra, we obtain
Remark 1
Note that (3.6) implies that for the geometric distribution we have
3.2 Weak kth records
The following theorem is analogous to Theorem 1 and may be proved in much the same way with the convention that \(W_{-1}^{(k)}=0\) for \(k\ge 1\), so its proof is omitted.
Theorem 3
For any \(k=1,2,\ldots ,\) and \(0\le m<n\):
Theorem 3 and formula (2.3) yield the following recurrence relations for covariances between weak kth records.
Corollary 2
For any \(k=1,2,\ldots ,\) and \(0\le m<n\):
The next result shows, that the conclusion of Theorem 3 and Corollary 2 can be simplified in the geometric case.
Theorem 4
Assume that the parent distribution is geometric ge(p). Then for \(k=1,2,\ldots \) and \(0\le m<n\):
and
Proof
In the geometric case, the rv \(M_{m_1}^{(k)}\) is a sum of iid rv’s \(\xi _{i}^{(k)}\), \(i=0,1,\ldots ,m_{1}\). Notice that \(M_{m_1}^{(k)}\) has a negative binomial distribution \(nb\left( k(m_{1}+1),1-p\right) \), because \(\xi _{i}\) has a negative binomial distribution \(nb(k,1-p)\). With the convention \(\left( {\begin{array}{c}a\\ b\end{array}}\right) =0\) for \(a<b\), it follows that
Using the formulas
and
for \(m\ge 0\), we get
Further, since \(M_{m_2}^{(k)}-M_{m_1}^{(k)}\) has a negative binomial distribution \(nb\left( (m_{2}-m_{1})k,1-p\right) \) and by (3.13), we obtain
Hence
where the last equality is a consequence of (3.13). Combining it with (3.14) and Theorem 3 we obtain (3.11). Moreover, analogously to the proof of (3.6), formulas (3.11) and (2.4) imply immediately (3.12). In particular, for \(m=0\) we get \(\mathrm{Cov}\,(W_{0}^{(k)},W_{n}^{(k)})=\mathrm{Var}\,(W_{0}^{(k)})\). \(\square \)
We can simplify recurrence relations (3.11) and (3.12) for \(k=1\) and \(k=2\). When \(k=1\), simple algebra yields
Note that the above expressions can be also derived by use of the well-known representation \(W_{n}{\mathop {=}\limits ^\mathrm{d}}\sum _{j=1}^{n+1}X_{j}\), where \(X_{1},X_{2},\ldots \) are original iid ge(p) rv’s; see e.g. Dembińska (2007b).
Setting \(k=2\) and \(m=0\) in (3.11) and (3.12), we obtain
Observe that for odd j:
while for even j:
Combining the above relations with (3.10) and
we get for \(j\ge 0\):
which, after some algebra, gives
Remark 2
Note that (3.12) implies that for the geometric distribution we have
3.3 Strong kth records
By Lemma 1, to derive expressions for the product moments of strong kth records from cdf F, it suffices to find the corresponding ones of records from cdf \(F_{1:k}\):
This relation enables us to obtain easily the desired formulas for geometric strong kth records. Formulas given in (3.9) with the fact that \(F_{1:k}\) is geometric \(ge(1-q^{k})\) provided that F is geometric ge(p), imply the following theorem.
Theorem 5
For any \(0\le m< n\) and \(k=1,2,\ldots \):
Moreover, it is known that the correlation coefficient between geometric records \(R_{m}\) and \(R_{n}\) is equal to \(\sqrt{(m+1)/(n+1)}\), \(m<n\) (see, for example, Arnold et al. 1998, p. 40). This immediately implies that in the geometric case we get
4 Tables
In this section, we present Tables 1, 2, 3 with numerical values of correlation coefficients between strong 2nd records \(S_{m}^{(2)}\) and \(S_{n}^{(2)}\), 2nd records \(R_{m}^{(2)}\) and \(R_{n}^{(2)}\) and weak 2nd records \(W_{m}^{(2)}\) and \(W_{n}^{(2)}\) for the geometric parent ge(p), where \(m<n\) and \(m=0,1,\ldots ,5\), \(n=1,\ldots ,7\) and \(p=0.25,\, 0.5,\, 0.75\). They provide us some insight into their magnitude of change. It is worth pointing out that in all the presented cases the correlation coefficients between 2nd records \(R_{m}^{(2)}\) and \(R_{n}^{(2)}\) as well as between weak 2nd records \(W_{m}^{(2)}\) and \(W_{n}^{(2)}\) do not exceed the correlation coefficients of 2nd strong records \(S_{m}^{(2)}\) and \(S_{n}^{(2)}\) given in (3.15), which is also the maximum correlation possible between any continuous square-integrable kth records (see, Nevzorov 1992).
References
Ahsanullah M, Aliev F (2011) A characterization of geometric distribution based on weak records. Stat Probab Lett 52:651–655
Arnold BC, Balakrishnan N, Nagaraja HN (1998) Records. Wiley, New York
Block HW, Fang Z (1988) A multivariate extension of Hoeffding’s lemma. Ann Probab 16:1803–1820
Charalambides ChA, Rychlik T (2008) Distributions and moments of record values in a sequence of maximally dependent random variables. J Stat Plann Inference 138:2253–2266
Danielak K, Dembińska A (2007a) Some characterizations of discrete distributions based on weak records. Stat Pap 48:479–489
Danielak K, Dembińska A (2007b) On characterizing discrete distributions via conditional expectations of weak record values. Metrika 66:129–138
Dembińska A (2007a) A review on characterizations of discrete distributions based on records and kth records. Commun Stat Theory Methods 36:1381–1387
Dembińska A (2007b) Characterizations of discrete distributions based on record values. In: Ahsanullah M, Kirmani S (eds) Extreme Value Distributions. Nova Science Publishers, New York, pp 63–75
Dembińska A (2008) \(k\)th records from geometric distribution. Stat Probab Lett 78:1662–1670
Dembińska A, Danielak K (2008) On moments of \(k\)-records from discrete distributions. Commun Stat Theory Methods 37:2516–2531
Dembińska A, López-Blázquez F (2005a) A characterization of geometric distribution through \(k\)th weak records. Commun Stat Theory Methods 34:2345–2351
Dembińska A, López-Blázquez F (2005b) \(k\)th records from discrete distributions. Stat Probab Lett 71:203–214
Dembińska A, Stepanov A (2006) Limit theorems for the ratio of weak records. Stat Probab Lett 76:1454–1464
Goroncy A, Rychlik T (2011) Lower bounds on the expectations of upper record values. J Stat Plann Inference 141:2726–2737
Hoeffding W (1940) Maßtabinvariante Korrelations - Theorie. Schrift Math Inst Univ Berlin 5:181–233
Jasiński K (2016) On maximum variance of kth records. Commun Stat Theory Methods 45:2392–2401
Karczewski R, Wesołowski J (2017) Linearity of regression for weak records, revisited. Statistics 51:878–887
Klimczak M, Rychlik T (2004) Maximum variance of kth records. Stat Probab Lett 69:421–430
Klimczak M, Rychlik T (2005) Recontruction of previous failure times and records. Metrika 61:277–290
Kozyra PM, Rychlik T (2017) Sharp bounds on the expectations of linear combinations of \(k\)th records expressed in the Gini mean difference units. Probab Math Stat Pol (to appear)
López-Blázquez F, Salamanca Miño B, Dembińska A (2005) A note on the distribution of \(k\)th records from discrete distributions. Stat Probab Lett 75:325–330
Nevzorov VB (1992) A characterization of exponential distributions by correlation between records. Math Meth Stat 1:49–54
Nevzorov VB (2001) Records: mathematical theory, translation of mathematical monographs, vol 194. American Mathematical Society, Providence
Oncel SY, Aliev F (2016) Characterizations of geometric and discrete pareto distributions based on the conditional distribution of \(k\)th records. J Stat Theory Appl 15:367–372
Raqab MZ, Rychlik T (2002) Sharp bounds for the mean of \(k\)th record value. Commun Stat Theory Methods 31:1927–1937
Raqab MZ, Rychlik T (2004) Sharp bounds on the expectations of second record values from symmetric populations. Statistics 38:49–57
Raqab MZ, Rychlik T (2011) Bounds for the mean residual life function of a \(k\)-out-of-\(n\) system. Metrika 74:361–380
Stepanov AV (1992) Limit theorems for weak records. Theory Probab Appl 37:570–574 (English translation)
Stepanov AV (1993) A characterization theorem for weak records. Theory Probab Appl 38:762–764 (English translation)
Stepanov AV, Balakrishnan N, Hofmann G (2003) Exact distribution and Fisher information of weak record values. Stat Probab Lett 64:69–81
Vervaat W (1973) Limit theorems for records from discrete distributions. Stoch Process Their Appl 1:317–334
Wesołowski J, Ahsanullah M (2001) Linearity of regression for non-adjacent weak records. Stat Sin 11(1):39–52
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Jasiński, K. Relations for product moments and covariances of kth records from discrete distributions. Metrika 81, 125–141 (2018). https://doi.org/10.1007/s00184-017-0637-9
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00184-017-0637-9