Abstract
In this paper, we introduce the newly generated Ristić–Balakrishnan–Topp–Leone–Gompertz-G family of distributions. Statistical and mathematical properties of this new family including moments, moment generating function, incomplete moments, conditional moments, probability weighted moments, distribution of the order statistics, stochastic ordering, and Rényi entropy are derived. The unknown parameters of the family are inferred using the maximum likelihood estimation technique. A Monte Carlo simulation study is performed to investigate the convergence of the maximum likelihood estimation. Three real-life data sets are used to demonstrate the flexibility and capacity of the new family of distributions.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Probability distributions are utilized extensively in statistical analysis especially when modeling and predicting real-world phenomena. There is a growing demand for more flexible distributions due to the variety and complexity of big data encountered nowadays. Recent work on the extensions of the existing distributions includes the Gompertz–Topp Leone-G family of distributions [33], the Marshall–Olkin-Odd power generalized Weibull-G family of distributions [10], Topp–Leone odd Fréchet generated family of distributions [4], a new extended Rayleigh distribution [6], the odd Weibull inverse Topp–Leone distribution [5], Type I half logistic Burr X-G family of distributions [2], and the Marshall–Olkin–Weibull-H family of distributions [1].
The gamma generator proposed by [35] has been one of the most explored approaches for creating new distributions. The cumulative distribution function (cdf) of the gamma generator is provided by
with a probability density function (pdf)
Some existing distributions developed via the gamma transformation method include the gamma odd power generalized Weibull-G family of distributions by [14], a new gamma generalized Lindley-log-logistic distribution by [24], the gamma modified Weibull distribution by [11], the gamma-exponentiated Weibull distribution by [31], the gamma odd Burr III-G family of distributions by [30], the gamma Weibull-G family of distributions by [28], the Zografos–Balakrishnan-G family of distributions by [27], Zografos–Balakrishnan Burr XII distribution by [8], the Zografos–Balakrishnan Lindley distribution by [21], the Zografos–Balakrishnan odd log-logistic generalized half-normal distribution by [26], the gamma-generalized inverse Weibull distribution by [29], the Ristić and Balakrishnan Lindley–Poisson distribution by [13], and Ristić–Balakrishnan extended exponential distribution by [16].
In this paper, we develop a novel family of distributions using the gamma generator method in conjunction with the Topp–Leone-G (TL-G) family of distributions [7] and the Gompertz-G (Gom-G) family of distributions [3]. The cdf and pdf of TL-G are given by
and
respectively, for \(b>0\) and parameter vector \(\psi\). Note that \({\overline{G}}(x;\psi )=1-G(x;\psi )\) is the baseline survival function (also called reliability function), and \(g(x;\psi )=dG(x;\psi )/dx\) where \(G(x;\psi )\) is the cdf of any continuous distribution. The cdf and pdf of the Gompertz-G (Gom-G) family of distributions are
and
where \(\theta , \gamma >0\), and \(\psi\) is the baseline parameter vector.
The objectives of developing this new family of distributions are as follows: to create new statistical distributions with more shape properties than previous distributions; to extend the possible shapes of the density and hazard rate functions of the baseline distribution; to skew any symmetrical distribution; and to modulate the weight of the tails of any parental distribution.
The remaining sections are organized as follows. In Sect. 2, we introduce the new RB–TL–Gom-G distribution family, its subfamilies, the hazard rate function, and the quantile function. Some special cases of the new family of distributions are described in Sect. 3. The expansion of density functions is demonstrated in Sect. 4. Section 5 derives the mathematical and statistical properties of the RB–TL–Gom-G family of distributions. In Sect. 6, the model parameters of the RB–TL–Gom-G family of distributions are estimated using the maximum likelihood approach, and in Sect. 7, convergence properties are validated using a simulation study. The flexibility and applicability of the RB–TL–Gom-G family of distributions are exemplified by three real-world data sets in Sect. 8. Section 9 concludes with some final remarks.
In this study, all simulations, parameter estimates, and model comparisons are conducted in R (an open-source software environment maintained by R Core Team and the R Foundation for Statistical Computing (https://www.r-project.org/).
2 The New Family of Distributions
Based on the TL-G and Gom-G families of distributions, we define a new family of distributions namely the Ristić–Balakrishnan–Topp–Leone–Gompertz-G (RB–TL–Gom-G) distribution.Footnote 1 The cdf of the RB–TL–Gom-G family of distributions is denoted by
where \(\gamma (\delta , y)= \int _{0}^y t^{\delta -1} e^{-t} dt\) is the lower incomplete gamma function, and \(\varGamma (\delta )\) is the gamma function. Note that \(g(x;\psi )=dG(x;\psi )/dx\) is the pdf of any continuous baseline distribution, and \(G(x;\psi )\) is the corresponding cdf. The pdf of the RB–TL–Gom-G family of distributions is
for \(\delta , b, \theta >0\) and baseline parameter vector \(\psi .\)
2.1 Sub-families
Several sub-families of the RB–TL–Gom-G family of distributions are presented in this sub-section.
-
When \(\delta =1,\) we obtain the Topp–Leone–Gompertz-G (TL-Gom-G) family of distributions with the cdf
$$\begin{aligned} F(x; b, \theta , \psi )= \left[ 1-\exp \Big \{\frac{2}{\theta }\left( 1-[1-G(x;\psi )]^{-\theta }\right) \Big \}\right] ^b, \end{aligned}$$for \(b, \theta >0\) and parameter vector \(\psi .\)
-
Setting \(b=1\), we find a new family of distributions namely Ristić–Balakrishnan–Gompertz-G (RB-Gom-G) with the cdf
$$\begin{aligned} F(x; \delta , \theta , \psi )&=1- \frac{\gamma \left( \delta ,-\log \left[ 1- \exp \Big \{\frac{2}{\theta }\left( 1-\left[ 1-{G}(x;\psi )\right] ^{-\theta }\right) \Big \}\right] \right) }{\varGamma (\delta )}, \end{aligned}$$for \(\delta , \theta >0\) and parameter vector \(\psi .\)
-
When \(\theta =1\), a new family of distributions is generated with the cdf
$$\begin{aligned} F(x; \delta , b, \psi )&=1- \frac{\gamma \left( \delta ,-\log \left[ 1- \exp \Big \{{2}\left( 1-\left[ 1-{G}(x;\psi )\right] ^{-1}\right) \Big \}\right] ^b \right) }{\varGamma (\delta )}, \end{aligned}$$(7)for \(\delta , b>0\) and parameter vector \(\psi .\)
-
When \(\delta =b=1,\) it becomes the Gompertz-G (Gom-G) family of distributions with the cdf
$$\begin{aligned} F(x; \theta , \psi )= \left[ 1-\exp \Big \{\frac{2}{\theta }\left( 1-[1-G(x;\psi )]^{-\theta }\right) \Big \}\right] , \end{aligned}$$for \(\theta >0\) and parameter vector \(\psi .\)
-
When \(\delta =\theta =1,\) another new family of distributions is obtained with the cdf
$$\begin{aligned} F(x; b, \psi )= \left[ 1-\exp \Big \{{2}\left( 1-[1-G(x;\psi )]^{-1}\right) \Big \}\right] ^b, \end{aligned}$$for \(b>0\) and parameter vector \(\psi .\)
-
Let \(b=\theta =1\), then we obtain one more new family of distributions with the cdf
$$\begin{aligned} F(x; \delta , \psi )&=1- \frac{\gamma \left( \delta ,-\log \left[ 1- \exp \Big \{{2}\left( 1-\left[ 1-{G}(x;\psi )\right] ^{-1}\right) \Big \}\right] \right) }{\varGamma (\delta )}, \end{aligned}$$for \(\delta >0\) and parameter vector \(\psi .\)
-
As \(\delta =b=\theta =1,\) we obtain a special case of the Gom-G family of distributions with the cdf
$$\begin{aligned} F(x; \psi )= \left[ 1-\exp \Big \{{2}\left( 1-[1-G(x;\psi )]^{-1}\right) \Big \}\right] , \end{aligned}$$for parameter vector \(\psi .\)
2.2 Hazard Rate and Quantile Functions
For a continuous random variable X with cdf F(x), survival function \({\bar{F}}(x)=1-F(x)\), and pdf f(x), its hazard rate function (hrf), reverse hazard function (rhf), and mean residual life function are defined as \(h_F(x) = f(x)/{\bar{F}}(x)\), \(\tau _{F}(x)=f(x)/F(x)\), and \(\displaystyle \delta _F(x)=\int _x^\infty \dfrac{{\bar{F}}(u)}{{\bar{F}}(x)} du\) respectively. It has been shown by [36] that the behaviors of \(h_F(x)\), \(\tau _{F}(x)\), and \(\delta _F(x)\) are equivalent. Here we will only present the hazard rate function, but the reverse hazard and residual life functions can be obtained in a similar way. Given the cdf (Eq. (5)) and pdf (Eq. (6)), the hrf of the RB–TL–Gom-G family of distributions is
The quantile function of the RB–TL–Gom-G family of distributions can be obtained by solving for the inverse cumulative distribution function as
for \(0\le p \le 1\), which is equivalent to
and
Thus, it is sufficient to solve
Therefore, the quantile function of the RB–TL–Gom-G family of distributions reduces to the quantile \(x_q\) of the baseline distribution with cdf \({G}(x;\psi )\) which is given by
where q is defined in Eq. (9).
3 Some Special Cases
Special cases of the RB–TL–Gom-G family of distributions are presented in this section when the baseline cdf \(G(x; \psi )\) is specified as the Burr XII, Weibull, or Uniform distributions.
3.1 Ristić–Balakrishnan–Topp–Leone–Gompertz–Burr XII (RB–TL–Gom–BXII) Distribution
Suppose the cdf and pdf of the baseline distribution are given by \(G(x;c, \lambda )=1-(1+x^{c})^{-\lambda }\) and \(g(x;c, \lambda )=\lambda cx^{c-1}(1+x^{c})^{-\lambda -1}\) for \(c, \lambda >0\) and \(x>0.\) The cdf and pdf of the new RB–TL–Gom–BXII distribution are
and
for \(\delta , b, \theta , c, \lambda >0.\) The hrf of the RB–TL–Gom–BXII distribution is given by
for \(\delta , b, \theta , c, \lambda >0.\) If \(\lambda =1,\) we obtain the RB–TL–Gom-Log-Logistic (RB–TL–Gom-LLoG) distribution. Also, if \(c=1,\) the RB–TL–Gom–BXII distribution reduces to the RB–TL–Gom–Lomax (RB–TL–Gom–Lx) distribution.
Figure 1 shows several typical configurations of the pdf and hrf of the RB–TL–Gom–BXII distribution. The pdf can take various shapes including almost symmetric, reverse-J, left-skewed, and right-skewed. Furthermore, plots of the hrf for the RB–TL–Gom–BXII distribution exhibit monotonically increasing, monotonically decreasing, bathtub, and upside-down bathtub shapes.
3.2 Ristić–Balakrishnan–Topp–Leone-Gompertz–Weibull (RB–TL–Gom–W) Distribution
If we consider the Weibull distribution with cdf and pdf given by \(G(x;\lambda )=1-\exp (- x^{\lambda })\) and \(g(x;\lambda )=\lambda x^{\lambda -1}\exp (- x^{\lambda })\) respectively, for \(\lambda >0\) and \(x>0\), as the baseline distribution, then the RB–TL–Gom–W distribution has cdf and pdf given by
and
for \(\delta , b, \theta , \lambda >0.\) The corresponding hrf is given by
for \(\delta , b, \theta , \lambda >0.\)
Figure 2 illustrates the adaptability of the RB–TL–Gom-W pdf and hrf distributions. The pdf produces a variety of forms, such as unimodal, reverse-J, left-skewed, and right-skewed. In addition, hrf plots for the RB–TL–Gom–W distribution exhibit growing, decreasing, bathtub, and inverted bathtub forms.
3.3 Ristić–Balakrishnan–Topp–Leone–Gompertz-Uniform (RB–TL–Gom-U) Distribution
If we consider the uniform distribution as the baseline distribution with cdf and pdf \(G(x; \lambda )=\frac{x}{\lambda }\) and \(g(x; \lambda )=\frac{1}{\lambda }\), for \(\lambda >0\) and \(0<\lambda <x\), then the RB–TL–Gom-U distribution has the cdf and pdf given by
and a pdf of
for \(\delta , b, \theta , \lambda >0.\) The hrf for the RB–TL–Gom-U distribution is
for \(\delta , b, \theta , \lambda >0.\)
Figure 3 presents the pdf and hrf of the RB–TL–Gom-U distribution. The pdf can take numerous shapes such as almost symmetric, J, reverse-J, left-skewed, and right-skewed. Furthermore, the hrf of the RB–TL–Gom-U distribution captures a variety of possibilities including decreasing, bathtub, and bathtub followed upside-down bathtub shapes.
4 Expansion of the Density Function
In this section, we will present a series expansion of the RB–TL–Gom-G density function. First, let \(\displaystyle y=\exp \Big \{\frac{2}{\theta }\left( 1-\left[ 1-{G}(x;\psi )\right] ^{-\theta }\right) \Big \}\), and note that \(\displaystyle -\log (1-y)=\sum \nolimits _{i=0}^\infty \dfrac{y^{i+1}}{i+1}\). Then
Moreover, let \(a_s=(s+2)^{-1}\), then \(\displaystyle \left( \sum \nolimits _{s=0}^\infty \dfrac{y^s}{s+2} \right) ^m=\left( \sum \nolimits _{s=0}^\infty a_sy^s \right) ^m=\sum \nolimits _{s=0}^\infty b_{s,m}y^s,\) where \(\displaystyle b_{s,m}=(sa_0)^{-1}\sum \nolimits _{l=1}^s[m(l+1)-s]a_lb_{s-l,m}\), and \(b_{0,m}=a_0^m\) [17, 32]. Then Eq. (12) can be simplified as
Now, the pdf in Eq. (6) can be written as
Note that \(\displaystyle y=\exp \left( \frac{2}{\theta }\left( 1-\left[ 1-{G}(x;\psi )\right] ^{-\theta }\right) \right)\) and
Substitute Eq. (15) into Eq. (14), and the pdf of RB–TL–Gom-G distribution can be written as
To simplify the notation, we let
be the weights. Then the RB–TL–Gom-G density function can be written as
where \({g_{_{u+1}}}(x;\psi )\) is the pdf of the exponentiated-G (Exp-G) distribution [20] with power parameter \(u+1\).
Therefore, we can obtain the statistical properties of the RB–TL–Gom-G family of distributions from the well-established properties of the Exp-G distributions.
5 Mathematical and Statistical Properties
The moments, moment generating function, incomplete and conditional moments, Reńyi entropy, order statistics, stochastic orderings, and probability weighted moments of the RB–TL–Gom-G family of distributions are presented in this section. Throughout this section X denotes a random variable with a density function \(f_{_{RB-TL-Gom-G}}(x)\) as in Eq. (6), and \(Y_{u+1}\) is a random variable with the exponentiated-G distribution with power parameter \(u+1\).
5.1 Moments and Generating Functions
The \(r^{th}\) moment of the RB–TL–Gom-G family of distributions can be derived as
where \(E(Y^r_{u+1})\) is the \(r^{th}\) moment of the Exp-G distribution with power parameter \(u+1\), and \(w_{u+1}\) is given in Eq. (17). Furthermore, the moment generating function (mgf) for \(s<1\) is
where \(M_{u+1}(s)\) is the mgf of \(Y_{u+1}\), and \(w_{u+1}\) is defined in Eq. (17).
5.2 Incomplete and Conditional Moments
Incomplete and conditional moments are widely used in lifetime models and measures of inequality such as Bonferroni and Lorenz curves. The rth incomplete moment of X can be obtained as
where \(g_{u+1}(x;\psi )\) is the pdf of Exp-G distribution with power parameter \(u+1\). By setting \(r=1\) in Eq. (22), we obtain the first incomplete moment of the RB–TL–Gom-G family of distributions. The rth conditional moments of the RB–TL–Gom-G family of distributions is given by
where \({\bar{F}}_{_{RB-TL-Gom-G}}(a; \delta ,b, \theta , \psi )=1-F_{_{RB-TL-Gom-G}}(a; \delta ,b, \theta , \psi )\) and
for parameter vector \(\psi\). The mean residual life function is given by \(E(X-a|X>a)=E(X|X>a)-a=V_F(a)-a\), where \(V_F(a)\) is referred to as the vitality function of the distribution function F. The mean deviations, Bonferroni, and Lorenz curves can be readily obtained from the conditional and incomplete moments.
5.3 Reńyi Entropy
In information theory, the Reńyi entropy [34] is a measurement that generalizes various related entropies including Hartley entropy, Shannon entropy, collision entropy, and min-entropy. Reńyi entropy plays an important role as an index of diversity in fields such as ecology and statistics. The Reńyi entropy of the RB–TL–Gom-G family of distributions is given by
where
Let \(\displaystyle y=\exp \Big \{\frac{2}{\theta }\left( 1-\left[ 1-{G}(x;\psi )\right] ^{-\theta }\right) \Big \}\). Consider a similar expansion for the pdf, as in Eq. (13), we can obtain that
Therefore, Eq. (26) can be written as
Note that with \(\displaystyle y=\exp \left( \frac{2}{\theta }\left( 1-\left[ 1-{G}(x;\psi )\right] ^{-\theta }\right) \right)\), then
It follows that
Thus, the Reńyi entropy of the RB–TL–Gom-G family of distributions can be written as
where \(\displaystyle I_{REG}=(1-v)^{-1} \log \left[ \int _0^{\infty }\left( \left( 1+u/v\right) G^{u/v}(x;\psi ) {g}(x;\psi )\right) ^v dx \right]\) is the Reńyi entropy of exponentiated-G distribution with power parameter \(\displaystyle 1+u/v\), and the weights are
As a result, we can obtain the Reńyi entropy of the RB–TL–Gom-G family of distributions from that of the exponentiated-G distribution by Eq. (30).
5.4 Order Statistics
Order statistics have a wide range of applications such as in actuarial science, modeling auctions, optimizing production processes, and estimating parameters of distributions. Let \(X_1, X_2, \ldots , X_n\) be independent and identically distributed RB–TL–Gom-G random variables, f(x) and F(x) be the pdf and cdf of the RB–TL–Gom-G family of distributions, and denote \({\bar{F}}(x)=1-F(x)\). The pdf of the ith order statistic, \(f_{_{i:n}}(x)\), can be expressed as
Here we can write
where \(\displaystyle d_{0,i+m-1}=(w_1)^{i+m-1}\),
for \(u=1,2,\ldots\), and \(w_{_{u+1}}\) is given in Eq. (17). Then
where
and \(g_{_{u+r+1}}(x;\psi )\) is the exponentiated-G distribution with parameter \(u+r+1\). Therefore, we can derive properties of the distribution of the order statistics from the RB–TL–Gom-G family of distributions from those of the exponentiated-G distribution.
5.5 Stochastic Orderings
In this subsection, we present the commonly used stochastic orders for the RB–TL–Gom-G family of distributions including stochastic order, hazard rate order and the likelihood ratio order [37].
Let \(F_X(t)\) and \(F_Y(t)\) be the cdfs of two random variables X and Y, and define \({\overline{F}}_X(t)=1-F_X(t)\) and \({\overline{F}}_Y(t)=1-F_Y(t)\) as the corresponding reliability or survival functions. The random variable X is stochastically smaller than Y if, for all t, \(\displaystyle {\overline{F}}_X(t)\le {\overline{F}}_Y(t)\) (or \(\displaystyle {F}_X(t)\ge {F}_Y(t)\)). It is represented by \(\displaystyle X<_{st}Y\) or \({\displaystyle X\preceq Y}\). Moreover, if \({\displaystyle {\overline{F}}_X(t)< {\overline{F}}_Y(t)}\) for some t, then X is stochastically strictly less than Y and denoted as \({\displaystyle X\prec Y}\). In the hazard rate order given by \(\displaystyle X\preceq _{hr}Y\) X, \(h_X(t)\ge h_Y(t)\) for all t. Similarly, X is said to be smaller than Y in the likelihood ratio order denoted by \(\displaystyle X\preceq _{lr}Y\) if \(\displaystyle \frac{f_X(t)}{f_Y(t)}\) is decreasing in t. It has been shown that \(\displaystyle \displaystyle X\preceq _{lr}Y \implies X\preceq _{hr}Y \implies X\preceq Y\) [37].
Now consider two independent random variables \(X_1\) and \(X_2\) following RB–TL–Gom-G family of distributions with \(X_1\sim F_1(x; \delta _1,b, \theta , \psi )\) and \(X_2\sim F_2(x; \delta _2,b,\theta , \psi )\) and their pdfs given by
and
respectively. Then
Differentiating Eq. (36) with respect to x yields
which is negative if \(\delta _1<\delta _2\). Therefore, the likelihood ratio order \(\displaystyle X\preceq _{lr}Y\) exists, and we can conclude that the random variables \(X_1\) and \(X_2\) are stochastically ordered.
5.6 Probability Weighted Moments
Distributions may be characterized by the probability weighted moments (PWMs) [18], defined as
where the random variable X is distributed as \(\displaystyle F \equiv F(x)=P(X\le x)\), i, j and k are real numbers. When j and k are nonnegative integers, then the probability weighted moment of order (i, j, k) is proportional to the ith moment about the origin of the \((j + 1)\)th order statistic for a sample of size \(n=k + j + 1\). If X is a random variable that follows the RB–TL–Gom-G family of distributions, then
and apply a similar expansion in Eq. (33),
where
\(\displaystyle d_{0,j+l}=(w_1)^{j+l}\),
for \(u=1,2,\ldots\), and \(w_{_{u+1}}\) is given in Eq. (17). As a result, the PWMs of the RB–TL–Gom-G family of distributions can be written as
i.e., the PWMs of the RB–TL–Gom-G family of distributions can be obtained from the moments of the exponentiated-G distribution.
6 Maximum Likelihood Estimation
In this section, we estimate the unknown parameters of the RB–TL–Gom-G family of distributions using the maximum likelihood estimation (MLE) method. Assuming that the independent random sample \(\left( X_{1}, X_{2}, \ldots ,X_{n} \right)\) is observed from the RB–TL–Gom-G family of distributions with the vector of model parameters \({\mathbf {\Delta }}=( \delta , b, \theta , \psi )^{T}\). Then, the log-likelihood function \(\ell _{n}=\ell _{n}(\mathbf{{\Delta }})\) for the parameters from the observed values has the form
To obtain the maximum likelihood estimates (MLEs), we maximize the log-likelihood function \(\ell _{n}(\mathbf{{\Delta }})\) numerically. This can be obtained by setting the nonlinear system of equations \((\frac{\partial \ell _{n}}{\partial \delta },\frac{\partial \ell _{n}}{\partial b }, \frac{\partial \ell _{n}}{\partial \theta }, \frac{\partial \ell _{n}}{\partial \psi _{k}})^{T}={{\textbf{0}}}\) and solving them simultaneously. However, since these equations are not in closed form, the MLEs can be found by maximizing \(\ell _{n}(\mathbf{{\Delta }})\) numerically with respect to the parameters using a numerical method such as Newton–Raphson procedure. The partial derivatives of the log-likelihood function with respect to each component of the parameter vector are presented in the Supplementary Information.
7 Monte Carlo Simulations
This section is devoted to assessing the asymptotic convergence property of the estimated parameters of the RB–TL–Gom-LLoG distribution. A Monte Carlo simulation study was performed based on the following: N = 1000 samples of size n = 25, 50, 100, 200, 400, 800, 1600 generated from the RB–TL–Gom-LLoG distribution with different parameter values. The assessment was done based on the average bias (ABIAS) and root mean squared errors (RMSEs).
The ABIAS and RMSE for the estimated parameter, \({\hat{\theta }},\) are given by:
respectively.
The results of the simulation are summarized in Tables 1 and 2. These Tables report the average estimates (mean), average bias (ABIAS), and root mean squared errors (RMSEs). These results indicate the convergence of the mean estimations to the true parameters as the sample size (n) increases. Moreover, the RMSEs and ABIAS converge to zero as n increases further validating that the estimates are robust and consistent.
8 Applications
In this section we illustrate the flexibility and functionality of the RB–TL–Gom-G family of distributions by fitting its special case, the RB–TL–Gom-LLoG distribution, to several real-life data sets. The computations in the applications are based on the definitions in Eqs. (5) and (6). Parameters are estimated using the maximum likelihood estimation method, with the box-constrained optimization using PORT routines [15]. Moreover, we apply the Broyden–Fletcher–Goldfarb–Shanno (BFGS) algorithm, a quasi-Newton method, to find the best-fit parameter sets and their corresponding goodness-of-fit. The performance of the new distribution is compared to that of other recent models including: Topp-Leone odd Burr III Log-logistic (TL-OBIII-LLoG) by [25], generalized Weibull log-logistic (GWLLoG) distribution by [12], a new gamma exponentiated Lindley-log-logistic (GELLoG) distribution by [24], heavy-tailed log-logistic distribution, named alpha power log-logistic distribution by [38], and alpha power Topp–Leone Weibull (APTLW) distribution by [9]. The pdf’s of the models of comparison are given in the Supplementary Information.
To assess the goodness-of-fit of all the fitted distributions, the well-known goodness-of-fit statistics such as −2log-likelihood statistic (\(-2\ln (L)\)), Akaike Information Criterion (\(AIC=2p-2\ln (L)\)), Consistent Akaike Information Criterion (\(CAIC=AIC+2\frac{p(p+1)}{n-p-1}\)), Bayesian Information Criterion (\(BIC=p\ln (n)-2\ln (L)\)), (n is the number of observations, and p is the number of estimated parameters), Cramér-von Mises (\(W^{*}\)) statistic, Anderson-Darling statistic (\(A^{*}\)), Kolmogorov-Smirnov (K-S) statistic and its \(p-\)value were performed. It is known that, except for the \(p-\)value of the K-S statistic, the smaller the values of all the goodness-of-fit statistics, the better the model for fitting the data set.
8.1 Survival Times of Guinea Pigs Data
This data set consists of the survival times of guinea pigs injected with tubercle bacilli and was analyzed by [23] and [19]. The observations are as follows:
12, 15, 22, 24, 24, 32, 32, 33, 34, 38, 38, 43, 44, 48, 52, 53, 54, 54, 55, 56, 57, 58, 58, 59, 60, 60, 60, 60, 61, 62, 63, 65, 65, 67, 68, 70, 70, 72, 73, 75, 76, 76, 81, 83, 84, 85, 87, 91, 95, 96, 98, 99, 109, 110, 121, 127, 129, 131, 143, 146, 146, 175, 175, 211, 233, 258, 258, 263, 297, 341, 341, 376.
Estimates of the parameters of the fitted distributions (standard error in parentheses), AIC, BIC, W\(^*\), and A\(^*\) are given in Table 3. Plots of the fitted densities along with the histogram and the observed probability vs predicted probability are presented in Fig. 4. We can conclude that the RB-TL-Gom-LLoG distribution is significantly better than other distributions since the values of all goodness-of-fit statistics are the smallest, and the p-value of the K-S statistic is the largest.
Graphical representations such as Kaplan–Meier (K–M) survival curve, theoretical and empirical cumulative distribution functions (ECDF), and total time on test (TTT) scaled are plotted in Fig. 5. It is clear that the fitted empirical and theoretical plots are close to each other, hence we conclude that our model provides a very good fit for the data. Moreover, the TTT scaled plot clearly demonstrates that the model fits a non-monotonic hazard rate structure.
8.2 Active Repair Times Data
The data set was reported by [22], and it represents active repair times for airborne communication transceivers. The observations are
0.50, 0.60, 0.60, 0.70, 0.70, 0.70, 0.80, 0.80, 1.00, 1.00, 1.00, 1.00, 1.10, 1.30, 1.50, 1.50, 1.50, 1.50, 2.00, 2.00, 2.20, 2.50, 2.70, 3.00, 3.00, 3.30, 4.00, 4.00, 4.50, 4.70, 5.00, 5.40, 5.40, 7.00, 7.50, 8.80, 9.00, 10.20, 22.00, 24.50.
Table 4 reports the maximum likelihood estimates (MLEs) of the parameters of all fitted models, as well as the standard errors (in parenthesis) and various goodness-of-fit statistics for the active repair times data. We can see that the RB–TL–Gom-LLoG distribution is the best model for describing the data set since it has the smallest values among all goodness-of-fit statistics and the largest p-value of the K–S statistic. Plots of the fitted densities and the histogram, observed probability against predicted probability are given in Fig. 6.
Moreover, the fitted Kaplan–Meier survival curves, theoretical and ECDF plots, and TTT scaled plot are presented in Fig. 7. From the Kaplan-Meier and ECDF plots, it is clear that the RB–TL–Gom-LLoG distribution provides a perfect description of the active repair times data. The TTT scaled plot demonstrates that the data follow a non-monotonic hazard rate shape.
8.3 Eruptions Data
The last data set was reported by Professor Jim Irish, which can be accessed at http://www.statsci.org/data/oz/kiama.html. Regarding the Kiama Blowhole eruptions, the following data is provided:
83, 51, 87, 60, 28, 95, 8, 27, 15, 10, 18, 16, 29, 54, 91, 8, 17, 55, 10, 35, 47, 77, 36, 17, 21, 36, 18, 40, 10, 7, 34, 27, 28, 56, 8, 25, 68, 146, 89, 18, 73, 69, 9, 37, 10, 82, 29, 8, 60, 61, 61, 18, 169, 25, 8, 26, 11, 83, 11, 42, 17, 14, 9, 12.
The parameter estimates (standard errors in parentheses), the goodness-of-fit statistics: AIC, BIC, CAIC, \(W^{*}\), \(A^{*}\), K-S statistic, and its p-value are given in Table 5. The numbers in Table 5 illustrate that the RB–TL–Gom-LLoG distribution has the smallest values for the goodness-of-fit statistics and the largest p-value of the K–S statistic. Thus, the RB–TL–Gom-LLoG distribution can fit the eruptions data better than the rest of the distributions. Plots of the fitted densities and the histogram, observed probability vs predicted probability are given in Fig. 8.
Figure 9 shows the observed and the fitted Kaplan-Meier survival curves, ECDF, and TTT scaled plot. We can see that the new proposed distribution follows the Kaplan-Meier survival and empirical cdf curves very closely. The TTT scaled plot demonstrates a non-monotonic hrf for the eruptions data.
9 Concluding Remarks
The Ristić–Balakrishnan–Topp–Leone–Gompertz-G (RB–TL–Gom-G) family of distributions has been proposed and studied. Major statistical properties of this novel distribution family are derived. Using the maximum likelihood estimation approach, the parameters of the RB–TL–Gom-G distribution family have been estimated. The accuracy and convergence of the maximum likelihood estimates are assessed using Monte Carlo simulations. By fitting three real-world data sets, the flexibility and properties of the RB–TL–Gom-G family of distributions are presented. In conclusion, the RB–TL–Gom-G family of distributions can leverage the performance of existing distributions to generate a range of densities and hazard rate functions with a variety of shapes.
Data Availibility
All data used in this paper are provided in the context.
Notes
The parameter \(\gamma\) was set to 1 in the Gompertz-G family of distributions (Eqn.(4)) to avoid issues of over-parameterization.
Abbreviations
- cdf:
-
Cumulative distribution function
- pdf:
-
Probability density function
- TL-G:
-
Topp–Leone-G
- RB:
-
Ristić and Balakrishnan
- Gom-G:
-
Gompertz-G
- RB–TL–Gom-G:
-
Ristić–Balakrishnan–Topp–Leone-G–Gompertz-G
- hrf:
-
Hazard rate function
- rhrf:
-
Reverse hazard function
- Exp-G:
-
Exponentiated-G
- PWMs:
-
Probability weighted moments
- MLE:
-
Maximum likelihood estimation
- ABIAS:
-
Average bias
- RMSEs:
-
Root mean squared errors
- TL-OBIII-LLoG:
-
Topp–Leone odd Burr III Log-logistic
- GWLLoG:
-
Generalized Weibull log-logistic
- GELLoG:
-
Gamma exponentiated Lindley-log-logistic
- APTLW:
-
Alpha power Topp–Leone Weibull
- AIC:
-
Akaike Information Criterion
- CAIC:
-
Consistent Akaike Information Criterion
- BIC:
-
Bayesian Information Criterion
- \(W^{*}\) :
-
Cramér–von Mises statistic
- \(A^{*}\) :
-
Anderson–Darling statistic
- K-S:
-
Kolmogorov–Smirnov statistic
References
Afify, A.Z., Al-Mofleh, H., Aljohani, H.M., Cordeiro, G.M.: The Marshall-Olkin-Weibull-H family: estimation, simulations, and applications to covid-19 data. J. King Saud Univ. Sci. 34(5), 102115 (2022)
Algarni, A., Almarashi, A.M., Elbatal, I., Hassan, A.S., Almetwally, E.M., Daghistani, A.M., Elgarhy, M.: Type I half logistic Burr X-G family: properties, Bayesian, and non-Bayesian estimation under censored samples and applications to covid-19 data. Math. Probl. Eng. (2021)
Alizadeh, M., Cordeiro, G.M., Pinho, L.G.B., Ghosh, I.: The Gompertz-G family of distributions. J. Stat. Theory Pract. 11(1), 179–207 (2017)
Al-Marzouki, S., Jamal, F., Chesneau, C., Elgarhy, M.: Topp–Leone odd Fréchet generated family of distributions with applications to covid-19 data sets. Comput. Model. Eng. Sci. 125(1), 437–458 (2020)
Almetwally, E.M.: The odd Weibull inverse Topp–Leone distribution with applications to covid-19 data. Ann. Data Sci. 9(1), 121–140 (2022)
Almongy, H.M., Almetwally, E.M., Aljohani, H.M., Alghamdi, A.S., Hafez, E.: A new extended Rayleigh distribution with applications of covid-19 data. Results Phys. 23, 104012 (2021)
Al-Shomrani, A., Arif, O., Shawky, A., Hanif, S., Shahbaz, M.Q.: Topp–Leone family of distributions: Some properties and application. Pak. J. Stat. Oper. Res. 12(3), 443–451 (2016)
Altun, E., Yousof, H.M., Chakraborty, S., Handique, L.: Zografos–Balakrishnan Burr XII distribution: regression modeling and applications. Int. J. Math. Stat. 19(3), 46–70 (2018)
Benkhelifa, L.: Alpha power Topp–Leone Weibull distribution: properties, characterizations, regression modeling and applications. J. Stat. Manag. Syst., 1–26 (2022)
Chipepa, F., Moakofi, T., Oluyede, B.: The Marshall–Olkin-odd power generalized Weibull-G family of distributions with applications of covid-19 data. J. Probab. Stat. Sci. 20(1), 1–20 (2022)
Cordeiro, G.M., Aristizábal, W.D., Suárez, D.M., Lozano, S.: The gamma modified Weibull distribution. Chil. J. Stat. 6(1), 37–48 (2015)
Cordeiro, G.M., Ortega, M.E.M., Ramires, T.G.: A new generalized Weibull family of distributions: mathematical properties and applications. J. Stat. Distrib. Appl. 2(1), 1–25 (2015)
Fagbamigbe, A.F., Melamu, P., Oluyede, B.O., Makubate, B.: The Ristić and Balakrishnan Lindley–Poisson distribution: model, theory and application. Afr. Stat. 13(4), 1837–1864 (2018)
Gabanakgosi, M., Moakofi, T., Oluyede, B., Makubate, B.: The gamma odd power generalized Weibull-G family of distributions with applications. J. Stat. Model. Theory Appl. 2(2), 79–101 (2021)
Gay, D.M.: Usage summary for selected optimization routines. Comput. Sci. Tech. Rep. 153, 1–21 (1990)
Gomes-Silva, F., de Andrade, T.A.N., Bourguignon, M.: Ristić–Balakrishnan extended exponential distribution. Acta Sci. Technol. 40, e34963 (2018)
Gradshteyn, I.S., Ryzhik, I.M.: Table of Integrals, Series, and Products. Academic press, New York (2014)
Greenwood, J.A., Landwehr, J.M., Matalas, N.C., Wallis, J.R.: Probability weighted moments: definition and relation to parameters of several distributions expressable in inverse form. Water Resour. Res. 15(5), 1049–1054 (1979)
Gupta, R.C., Kundu, D.: Weighted inverse Gaussian-a versatile lifetime model. J. Appl. Stat. 38(12), 2695–2708 (2011)
Gupta, R.C., Gupta, P.L., Gupta, R.D.: Modeling failure time data by Lehman alternatives. Commun. Stat. Theory Methods 27(4), 887–904 (1998)
Irshad, M.R., D’cruz, V., Maya, R.: The Zografos–Balakrishnan Lindley distribution: properties and applications. Statistica (Bologna) 81(1), 45–64 (2021)
Jorgensen, B.: Statistical Properties of the Generalized Inverse Gaussian Distribution, vol. 9. Springer, Berlin (2012)
Kundu, D., Gupta, R.D.: An extension of the generalized exponential distribution. Stat. Methodol. 8(6), 485–496 (2011)
Moakofi, T., Oluyede, B., Makubate, B.: A new gamma generalized Lindley-log-logistic distribution with applications. Afr. Stat. 15(4), 2451–2481 (2020)
Moakofi, T., Oluyede, B., Gabanakgosi, M.: The Topp–Leone odd Burr III-G family of distributions: model, properties and applications. Stat. Optim. Inf. Comput. 10(1), 236–262 (2022)
Mozafari, M., Afshari, M., Alizadeh, M., Karamikabir, H.: The Zografos–Balakrishnan odd log-logistic generalized half-normal distribution with mathematical properties and simulations. Stat. Optim. Inf. Comput. 7(1), 211–234 (2019)
Nadarajah, S., Cordeiro, G.M., Ortega, E.M.: The Zografos–Balakrishnan-G family of distributions: mathematical properties and applications. Commun. Stat. Theory Methods 44(1), 186–215 (2015)
Oluyede, B., Pu, S., Makubate, B, Qui, Y.: The gamma-Weibull-G family of distributions with applications. Aust. J. Stat. 47(1), 45–76 (2018)
Oluyede, B.O., Makubate, B., Wanduku, D., Elbatal, I., Sherina, V.: The gamma-generalized inverse Weibull distribution with applications to pricing and lifetime data. J. Comput. Model. 7(2), 1 (2017)
Peter, P.O., Oluyede, B., Bindele, H.F., Ndwapi, N., Mabikwa, O.: The gamma odd Burr III-G family of distributions: model, properties and applications. Rev. Colomb. Estad. 44(2), 331–368 (2021)
Pinho, L., Cordeiro, G., Nobre, J.: The gamma-exponentiated Weibull distribution. J. Stat. Theory Appl. 11(4), 379–395 (2012)
Pu, S., Oluyede, B.O., Qiu, Y., Linder, D.: A generalized class of exponentiated modified Weibull distribution with applications. J. Data Sci. 14(4), 585–613 (2016)
Rannona, K., Oluyede, B., Chamunorwa, S.: The Gompertz–Topp–Leone-G family of distributions with applications. J. Probab. Stat. Sci. 20(1), 108–126 (2022)
Rényi, A.: On measures of entropy and information. In: Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 547–561 (1961)
Ristić, M.M., Balakrishnan, N.: The gamma-exponentiated exponential distribution. J. Stat. Comput. Simul. 82(8), 1191–1206 (2012)
Shaked, M., Shanthikumar, J.G.: Stochastic Orders and Their Applications. Academic press, New York (1994)
Szekli, R.: Stochastic Ordering and Dependence in Applied Probability, vol. 97. Springer, Berlin (2012)
Teamah, A.E.A., Elbanna, A.A., Gemeay, A.M.: Heavy-tailed log-logistic distribution: properties, risk measures and applications. Stat. Optim. Inf. Comput. 9(4), 910–941 (2021)
Acknowledgements
The authors wish to thank Aaron Stringfellow and Dr. Achraf Cohen for their helpful comments on the manuscript.
Funding
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
Author information
Authors and Affiliations
Contributions
All authors have contributed significantly to writing and editing the paper.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Pu, S., Moakofi, T. & Oluyede, B. The Ristić–Balakrishnan–Topp–Leone–Gompertz-G Family of Distributions with Applications. J Stat Theory Appl 22, 116–150 (2023). https://doi.org/10.1007/s44199-023-00053-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s44199-023-00053-9
Keywords
- Generalized distributions
- Gamma generator
- Topp–Leone distribution
- Statistical properties
- Maximum likelihood estimation
- Goodness-of-fit tests