Abstract
In this paper, we investigate to propose a new statistical distribution based on power series. We introduce a new family of distributions which are constructed based on a latent complementary risk problem and are obtained by compounding Beta Exponential (BE) and Power Series distributions. The new distribution contains, as special sub-models, several important distributions which are discussed in the literature, such as Beta Exponential Poisson (BEP) distribution, Beta Exponential Geometric (BEG) distribution, Beta Exponential Logarithmic (BEL) distribution, Beta Exponential Binomial (BEB) distribution as special cases. The hazard function of the BEPS distributions can be increasing, decreasing or bathtub shaped among others. The comprehensive mathematical properties of the new distribution is provided such as closed-form expressions for the density, cumulative distribution, survival function, failure rate function, the r-th raw moment, maximum likelihood estimation and also the moments of order statistics. The proposed type of distributions is used to modeling simulated and real datasets.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
In the analyzing of lifetime data, it is common that researchers use the Exponential and Generalized Exponential distributions. It is well-known that Exponential can have only constant hazard function and Generalized Exponential distribution can have only monotone hazard functions. Since these distributions are very well-known distributions for modeling lifetime data in reliability and medical studies, one of the interesting points in the data analysis can be searching for distributions that have some properties which enable them to describe the lifetime of some devices.
In the concept of lifetime data analysis, new classes of distributions were proposed based on the Beta Exponential distribution. Interesting reviews of some of these models is presented by [1]. Also, they defined the cumulative distribution function (CDF) of the Generalized Exponential (GE) distribution. The distribution introduced by them is also named the Exponentiated Exponential distribution. Clearly, the Exponential distribution is a particular case of the GE distribution, [1] also investigated some of their properties. The Exponentiated Weibull (EW) distribution, introduced by [2], extends the GE distribution. The Exponentiated Weibull also was studied by [3, 4] and [5]. Nadarajah and Kotz [6] introduced the exponentiated type of distributions such as Exponentiated Gumbel and Exponentiated Frechet distributions by generalizing the Gumbel, Frechet and other distributions in the same way that the GE distribution extends the Exponential distribution. They also provide some mathematical properties for each Exponentiated distribution. Eugene et al. [7] defined the Beta-Bormal (BN) distribution. General expressions for the moments of the BN distribution were derived by [8]. Nadarajah and Kotz [9] considered the Beta Gumbel (BG) distribution and provided closed-form expressions for the moments, the asymptotic distribution of the extreme order statistics and discussed the maximum likelihood estimation procedure. Moreover, [10] worked with the Beta Exponential (BE) distribution and obtained the moment generating function, the first four cumulants, the asymptotic distribution of the extreme order statistics and presented the maximum likelihood estimation. Barreto-Souza et al. [11] proposed the generalization of BE distribution and Raffiq et al. [12] proposed The Marshall–Olkin inverted Nadarajah–Haghighi distribution to analyze lifetime data. In addition, [13] introduced a new generalization of Gumbel–Weibull distribution with application in analysis COVID-19 data.
Recently, by pioneering statistical learning and its related concepts, data science played main role in the analyzing of lifetime data. The effect of data science was not restricted in only computational aspects. Also, it helped theoretical studies. Olsen et al. [14] and Shi et al. [15] discussed and introduced the role and usage of data mining in modern data analysis. Moreover, [16] proposed a new type of Lomax distribution using statistical learning literature. For more information about usage of statistical learning concepts see [17].
In this paper we propose a new lifetime model, called Beta Exponential Power Series distributions. We obtain some of its mathematical properties. Some structural properties of the new distribution are studied. The method of maximum likelihood is used for estimating the model parameters. We illustrate the usefulness of the proposed model by applications to real data. Accordingly in Sect. 2, the new class of BEPS distribution is introduced. In Sect. 3, some properties of this distribution is calculated such as quantiles and order statistics. In Sect. 3, the parameters are estimated by MLE method .In Sect. 4, the numerical experiments is provided for analysis of proposed distribution based on simulated datasets.
2 The Class of BEPS Distribution
The three-parameter distribution known as Beta Exponential (BE) distribution, was introduced by []. The cumulative distribution function (cdf) of the BE distribution with four parameters \( a>0 \), \( b>0 \), \( \lambda >0 \) is given by
and the density function of the BE distribution is given by
The beauty and importance of this distribution lies in its ability to model monotone as well as non-monotone failure rates, which are quite common in lifetime problems and reliability. Let N be a random variable denoting the number of failure causes, \(N = 1, 2, ... \) and considering N following a power series distribution (truncated at zero) with probability function given by
where \(a_{1},a_{2},...\) is a sequence of non-negative real numbers, where at least one of them is strictly positive, S is a positive number no greater than the ratio of convergence of the power series \(\sum _{n=1}^{\infty }a_{n}\theta ^{n}\) and \(C(\theta ) =\sum _{n=1}^{\infty }a_{n}\theta ^{n}\). The Beta Exponential Power Series distribution, denoted by BEPS \((\alpha ,a,b,\lambda ,\theta )\), is defined by the marginal cdf of \(X=\min \{Y_{1},Y_{2},\ldots ,Y_{N}\}\), i.e.,
where \(\alpha ,a ,b > 0\) ,\(\lambda>0,\theta > 0\). The pdf of the BEPS\((\alpha ,a ,b,\lambda ,\theta )\) is given by
The survival function and hazard rate function of the BEPS distribution are given, respectively, by
and
where
2.1 Beta Exponential Poisson Distribution
The Beta Exponential Poisson (BEP) distribution is a special case of BEPS distributions with \(a_{n}=\frac{1}{n!}\) and \(C(\theta )=e^{\theta }-1\). Using cdf (4), the CDF of Beta Exponential Poisson (BEP) distribution is given by
Where \(\alpha ,a ,b> 0, \lambda ,\theta >0\). The associated pdf and hazard rate function of this distribution are given, respectively, by
and
where
The plots of pdf, cdf and hazard rate function of BEP distribution for some values of \(\alpha ,a,b, \lambda ,\theta \) are given in Fig. 1.
Models that present bathtub-shaped failure rate are very useful in survival analysis. The modeling and analysis of lifetimes is an important aspect of statistical work in a wide variety of scientific and technological fields. The new distribution due to its flexibility in accommodating all the forms of the risk function seems to be an important distribution that can be used in a variety of problems in modeling survival data.
2.2 Beta Exponential Geometric Distribution
The Beta Exponential Geometric (BEG) distribution is a special case of BEPS distributions with \(a_{n}=1\) and \(C(\theta )=\theta {(1-\theta )}^{-1}\). Using cdf (4), the cdf of Beta Exponential Geometric (BEG) distribution is given by
where \(\alpha ,a ,b , \lambda ,\theta >0\). The pdf is
The hazard rate function of BEG distribution is given by
where
The plots of pdf, cdf and hazard rate function of BEG distribution for some values of \(\alpha ,a,b, \lambda ,\theta \) are given in Fig. 2.
2.3 Beta Exponential Logarithmic Distribution
The Beta Exponential Logarithmic (BEL) distribution is a special case of BEPS distributions with \(a_{n}=\frac{1}{n}\) and \(C(\theta )=-log(1-\theta )\). Using cdf (4), the cdf of Beta Exponential Logarithmic (BEL) distribution is given by
where \(\alpha ,a ,b ,\lambda ,\theta > 0\). The associated pdf and hazard rate functions are given, respectively, by
and
where
The plots of pdf, cdf and hazard rate function of BEL distribution for some values of \(\alpha ,a,b, \lambda ,\theta \) are given in Fig. 3.
2.4 Beta Exponential Binomial Distribution
The Beta Exponential Binomial (BEB) distribution is a special case of the BEPS distribution with \(a_{n} ={m \atopwithdelims ()n}\) and \(C(\theta )={(\theta +1)}^{m}-1\) where m \((n\le m)\) is the number of replicates. The pdf and cdf of the BEB distribution is given, by
where \(\alpha ,a ,b , \lambda ,\theta \) The associated pdf and hazard rate functions are given, respectively, by
and
where
If \(m = 1\), then the density function in () changes to the density of BE distribution.
3 Statistical Properties
In this section, we proposed some of the basic statistical properties of the BEPS. For examples, we provide Quantiles and Order statistic
3.1 Quantiles, Moments and Order Statistics
The quantiles of a distribution can be used in data generation from a distribution. The quantile \(x_{q}\) of the BEPS\((\alpha ,a,b,\lambda ,\theta )\) is the real solution of the following equation:
The above equation has no closed form solution in \(x_{q}\), so we have to use a numerical technique such as a Newton-Raphson method to get the quantile. The pdf \(f_{i:n}\) of the ith order statistic for a random sample \(X_{1}, X_{2},. . . , X_{n}\) from the BEPS distribution is given by
and the cdf is
Carrasco et al. [18] obtained an infinite representation for the rth moment of the \(BE(a,b,\lambda )\) distribution. If Y has the \(BE(a,b,\lambda )\), the rth moment of Y say \(\mu ^{'}_{r}\), is given as follows
where \(A_{i_{1},...,i_{r}}=a_{i_{1}}...a_{i_{r}}\) , \(S_{r}=i_{1}+...+i_{r}\) , \(a_{i}=\frac{{(-1)}^{i+1}i^{i-2}\lambda ^{i-1}}{(i-1)!} \) and
Let that \(Y_{i:n}\) be the ith order statistic of a random sample from the BE distribution. The rth moment of BEPS \((\alpha ,a,b,\lambda ,\theta )\), is given as follows
According to th e Eq. (16), we can conclude that:
Hence, expectation and variance of BEPS can be immediately implied from (16). Also, based on the results given in (15), the measures of skewness and kurtosis of the BEPS \((\alpha ,a,b,\lambda ,\theta )\) can be obtained according to the following relations, respectively,
and
In Fig. 5, we show the behavior of the Galton’ skewness [19] and Moors’ kurtosis [20] as a functions of \(\theta \) for the values of \(\alpha \),a,b and \(\lambda \). The behavior of different types of BEPS is different against the variation of \(\theta \). But, skewness and kurtosis are generally decreasing meaningfully by increasing the values of \(\alpha \),a,b and \(\lambda \).
4 Estimation and Inference
Standard statistical techniques such as method of maximum likelihood can always be used for parametric estimation. The likelihood equations, given the complete or censored failure data set, can be derived and solved. Parameter estimation is usually a difficult problem even for a five parameter BEPS distribution. Methods like the maximum likelihood estimation will not yield a closed form solution. Different methods can be used to estimate the model parameters. Among these methods, the Maximum Likelihood Estimation method is the most commonly used method for model estimation. In this subsection, we use the maximum likelihood procedure to derive the point and interval estimates of the parameters. Calculating the first partial derivatives of L with respect to \(\alpha ,a,b,\sigma ,\theta \) and equating each to zero, we get the likelihood equations in the following system of nonlinear equations of \(\alpha ,a,b,\sigma \) and \(\theta \).
where
and
To find out the maximum likelihood estimators of \(\alpha ,a,b,\lambda \) and \(\theta \), we have to solve the above system of nonlinear equations with respect to \(\alpha ,a,b,\lambda \) and \(\theta \). As it seems, this system has no closed form solution in \(\alpha ,a,b,\lambda \) and \(\theta \). Then we have to use a numerical technique method, such as Newton-Raphson method, to obtain the solution.
5 Numerical Studies
5.1 Simulation Study
In the simulation study, we examine the precision of maximum likelihood estimators for \(BEPS(\alpha ,a,b,\lambda ,\theta )\). The mean of square error (MSE) for each parameter as comparison tool. We follow the following steps:
-
1.
Consider 4 sample size \(n=50,100,200,400\) for \((\alpha ,a,b,\lambda ,\theta )=(1.25,1,0.75, 1.75,0.5)\)
-
2.
Repeat the above step \(N=5000\) times
-
3.
Calculate the MSE of each parameter over the 100 iterations.
The results of generating random sample and calculating the MSE values are prepared in Table 2. As we can see, by increasing n, MSE decreases. Actually, when \(n=400\), MSE have the minimum amount except \(\alpha \) in BEL. Generally, based on MSE, we can clearly conclude that we can obtain more precise estimations when we have more sample data.
Now, we check the behavior of ML estimation of the model parameters when n tends to infinity. We simulate 1000 repetition of simulated data for \((\alpha ,a,b,\lambda ,\theta )=(1.25,1,0.75,1.75,1)\) with size of \(n=5000\). Then, we consider the average of their MSE over increasing of sample size. Figure 6 shows the effect of increasing sample size n on the amount of MSE for ML estimation of the model parameters. As it can be seen, the MSE decreases significantly by increasing n. In the most of the cases, it converges to 0 around \(n=5000\). Also, there is an exceptions, and the MSE increases for MLE of \(\lambda \) in BEP.
5.2 Real Data Applications
From the current study, it is hoped that BEPS distribution can be used more widely in both a theoretical and an applicable aspect. In this section, we analyze a real data sets to demonstrate the performance of the BEPS distribution in practice. This is a sample of 50 components taken from [21]. We fit the BEPS distributions family over the widely used Aarset data and analyze the obtained results. It is worth to note that the Aarset data was also reported and analyzed in [2, 4, 22, 23] studies and it contains lifetimes of 50 components, which possess a bathtub-shaped failure rate property. The data contains the times to failure of 50 devices put on life test at time 0, from [21].
The submodels of BEPS contained the exponential power series (proposed by [24]), beta exponential and exponential distributions are considered for comparison with the performance of the BEPS distributions family and also for applying likelihood ratio tests. Tables 3 and 4 shows values and Descriptive statistics of the Aarset data. In additions Fig. 7 show histogram and the approximation of density curve of the data.
Table 5 shows the MLEs of parameters of the models (distributions) fitted to Aarset data. Also, we can find the values of AIC, BIC and \(-2\log -likelihood\) (\(-2 log(L)\) ) for the fitted models which are considered for model comparison. According to AIC and \(-2\log (L)\), BEL is the best model. But, based on BIC, BE with a slight difference with BEL has a better performance. In continue, we analyze the performance of fitted models by other approaches to reach the best conclusion about the best fitted model on Aarset data.
For more comparison and reaching better decision about the best model fitted over the Aarset data, we apply likelihood ratio (LR) test. We should calculate the maximum amounts of log-likelihoods of null and alternative hypothesis to obtain the LR statistics for testing some sub-models of the BEPS distribution. In LR test, if \(\Theta \) be the parameter space of problem, we divide the parameter space as \(\Theta =(\Theta _0,\Theta _1)\). \(\Theta _0\) is the parameter space of model under null hypothesis (\(H_0\)) and \(\Theta _1\) is the parameter space of model under alternative hypothesis (\(H_1\)). For example, if we want to asses the BE model against BEL, \(\Theta _0\) is the parameter space under the assumption of BE distribution and \(\Theta _1\) is the parameter space when we consider BEL for data. \(\hat{\Theta _0}\) and \(\hat{\Theta _1}\) be MLE under \(H_0\) and \(H_1\), the LR statistic for this problem is calculated as:
Table 6 shows the information about LR test for some submodels against BEPS family. For the aforementioned example about testing BE versus BEL, \(p-value=0.024\) and we have another evidence to obtain BEL as the best model fitted over the Aarset data.
Now, we examine the goodness of fit for the fitted models over the Aarset data. we consider and compute the Kolmogorov–Smirnov (\(K--S\)), the Cramer–von Mises (\(W^*\)) and Anderson–Darling (\(AD^*\)) statistics for testing the goodness of fit. The smaller values of \(W^*\) and \(AD^*\) indicate the better fitting of the model. [25] provided detailed information about Anderson–Darling and Cramer–von Mises tests.
Table 7 presents the values of \(AD^*\), \(W*\) and \(K--S\) statistics, also it gives \(p-values\) of \(K--S\) test. According to the Table 7, BEL has the minimum values of \(AD^*\) and \(W*\) statistics. In addition, except EL, other distributions are fit for the data based on \(K--S\) test.
Figures 8 and 9, show the estimated survival function plot, TTT plot and Kaplan–Meier curve of the proposed models for the Aarset data. These figures show that the BEL distribution has better fit than the other involved distributions.
6 Conclusion
We introduce a five parameter lifetime family of distributions that is called Beta Exponential Power Series (BEPS). This type of distributions is a new mixed distribution of the Beta Exponential and power series distribution. Beta Exponential Poisson (BEP), beta Exponential Geometric (BEG), Beta Exponential Logarithmic (BEL), and Beta Exponential Binomial (BEB) are the distributions of this family of statistical distributions. Exponential power series (EP) and Beta Exponential (BE) distributions are the special cases of this type of distributions. Furthermore, we provide a mathematical treatment of this distribution including the order statistics. Also, we provide explicit expressions for the density function of the order statistics and their moments. Several properties of the BEPS distribution such as quantiles and moments are provided. BEPS has better fitting over Aarset data in comparison of submodels such as EP and BE.
For the future studies, the BEPS family can be extended by alpha power distributions as a new way to model lifetime data. Similarly, BE and EP can be considered for the extension by alpha power distributions.
Data Availability
The dataset is a sample of 50 components taken from Aarset (1987). The data can be found in the Table 1 in Aarset (1987) which is cited in the references of the paper.
Code Availability
Not Applicable.
References
Gupta RD, Kundu D (1999) Theory & methods: generalized exponential distributions. Austr NZ J Stat 41(2):173–188
Mudholkar GS, Srivastava DK (1993) Exponentiated weibull family for analyzing bathtub failure-rate data. IEEE Trans Eeliab 42(2):299–302
Mudholkar GS, Srivastava DK, Freimer M (1995) The exponentiated weibull family: a reanalysis of the bus-motor-failure data. Technometrics 37(4):436–445
Mudholkar GS, Hutson AD (1996) The exponentiated weibull family: some properties and a flood data application. Commun Stat-Theory Methods 25(12):3059–3083
Nassar MM, Eissa FH (2003) On the exponentiated weibull distribution. Commun Stat-Theory Methods 32(7):1317–1336
Nadarajah S, Kotz S (2006) The exponentiated type distributions. Acta Applicandae Math 92(2):97–111
Eugene N, Lee C, Famoye F (2002) Beta-normal distribution and its applications. Commun Stat-Theory Methods 31(4):497–512
Gupta AK, Nadarajah S (2005) On the moments of the beta normal distribution. Commun Stat-Theory Methods 33(1):1–13
Nadarajah S, Kotz S (2004) The beta gumbel distribution. Math Probl Eng 2004(4):323–332
Nadarajah S, Kotz S (2006) The beta exponential distribution. Reliab Eng Syst Saf 91(6):689–697
Barreto-Souza W, Santos AH, Cordeiro GM (2010) The beta generalized exponential distribution. J Stat Comput Simul 80(2):159–172
Raffiq G, Dar IS, Haq MAU, Ramos E (2020) The marshall–olkin inverted nadarajah–haghighi distribution: estimation and applications. Annals of Data Science, pp 1–16
Osatohanmwen P, Efe-Eyefia E, Oyegue FO, Osemwenkhae JE, Ogbonmwan SM, Afere BA (2022) The exponentiated gumbel–weibull \(\{\)Logistic\(\}\) distribution with application to nigeria’s covid-19 infections data. Annals of Data Science, pp 1–35
Olson DL, Shi Y, Shi Y (2007) Introduction to business data mining. McGraw-Hill/Irwin New York
Shi Y, Tian Y, Kou G, Peng Y, Li J (2011) Optimization based data mining: theory and applications. Springer
Dey S, Altun E, Kumar D and Ghosh I (2021) The reflected-shifted-truncated lomax distribution: associated inference with applications. Annals of Data Science, pp 1–24
Tien JM (2017) Internet of things, real-time decision making, and artificial intelligence. Ann Data Sci 4(2):149–178
Carrasco JM, Ortega EM, Cordeiro GM (2008) A generalized modified weibull distribution for lifetime modeling. Comput Stat Data Anal 53(2):450–462
Johnson NL, Kotz S, Balakrishnan N (1994) Continuous univariate distributions, second edition, vol 1. Wiley, Hoboken
Moors J (1988) A quantile alternative for kurtosis. J Royal Stat Soc: Series D (The Statistician) 37(1):25–32
Aarset MV (1987) How to identify a bathtub hazard rate. IEEE Trans Reliab 36(1):106–108
Wang F (2000) A new model with bathtub-shaped failure rate using an additive burr xii distribution. Reliab Eng Syst Saf 70(3):305–312
Choulakian V, Stephens MA (2001) Goodness-of-fit tests for the generalized pareto distribution. Technometrics 43(4):478–484
Chahkandi M, Ganjali M (2009) On some lifetime distributions with decreasing failure rate. Comput Stat Data Anal 53(12):4433–4440
Chen G, Balakrishnan N (1995) A general purpose approximate goodness-of-fit test. J Qual Technol 27(2):154–161
Acknowledgements
We acknowledged the Editor-in-Chief and reviewers for their thorough observations and useful comments that improved the quality of this paper.
Funding
No funding was received.
Author information
Authors and Affiliations
Contributions
NK: 35%, Methodology, Writing and preparation. EBS: 35%, Conceptualization, Investigation, Supervision and editing. AG: 30%, Visualization, Writing- Reviewing and Editing.
Corresponding author
Ethics declarations
Conflict of interest
Authors state no conflict of interest.
Ethical approval
The conducted research is not related to either human or animals use.
Ethical statements
All of the followed procedures were in accordance with the ethical and scientific standards. Authors declare that this manuscript is the result of their independent creation under the reviewers’ comment and this work does not cause any harm to human or society.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
Proof of the formulas of Beta Exponential Power Series distribution, Beta Exponential Poisson distribution, Beta Exponential Geometric distribution, Beta Exponential Logarithmic distribution, Beta Exponential Binomial distribution, respectively,
Proof (BEPS):
Proof (BEP):
Proof (BEG):
Proof (BEL):
Proof(BEB):
Rights and permissions
About this article
Cite this article
Khojastehbakht, N., Ghatari, A. & Samani, E.B. The Beta Exponential Power Series Distribution. Ann. Data. Sci. 10, 1157–1178 (2023). https://doi.org/10.1007/s40745-022-00414-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40745-022-00414-8