1 Introduction

In the analyzing of lifetime data, it is common that researchers use the Exponential and Generalized Exponential distributions. It is well-known that Exponential can have only constant hazard function and Generalized Exponential distribution can have only monotone hazard functions. Since these distributions are very well-known distributions for modeling lifetime data in reliability and medical studies, one of the interesting points in the data analysis can be searching for distributions that have some properties which enable them to describe the lifetime of some devices.

In the concept of lifetime data analysis, new classes of distributions were proposed based on the Beta Exponential distribution. Interesting reviews of some of these models is presented by [1]. Also, they defined the cumulative distribution function (CDF) of the Generalized Exponential (GE) distribution. The distribution introduced by them is also named the Exponentiated Exponential distribution. Clearly, the Exponential distribution is a particular case of the GE distribution, [1] also investigated some of their properties. The Exponentiated Weibull (EW) distribution, introduced by [2], extends the GE distribution. The Exponentiated Weibull also was studied by [3, 4] and [5]. Nadarajah and Kotz [6] introduced the exponentiated type of distributions such as Exponentiated Gumbel and Exponentiated Frechet distributions by generalizing the Gumbel, Frechet and other distributions in the same way that the GE distribution extends the Exponential distribution. They also provide some mathematical properties for each Exponentiated distribution. Eugene et al. [7] defined the Beta-Bormal (BN) distribution. General expressions for the moments of the BN distribution were derived by [8]. Nadarajah and Kotz [9] considered the Beta Gumbel (BG) distribution and provided closed-form expressions for the moments, the asymptotic distribution of the extreme order statistics and discussed the maximum likelihood estimation procedure. Moreover, [10] worked with the Beta Exponential (BE) distribution and obtained the moment generating function, the first four cumulants, the asymptotic distribution of the extreme order statistics and presented the maximum likelihood estimation. Barreto-Souza et al. [11] proposed the generalization of BE distribution and Raffiq et al. [12] proposed The Marshall–Olkin inverted Nadarajah–Haghighi distribution to analyze lifetime data. In addition, [13] introduced a new generalization of Gumbel–Weibull distribution with application in analysis COVID-19 data.

Recently, by pioneering statistical learning and its related concepts, data science played main role in the analyzing of lifetime data. The effect of data science was not restricted in only computational aspects. Also, it helped theoretical studies. Olsen et al. [14] and Shi et al. [15] discussed and introduced the role and usage of data mining in modern data analysis. Moreover, [16] proposed a new type of Lomax distribution using statistical learning literature. For more information about usage of statistical learning concepts see [17].

In this paper we propose a new lifetime model, called Beta Exponential Power Series distributions. We obtain some of its mathematical properties. Some structural properties of the new distribution are studied. The method of maximum likelihood is used for estimating the model parameters. We illustrate the usefulness of the proposed model by applications to real data. Accordingly in Sect. 2, the new class of BEPS distribution is introduced. In Sect. 3, some properties of this distribution is calculated such as quantiles and order statistics. In Sect. 3, the parameters are estimated by MLE method .In Sect. 4, the numerical experiments is provided for analysis of proposed distribution based on simulated datasets.

2 The Class of BEPS Distribution

The three-parameter distribution known as Beta Exponential (BE) distribution, was introduced by []. The cumulative distribution function (cdf) of the BE distribution with four parameters \( a>0 \), \( b>0 \), \( \lambda >0 \) is given by

$$\begin{aligned} I_y(a,b)= & {} {B_y(a,b) \over B(a,b)}\nonumber \\ B_y(a,b)= & {} \int _0^{y} w^{a-1} (1-w)^{b-1} dw\nonumber \\ G(y)= & {} I_{1-exp(-\lambda x)} (a,b)={B_{1-exp(-\lambda x)} (a,b) \over B(a,b)}, \end{aligned}$$
(1)

and the density function of the BE distribution is given by

$$\begin{aligned} g(y)={\lambda exp(-b \lambda x) (1-exp(-\lambda x))^{a-1}\over B(a,b)}. \end{aligned}$$
(2)

The beauty and importance of this distribution lies in its ability to model monotone as well as non-monotone failure rates, which are quite common in lifetime problems and reliability. Let N be a random variable denoting the number of failure causes, \(N = 1, 2, ... \) and considering N following a power series distribution (truncated at zero) with probability function given by

$$\begin{aligned} p(N=n)=\frac{a_{n}\theta ^{n}}{C(\theta )}~,~n=1,2,...~~and~~\theta \in (0,S). \end{aligned}$$
(3)

where \(a_{1},a_{2},...\) is a sequence of non-negative real numbers, where at least one of them is strictly positive, S is a positive number no greater than the ratio of convergence of the power series \(\sum _{n=1}^{\infty }a_{n}\theta ^{n}\) and \(C(\theta ) =\sum _{n=1}^{\infty }a_{n}\theta ^{n}\). The Beta Exponential Power Series distribution, denoted by BEPS \((\alpha ,a,b,\lambda ,\theta )\), is defined by the marginal cdf of \(X=\min \{Y_{1},Y_{2},\ldots ,Y_{N}\}\), i.e.,

$$\begin{aligned} F(x)=1- {C(\theta (1-(I_{1-exp(-\lambda x)} (a,b))^{\alpha })) \over C(\theta )}. \end{aligned}$$
(4)

where \(\alpha ,a ,b > 0\) ,\(\lambda>0,\theta > 0\). The pdf of the BEPS\((\alpha ,a ,b,\lambda ,\theta )\) is given by

$$\begin{aligned} f(x)={\alpha \theta \lambda \over C(\theta ) B(a,b) } A. \end{aligned}$$
(5)

The survival function and hazard rate function of the BEPS distribution are given, respectively, by

$$\begin{aligned} s(x)={C(\theta (1-(I_{1-exp(-\lambda x)} (a,b))^{\alpha })) \over C(\theta )}, \end{aligned}$$

and

$$\begin{aligned} h(x)={{\alpha \theta \lambda } \over C(\theta (1-(I_{1-exp(-\lambda x)} (a,b))^{\alpha }))B(a,b) } A. \end{aligned}$$
(6)

where

$$\begin{aligned} A= & {} C^{\prime } \left( \theta (1-(I_{1-exp(-\lambda x)} (a,b))^{\alpha })\left) \right( I_{1-exp(-\lambda x)} (a,b)\right) ^{\alpha -1}\\&exp( -\lambda x) (1-exp(-\lambda x ))^{a-1}exp(-\lambda x )^{b-1}, \end{aligned}$$
Table 1 Useful quantities of some power series distributions

2.1 Beta Exponential Poisson Distribution

The Beta Exponential Poisson (BEP) distribution is a special case of BEPS distributions with \(a_{n}=\frac{1}{n!}\) and \(C(\theta )=e^{\theta }-1\). Using cdf (4), the CDF of Beta Exponential Poisson (BEP) distribution is given by

$$\begin{aligned} F_{{BEP}} (x) = \left\{ {1 - \frac{{(exp(\theta - \theta [I_{{1 - exp( - \lambda x)}} (a,b)]^{\alpha } ) - 1)}}{{exp(\theta ) - 1}}} \right\} . . \end{aligned}$$
(7)

Where \(\alpha ,a ,b> 0, \lambda ,\theta >0\). The associated pdf and hazard rate function of this distribution are given, respectively, by

$$\begin{aligned} f_{BEP}(x)={\alpha \lambda \theta \over (exp(\theta ) -1) B(a,b)} B . \end{aligned}$$
(8)

and

$$\begin{aligned} h_{BEP}(x)= {{\alpha \lambda \theta }\over {(exp( \theta -\theta [I_{1-exp(-\lambda x)} (a,b)]^{\alpha }) -1 )B(a,b)}} B. \end{aligned}$$
(9)

where

$$\begin{aligned} B= & {} exp(\theta ( 1-[I_{1-exp(-\lambda X)} (a,b)]^{\alpha } )) exp(-\lambda x) (1-exp(-\lambda x))^{a-1}\\&exp(-\lambda x)^{ b-1} (I_{1-exp(-\lambda x)} (a,b))^{\alpha -1} \end{aligned}$$

The plots of pdf, cdf and hazard rate function of BEP distribution for some values of \(\alpha ,a,b, \lambda ,\theta \) are given in Fig. 1.

Fig. 1
figure 1

Plots of density function, cumulative distribution function and hazard function for BEP

Models that present bathtub-shaped failure rate are very useful in survival analysis. The modeling and analysis of lifetimes is an important aspect of statistical work in a wide variety of scientific and technological fields. The new distribution due to its flexibility in accommodating all the forms of the risk function seems to be an important distribution that can be used in a variety of problems in modeling survival data.

2.2 Beta Exponential Geometric Distribution

The Beta Exponential Geometric (BEG) distribution is a special case of BEPS distributions with \(a_{n}=1\) and \(C(\theta )=\theta {(1-\theta )}^{-1}\). Using cdf (4), the cdf of Beta Exponential Geometric (BEG) distribution is given by

$$\begin{aligned} F_{BEG}(x)= 1- {(1- \theta )[1-(I_{1-exp(-\lambda x)} (a,b))^{\alpha }] \over 1 - { \theta [1-(I_{1-exp(-\lambda x)} (a,b))^{\alpha }]}}. \end{aligned}$$

where \(\alpha ,a ,b , \lambda ,\theta >0\). The pdf is

$$\begin{aligned} f_{BEG}(x)={\alpha \lambda (1-\theta ) \over (exp(\lambda ) -1)( 1 -{ \theta [1-(I_{1-exp(-\lambda x)} (a,b))^{\alpha }] )^2}B(a,b)} C. \end{aligned}$$

The hazard rate function of BEG distribution is given by

$$\begin{aligned} h_{BEG}(x)={\alpha \lambda \over [1-(I_{1-exp(-\lambda x)} (a,b))^{\alpha }] (1-{ \theta [1-(I_{1-exp(-\lambda x)}(a,b))^{\alpha }]})B(a,b)} C. \end{aligned}$$

where

$$\begin{aligned} C=exp(-\lambda x) (1-exp(-\lambda x))^{a-1} exp(-\lambda x)^{ b-1} (I_{1-exp(-\lambda x)} (a,b))^{\alpha -1} \end{aligned}$$

The plots of pdf, cdf and hazard rate function of BEG distribution for some values of \(\alpha ,a,b, \lambda ,\theta \) are given in Fig. 2.

Fig. 2
figure 2

Plots of density function, cumulative distribution function and hazard function for BEG

2.3 Beta Exponential Logarithmic Distribution

The Beta Exponential Logarithmic (BEL) distribution is a special case of BEPS distributions with \(a_{n}=\frac{1}{n}\) and \(C(\theta )=-log(1-\theta )\). Using cdf (4), the cdf of Beta Exponential Logarithmic (BEL) distribution is given by

$$\begin{aligned} F_{BEL}(x)= 1- {log [ \theta (1-(I _{1-exp(-\lambda x)}(a,b)) ^{\alpha })]\over log(1-\theta )}, \end{aligned}$$
(10)

where \(\alpha ,a ,b ,\lambda ,\theta > 0\). The associated pdf and hazard rate functions are given, respectively, by

$$\begin{aligned} f_{BEL}(x)= {\lambda \alpha \over log(1-\theta ) [1-(I_{1-exp(-\lambda x)} (a,b))^{\alpha })] B(a,b)} D. \end{aligned}$$

and

$$\begin{aligned} h_{BEL}(x)={\lambda \alpha \over log(1-\theta ) [1-(I_{1-exp(-\lambda x)} (a,b))^{\alpha }]log [ \theta (1-(I _{1-exp(-\lambda x)}(a,b)) ^{\alpha }] B(a,b)} D.\nonumber \\ \end{aligned}$$
(11)

where

$$\begin{aligned} D=exp(-\lambda x) (1-exp(-\lambda x))^{a-1} exp(-\lambda x)^{ b-1} (I_{1-exp(-\lambda x)} (a,b))^{\alpha -1} \end{aligned}$$

The plots of pdf, cdf and hazard rate function of BEL distribution for some values of \(\alpha ,a,b, \lambda ,\theta \) are given in Fig. 3.

Fig. 3
figure 3

Plots of density function, cumulative distribution function and hazard function for BEL

2.4 Beta Exponential Binomial Distribution

The Beta Exponential Binomial (BEB) distribution is a special case of the BEPS distribution with \(a_{n} ={m \atopwithdelims ()n}\) and \(C(\theta )={(\theta +1)}^{m}-1\) where m \((n\le m)\) is the number of replicates. The pdf and cdf of the BEB distribution is given, by

$$\begin{aligned} F_{BEB}(x)= {1-{[\theta (1-(I_{1-exp(-\lambda x)} (a,b))^{\alpha })}]^m -1 \over (\theta +1)^m -1}. \end{aligned}$$

where \(\alpha ,a ,b , \lambda ,\theta \) The associated pdf and hazard rate functions are given, respectively, by

$$\begin{aligned} f_{BEB}(x)= {\alpha \lambda \theta m \over (\theta +1)^{m -1} B(a,b)} E. \end{aligned}$$

and

$$\begin{aligned} h_{BEB}(x)={{\alpha \lambda \theta m } \over {([\theta (1-(I_{1-exp(-\lambda x)} (a,b))^{\alpha })]^m -1 })B(a,b)} E. \end{aligned}$$

where

$$\begin{aligned} E= & {} \lbrace [\theta (1-(I_{1-exp(-\lambda x)} (a,b))^{\alpha })]^{m-1} exp(-\lambda x) (1-exp(-\lambda x))^{a-1}\\&exp(-\lambda x)^{ b-1} (I_{1-exp(-\lambda x)} (a,b))^{\alpha -1} \rbrace \end{aligned}$$

If \(m = 1\), then the density function in () changes to the density of BE distribution.

Fig. 4
figure 4

Plots of density function, cumulative distribution function and hazard function for BEB

3 Statistical Properties

In this section, we proposed some of the basic statistical properties of the BEPS. For examples, we provide Quantiles and Order statistic

3.1 Quantiles, Moments and Order Statistics

The quantiles of a distribution can be used in data generation from a distribution. The quantile \(x_{q}\) of the BEPS\((\alpha ,a,b,\lambda ,\theta )\) is the real solution of the following equation:

$$\begin{aligned}&\int _{-\infty }^{\chi _q}f(x)dx=q,\nonumber \\&{C( \theta (1-(I_{1-exp(-\lambda x)} (a,b))^\alpha ) \over C(\theta ))}=q,\nonumber \\&C(\theta )= \theta (1-(I_{1-exp(-\lambda x)} (a,b))^\alpha ) \end{aligned}$$
(12)

The above equation has no closed form solution in \(x_{q}\), so we have to use a numerical technique such as a Newton-Raphson method to get the quantile. The pdf \(f_{i:n}\) of the ith order statistic for a random sample \(X_{1}, X_{2},. . . , X_{n}\) from the BEPS distribution is given by

$$\begin{aligned} f_{i:n}(x)= & {} \frac{1}{B(i,n-i+1)}f(x){F(x)}^{i-1}{[1-F(x)]}^{n-i}\nonumber \\= & {} \frac{1}{B(i,n-i+1)} \sum _{j=0}^{n-i}{n-i \atopwithdelims ()j} {(-1)}^{j} \bigg [\frac{C(\theta (1- [I_{1-exp(-\lambda x)} (a,b)]^{\alpha })) }{C(\theta )}\bigg ]^{j+i-1}\nonumber \\ \end{aligned}$$
(13)

and the cdf is

$$\begin{aligned} F_{i:n}(x)= & {} \sum _{k=i}^{n}{n \atopwithdelims ()k}{F(x)}^{k}{[1-F(x)]}^{n-k}\nonumber \\= & {} \sum _{k=i}^{n}\sum _{j=0}^{n-k}{n-k \atopwithdelims ()j}{n \atopwithdelims ()k} {(-1)}^{j} \bigg [\frac{C(\theta (1- [I_{1-exp(-\lambda x)} (a,b)]^{\alpha })) }{C(\theta )} \bigg ]^{j+k}. \end{aligned}$$
(14)

Carrasco et al. [18] obtained an infinite representation for the rth moment of the \(BE(a,b,\lambda )\) distribution. If Y has the \(BE(a,b,\lambda )\), the rth moment of Y say \(\mu ^{'}_{r}\), is given as follows

$$\begin{aligned} \mu ^{'}_{r}= \sum _{j=0}^{\infty }\sum _{i_1,...,i_r=1}^{\infty }w_{j}\frac{A_{i_1,...,i_r}\Gamma (S_r +1)}{\left( \alpha (b+j)\right) ^{S_r} } \end{aligned}$$
(15)

where \(A_{i_{1},...,i_{r}}=a_{i_{1}}...a_{i_{r}}\) , \(S_{r}=i_{1}+...+i_{r}\) , \(a_{i}=\frac{{(-1)}^{i+1}i^{i-2}\lambda ^{i-1}}{(i-1)!} \) and

$$\begin{aligned} w_{j}=w_{j}(a,b)=\frac{(-1)^j \Gamma (a)}{B(a,b)\Gamma (a-j)(b+j)j!} \end{aligned}$$

Let that \(Y_{i:n}\) be the ith order statistic of a random sample from the BE distribution. The rth moment of BEPS \((\alpha ,a,b,\lambda ,\theta )\), is given as follows

$$\begin{aligned} \mu _{r}=E(X^r)= & {} \sum _{i=1}^{\infty }P(N=n)E(Y_{(n)}^r) \nonumber \\= & {} \sum _{n=1}^{\infty }\frac{a_{n}\theta ^{n}}{C(\theta )}n \sum _{j=0}^{\infty }\frac{(-1)^j \Gamma (a)}{B(a,b)\Gamma (a-j)(b+j)j!}\sum _{i_1,...,i_r=1}^{\infty }\frac{A_{i_1,...,i_r}\Gamma (S_r+1)}{\left( \alpha (b+j)\right) ^{\frac{S_r}{\gamma }}}\nonumber \\ \end{aligned}$$
(16)

According to th e Eq. (16), we can conclude that:

$$\begin{aligned}&E_{B E P S}(X)=\mu _{1} \\&{Var}_{B E P S}(X)=\mu _{2}-\mu _{1}^{2}. \end{aligned}$$

Hence, expectation and variance of BEPS can be immediately implied from (16). Also, based on the results given in (15), the measures of skewness and kurtosis of the BEPS \((\alpha ,a,b,\lambda ,\theta )\) can be obtained according to the following relations, respectively,

$$\begin{aligned} Skewness_{B E P S}=\frac{\mu _{3}-3\mu _{1}\mu _{2}+2{\mu _{1}}^3 }{{(\mu _{2}-\mu _{1})}^{\frac{3}{2}}}, \end{aligned}$$

and

$$\begin{aligned} Kurtosis_{B E P S}=\frac{\mu _{4}-4\mu _{3}\mu _{1}+6{\mu _{1}}^{2}\mu _{2}-3{\mu _{1}}^4 }{{(\mu _{2}-\mu _{1})}^{2}}. \end{aligned}$$

In Fig. 5, we show the behavior of the Galton’ skewness [19] and Moors’ kurtosis [20] as a functions of \(\theta \) for the values of \(\alpha \),a,b and \(\lambda \). The behavior of different types of BEPS is different against the variation of \(\theta \). But, skewness and kurtosis are generally decreasing meaningfully by increasing the values of \(\alpha \),a,b and \(\lambda \).

Fig. 5
figure 5

The effect of \(\theta \) on Galton’ skewness and Moors’ kurtosis for different values of \(\alpha \),a,b and \(\lambda \)

4 Estimation and Inference

Standard statistical techniques such as method of maximum likelihood can always be used for parametric estimation. The likelihood equations, given the complete or censored failure data set, can be derived and solved. Parameter estimation is usually a difficult problem even for a five parameter BEPS distribution. Methods like the maximum likelihood estimation will not yield a closed form solution. Different methods can be used to estimate the model parameters. Among these methods, the Maximum Likelihood Estimation method is the most commonly used method for model estimation. In this subsection, we use the maximum likelihood procedure to derive the point and interval estimates of the parameters. Calculating the first partial derivatives of L with respect to \(\alpha ,a,b,\sigma ,\theta \) and equating each to zero, we get the likelihood equations in the following system of nonlinear equations of \(\alpha ,a,b,\sigma \) and \(\theta \).

$$\begin{aligned} \frac{\partial L}{\partial \alpha }= & {} \frac{n}{\alpha }+\sum _{i=1}^{n} \ln \Big (I_{U(x_i)} (a,b) \Big )-\theta \big (I_{U(x)}(a,b)\big )^\alpha \ln \big (I_{U(x)}(a,b)\big )T_{i1} . \end{aligned}$$
(17)
$$\begin{aligned} \frac{\partial L}{\partial \theta }= & {} \frac{n}{\theta }+\Big (1-\big (I_{U(x)}(a,b)\big )^\alpha \Big )-n \frac{C^{\prime }(\theta )}{C(\theta )} \end{aligned}$$
(18)
$$\begin{aligned} \frac{\partial L}{\partial a}= & {} \frac{\frac{\partial (B(a,b))^{-1}}{\partial a}}{(B(a,b))^{-1}}+(\alpha -1)\sum _{i=1}^{n} \frac{\frac{\partial (B(a,b))^{-1}}{\partial a}T_{i2}+(B(a,b))^{-1}T_{i3}}{I_{U(x)}(a,b)} \nonumber \\&-\sum _{i=1}^{n}\alpha \theta \big (I_{U(x)}(a,b)\big )^{\alpha -1}(\frac{\partial (B(a,b))^{-1}}{\partial a} T_{i2}+(B(a,b))^{-1}T_{i3}) T_{i1}. \qquad \end{aligned}$$
(19)
$$\begin{aligned} \frac{\partial L}{\partial b}= & {} \frac{\frac{\partial (B(a,b))^{-1}}{\partial b}}{(B(a,b))^{-1}}(\alpha -1)\sum _{i=1}^{n} \frac{\frac{\partial (B(a,b))^{-1}}{\partial b} T_{i2}+(B(a,b))^{-1} T_{i4}}{I_{U(x)}(a,b)}\nonumber \\&-\sum _{i=1}^{n}\alpha \theta \big (I_{U(x)}(a,b)\big )^{\alpha -1} (\frac{\partial (B(a,b))^{-1}}{\partial b}T_{i2}+(B(a,b))^{-1} T_{i4} )T_{i1}\qquad \end{aligned}$$
(20)

where

$$\begin{aligned} T_1= & {} \frac{C^{\prime \prime }\big (\theta \big (1-(I_{U(x)}(a,b))^\alpha \big )\big )}{C^{\prime } \big (\big (\theta \big (1-(I_{U(x)}(a,b))^\alpha \big )\big )}\\ T_2= & {} \int _{0}^{1-exp(- \lambda x)}t^{a-1}(1-t)^{b-1}dt\\ T_3= & {} \int _{0}^{1-exp(-\lambda x)}t^{a-1} \ln t (1-t)^{b-1}dt \end{aligned}$$

and

$$\begin{aligned} T_4= \int _{0}^{1-exp(-\lambda x)}t^{a-1} \ln (1-t) (1-t)^{b-1}dt \end{aligned}$$

To find out the maximum likelihood estimators of \(\alpha ,a,b,\lambda \) and \(\theta \), we have to solve the above system of nonlinear equations with respect to \(\alpha ,a,b,\lambda \) and \(\theta \). As it seems, this system has no closed form solution in \(\alpha ,a,b,\lambda \) and \(\theta \). Then we have to use a numerical technique method, such as Newton-Raphson method, to obtain the solution.

5 Numerical Studies

5.1 Simulation Study

In the simulation study, we examine the precision of maximum likelihood estimators for \(BEPS(\alpha ,a,b,\lambda ,\theta )\). The mean of square error (MSE) for each parameter as comparison tool. We follow the following steps:

  1. 1.

    Consider 4 sample size \(n=50,100,200,400\) for \((\alpha ,a,b,\lambda ,\theta )=(1.25,1,0.75, 1.75,0.5)\)

  2. 2.

    Repeat the above step \(N=5000\) times

  3. 3.

    Calculate the MSE of each parameter over the 100 iterations.

The results of generating random sample and calculating the MSE values are prepared in Table 2. As we can see, by increasing n, MSE decreases. Actually, when \(n=400\), MSE have the minimum amount except \(\alpha \) in BEL. Generally, based on MSE, we can clearly conclude that we can obtain more precise estimations when we have more sample data.

Table 2 MSE of ML estimation of parameters for different sample sizes

Now, we check the behavior of ML estimation of the model parameters when n tends to infinity. We simulate 1000 repetition of simulated data for \((\alpha ,a,b,\lambda ,\theta )=(1.25,1,0.75,1.75,1)\) with size of \(n=5000\). Then, we consider the average of their MSE over increasing of sample size. Figure 6 shows the effect of increasing sample size n on the amount of MSE for ML estimation of the model parameters. As it can be seen, the MSE decreases significantly by increasing n. In the most of the cases, it converges to 0 around \(n=5000\). Also, there is an exceptions, and the MSE increases for MLE of \(\lambda \) in BEP.

Fig. 6
figure 6

The values of MSE for each parameter against increasing sample sizes

5.2 Real Data Applications

From the current study, it is hoped that BEPS distribution can be used more widely in both a theoretical and an applicable aspect. In this section, we analyze a real data sets to demonstrate the performance of the BEPS distribution in practice. This is a sample of 50 components taken from [21]. We fit the BEPS distributions family over the widely used Aarset data and analyze the obtained results. It is worth to note that the Aarset data was also reported and analyzed in [2, 4, 22, 23] studies and it contains lifetimes of 50 components, which possess a bathtub-shaped failure rate property. The data contains the times to failure of 50 devices put on life test at time 0, from [21].

The submodels of BEPS contained the exponential power series (proposed by [24]), beta exponential and exponential distributions are considered for comparison with the performance of the BEPS distributions family and also for applying likelihood ratio tests. Tables 3 and 4 shows values and Descriptive statistics of the Aarset data. In additions Fig. 7 show histogram and the approximation of density curve of the data.

Table 3 Lifetimes of 50 devices
Table 4 Descriptive values of Aarset data
Fig. 7
figure 7

Plots of density function, cumulative distribution function and hazard function

Table 5 shows the MLEs of parameters of the models (distributions) fitted to Aarset data. Also, we can find the values of AIC, BIC and \(-2\log -likelihood\) (\(-2 log(L)\) ) for the fitted models which are considered for model comparison. According to AIC and \(-2\log (L)\), BEL is the best model. But, based on BIC, BE with a slight difference with BEL has a better performance. In continue, we analyze the performance of fitted models by other approaches to reach the best conclusion about the best fitted model on Aarset data.

Table 5 MLE values of the model parameters, and the values of AIC, BIC and \(-2 log(L)\) for Aarset data

For more comparison and reaching better decision about the best model fitted over the Aarset data, we apply likelihood ratio (LR) test. We should calculate the maximum amounts of log-likelihoods of null and alternative hypothesis to obtain the LR statistics for testing some sub-models of the BEPS distribution. In LR test, if \(\Theta \) be the parameter space of problem, we divide the parameter space as \(\Theta =(\Theta _0,\Theta _1)\). \(\Theta _0\) is the parameter space of model under null hypothesis (\(H_0\)) and \(\Theta _1\) is the parameter space of model under alternative hypothesis (\(H_1\)). For example, if we want to asses the BE model against BEL, \(\Theta _0\) is the parameter space under the assumption of BE distribution and \(\Theta _1\) is the parameter space when we consider BEL for data. \(\hat{\Theta _0}\) and \(\hat{\Theta _1}\) be MLE under \(H_0\) and \(H_1\), the LR statistic for this problem is calculated as:

$$\begin{aligned} L R=-2 \log \left[ \frac{L\left( {\hat{\Theta }}_{0}\right) }{L\left( {\hat{\Theta }}_{1}\right) }\right] =476.24-468.78=7.46. \end{aligned}$$

Table 6 shows the information about LR test for some submodels against BEPS family. For the aforementioned example about testing BE versus BEL, \(p-value=0.024\) and we have another evidence to obtain BEL as the best model fitted over the Aarset data.

Table 6 Likelihood ratio test results for Aarset data

Now, we examine the goodness of fit for the fitted models over the Aarset data. we consider and compute the Kolmogorov–Smirnov (\(K--S\)), the Cramer–von Mises (\(W^*\)) and Anderson–Darling (\(AD^*\)) statistics for testing the goodness of fit. The smaller values of \(W^*\) and \(AD^*\) indicate the better fitting of the model. [25] provided detailed information about Anderson–Darling and Cramer–von Mises tests.

Table 7 presents the values of \(AD^*\), \(W*\) and \(K--S\) statistics, also it gives \(p-values\) of \(K--S\) test. According to the Table 7, BEL has the minimum values of \(AD^*\) and \(W*\) statistics. In addition, except EL, other distributions are fit for the data based on \(K--S\) test.

Table 7 Goodness-of-fit tests for Aarset data

Figures 8 and 9, show the estimated survival function plot, TTT plot and Kaplan–Meier curve of the proposed models for the Aarset data. These figures show that the BEL distribution has better fit than the other involved distributions.

Fig. 8
figure 8

Estimated survival function and the empirical survival for Aarset data

Fig. 9
figure 9

Empirical TTT-plot (top left), estimated hazard rate function (top right), estimated survival functions (bottom) of three fitted models for Aarset data

6 Conclusion

We introduce a five parameter lifetime family of distributions that is called Beta Exponential Power Series (BEPS). This type of distributions is a new mixed distribution of the Beta Exponential and power series distribution. Beta Exponential Poisson (BEP), beta Exponential Geometric (BEG), Beta Exponential Logarithmic (BEL), and Beta Exponential Binomial (BEB) are the distributions of this family of statistical distributions. Exponential power series (EP) and Beta Exponential (BE) distributions are the special cases of this type of distributions. Furthermore, we provide a mathematical treatment of this distribution including the order statistics. Also, we provide explicit expressions for the density function of the order statistics and their moments. Several properties of the BEPS distribution such as quantiles and moments are provided. BEPS has better fitting over Aarset data in comparison of submodels such as EP and BE.

For the future studies, the BEPS family can be extended by alpha power distributions as a new way to model lifetime data. Similarly, BE and EP can be considered for the extension by alpha power distributions.