Three-Inflated Poisson Distribution and its Application in Suicide Cases of India During Covid-19 Pandemic

Rahman, Tousifur; Hazarika, Partha Jyoti; Ali, M. Masoom; Barman, Manash Pratim

doi:10.1007/s40745-022-00372-1

Three-Inflated Poisson Distribution and its Application in Suicide Cases of India During Covid-19 Pandemic

Published: 15 March 2022

Volume 9, pages 1103–1127, (2022)
Cite this article

Download PDF

Annals of Data Science Aims and scope Submit manuscript

Three-Inflated Poisson Distribution and its Application in Suicide Cases of India During Covid-19 Pandemic

Download PDF

2655 Accesses
1 Citation
Explore all metrics

Abstract

Inflated models are generally used whenever there is an excess number of frequencies at particular count. In this study, a three-inflated Poisson (ThIP) distribution is proposed by mixing the Poisson distribution and a distribution to a point mass at three. Some of its distribution properties and reliability characteristics are studied. A simulation study is carried out to see the performance of the MLEs. In India Covid-19 implications on mental health have been abysmal. Covid-19 related suicide data of India during lockdown to the first gradual relaxation of the terms of the total lockdown (unlocking 1.0) are used to examine the appropriateness of the proposed distribution. Likelihood ratio test is used for discriminating between Poisson and the proposed distribution.

Asymptotic Likelihood Ratio Test for the Parameter of Inflation in ZIP

Article 09 August 2018

Modeling socio-demographic and clinical factors influencing psychiatric inpatient service use: a comparison of models for zero-Inflated and overdispersed count data

Article Open access 16 September 2020

On the zero-modified Poisson–Shanker regression model and its application to fetal deaths notification data

Article 09 February 2018

1 Introduction

Statistical modelling is an essential part of data science in various areas of scientific research or decision-making situations. To carry out this approach, selecting an appropriate distribution is one of the significant tasks. Distributional properties are quite important while dealing with huge data in data science (see [1,2,3] for example). In medical and social sciences, modeling count variables is an everyday exercise [4]. Poisson distribution plays a foremost part in count data analysis. Count data such as number of suicide attempts, number of heart attacks, number of unprotected sexual encounters, number of days of alcohol drinking, the number of days of missing primary activities, number of cigarettes smoked, number of hospitalizations, or number of unhealthy days during a period are common in medical and psychological research [5,6,7]. Poisson distribution is widely used to model such type of count data [6]. As a matter of fact, what happens because of data clustering or certain other factors is a kind of heterogeneity in study populations leading to the creation of extra variability which in turn results in variance that is greater than the mean [6]. In such cases Poisson distribution is inappropriate for data modelling. Medical and public health research often used zero-inflated models when there is a large proportion of zeros [5]. Whenever there is an excess number of frequencies at particular count, inflated models are used. The following are the situations in which too many observed frequencies at particular count data point may occur: (I) All the participants of the study area contemplated are not affected by the Poisson process, so inflation occurs at a particular count; (II) The increase or decrease of the participants of an area at a particular count into the sample may be due to some certain unavoidable problem in the sampling, which leads to inflation or deflation at that particular count; (III) There is no possibility that all the participants in the sample would come into a particular count as an utmost case of situation (II) and this is called truncation at that particular count; (IV) Leading to the data generating procedure of Poisson distribution, we have a sub population as an amalgamation of situations (I) and (III), whereas the part of the population out of this subpopulation in contention not affected by the distribution process furnished excess count in that particular count. Thus the inflated distribution is a mixture between a distribution to a point mass at a particular count and any other count distribution supported by non-negative integers [8].

In statistical literature, the issue of zero moderation in count data has a long history. Neyman [9] and Feller [10] first introduced the concept of zero-inflation when there is a problem of extravagant zeros in the data. Mullahy [11] introduced the zero-inflated Poisson (ZIP) distribution as a mixture of Poisson distribution and a distribution to a point mass at zero, with mixing probability $\gamma$, denoted by $ZIP\left( {\lambda ,\gamma } \right)$ and the probability mass function of this distribution is given by

$$P\left( {z;\lambda ,\gamma } \right) = \left\{ {\begin{array}{*{20}c} {\gamma + \left( {1 - \gamma } \right)e^{ - \lambda } ; \, z = 0} \\ {\left( {1 - \gamma } \right)\frac{{e^{ - \lambda } \lambda^{z} }}{z!}; \, z > 0} \\ \end{array} } \right.$$

where $\gamma$ is a zero-inflation parameter $\left( {0 < \gamma < 1} \right)$, $\, \lambda \ge 0$ and if $\gamma = 0$, ZIP distribution reduces to Poisson distribution.

Using inflated Poisson distribution Pandey [12] narrates a situation of the number of flowers of primulaveris and he exhibits the persistence of Poisson distribution inflated at the point eight, not at zero with the extravagant number of plants with eight flowers. Keeping this example in view, inflated discrete distribution should be studied at any point say $k,\left( {k \ne 0} \right)$.

Johnson et al. [13] described the zero-inflated distribution as a mixture of any count distribution hold up on non-negative integers and distribution at a point mass at zero and is defined as follows: A random variable $Z$ is said to be a zero-inflated distribution if its probability mass function is given by

$$g(z) = \left\{ {\begin{array}{*{20}c} {\gamma + (1 - \gamma )h(z;\Theta) ;} & {z = 0} \\ {(1 - \gamma )h(z;\Theta );} & {z > 0} \\ \end{array} } \right.$$

where $\gamma$ is a zero-inflation parameter $\left( {0 < \gamma < 1} \right)$ and $h\left( {z;\Theta } \right)$ is the pmf of $Z$ with a vector of parameter, $\Theta = \left\{ {\phi_{1} ,\phi_{2} ,...,\phi_{n} } \right\}$.

Gupta et al. [14] studied the structural properties and attained the MLEs of discrete distributions inflated at the point zero. Murat and Szynal [15] studied the discrete distributions inflated at any point $j,\left( {j \ge 0} \right)$, which was extended by the results of Gupta et al. [14]. Najundan et al. [16] estimate the parameters of zero-inflated Poisson model using the method of moments and compared with the maximum likelihood estimators. Using some natural calamities data Beckett et al. [17] studied zero-inflated Poisson model and juxtapose MLEs and MMEs regarding standardized bias and standardized mean squared error. Zero-inflated Poisson distribution, Zero-inflated binomial distribution, Zero-inflated negative binomial distribution and Zero-inflated geometric distribution are characterized by Najundan et al. [18], Najundan et al. [19], Suresh et al. [20] and Nagesh et al. [21].

Alshkaki [22] introduced zero–one-inflated Poisson (ZOIP) distribution and defined as: A random variable $Z$ is said to be a zero–one inflated Poisson distribution, denoted by $ZOIP\left( {\lambda ,\gamma ,\psi } \right)$, if its probability mass function is given by

$$g(z) = \left\{ {\begin{array}{*{20}l} {\gamma + (1 - \gamma - \psi )e^{ - \lambda } ;} \hfill & {z = 0} \hfill \\ {\psi + (1 - \gamma - \psi )\lambda e^{ - \lambda } ;} \hfill & {z = 1} \hfill \\ {(1 - \gamma - \psi )\frac{{e^{ - \lambda } \lambda^{z} }}{z!};} \hfill & {z > 1} \hfill \\ \end{array} } \right.$$

where $\gamma \, and \, \psi$ are zero and one inflation parameter $\left( 0 < \gamma < 1, \, 0 < \psi < 1, \right.\break\left. \, 0 < \gamma + \psi < 1 \right)$ and if $\psi = 0$ ZOIP reduces to ZIP and if $\gamma = 0 \, and \, \psi = 0$ ZOIP reduces to Poisson distribution. He studied its structural properties and estimates its parameters by method of maximum likelihood and method of moments. Using three real data sets constituting, a stillbirths of rabbit’s data, an accident insurance claims data and a heavy vehicle traffic accident data, he shows that the zero–one inflated Poisson distribution gives better fitting than the zero inflated Poisson distribution and also MLE provides better estimates than MME.

Singh et al. [23] introduced two-inflated binomial distribution to investigate the mechanism of son preference through the modeling of the pattern of male children in Uttar Pradesh, where family size and sex composition are dominated by strong son reference. Mwalili et al. [24] studied a zero-inflated negative binomial model to gratify extravagant zeros, an extension of negative binomial distribution.

For fitting a data set of excessive zeros, how a zero-inflated Poisson regression is better than a Poisson regression was demonstrated by Lambert [25]. Lambert [25] used a dataset of the number of manufacturing defects on writing boards to juxtapose the models. Many extensions and implementations of zero-inflated Poisson regression were described (for details see [26,27,28,29,30] and among others). Hall [31] procuring a zero-inflated binomial model considered a situation of upper bound count data, altering Lambert [25] methodology, with an example from horticulture. Famoye and Singh [32] propounded a zero-inflated generalized Poisson regression model, an extension of generalized Poisson regression model. Mwalili et al. [24] studied a zero-inflated negative binomial model to gratify extravagant zeros, an extension of negative binomial distribution.

In this paper, the researchers propose a Three-inflated Poisson (ThIP) distribution along with its distributional properties, reliability characteristics and consider the method of moment estimation (MM) and maximum likelihood estimation (MLE) to estimate its parameters. A simulation study has been conducted to see the behavior of the MLEs. In the application part a real-life data set of Covid-19 related suicides during Lockdown to the first gradual relaxation of the terms of the total lockdown (Unlocking 1.0) is used to examine the pertinence of the proposed distribution. The proposed distribution is compared with PD, ZIPD and ZOIPD using log-likelihood, the Akaike Information Criterion (AIC) [33], the Bayesian Information Criterion (BIC) [34] for model selection and the Kolmogorov–Smirnov (K-S) test [35] P-values for the goodness of fit. Likelihood ratio test is provided to discriminate between Poisson distribution and our proposed distributions.

2 Three Inflated Poisson Distribution (ThIPD)

Definition 1

A random variable $Z$ is said to be a three-inflated Poisson distribution, denoted by $ThIPD\left( {\lambda ,\alpha } \right)$ if its probability mass function is given by

$$P(z;\lambda ,\alpha ) = P(Z = z) = \left\{ {\begin{array}{*{20}c} {\alpha + (1 - \alpha )\frac{{e^{ - \lambda } \lambda^{3} }}{3!};} & {z = 3} \\ {(1 - \alpha )\frac{{e^{ - \lambda } \lambda^{z} }}{z!};} & {z \in \left\{ {0,1,2,4,5,6,7...} \right\}} \\ \end{array} } \right.$$

(1)

where $\alpha$ is a three-inflation parameter $\left( {0 < \alpha < 1} \right)$ and $\, \lambda \ge 0$.

Some particular cases: When.

(i) $\alpha \to 0$, $ThIPD\left( {\lambda , \, \alpha } \right)$ reduces to $Poisson\left( \lambda \right)$.

(ii) $\alpha \to 0 \, and \, \lambda \to \infty$, $ThIPD\left( {\lambda , \, \alpha } \right)$ reduces to Normal distribution.

Theorem 1

The probability of three in ThIPD is larger than that of a general PD.

Proof

We know that $\alpha + \left( {1 - \alpha } \right)\frac{{e^{ - \lambda } \lambda^{3} }}{3!} > 0$ and $\frac{{e^{ - \lambda } \lambda^{3} }}{3!} > 0$.

Hence $1 - \frac{{e^{ - \lambda } \lambda^{3} }}{3!} > 0 \, \Rightarrow \frac{{e^{ - \lambda } \lambda^{3} }}{3!} < 1$.

Now multiplying $- \alpha$ and adding $\alpha$ to both sides, we get

$$\alpha - \alpha \frac{{e^{ - \lambda } \lambda^{3} }}{3!} > 0$$

Finally, adding both sides by $\frac{{e^{ - \lambda } \lambda^{3} }}{3!}$, we get

$$\alpha + \left( {1 - \alpha } \right)\frac{{e^{ - \lambda } \lambda^{3} }}{3!} > \frac{{e^{ - \lambda } \lambda^{3} }}{3!}$$

Hence proved.

The pmf plots of $ThIPD \, \left( {\lambda , \, \alpha } \right)$ with different choice of parameters values of $\lambda \, and \, \alpha \,$ to study the variety of shapes are provided in Fig. 1.

It is observed from the plots of $ThIPD \, \left( {\lambda , \, \alpha } \right)$ in Fig. 1 that as $\alpha$ increases the curve is peak at $z = 3$ and as $\alpha$ decreases and $\lambda$ increases the curve tends to normal curve.

3 Distributional Properties

3.1 Moments

Theorem 2

If $Z\sim ThIPD \, \left( {\lambda , \, \alpha } \right)$ , then its $r^{th}$ order moments about zero is as follows

$$\mu^{\prime}_{r} = E\left( {z^{r} } \right) = 3^{r} \alpha + \left( {1 - \alpha } \right)\sum\limits_{j = 0}^{r} {S(r,j)\lambda^{j} }$$

(2)

Proof

If $Z\sim ThIPD \, \left( {\lambda , \, \alpha } \right)$, then the $r^{th}$ order moments about zero is

$$\begin{gathered} \mu^{\prime}_{r} = E\left( {Z^{r} } \right) \\ = \sum\limits_{z}^{{}} {z^{r} p\left( {\lambda , \, \alpha } \right)} \\ = 3^{r} \alpha + \left( {1 - \alpha } \right)\sum\limits_{z}^{{}} {z^{r} } \frac{{e^{ - \lambda } \lambda^{z} }}{z!} \\ = 3^{r} \alpha + \left( {1 - \alpha } \right)\sum\limits_{j = 0}^{r} {S(r,j)\lambda^{j} } \, \left[ {\because \sum\limits_{z}^{{}} {z^{r} } \frac{{e^{ - \lambda } \lambda^{z} }}{z!} = \sum\limits_{j = 0}^{r} {S(r,j)\lambda^{j} {\text{ for details see }}\left[ {13} \right]} } \right] \\ \end{gathered}$$

where $S(r,j)$ is the second kind of Stirling number. Hence proved.

In particular

$$\mu^{\prime}_{1} = E\left( Z \right) = 3\alpha + \left( {1 - \alpha } \right)\lambda \,$$

(3)

$$\mu^{\prime}_{2} = E\left( {Z^{2} } \right) = 9\alpha + \left( {1 - \alpha } \right)\left( {\lambda^{2} + \lambda \, } \right)$$

(4)

$$\mu^{\prime}_{3} = E\left( {Z^{3} } \right) = 27\alpha + \left( {1 - \alpha } \right)\left( {\lambda^{3} + 3\lambda^{2} + \lambda \, } \right)$$

(5)

$$\mu^{\prime}_{4} = E\left( {Z^{4} } \right) = 81\alpha + \left( {1 - \alpha } \right)\left( {\lambda^{4} + 6\lambda^{3} + 7\lambda^{2} + \lambda \, } \right)$$

(6)

Therefore,

$$V\left( Z \right) = \left( {1 - \alpha } \right)\left[ {\alpha \left( {\lambda - 3} \right)^{2} + \lambda } \right]$$

(7)

The plots of mean and variance of the proposed distribution with different choice of parameters to study their variations are shown in the Figs. 2 and 3. From the Figs. 2 and 3 it is clear that as $\alpha$ tends to 0, the mean and variance of the proposed distribution tends to $\lambda$, which is the mean and variance of general Poisson distribution.

3.1.1 Coefficient of Skewness

If $Z\sim ThIPD \, \left( {\lambda , \, \alpha } \right)$, then the Pearson’s $\beta_{1}$ coefficient is as follows

$$\beta_{1} = \frac{{\mu_{3}^{2} }}{{\mu_{2}^{3} }} = \frac{{\left[ {\mu^{\prime}_{3} - 3\mu^{\prime}_{2} \mu^{\prime}_{1} + 2\mu^{\prime{3}}_{1} } \right]^{2} }}{{\left[ {\mu^{\prime}_{2} - \mu^{\prime{2}}_{1} } \right]^{3} }}$$

$$= \frac{{\left[ {\lambda + \alpha \left( {\lambda - 3} \right)\left( {2\alpha \left( {\lambda - 3} \right)^{2} - \left( {\lambda - 9} \right)\lambda } \right) - 9} \right]^{2} }}{{\left( {1 - \alpha } \right)\left( {\alpha \left( {\lambda - 3} \right)^{2} + \lambda } \right)^{3} }}$$

(8)

The plots of coefficient of skewness of the proposed distribution for different choice of parameters are shown in the Fig. 4

From the Fig. 4 it is observed that $\beta_{1}$ increases as $\alpha$ increases and $\beta_{1}$ tends to zero as $\lambda$ decreases and increases for $0 < \alpha < 1$.

Remark 1

As $\alpha \to 0 \, and \, \lambda \to \infty$, the coefficient of skewness $\beta_{1} \to 0$ i.e. the proposed distribution tends to symmetric distribution.

3.1.2 Coefficient of Kurtosis

If $Z\sim ThIPD \, \left( {\lambda , \, \alpha } \right)$, then the Pearson’s $\beta_{2}$ coefficient is as follows

$$\beta_{2} = \frac{{\mu_{4} }}{{\mu_{2}^{2} }} = \frac{{\left[ {\mu^{\prime}_{4} - 4\mu^{\prime}_{3} \mu^{\prime}_{1} + 6\mu^{\prime}_{2} \mu^{{\prime}{2}}_{1} - 3\mu^{\prime{4}}_{1} } \right]}}{{\left[ {\mu^{\prime}_{2} - \mu^{\prime{2}}_{1} } \right]^{2} }}$$

$$= \frac{{\alpha \left( {\lambda - 3} \right)\left( {3\alpha^{2} \left( {\lambda - 3} \right)^{3} - 3\alpha \left( {\lambda - 3} \right)\left( {9 + \left( {\lambda - 8} \right)\lambda } \right) + \lambda \left( {31 + \left( {\lambda - 9} \right)\lambda } \right) - 27} \right) + \lambda^{2} \left( {3 + \frac{1}{\lambda }} \right)}}{{\left( {1 - \alpha } \right)\left( {\alpha \left( {\lambda - 3} \right)^{2} + \lambda } \right)^{2} }}$$

(9)

The plots of coefficient of Kurtosis of the proposed distribution for different choice of parameters are shown in the Fig. 5.

From the Fig. 5 it is observed that $\beta_{2} > 3$ as $\alpha$ increases.

Remark 2

As $\alpha \to 0 \, and \, \lambda \to \infty$, the coefficient of kurtosis $\beta_{2} = 3$ i.e. the proposed distribution tends to normal.

3.2 Probability Generating Function

Theorem 3

If $Z\sim ThIPD \, \left( {\lambda , \, \alpha } \right)$ , then its Probability Generating Function(p.g.f), $P_{z} \left( S \right)$ is as follows

$$P_{z} (S) = \alpha s^{3} + \left( {1 - \alpha } \right)e^{{\lambda \left( {s - 1} \right)}}$$

(10)

Proof

If $Z\sim ThIPD \, \left( {\lambda , \, \alpha } \right)$, then the probability generating function $P_{z} (S)$ is

$$\begin{gathered} P_{z} \left( S \right) = E\left( {S^{z} } \right) \\ = \sum\limits_{z = 0}^{\infty } {P\left( {z,\lambda ,\alpha } \right)} s^{z} \\ = \alpha s^{3} + \left( {1 - \alpha } \right)\sum\limits_{z = 0}^{\infty } {s^{z} \frac{{e^{ - \lambda } \lambda^{z} }}{z!}} \\ = \alpha s^{3} + \left( {1 - \alpha } \right)e^{{\lambda \left( {s - 1} \right)}} \\ \end{gathered}$$

Hence proved.

Remark 3

Putting $S = e^{t}$ in Eq. (10), the Moment Generating Function (m.g.f), $M_{z} \left( t \right)$ of $Z\sim ThIPD \, \left( {\lambda , \, \alpha } \right)$ is as follows

$$M_{z} \left( t \right) = \alpha e^{3t} + \left( {1 - \alpha } \right)e^{{\lambda \left( {e^{t} - 1} \right)}}$$

(11)

Remark 4

Putting $S = e^{it}$ in Eq. (10), the Characteristic Function, $\varphi_{z} \left( t \right)$ of $Z\sim ThIPD \, \left( {\lambda , \, \alpha } \right)$ is as follows

$$\varphi_{z} \left( t \right) = \alpha e^{3it} + \left( {1 - \alpha } \right)e^{{\lambda \left( {e^{it} - 1} \right)}}$$

(12)

3.3 Cumulative Distribution Function (CDF)

Theorem 4

If $Z\sim ThIPD \, \left( {\lambda , \, \alpha } \right)$ , then its CDF of $Z$ is as follows

$$F\left( z \right) = P\left( {Z \le z} \right) = \alpha + \left( {1 - \alpha } \right)\frac{{\Gamma \left( {z + 1,\lambda } \right)}}{{\Gamma \left( {z + 1} \right)}}$$

(13)

Proof

If $Z\sim ThIPD \, \left( {\lambda , \, \alpha } \right)$, then its CDF is as follows

$$\begin{gathered} F\left( z \right) = P\left( {Z \le z} \right) \\ = \sum\limits_{t = 0}^{z} {P\left( {Z = t} \right)} \\ = \alpha + \sum\limits_{t = 0}^{z} {\left( {1 - \alpha } \right)\frac{{e^{ - \lambda } \lambda^{z} }}{z!}} \\ = \alpha + \left( {1 - \alpha } \right)\frac{{\Gamma \left( {z + 1,\lambda } \right)}}{{\Gamma \left( {z + 1} \right)}} \, \left[ {\sum\limits_{t = 0}^{z} {\frac{{e^{ - \lambda } \lambda^{z} }}{z!}} = \frac{{\Gamma \left( {z + 1,\lambda } \right)}}{{\Gamma \left( {z + 1} \right)}}{\text{ for detail see }}\left[ {{36}} \right]} \right] \\ \end{gathered}$$

where $\Gamma \left( {z + 1,\lambda } \right)$ and $\Gamma \left( {z + 1} \right)$ is an upper incomplete gamma function.

Hence proved.

The plots of CDF of $ThIPD \, \left( {\lambda , \, \alpha } \right)$ with different choice of parameters $\lambda \, and \, \alpha$ are provided in Fig. 6.

4 Reliability Characteristics

4.1 Survival Function (SF)

Theorem 5

If $Z\sim ThIPD \, \left( {\lambda , \, \alpha } \right)$ , then its Survival Function (SF) of $Z$ is as follows

$$S(z) = P(Z > z) = \alpha + \left( {1 - \alpha } \right)\frac{{\gamma \left( {z + 1,\lambda } \right)}}{{\Gamma \left( {z + 1} \right)}}$$

(14)

Proof

If $Z\sim ThIPD \, \left( {\lambda , \, \alpha } \right)$, then its Survival Function (SF) is as follows

$$\begin{gathered} S(z) = P(Z > z) \\ = \sum\limits_{t = z + 1}^{\infty } {P(Z = t)} \\ = \alpha + \sum\limits_{t = z + 1}^{\infty } {\left( {1 - \alpha } \right)\frac{{e^{ - \lambda } \lambda^{z} }}{z!}} \\ \end{gathered}$$

$$= \alpha + \left( {1 - \alpha } \right)\frac{{\gamma \left( {z + 1,\lambda } \right)}}{{\Gamma \left( {z + 1} \right)}} \, \left[ {\sum\limits_{t = z + 1}^{\infty } {\frac{{e^{ - \lambda } \lambda^{z} }}{z!}} = \frac{{\gamma \left( {z + 1,\lambda } \right)}}{{\Gamma \left( {z + 1} \right)}}{\text{ for detail see }}\left[ {{36}} \right]} \right]$$

where $\gamma \left( {z + 1,\lambda } \right)$ a lower incomplete gamma is function and $\Gamma \left( {z + 1} \right)$ is an upper incomplete gamma function.

Hence proved.

The plots of Survival Function (SF) of $ThIPD \, \left( {\lambda , \, \alpha } \right)$ with different choice of parameters $\lambda \, and \, \alpha$ are provided in Fig. 7.

4.2 Failure Rate (FR)

Let $z_{1} ,z_{2} ,z_{3} ,...,z_{n}$ be a random sample from three inflated Poisson distribution as given by Eq. (1)

Define $Y$ be the number of $Z_{i}^{{^{{^{,} }} }} s$ taking the value three. Then Eq. (1) can be inscribed as follows

$$P\left( {Z = z_{i} } \right) = \left[ {\alpha + \left( {1 - \alpha } \right)\frac{{e^{ - \lambda } \lambda^{3} }}{3!}} \right]^{Y} \left[ {\left( {1 - \alpha } \right)\frac{{e^{ - \lambda } \lambda^{{z_{i} }} }}{{z_{i} !}}} \right]^{1 - Y}$$

and using $S(z)$ from Eq. (14)

The failure rate of $ThIPD \, \left( {\lambda , \, \alpha } \right)$ is given by

$$R\left( z \right) = \frac{P\left( z \right)}{{S\left( z \right)}} = \frac{{\Gamma \left( {z + 1} \right)\left[ {\left( {\alpha + \left( {1 - \alpha } \right)\frac{{e^{ - \lambda } \lambda^{3} }}{3!}} \right)^{Y} \left( {\left( {1 - \alpha } \right)\frac{{e^{ - \lambda } \lambda^{{z_{i} }} }}{{z_{i} !}}} \right)^{1 - Y} } \right]}}{{\Gamma \left( {z + 1} \right)\alpha + \left( {1 - \alpha } \right)\gamma \left( {z + 1,\lambda } \right)}}$$

(15)

The plots of Failure Rate (FR) of $ThIPD \, \left( {\lambda , \, \alpha } \right)$ with different choice of parameters $\lambda \, and \, \alpha$ are provided in Fig. 8.

5 Parameter Estimation

5.1 Method of Moment Estimation (MM)

The parameters $\lambda {\text{ and }}\alpha$ of (1) can be obtained using method of moments as follows:

Considering the first two moments from Eqs. (3) and (4)

$$\hat{\alpha } = \frac{{\mu^{\prime}_{1} - \lambda }}{{\left( {3 - \lambda } \right)}}$$

(16)

$$\mu^{\prime}_{2} - 3\mu^{\prime}_{1} = \left( {1 - \alpha } \right)\left( {\lambda^{2} - 2\lambda } \right)$$

(17)

Putting the value of $\alpha$ from Eq. (16), the Eq. (17) reduces to

$$\mu^{\prime}_{2} - 3\mu^{\prime}_{1} = \left( {1 - \frac{{\mu^{\prime}_{1} - \lambda }}{{\left( {3 - \lambda } \right)}}} \right)\left( {\lambda^{2} - 2\lambda } \right)$$

$$\frac{{\mu^{\prime}_{2} - 3\mu^{\prime}_{1} }}{{3 - \mu^{\prime}_{1} }} = \frac{{\lambda^{2} - 2\lambda }}{3 - \lambda } = M{\text{ (say)}}$$

(18)

Then

$$M = \frac{{\lambda^{2} - 2\lambda }}{3 - \lambda }$$

$$\lambda^{2} - \lambda \left( {2 - M} \right) - 3M = 0$$

(19)

Solving the quadratic Eq. (19), we can estimate the value of $\lambda$, which has been used in Eq. (16) to estimate the value of $\alpha$.

5.2 Maximum Likelihood Estimation (MLE)

The parameters $\lambda {\text{ and }}\alpha$ of Eq. (1) can be obtained using method of maximum likelihood as follows:

Let $z_{1} ,z_{2} ,z_{3} ,...,z_{n}$ be a random sample from three inflated Poisson distribution as given by Eq. (1) and let for $i = 1,2,3,...,n$

$$a_{i} = \left\{ {\begin{array}{*{20}c} {1, \, if{\text{ z}}_{i} = 3} \\ { \, 0, \, otherwise} \\ \end{array} } \right.$$

Then for $i = 1,2,3,......,n$, Eq. (1) can be described as follows

$$P\left( {Z = z_{i} } \right) = \left[ {\alpha + \left( {1 - \alpha } \right)\frac{{e^{ - \lambda } \lambda^{3} }}{3!}} \right]^{{a_{i} }} \left[ {\left( {1 - \alpha } \right)\frac{{e^{ - \lambda } \lambda^{{z_{i} }} }}{{z_{i} !}}} \right]^{{1 - a_{i} }}$$

Hence the likelihood function, $L = L\left( {\lambda ,\alpha ;z_{1} ,z_{2} ,z_{3} ,......,z_{n} } \right)$ will be

$$L = \prod\limits_{i = 1}^{n} {\left[ {\alpha + \left( {1 - \alpha } \right)\frac{{e^{ - \lambda } \lambda^{3} }}{3!}} \right]^{{a_{i} }} \left[ {\left( {1 - \alpha } \right)\frac{{e^{ - \lambda } \lambda^{{z_{i} }} }}{{z_{i} !}}} \right]^{{1 - a_{i} }} }$$

$$= \left[ {\alpha + \left( {1 - \alpha } \right)\frac{{e^{ - \lambda } \lambda^{3} }}{6}} \right]^{{n_{0} }} \prod\limits_{i = 1}^{n} {\left[ {\left( {1 - \alpha } \right)\frac{{e^{ - \lambda } \lambda^{{z_{i} }} }}{{z_{i} !}}} \right]^{{k_{i} }} }$$

where $k_{i} = 1 - a_{i}$, $n_{0} = \sum\limits_{i = 1}^{n} {a_{i} }$. Note that $n_{0}$ represents the number of three’s in the sample.

Therefore,

$$\log L = n_{0} \log \left[ {\alpha + \left( {1 - \alpha } \right)\frac{{e^{ - \lambda } \lambda^{3} }}{6}} \right] + \left( {n - n_{0} } \right)\log \left( {1 - \alpha } \right) - \left( {n - n_{0} } \right)\lambda + \sum\limits_{i = 1}^{n} {k_{i} } z_{i} \log \lambda - \sum\limits_{i = 1}^{n} {k_{i} } \log \left( {z_{i} !} \right)$$

$$\frac{\partial \log L}{{\partial \alpha }} = \frac{{n_{0} \left( {6 - e^{ - \lambda } \lambda^{3} } \right)}}{{6\alpha + \left( {1 - \alpha } \right)e^{ - \lambda } \lambda^{3} }} - \frac{{\left( {n - n_{0} } \right)}}{{\left( {1 - \alpha } \right)}}$$

(20)

Similarly,

$$\frac{\partial \log L}{{\partial \lambda }} = \frac{{n_{0} \left( {3e^{ - \lambda } \lambda^{2} - e^{ - \lambda } \lambda^{3} } \right)\left( {1 - \alpha } \right)}}{{6\alpha + \left( {1 - \alpha } \right)e^{ - \lambda } \lambda^{3} }} + \frac{{\sum\limits_{i = 1}^{n} {k_{i} } z_{i} }}{\lambda } - \left( {n - n_{0} } \right)$$

(21)

Let,

$$p = \alpha + \left( {1 - \alpha } \right)\frac{{e^{ - \lambda } \lambda^{3} }}{6}$$

(22)

Now, let $\frac{\partial \log L}{{\partial \alpha }} = 0$. Then from Eq. (20) and using Eq. (22)

$$1 - \alpha = \frac{{6\left( {n - n_{0} } \right)}}{{\frac{{n_{0} }}{p}\left( {6 - e^{ - \lambda } \lambda^{3} } \right)}}$$

(23)

And letting $\frac{\partial \log L}{{\partial \lambda }} = 0$ from Eq. (21) and using Eq. (22)

$$\frac{{n_{0} }}{6p}\left( {3e^{ - \lambda } \lambda^{2} - e^{ - \lambda } \lambda^{3} } \right)\left( {1 - \alpha } \right) + \frac{{\sum\limits_{i = 1}^{n} {k_{i} } z_{i} }}{\lambda } - \left( {n - n_{0} } \right) = 0$$

(24)

Now, if we replace $p$ by their sample relative frequencies, i.e. by their sample estimates, the proportion of three’s in the sample, i.e. $\hat{p} = {\raise0.7ex\hbox{${n_{0} }$} \!\mathord{\left/ {\vphantom {{n_{0} } n}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{$n$}}$ and then Eqs. (23) and (24) reduces to

$$1 - \alpha = \frac{{6\left( {n - n_{0} } \right)}}{{n\left( {6 - e^{ - \lambda } \lambda^{3} } \right)}}$$

(25)

and

$$\frac{n}{6}\left( {3e^{ - \lambda } \lambda^{2} - e^{ - \lambda } \lambda^{3} } \right)\left( {1 - \alpha } \right) + \frac{{\sum\limits_{i = 1}^{n} {k_{i} } z_{i} }}{\lambda } - \left( {n - n_{0} } \right) = 0$$

(26)

Using Eq. (25), Eq. (26) reduces to

$$\sum\limits_{i = 1}^{n} {k_{i} } z_{i} \left( {6 - e^{ - \lambda } \lambda^{3} } \right) + \left( {n - n_{0} } \right)\left( {3e^{ - \lambda } \lambda^{3} - 6\lambda } \right) = 0$$

$$C\left( \lambda \right) = 0$$

(27)

where $C\left( \lambda \right) = \sum\limits_{i = 1}^{n} {k_{i} } z_{i} \left( {6 - e^{ - \lambda } \lambda^{3} } \right) + \left( {n - n_{0} } \right)\left( {3e^{ - \lambda } \lambda^{3} - 6\lambda } \right)$.

Hence Eq. (27) can be solved by any numerical procedure, say, Newton Rapson method, to obtain $\hat{\lambda }$ numerically, i.e.$C\left( {\hat{\lambda }} \right) = 0$.

Similarly using Eqs. (22) and (25), $\alpha$ can be estimated as

$$\hat{\alpha } = \frac{1}{n}\left[ {n_{0} - \frac{{\left( {n - n_{0} } \right)e^{ - \lambda } \lambda^{3} }}{{6 - e^{ - \lambda } \lambda^{3} }}} \right]$$

(28)

Therefore, the maximum likelihood estimates (MLE) of the parameter $\lambda {\text{ and }}\alpha$ can be estimated by solving (27) numerically to find $\hat{\lambda }$, and Eq. (28) gives $\hat{\alpha }$ respectively.

For the reckoning of the asymptotic variance–covariance matrix of the estimates the second order differentiations of the log-likelihood function are furnished here

$$\frac{{\partial^{2} \log L}}{{\partial \alpha^{2} }} = - \frac{{n_{0} \left( {6 - e^{ - \lambda } \lambda^{3} } \right)^{2} }}{{\left[ {6\alpha + \left( {1 - \alpha } \right)e^{ - \lambda } \lambda^{3} } \right]^{2} }} - \frac{{\left( {n - n_{0} } \right)}}{{\left( {1 - \alpha } \right)^{2} }}$$

$$\frac{{\partial^{2} \log L}}{{\partial \lambda^{2} }} = n_{0} \left( {1 - \alpha } \right)\left[ {\frac{{3\lambda e^{ - \lambda } \left( {2 - \lambda } \right)}}{{6\alpha + \left( {1 - \alpha } \right)e^{ - \lambda } \lambda^{3} }} - \frac{{\lambda^{2} e^{ - \lambda } \left( {3 - \lambda } \right)\left[ {6\alpha + \left( {1 - \alpha } \right)3\lambda^{2} e^{ - \lambda } } \right]}}{{\left( {6\alpha + \left( {1 - \alpha } \right)e^{ - \lambda } \lambda^{3} } \right)^{2} }}} \right] - \frac{{\sum\limits_{i = 1}^{n} {k_{i} } z_{i} }}{{\lambda^{2} }}$$

$$\frac{\partial \log L}{{\partial \lambda \partial \alpha }} = \frac{{6n_{0} e^{ - \lambda } \lambda^{2} \left( {\lambda - 3} \right)}}{{\left( {6\alpha + \left( {1 - \alpha } \right)e^{ - \lambda } \lambda^{3} } \right)^{2} }}$$

The asymptotic variance–covariance matrix of the maximum likelihood estimates of $\lambda$ and $\alpha$ for $ThIPD \, \left( {\lambda , \, \alpha } \right)$, can be acquired by inverting the Fisher information matrix (I), given by

$$I = \left[ {\begin{array}{*{20}c} {E\left( { - \frac{{\partial^{2} \log L}}{{\partial \alpha^{2} }}} \right)} & {E\left( { - \frac{{\partial^{2} \log L}}{\partial \alpha \partial \lambda }} \right)} \\ {E\left( { - \frac{{\partial^{2} \log L}}{\partial \lambda \partial \alpha }} \right)} & {E\left( { - \frac{{\partial^{2} \log L}}{{\partial \lambda^{2} }}} \right)} \\ \end{array} } \right]$$

The ingredient of the above Fisher information matrix can be acquired as.

$E\left( { - \frac{{\partial^{2} \log L}}{{\partial \alpha^{2} }}} \right) = \left. {\left( { - \frac{{\partial^{2} \log L}}{{\partial \alpha^{2} }}} \right)} \right|_{{\alpha = \hat{\alpha },\lambda = \hat{\lambda }}}$, and so on.

The asymptotic distribution of the maximum likelihood estimator $\left( {\hat{\lambda },\hat{\alpha }} \right)^{/}$ is given by

$$\sqrt n \left( {\begin{array}{*{20}c} {\hat{\lambda }} \\ {\hat{\alpha }} \\ \end{array} } \right)_{MLE} \mathop{\longrightarrow}\limits^{L}AN\left( {\left( {\begin{array}{*{20}c} {\hat{\lambda }} \\ {\hat{\alpha }} \\ \end{array} } \right),I^{ - 1} } \right), \, as \, n \to \infty$$

6 Simulation Study

In this section a simulation study has been conducted to see the performance of the estimated parameters. Here, to generate random numbers $Z$ from $ThIPD \, \left( {\lambda , \, \alpha } \right)$ we have applied acceptance rejection sampling [37]. By applying this method random samples are generated of size $n = 30, \, 50, \, 100{\text{ and 200}}$ with different combinations of true values of parameters $\lambda {\text{ and }}\alpha$ and finally, MLEs are computed using optim function of R software. Bias and MSE of the parameters given in the Table 1 are calculated using the following formulae.

$$Bias\left( {\hat{\theta }} \right) = E\left( {\hat{\theta }} \right) - \theta$$

$$MSE\left( {\hat{\theta }} \right) = E\left( {\hat{\theta } - \theta } \right)^{2}$$

$$\hat{\theta } = {\text{estimated parameter and }}\theta = {\text{true parameter}}$$

$$\theta = \left( {\lambda ,\alpha } \right){, }\hat{\theta } = \left( {\hat{\lambda },\hat{\alpha }} \right) \,$$

Table 1 Results of Simulation

Full size table

Here $r\left( { = {\text{number of replication}}} \right) = 1000$.

From the values of the MSE and biases of the simulation study given in Table 1, it is observed that the bias and MSE of the estimators are small and as the sample size increases the estimated bias and MSE also gradually decreases which is as expected.

Furthermore we have checked the normality of the MLEs by normal Q-Q plot for all the parameters of each run. One such Q-Q plot is presented here obtained for the case when $\left( {\lambda ,\alpha } \right) = \left( {5,0.3} \right)$ as a demonstration (Fig. 9). From the Fig. 9 it is observed that the MLEs of all the parameters follow approximately normal distribution.

7 Real-life Applications

Covid-19 virus started from Wuhan, China and has blazed the trail of a new world order [38]. This new world order necessitated that the global community drop and dissolve all culture differences and brainstorms to locate mitigating measures especially in respect of providing sustenance to the economy (for details see [39,40,41]). Something that came as a bolt from the blue, India was unprepared to defend the onslaughts of Covid-19. In India, the first case of Covid-19 was reported on 30^th January, 2020 [42]. The disease accelerated to such a level that is prompted the Govt. of India to enforce and clamp an emergency like Lockdown (lockdown denotes a clamp down on almost all human transactions and activities in an emergency), the fallouts of which have been discernible in an emphatic manner [43]. The Lockdown in India starts from 25^th March 2020 to 7^th June 2020 [44] in a phased manner. The first phase of lockdown in India starts from 25^th March 2020 for a period of twenty one days [45]. After the first lockdown, the next three phases are announced with conditional relaxations and restrictions- the second phase with effect from 15^th April 2020 to 3^rd May 2020, the third phase from 4th May 2020 to 17th May 2020 and the fourth phase from 18th May 2020 to 31st May 2020 [45]. The fourth phase of lockdown extended to 7th June 2020.The hardest hit because of the clamping of lockdown/s have been those living on the edge and on the margins like the daily wage earners, private job seekers who lost their job, the farmers who could not locate markets to sell their agricultural produce, the migrant workers who were left stranded like anything, the large chunks of underprivileged students and those who opted for reverse migration [43, 46]. The most discernible corollary has therefore, been the conditions of poverty and starvation which in turn have sapped the vitality and jolted the psychological and cognitive get-up of the hugely dense Indian populace [43]. Billed as a biomedical disease with negative cognitive responses, it has been unfortunate that Asian countries have been facing the brunt in terms of the exponential growth of the transmission of Sars-Cov-2 in densely populated areas of internal migrants. There have alarming instances of socially irresponsible behavior and panic attacks among internal migrant workers who are in desperate need of psycho-social support [47].

After completion of the fourth phase of lockdown, the Government of India started unlocking (unlocking denotes relaxations on the imposed lockdown in the event of any emergency) the nation in a phased manner with restriction to containment zones from 8th June, 2020 [48]. The first phase of unlocking i.e. unlocking 1.0 starts from 8th June 2020 to 30th June 2020 [48]. Covid-19 has a diverse array of effects. The worst being that of committing suicides primarily triggered by uncertainty regarding living from hand to mouth in all aspects of life. Given the circumstances, the situation has come to such a pass that suicides, more often than not have occasionally hogged the limelight. The first Indian suicide related to Covid-19 took place on February 12, 2020 [49], followed by two more such suicides [50]. In addition, the first Covid-19 related Student Suicide case was reported on June 2, 2020 [51]. The reasons being financial distress, fear of infection, freezing of employment opportunities, lack of freedom of movement, withdrawal etc. the major cause of occurrences of suicide [47, 49, 50].

The data set of 298 Covid-19 pandemic related suicide cases during Lockdown to Unlocking 1.0 in India are collected from the web portal https://www.kaggle.com/.. The age and sex distribution of the individuals who committed Suicides during Lockdown and Unlocking 1.0 are presented in (Fig. 10) which manifest that, throughout both lockdown and unlocking1.0, the highest percentage of suicides has been committed by individuals of the age group 21–40 and male individuals. The causes of suicide during Lockdown and Unlocking 1.0 are presented in (Fig. 11), where it can be observed that due to financial distress and fear of infection maximum suicide occurred during lockdown and unlocking 1.0. The occupation of the individuals who committed suicide during Lockdown and Unlocking 1.0 are presented in (Fig. 4), where it can be observed that maximum individuals who committed suicides during lockdown and unlocking1.0 are migration workers, worker and private sector service (Fig. 12).

In order to study the pattern of the suicide cases, Poisson distribution is used. Since during lockdown to unlocking 1.0 the proportion of three (3) deaths per day is inflated than the others, so we used three-inflated Poisson distribution $ThIPD \, \left( {\lambda , \, \alpha } \right)$ for fitting the real life data set of Covid-19 related suicides along with zero inflated Poisson distribution $ZIPD \, \left( {\lambda ,\gamma } \right)$ [11], and zero–one inflated Poisson distribution $ZOIP\left( {\lambda ,\gamma ,\psi } \right)$[22] and standard Poisson distribution $PD \, \left( \lambda \right)$. The values of the MLEs of the parameters for different distributions are estimated using optim function of R language. The log-likelihood, Akaike information criterion (AIC), Bayesian information criterion (BIC) and the Kolmogorov–Smirnov test (KS test) with p-values are summarized in Table 2 for the number of suicides cases during the 98 days of lockdown and Unlocking 1.0 in India during Covid-19 pandemic.

Table 2 Distribution of number of suicides per day during Lockdown to Unlocking 1.0 in 98 observed Days of India

Full size table

From the Table 2 it is seen that the value of AIC and BIC of ThIPD is smaller than PD, ZIPD, ZOIPD and highest P-value of the KS statistics of ThIPD and also the expected frequencies of ThIPD are closer to the observed frequencies.

In Fig. 13 the observed histogram and estimated pmf’s of PD, ZIPD, ZIOPD and ThIPD are plotted which also validate our findings and in Fig. 14 the observed Ogive and estimated cdf’s of PD, ZIPD, ZOIPD and ThIPD are plotted for visual comparisons.

The proposed three-inflated Poisson distribution (ThIPD) provides better fit to the data set under consideration of all criteria.

8 Likelihood Ratio Test

Since $Poi\left( \lambda \right)$ and $ThIPD\left( {\lambda ,\alpha } \right)$ are nested models, the likelihood ratio (LR) test is used to discriminate between them. The LR test is carried out to test the hypothesis: $H_{0} :\alpha = 0$, that is the sample is drawn from $Poi\left( \lambda \right)$; against the alternative $H_{0} :\alpha \ne 0$, that is the sample is drawn from $ThIPD\left( {\lambda ,\alpha } \right)$. The value of LR test statistic for the above dataset is given below in Table 3.

Table 3 The value of the LR test statistic for the Covid-19 related suicide dataset

Full size table

The value of the LR test statistic for the dataset is respectively 28.542 which exceeds the critical value at 5% level of significance for one (1) degrees of freedom, i.e., 3.841. Thus the evidence is in support of the alternative hypothesis that the sample data comes from $ThIPD\left( {\lambda ,\alpha } \right)$ and not from $Poi\left( \lambda \right)$.

9 Conclusion

A three-inflated Poisson distribution (ThIPD) is proposed and we studied its distributional properties and reliability characteristics. A simulation study has been conducted to see the behavior of the MLEs. The appropriateness of fitting the distribution is carried out based on the goodness of fit test and some information criteria. The usefulness of the proposed distribution is exemplified by the data of number of suicides occurred during lockdown to unlocking 1.0 in India. The real life data set of Covid-19 related suicides considered here has shown that the proposed three-inflated Poisson distribution (ThIPD) provides better fit in comparison to the other known distributions viz. ZIPD, ZOIPD and general PD under considerations in terms of model selection criteria, namely AIC and BIC and goodness of fit test, namely KS-test. The plots presented above also validate our findings. Moreover from the LR test it is observed that the sample comes from ThIPD, not from PD. Thus our proposed distribution provide better fitting in comparison to the other competitor distributions.

Data availability

Kaggle.com.

Code availability

Not applicable.

References

Olson DL, Shi Y, Shi Y (2007) Introduction to business data mining. McGraw-Hill/Irwin, New York, pp 2250–2254
Google Scholar
Shi Y, Tian YJ, Kou G, Peng Y, Li JP (2011) Optimization based data mining: theory and applications. Springer, Berlin
Book Google Scholar
Tien JM (2017) Internet of things, real-time decision making, and artificial intelligence. Annal Data Sci 4(2):149–178
Article Google Scholar
Nani L (2019) Modeling Zero-inflated and overdispersed count data: Application to IN-Hospital mortality data. PhD Thesis, The University of Tennessee at Chattanooga, Chattanooga TN.
Lam KF, Xue H, Cheung YB (2006) Semiparametric analysis of zero-inflated count data. Biometrics 62:996–1003
Article Google Scholar
He H, Tang W, Wang W, Crits-Christoph P (2014) Structural zeroes and zero-inflated models. Shanghai Arch Psy 26(4):236–242
Google Scholar
Yang S, Harlow LL, Puggioni G, Redding CA (2017) A comparison of different methods of zero-inflated data analysis and an application in health surveys. J Mod Appl Stat Methods 16(1):518–543
Article Google Scholar
Bodhisuwan W, Samutwachirawong S, Payakkapong P (2018) The zero-inflated negative binomial-Erlang distribution: An application to highly pathogenic avian influenza H5N1 in Thailand. Songklanakarin J Sci Technol 40(6):1428–1436
Google Scholar
Neyman J (1939) On a new class of contagious distributions applicable in entomology and bacteriology. Ann Math Stat 10:35–57
Article Google Scholar
Feller W (1945) On a general class of contagious distributions. Ann Math Stat 12:389–400
Google Scholar
Mullahy J (1986) Specification and testing of some modified count data models. J Econometrics 33(3):341–365
Article Google Scholar
Pandey KN (1964) Generalized inflated Poisson distribution. J Sci Res 15(2):157–162
Google Scholar
Johnson NL, Kotz S, Kemp AW (1992) Univariate discrete distributions, 2nd edn. John Wiley and Sons, New York
Google Scholar
Gupta PL, Gupta RL, Tripathi RC (1995) Inflated modified power series distributions with applications. Commun Stat Theory Methods 24(9):2355–2374
Article Google Scholar
Murat M, Szynal D (1998) Non–Zero-Inflated modified power series distributions. Commun Stat-Theory Methods 27(12):3047–3064
Article Google Scholar
Nanjundan G, Naika TR (2012) Asymptotic Comparison of Method of Moments estimators and maximum likelihood estimators of parameters in Zero-Inflated Poisson model. Appl Math 3:610–616
Article Google Scholar
Beckett S, Jee J et al (2014) Zero-inflated Poisson (ZIP) distribution: parameter estimation and applications to model data from natural calamities. Involve J Math 7(6):751–767
Article Google Scholar
Nanjundan G, Pasha S (2015) A note on the characterization of Zero-inflated Poisson model. Open J Stat 5:140–142
Article Google Scholar
Nanjundan G, Pasha S (2015) A characterization of Zero-inflated binomial model. Int J Math Comp Res 3(10):1188–1190
Google Scholar
Suresh R, Nanjundan G et al (2015) On a characterization of Zero-inflated negative binomial distribution. Open J Stat 5:511–513
Article Google Scholar
Nagesh S, Nanjundan G et al (2015) A Characterization of Zero-inflated geometric model. Int J Math Trends Technol 23(1):71–73
Article Google Scholar
Alshkaki RSA (2016) On the Zero-One Inflated Poisson distribution. Int J Stat Distributions Appl 2(4):42–48
Google Scholar
Singh BP, Maheshwari S, Gupta PK (2015) A probability model for sex composition of children in the presence of son preference. Demography India 44:50–57
Google Scholar
Mwalili SM, Lesaffre E, Declerck D (2008) The zero-inflated negative binomial regression model with correction for misclassification: an example in caries research. Stat Methods Med Res 17(2):123–139
Article Google Scholar
Lambert D (1992) Zero-inflated Poisson regression, with an application to defects in manufacturing. Technometrics 34(1):1–14
Article Google Scholar
Bohning D (1998) Zero-inflated Poisson models and C.A.MAN: A tutorial collection of evidence. Biom J 40:833–843
Article Google Scholar
Bohning D, Dietz E, Schlattmann P, Mendonca L, Kirchner U (1999) The zero inflated Poisson model and the decayed, missing and filled teeth index in dental epidemiology. J R Stat Soc Ser A 162:195–209
Article Google Scholar
Ridout M, Hinde J, Demetrio CGB (2001) A score test for testing a zero-inflated Poisson regression model against zero-inflated negative binomial alternatives. Biometrics 57:219–223
Article Google Scholar
Lee AH, Wang K, Yau KKW (2001) Analysis of zero-inflated Poisson data incorporating extent of exposure. Biom J 43:963–975
Article Google Scholar
Yau KKW, Lee AH (2001) Zero-inflated Poisson regression with random effects to evaluate an occupational injury prevention programme. Stat Med 20:2907–2920
Article Google Scholar
Hall DB (2000) Zero-inflated Poisson and binomial regression with random effects: A case study. Biometrics 56:1030–1039
Article Google Scholar
Famoye F, Singh KP (2006) Zero-inflated generalized Poisson regression model with an application to domestic violence data. J Data Sci 4(1):117–130
Article Google Scholar
Akaike H (1974) A new look at the statistical model identification. IEEE-TAC 19(6):716–723
Google Scholar
Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6(2):461–464
Article Google Scholar
Rohatgi VK, Saleh AKMdE (2003) An Introduction to Probability and Statistics. John Willy and Sons, New York
Google Scholar
Haight FA (1967) Handbook of the Poisson distribution. John Wiley & Sons Inc, New York
Google Scholar
Ross SM (2012) Simulation. Elsevier Academic Press, USA
Google Scholar
Barman MP, Rahman T, Bora K, Borgohain C (2020) COVID-19 pandemic and its recovery time of patients in India: A pilot study. Diabetes Metab Syndr 14:1205–1211
Article Google Scholar
Guan C, Liu W, Cheng JYC (2021) Using social media to predict the stock market crash and rebound amid the pandemic: the digital ‘Haves’ and ‘Have-mores.’ Annals Data Sci. https://doi.org/10.1007/s40745-021-00353-w
Article Google Scholar
Li J, Guo K, Herrera VE, Lee H, Liu J, Zhong Z, Gomes L, Filip FG, Fang SC, Özdemir MS, Liu XH, Lu G, Sh Y (2020) Culture vs policy: more global collaboration to effectively combat COVID-19. The Innovation. https://doi.org/10.1016/j.xinn.2020.100023
Article Google Scholar
Liu Y, Gu Z, Xia S, Shi B, Zhou X, Shi Y, Liu J (2020) What are the underlying transmission patterns of COVID-19 outbreak? An Age-Specific Social Contact Characterization. EClincialMedicine 22:100354
Google Scholar
Kumar S (2020) Monitoring Novel Corona Virus (COVID-19) infections in India by cluster analysis. Annals of Data Science 7(3):417–425
Article Google Scholar
Golechha M (2020) COVID-19, India, lockdown and psychosocial challenges: What next? Int J Soc Psychiatry 66(8):830–832
Article Google Scholar
Rahman T, Bora K, Barman MP, Goswami K, Borgoain C (2020) Unlocking India during Covid-19 pandemic: a data driven investigation. Int J Stat Reliab Eng 7(3):445–450
Google Scholar
Rai B, Shukla A, Dwivedi LK (2020) Dynamics of COVID-19 in India: A review of different phases of lockdown. Population Med 21(2):1–2
Google Scholar
Rehman U, Shahnawaz MG, Khan NH, Kharshiing KD, Khursheed M, Gupta K, Kashyap D, Uniyal R (2020) Depression, Anxiety and Stress among Indians in times of Covid-19 Lockdown. Community Ment Health J 57(1):42–48
Article Google Scholar
Choudhari R (2020) COVID 19 pandemic: Mental health challenges of internal migrant workers of India. Asian J Psy 54:102254
Article Google Scholar
Saha J, Chouhan P (2021) Lockdown and unlock for the COVID-19 pandemic and associated residential mobility in India. Int J Infect Dis 104:382–389
Article Google Scholar
Goyal K, Chauhan P, Chhikara K, Gupta P, Singh MP (2020) Fear of COVID 2019: first suicidal case in India! Asian J Psy 49:e101989
Article Google Scholar
Sahoo S, Bharadwaj S, Parveen S, Singh AP, Tandup C, Mehra A, Grover S (2020) Self-harm and COVID-19 Pandemic: an emerging concern–a report of 2 cases from India. Asian J Psychiatry 51:e1022104
Article Google Scholar
The Hindu (2020) Kerala class X girl ends life allegedly over lack of access to online classes. https://www.thehindu.com/news/national/kerala/kerala-class-x-girl-ends-life-allegedly-over-lack-of-access-to-online-classes/article31728470.ece. Accessed 11 June 2020

Download references

Acknowledgements

We acknowledge all the reviewers for extending their suggestions and pieces of expert advice towards the improvement of the manuscript. We are also thankful to Chandan Borgohain, Sibsagar College, Assam, India for extending a helping hand in taking care of the language part.

Funding

No funding.

Author information

Authors and Affiliations

Department of Statistics, Dibrugarh University, Dibrugarh, Assam, 786004, India
Tousifur Rahman, Partha Jyoti Hazarika & Manash Pratim Barman
Department of Mathematical Sciences, Ball State University, Muncie, IN, 47306, USA
M. Masoom Ali

Authors

Tousifur Rahman
View author publications
You can also search for this author in PubMed Google Scholar
Partha Jyoti Hazarika
View author publications
You can also search for this author in PubMed Google Scholar
M. Masoom Ali
View author publications
You can also search for this author in PubMed Google Scholar
Manash Pratim Barman
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors equally contributed towards preparing the manuscript.

Corresponding author

Correspondence to Manash Pratim Barman.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical Statement

We hereby declare that this manuscript is the result of our independent creation under the reviewer’s comments. Except for the quoted contents, this manuscript does not contain any research achievements that have been published or written by other individuals or groups. We are the only authors of this manuscript. The legal responsibility of this statement shall be borne by us.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rahman, T., Hazarika, P.J., Ali, M.M. et al. Three-Inflated Poisson Distribution and its Application in Suicide Cases of India During Covid-19 Pandemic. Ann. Data. Sci. 9, 1103–1127 (2022). https://doi.org/10.1007/s40745-022-00372-1

Download citation

Received: 29 July 2021
Revised: 12 January 2022
Accepted: 12 February 2022
Published: 15 March 2022
Issue Date: October 2022
DOI: https://doi.org/10.1007/s40745-022-00372-1

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Three-Inflated Poisson Distribution and its Application in Suicide Cases of India During Covid-19 Pandemic

Abstract

Similar content being viewed by others

Asymptotic Likelihood Ratio Test for the Parameter of Inflation in ZIP

Modeling socio-demographic and clinical factors influencing psychiatric inpatient service use: a comparison of models for zero-Inflated and overdispersed count data

On the zero-modified Poisson–Shanker regression model and its application to fetal deaths notification data

1 Introduction

2 Three Inflated Poisson Distribution (ThIPD)

Definition 1

Theorem 1

Proof

3 Distributional Properties

3.1 Moments

Theorem 2

Proof

3.1.1 Coefficient of Skewness

Remark 1

3.1.2 Coefficient of Kurtosis

Remark 2

3.2 Probability Generating Function

Theorem 3

Proof

Remark 3

Remark 4

3.3 Cumulative Distribution Function (CDF)

Theorem 4

Proof

4 Reliability Characteristics

4.1 Survival Function (SF)

Theorem 5

Proof

4.2 Failure Rate (FR)

5 Parameter Estimation

5.1 Method of Moment Estimation (MM)

5.2 Maximum Likelihood Estimation (MLE)

6 Simulation Study

7 Real-life Applications

8 Likelihood Ratio Test

9 Conclusion

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical Statement

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation