Zero-Inflated Models for Count Data: An Application to Number of Antenatal Care Service Visits

The risk of maternal death in developing countries is projected to be one in 61, while for developed countries it is estimated to be one in 2800. Antenatal care is a protective obstetric health care system aimed at improving the outcome of the pregnant fetus by routine pregnancy monitoring. One of the most important functions of antenatal care is to offer health information and services that can significantly improve the health of women and their infants. 6450 pregnant women from Ethiopian Demographic and Health Survey of 2016 were used to analyze the determinants of the barriers in number of antenatal care service visits among pregnant women in Ethiopia. The data were found to have excess zeros (35%); thus several count data models such as Poisson, Negative Binomial, Zero Inflated Poisson, Zero Inflated Negative Binomial and Hurdle regression models were modeled and fitted. From the exploratory analysis the results showed that among those eligible pregnant women, it was seen that 2240 (34.7%) of them did not visit antenatal care service during their periods of pregnancy months. The visualization of data using scatter plot depicts that all of the variables selected for modeling have an influence on the event of not visiting antenatal care cervices while each of these variables had opposite slope in non-zero number of such events in their respective categories. To select the model which best fits the data, models were compared based on their Akaike information criterion value by using the simulation study. The simulation experiment revealed that models for zero-inflated data such as; Zero Inflated Poisson, Zero Inflated Negative Binomial and Hurdle were models that fitted the data better than the classical models Poisson and Negative Binomial. Each of these zero-inflated models was compared using Voung test and Hurdle model was better fitted the data which was characterized by excess zeros and high variability in the non-zero outcome than any other zero-inflated models. In this study, maternal education, partner education level, age of mothers, religion of mothers and wealth index are major predictors of antenatal care service utilization. Through simulation experiment, it was found that Zero Inflated Poisson, Zero Inflated Negative Binomial and Hurdle models were better fitted zero-inflated data than Poisson and Negative Binomial. Voung test suggests that Hurdle model was better fitted zero-inflated (ZI) data than any other zero inflated models and therefore, it was selected as the best parsimonious model.


Introduction
Antenatal care is a preventive obstetric health care program aimed at improving the outcome of the maternal fetus by routine pregnancy monitoring [1]. The risk of maternal death in developing countries is projected to be one in 61, while for developed countries it is estimated to be one in 2800 [2]. Complications during pregnancy and childbirth are the leading cause of death and disability among women of reproductive age in developing countries. There are an estimated 529,000 maternal deaths each year, of which 99% occur in developing countries [3]. Millions of women lack access to adequate care during pregnancy in these countries. ANC is actually provided by 71 percent of women worldwide; more than 95 percent of pregnant women in industrialized countries have access to ANC. 69% of pregnant women in sub-Saharan Africa have at least one ANC visit [4].
Although various studies have examined various risk factors for antenatal care (ANC) and the country's use of delivery services, the Ethiopian Ministry of Health stated in 2014 that substantially 45 percent of Ethiopian women received one or more ANC visits, less than 11 percent received professional care and 19 percent received postnatal care [5]. Inadequate coverage and under-use of modern health care services in developing countries are major reasons for poor health [6].
A rising problem is this imbalance in women's health and well-being in the developing and developed world. Essential determinant of safe delivery is antenatal care (ANC) [7]. Although it is not possible to predict such obstetric emergencies via antenatal screening, women may be trained to identify and respond on signs that lead to potentially serious conditions [8]; this is one solution to maternal mortality reduction [9]. One of ANC's most important functions is to provide health information and services that can boost women's and their infants ' health significantly [10]. Additionally, ANC seems to have a positive impact on the use of postnatal health care services during pregnancy [11]. Empirical evidence suggests that four visits are adequate for uncomplicated pregnancies and more are needed only in complications [12]; thus, the World Health Organization is currently recommending at least four ANC visits throughout pregnancy.
This study, will provide a valuable information about count data models when the assumption of the standard Poisson regression is violated (when there is greater variability in the response counts than one would expect if the response distribution truly were Poisson). Counting data in many studies may have an excess of zeros. In the same process as other positive counts, zero counts may not occur. Lambert [13] discussed this issue and suggested a zero-inflated Poisson (ZIP) model, which Greene [14] also proposed to apply defects in manufacturing. There may be no equality of mean and variance between zero-inflated count results. In such situations, it is necessary to take into account over-dispersion (or under-dispersion). Several researchers have proposed a variety of mixture and compound distribution of Poisson, such as double Poisson, Poisson log-normal, Poisson-geometric, Poisson-negative binomial, Poisson-modified, Poisson-Pascal, Quasi-Poisson, mixed-Poisson, Poisson-binomial, and so on, to tackle the mean inequality and variance of the Poisson distribution.
However, it is suggested that a generalized Poisson distribution could take into account Poisson distribution over-dispersion [15]. It contains two parameters 1 and2 where2 can be positive, zero, or negative. If 2 is zero then it becomes the standard Poisson. Still there is a mean variance relationship but equality assumption becomes flexible. Later Consul and Famoye [16] improved the model for regression analysis. An extension of the generalized Poisson distribution proposed by Famoye and Singh [17] is zero-inflated generalized Poisson (ZIGP). In the presence of over-dispersion in the results, if Poisson mean has a gamma distribution, negative binomial model can be preferred. The zeroinflated negative binomial (ZINB) model discussed by Mwalili [18] is a natural extension of the negative binomial model to accommodate excess zeros in the results. Using truncated models is another popular approach to modeling the excess zeros in count data. Hurdle model, which is developed by Cragg [19] is an example of truncated models for count data.
In this study, three zero-inflated models, ZIP, ZINB and Hurdle model were considered, to observe whether there is any effect of proportion of zeros in the performance of the models with given overall rate of the counts. Two classical data count models were also considered for comparison, Poisson and negative binomial. This study's main motivation was that the real life data was often skewed with zero counts and may or may not be over-dispersed (or under-dispersed). In this study, data generating processes were used from Poisson inflated at zero by excessively varying zero counts (from 30% to 80%). The study tried to investigate how the model fit changes for different rate of counts with different proportion of zeros in the simulation experiment. The literature contains numerous applications of ZI models. In this research, these models were designed and fitted on the number of antenatal care visits to Ethiopia.

Sources of Data and Variables in the Model
The data used for this study was taken from the 2016 EDHS, a nationally representative survey of fertile age women (15-49 years of age) groups taken from Ethiopia's Central Statistical Agency (CSA). This survey was the fourth compressive survey designed to provide estimates for the health and demographic variables of interest for the whole urban and rural areas of Ethiopia as a domain. Women who had 9 months pregnancy during the survey interview were included in the analysis. The study includes 15,684 women of fertile age group in the country, who had at least 9 months of pregnancy period during survey in 2016 EDHS. There were cases in which information on the relevant variables was missing and these cases were excluded from the analysis. Thus, the analysis presented in this study on the factors affecting ANC visit was based on the 6,450 women in the fertile age . The study's response variable was the number of pregnant women's antenatal care visits from early pregnancy to their 9-month pregnancy period. Thus, this paper attempted to include socioeconomic, demographic and environmental related factors that are assumed as a potential determinants for the barriers in the number of antenatal care service visits, adopted from literature reviews and their theoretical justification. Detailed descriptions of these factors are presented in Table 1.

Statistical Models
In this section, a non-exhaustive list of commonly used regression models for count and zero-inflated count data were briefly outlined. The following five regression models, Poisson regression, Negative Binomial regression, zero-inflated Poisson (ZIP) regression, zero-inflated negative binomial (ZINB) regression, hurdle regression are used to model count and zero-inflated count data.

Poisson Regression Model
Given that the dependent variable (number of ANC visits) is a non-negative integer, most of the recent thinking in the field is the use of Poisson regression model as a starting point. In a standard Poisson regression model, the probability of pregnant women i having y i antenatal care service visits until her nine (9) months of pregnancy period (where y i is a non-negative integer) is given by: where p(y i ) is the probability of (9) month pregnant women entity i having y i antenatal care service visits in nine (9) months of pregnancy period and i is the Poisson parameter for pregnant women i, which is equal to pregnant women entity i ′ s expected number of antenatal care service visits in nine (9) months, E(y i ).

Negative Binomial Regression Model
The negative binomial (or Poisson-gamma) model is a Poisson model extension to address possible data over-dispersion. This model assumes that the Poisson parameter follows a gamma probability distribution. The negative binomial model is derived by rewriting the Poisson parameter for each observation i where i = exp( X i + i ) where exp( i ) is a gamma-distributed error term with mean 1 and variance K. The addition of this term allows the variance to differ from the mean as: The probability mass function for the negative binomial distribution is: The parameter p is the probability of success in each trial and it is calculated as: p = r i +r where, i = exp(y) is mean of the observations and r is inverse of the dispersion parameter (i.e r = 1 k ). The Poisson regression model is a limiting model of the negative binomial regression model as k approaches zero, which means that the selection between these two models is dependent upon the value of k. The parameter k is often referred to as the overdispersion parameter. Although the negative binomial model can solve an over dispersion problem, it may not be enough flexible to handle when there are excess zeros. In such cases, one can use the zero-inflated models as well as hurdle models to solve the problem.

Zero-Inflated Models
There are cases where the predominance of zero counts is a major source of overdispersion and the resulting over-dispersion can not be adequately modeled with negative binomial model. In such cases, it is possible to fit the data with zeroinflated Poisson or zero-inflated negative binomial model [21]. According to Lord [22], Zero-inflated techniques permit the researcher to answer two questions that pertain to low base rate-dependent variables: (a) what predicts whether or not the event occurs, and (b) if the event occurs, what predicts frequency of occurrence? In other words, two regression equations are created: one predicting the occurrence of the count and the other predicting the occurrence of the count [22]. Moreover, zero-inflated models have statistical advantage to standard Poisson and negative binomial models in such a way that they model the preponderance of zeros as well as the distribution of positive counts simultaneously [23].

Zero-Inflated Poisson (ZIP) Regression Model
ZIP model operates on the principle that the excess zero density that cannot be accommodated by a traditional count structure. A splitting regime that models a woman who is not visited for ANC versus a woman who visited for ANC during her pregnancy time determined by a binary logit or probit model that account for the likelihood of an ANC visitation entity being in zero or non-zero states [24,25]. In one regime (R 1 ) the outcome is always a zero count, while in the other regime (R 2 ) the counts follow a standard Poisson process. Suppose that: Thus, the occurrence of Y i follows the following distributions: The mean and variance of ZIP distribution are exp( . It is clear that this reduces to the standard Poisson model when i = 0.

Zero-Inflated Negative Binomial (ZINB) Regression Model
Zero-Inflated Negative Binomial (ZINB) regression model also assumes two distinct data generation processes. The ZINB distribution is a mixture distribution, similar to ZIP distribution, where the probability i for excess zeros and with probability (1 − i ) the rest of the counts followed negative binomial distribution. Note that the negative binomial distribution is a mixture of Poisson distributions, Annals of Data Science (2021) 8(4):683-708 which allows the Poisson mean to be distributed as Gamma, and in this way over dispersion is modelled. The ZINB distribution is given by: The mean and variance of the ZINB distribution are E( , respectively. It is to be noted that this distribution approaches the ZIP distribution and the negative binomial distribution as → ∞ and i → 0 , respectively. If both 1∕ and i ≈ 0 then ZINB distribution reduces to Poisson distribution [18,26].

Hurdle Regression Models
Hurdle regression is also known as two-part model which is originally developed by Mullahy [23]. Mullahy says, "The idea behind the hurdle formulations is that a model of binomial probability controls the binary outcome of whether a variable count has a zero or a positive outcome. If the realization is non-zero (positive), the "hurdle is crossed", and the conditional distribution of the positives is governed by a truncatedat-zero count data model." The attraction of Hurdle regression is that it reflects a twostage decision-making process in most human behaviors and therefore has an appealing interpretation [27]. For instance, it is pregnant mother's decision whether to contact the doctor's office and to make the initial visit. However, after the pregnant mother's first visit, doctor plays a more important role in determining if the pregnant mother needs to make follow-up visits. Therefore, in a regression setting, a Logit or Probit regression may represent the first decision, while a truncated Poisson or Negative binomial regression may evaluate the second. In addition, in each decision process, different explanatory variables are allowed to have different impacts.

Hurdle Poisson (HP) Regression Model
The most popular formulation of a Hurdle regression is called Logit-Poisson model, which is the combination of a Logit regression modeling zero versus non-zero outcomes and a truncated Poisson regression modeling positive counts conditional on nonzero outcomes.Its probability density function is given as:

Hurdle NB Regressions Model
We consider a hurdle negative binomial (HNB) regression model in which the response variable y i has the distribution: where ( > 0) is a dispersion parameter that is assumed not to depend on covariates [23]. In addition, we suppose 0 < i < 1 and Where Ψ i and B i are i th row of covariate matrix Ψ and B and as well as and are the independent variables in the regression model [27].

Voung Test
The Vuong test is a non-nested test based on a contrast between the expected probabilities of two non-nested models [28]. For example, using Voung test comparisons can be made with ordinary Poisson between Zero-inflated count models, or Zeroinflated negative binomial against ordinary negative binomial model. This test is used for model comparison.

Exploratory Analysis of ANC Visits
Only those women who were in their reproductive Age (15-49) have been considered. Table 2 presents the descriptive statistics of the variables.

Why it is Important to Model Zero Counts and Positive Counts Separately
From the policy making view point, it is important to investigate why some mothers never visit antenatal care while some mothers experienced such events. These causes of not attending ANC may be influenced by socioeconomic characteristics in individual level and household level, such as, mother's health seeking behavior, partner level of education, accessibility of health services, and strength of the health system of that region. In this study we were considered the socioeconomic characteristics in individual level and household level to find the causal association with ANC visit.
In this study three zero-inflated count data models and two classical count data models have been considered. Table 2 presents the number ANC visit of 6450 women surveyed in EDHS 2016. The data seems to be a good candidate to be fitted by zero-inflated count data models where nearly 35% of the women never experienced ANC visit. The descriptive statistics given in Table 2 below shows the number and percentage of ANC visits that the pregnant mothers in the sample have encountered in their nine months of pregnancy period. It can be seen that 2240 (34.7%) of the pregnant mothers have not visited ANC service during their periods of pregnancy months, whereas 313 (4.9%) of them visited only once, 522 (8.1%) of them visited twice, 1073 (16.6%) visited three times, 1014 (15.7%) visited four times and the proportion of the rest counts can be expressed in a similar ways. Table 3 below presents summary statistics of the variables that are assumed to affect the number of ANC visits and its distributions for each levels of the variables. Accordingly, 3971 (61.6%) of the mothers are not educated, 1713 (26.6%) of mothers are in primary education, 495 (7.7%) of mothers are in secondary education and 271 (4.2%) of mothers are in higher education. From Fig. 1, age of mothers may have effect on the degree of not experiencing ANC visits. Women with higher age are less likely to experience ANC visits compared to the other age groups. Also not visiting antenatal care may be influenced by exposure to media. The proportion of mothers attending antenatal care for those who do not exposed to media (36%) is slightly higher than those mothers exposed to media (24%). However, both variables, mothers age and exposure to media, may have influence on experiencing ANC visits with opposite slope for both variables. Figure 2 shows women from developing region (Afar and Somali) accounts for higher proportion (60% and 65% respectively) of not attending antenatal care service, whereas percentage of not visiting antenatal care for women residing in the Mother's place of residence may have effects on the degree of not experiencing ANC visits. In EDHS 2016 survey, we have found that around 40% of rural mothers did not attend antenatal care at all while only 10% of mothers from urban area did not visit antenatal care. Also mother's employment status may have effect on the proportion of not attending ANC visit. Unemployed mothers have higher proportion (37.5%) of not attending antenatal care compared to employed mothers. Also,

3
Annals of Data Science (2021) 8(4):683-708 Fig. 1 Percentage of women not visiting antenatal care by age and exposure to media: how zero counts differ than the rests Fig. 2 Percentage of mother's not visiting antenatal care by region: how zero counts differ than the rests the proportion of mothers who attended ANC visit showed similar trend with opposite slope (Fig. 3). This finding was motivational to employ zero-inflated count data models where factors affecting zero counts can be modeled separately. Figure 4 shows that women whom their religion is Orthodox have experienced lower proportion (20%) of not visiting antenatal care compared to women that followed traditional religion which have substantially higher proportion (76%) of not visiting antenatal care. Also, the slope for the percentage of women who experienced ANC visits are showing a trend with opposite slope.
Mother's level of education may have effects on the degree of not experiencing ANC visits. In EDHS 2016 survey, we have found that 50% of mothers who have no education did not attend antenatal care at all while only 1% of mothers with higher education did not visit antenatal care. Similar pattern is found for partner education level. Also, mothers who attended ANC showed similar trend with opposite slope (Fig. 5). This finding was also motivational to employ zero-inflated count data models where factors affecting zero counts can be modeled separately.
Wealth index (also known as asset quintile) is another factor which have substantial effect on ANC visits. Women who are from those households that belongs to the poor 50% according to the wealth index had higher experience of not visiting antenatal care than the women who are from those households that belongs to the rich quintile (13%). Again, the slope for the percentage of women who experienced ANC visits, are showing a trend with opposite slope of women who had at least one experience of visiting antenatal care (Fig. 6). There is also slight proportional difference of not attending antenatal care between married and single women. Married women had a bit higher proportion (36.5%) of not attending antenatal care compared to single women. Similar trend follow for marital status as that of wealth index for women who had at least one experience of visiting antenatal care which shows an opposite slope.

The Simulation Experiment on Zero-Inflated Count Data Generated from Zero-Inflated Poisson Model
In this study, zero-inflated count data which accounted three covariates with random values for and was simulated. The underlying interest of doing this was to see the performance of different count data models for different parameter values. Three popular zero-inflated count models, ZIP, ZINB and Hurdle models were considered, and classical count data models, Poisson and Negative Binomial regression model. Our assumption was zero-inflated count data models will have superior performance in model fitting if proportion of zeros is large enough (say, more than 30 % than the rest of the counts) regardless of the values of the parameters and . Three binary covariates were considered, which were generated from binomial distribution with success probability 0.5, 0.6, 0.7 respectively. Random values for the parameters within a specified boundary in each trial of the simulations were considered. = (0.30 to 0.7, 0.02 to 0.69, − 1.70 to − 1.20, 0.30 to 0.91) and = (0.04 to 0.28, 0.01 to 0.94, 0.9 to 1.12, − 0.93 to − 0.63) were considered. In each trial it takes a random set of parameters to generate i and p i where = exp( X) and = logit(p) . The response variable y i was generated from zero inflated Poisson model. The trial was repeated for 1000 times for sample size of 100. ZIP, ZINB and Hurdle regression models using zeroinfl() and hurdle() functions through pscl package were fitted using R.
In this study, we were interested to explore how the available zero-inflated count data models behave with different sets of parameters. AIC statistic was considered to compare efficiency of each models. Figure 7 shows that each zero-inflated models have superior performance than the classical count data models, that is, Poisson and negative binomial models. And it does not affected by neither nor p for any of the zero-inflated models. Moreover, if lambda increases but proportion of zeros remain high then zero-inflated models perform much better than the classical models.
As we can see from Fig. 7 the AIC value for classical models, Poisson and Negative Binomial is too high compared to the zero-inflated models which depicts the appropriateness of zero-inflated models for ZI data. Among the three zero-inflated models, their performance are indistinguishable. In fact, ZIP and ZINB have identical performance as the dispersion was one for data generated from Poisson with inflated zeros. But ZINB is sensitive to convergence issue. However, Hurdle models also perform along with the rest of the two zero-inflated models. Also, from Fig. 8, two histogram of ZI count data generated from Poisson and negative binomial for sample size 1000 with same mean 7, dispersion 1 and 8, respectively, and proportion of excess zeros 35% have shown. It is hard to differentiate between two graphs. This motivated us to check model efficiency of zero-inflated models using Voung test.

Parameter Estimates and Interpretation
We have fitted ZIP, ZINB and Hurdle regression models on the number of ANC visit data as around 35% of the mothers in EDHS 2016 survey have never experienced antenatal care visit. The data seems to be a good candidate for the ZI models. All the three ZI models also can be modeled with different link functions, namely logit, probit, cloglog (complementary log log), and Cauchy to account the zero counts. For this study we have considered logit link. The link functions neither improved the model nor have practical inclination with this kind of study. For positive counts we have considered Poisson and negative binomial distribution for ZIP and ZINB models, respectively.
Hurdle model has a bit more flexibility. We have discussed it earlier that it is a two component model and zero counts can be modeled separately with the mentioned link functions while the positive counts can be modeled using Poisson, negative binomial or geometric distribution. To determine which distribution works better for Hurdle model, we performed Wald test (also log-likelihood ratio test, though both produce same conclusion) with different distribution for positive counts and zero counts in Hurdle model. We also had to determine which variables can take account of excess zeros. As we have shown in pervious section all of the variables selected for modeling have influence on the event of antenatal care visit of a mother while each of these variables have opposite slope in number of such events of a mother. Such exploration motivated us to consider all variables for excess zeros and also for positive counts. We also fitted classical count data models, Poisson and Negative Binomial to compare our results. Table 4 shows the parameter estimates of each models with associated standard error. Table 4 shows that mother's level of education, partner level of education, age of mothers, exposure to media and wealth index are those factors that motivate mothers to follow antenatal care visit compared to their respective reference group. Age of mothers, mother's religion and place of residence are factors that hinder mothers to attend antenatal care. All the ZI models showed a consensus on that with slightly different estimate of the coefficients but with same directions. Notice that ZIP and ZINB with logit link have almost similar estimates of the parameters. It might be due to the fact that there is nearly unit dispersion in the data. On the other hand, based on the Voung test, the Hurdle model with negative binomial distribution for positive counts and logit link for the zero counts is the best model that fitted ZI data very well. Estimates of the parameters for zero counts is only available for ZI models. Mother's level of education, partner level of education, and wealth index (the lower the better), are the significant covariates for a mother not to have zero antenatal care visit.
While age of mothers, mother's religion and place of residence are the responsible covariates for a mother not attending antenatal care. Marital status is not significant in both zero and positive counts while exposure to media is only significant for positive counts and employment status is insignificant for positive counts for zero-inflated models. Notice that, Hurdle model have different sign for the estimates parameters for zero counts than the ZIP and ZINB. This is because of the way the models are defined. The zero hurdle component represents the likelihood of a positive count for the hurdle model, whereas the zero-inflation component estimates the likelihood of experiencing a negative count from the point mass element for the ZIP and ZINB models (Table 5).

Model Comparison
To compare the performance of each models, we may use Voung test as the models are non-nested [28]. Table 6 shows that Hurdle model is the superior one than the rests. Also ZIP and ZINB has almost similar performance and Voung test inferred that these two models does not have significant difference. It might be a reason of statistical insignificance of the estimate of dispersion parameter, log (theta).

Hurdle Model Parameter Estimate
To find the best model, we have considered the Voung test and hurdle regression model was chosen as the most parsimonious model which fits the data better than the other possible candidate models. The hurdle model suggested that educated pregnant mothers have a higher probability of visiting ANC service and a higher Table 4 Estimation of coefficients for positive counts using different count data models for number of ANC visit expected number of non-zero visits than uneducated pregnant mothers. The estimated odds that the number of ANC visits become non-zero with educated (mothers with higher education level) is e 0.149 = 1.160 times the estimated odds for uneducated mothers. Place of residence is significantly associated with decreased odds of at least one ANC visits.  That means the estimated odds that mothers from rural area visit at least one ANC is e −0.297 = 0.743 times estimated odds of those mothers from urban. Wealth index is also another interesting factor that associates to number of ANC service. The estimated odds that the mothers belongs to rich wealth index visit antenatal care is e 0.123 = 1.130 times odds of mothers whose their wealth index is poor. Mothers with age 25-34 and 35-49 is less likely to have a non-zero counts of ANC service in comparison to mothers with age 15-24. Mothers who belongs to Catholic, Protestant, Muslim and Traditional religious group are less likely to experience at least one ANC service visit in comparison to mothers from Orthodox religion. Also note that the model showed that partner education level has significant effect on mothers ANC service. Mothers whose their partner is illiterate are less likely to attend ANC service compared with mothers whose their partner is literate. As we can see from Hurdle model, marital and employment status of mothers are insignificant factors that do not associated with ANC service of mothers (Table 7).

Discussion
This study was aimed at modeling the determinants of antenatal care visits in Ethiopia. As a preliminary analysis, assortments of summary statistics were employed to explore the association between the response variable of interest and available covariates. It should be well-known that there is inconsistency in the conclusion from the analysis of various summary statistics. Therefore,the analysis was extended to other statistical methods by taking into account zero-inflated count data. The data were then analyzed using two model families, classical models (Poisson and NB) for count data, and zero-inflated models (ZIP, ZINP and Hurdle) for zero-inflated count data.
From simulation experiment, we have seen that zero-inflated models, ZIP, ZINB and Hurdle, are consistent over the changes of the model parameters compared with classical models which agrees with actual data set of ANC visit. Specification of the correct model is relevant. Sometimes over-dispersion of a data may not be significant if the percentage of zeros is too high and in such case ZIP and ZINB have nearly identical estimate of the parameters. But ZIP and ZINB do not fit the data well, if there is over-dispersion with moderate percentage of zeros. Hurdle model has a higher flexibility to fit a model with mixture of distribution for zeros and   [29]. Number of ANC service of a mother was a good candidate to apply zero-inflated count data models. From simulation study, we have found that the ZI models have better performance than the Poisson and Negative Binomial models in terms of AIC statistic. Although ANC shows increasing trend over time in Ethiopia, still it is low compared to developed countries. Finding from this study revealed women education was significant predictor of ANC visit in which educated pregnant mothers have a higher odds of visiting ANC service and a higher expected number of nonzero visits than uneducated pregnant mothers. This finding agrees with study conducted by Nisar and White [30]. Place of residence is significantly associated with decreased odds of ANC visits for rural women. That means women in urban areas were more likely to visit ANC than rural women.
This finding consistently agrees with study conducted by Kwast and Liff [31]. This is because, health facilities are more accessible in urban than rural areas. Findings from this study showed that age was a significant predictor of ANC visit in which older women visited ANC more than younger ones. This finding agrees with study carried out by Dairo and Owoyokun [32]. In line with a similar study by Dairo and Owoyokun [32], we found wealth index and religion to be associated with ANC visit such that women in the richest category and with Orthodox * * * P < 0.001 , * * P < 0.01 , * P < 0.05 , S.E. = Standard error, Ref = Reference group

Conclusions
In conclusion, the ANC service utilization rate in Ethiopia was lower than the national figures available to date. It was worth nothing that majority of the mothers who attend ANC did not receive adequate number of visits recommended by the World Health Organization. Furthermore, maternal education, partner education level, age of mothers, religion of mothers and wealth index were major predictors of ANC service utilization. Therefore, efforts to bring about changes in these major predictors at individual and community level through behavioral change communication are recommended. In this study, through simulation experiment, it was found that ZIP, ZINB and Hurdle models were better fitted the data than Poisson and negative binomial. Voung test shows that Hurdle model was better fitted the data which is characterized by excess zeros and high variability in the non-zero outcome than any other ZI models and therefore it was selected as the best parsimonious model.