Introduction

Infant mortality is the death of infants under the age of one. Infant mortality is measured by the infant mortality rate (IMR). It refers to any death occurring after birth but prior to the child’s first birthday. Infant mortality rate is an indicator of national health because it is linked to a number of factors including maternal health, health-care quality and access, socioeconomic conditions, and public health practices. The socioeconomic status of the family has a greater impact on infant survival than any other age group in the population [1,2,3].

In 2020, 2.4 million children died in their first month of life worldwide. Every day, approximately 6700 newborns die, accounting for 47% of all child deaths under the age of five, up from 40% in 1990. The number of newborn deaths worldwide has decreased from 5 million in 1990 to 2.4 million in 2020. However, the decline in neonatal mortality between 1990 and 2020 was slower than the decline in neonatal mortality among children under the age of five. In 2020, neonatal mortality rate were highest in Sub-Saharan Africa 27 (25–32) deaths per 1000 live births), followed by Central and Southern Asia, which had 23 (21–25) deaths per 1000 live births [4]. Children born in Sub-Saharan Africa are ten times more likely than children born in high-income countries to die within the first month of life. In 2020, nationally infant mortality rates will range from 1 to 44 per 1000 live births, with the risk of babies dying before the age of 28 days being about the lowest in the world. It happened 56 times [5, 6].

Worldwide, 50 % of the deaths occurred in six countries, including Ethiopia, and 1500 children died every day. The majority of children died from preventable or treatable causes such as birth complications, pneumonia, diarrhea, neonatal sepsis, and malaria [7, 8]. Every day, more than 704 children die from diseases that could be avoided. If current trends continue, more than 3,084,000 children will die by 2030 [9]. There have been regional variations in child mortality in Ethiopia. Child mortality rates in Ethiopia range from 39 per 1000 live births in Addis Abeba to 125 per 1000 live births in Afar. Infant mortality rates are higher. Infant mortality risks are higher during the first few months of life in particular. This also holds true in Sub-Saharan Africa [10]. According to the 2016 Ethiopian Demographic and Health Survey (EDHS), the infant mortality rate for the 5 years preceding the survey was 48 deaths per 1000 live births. Nearly 72% of under-five deaths occur before the age of one in our country. The magnitude of infant mortality varies greatly across the country [11].

To estimate the risk factor of infant mortality, various countries conducted studies [12,13,14,15,16]. There have been many small-scale surveys that investigated a specific set of variables. The analyses in these studies looked at binary logistic and survival factors [16,17,18,19]. While binary logistic regression undercounts the total number of deaths since multiple deaths are rolled into a single unit to meet the criteria for binary logistic regression, it provides sufficient information to study multiple child deaths. As a method of analysis, count regression was chosen in this study. The response variable is the number of infant deaths (count), and the main objective is to estimate how this number changes with the increase in explanatory variables. Modeling count data using Poisson regression is the most widely known method. Despite its underlying assumption of equality in distribution (i.e., the mean and variance are equal), its use is limited in some real-world cases with uneven distribution (i.e., the variance is greater than the mean or smaller than the mean). In the event of excessive variation, parameter estimates, standard errors, tests, and confidence intervals may be distorted. Over dispersion has a variety of causes. Excessive zero counts or censorship are two examples. Over-dispersed count data are common in many fields, prompting the development of statistical methodologies for modeling over-dispersed data [20, 21]. The negative binomial distribution resembles the Poisson distribution, except that it has a longer, fatter tail when the variance greater than the mean. Depending on the degree of over dispersion, the negative binomial model may capture more zeros than the Poisson model [22, 23]. Conversely, the model may be insufficient for empirical applications with zero values in the data. By allowing over dispersion, zero inflated models can model the excessive proportion of zero values. When the number of zeros is large, it performs better than the Poisson or negative binomial models [24]. Studies on infant mortality often used prevalence alone, Chi square associations, or logistic and survival modeling [15, 25,26,27,28,29]. Logistic regression divides the dependent variable into two categories, dead or alive, undercounting the rate of infant mortality. Using this approach, multiple child deaths will be considered as one death for the purpose of a logistic regression. A multi-index count model involves a two-part model with extensive (zero) and intensive (positive) margins, representing nonoccurrence and occurrence of death, respectively. Combining the zero-point mass distribution and the nonzero-point distribution results in a finite mixture, an alternative to the two-part process. As a solution, we proposed a two-part random effects model for infant mortality data. This study uses Multilevel Log Linear Modeling to estimate the risk factors for infant mortality in Ethiopia. A model’s main strength is that it accounts for survey design features such as clustering, and stratification, which could lead to inaccurate statistical estimations.

Contribution

As far as we know, this is the first study to use multilevel count regression models such as ZINB at a multi-regional level to assess the impact of various covariates on infant mortality. Furthermore, the estimation of the specific risk factors with infant mortality will help to prioritize interventions and reveal patterns that will help to improve outcomes. Finally, to provide researchers with detailed information about how to use over dispersed, under dispersed, and zero inflated regression models.

Methods

Study design, area and sampling

For this study, a cross sectional secondary data from the Ethiopian Demographic and Health Survey (EDHS) of 2016 were used. In response to a reasonable request, and after ensuring that the data was licensed, survey information was downloaded from the DHS website. EDHS 2016 is a part of the global level DHS venture that is funded by the USA agency for global development (USAID) and carried out by using the Ethiopian vital Statistical enterprise. The DHS is accomplished each 5 years, and the 2016 survey is Ethiopia’s fourth, overlaying all nine areas and two administrative city [11]. The Ethiopian Demographics and Health Survey (EDHS) program collected data from nationally representative samples of all age groups and key indicators. The survey collected data on socio-demographic, socioeconomic, and maternal variables. The stratified two-stage cluster sampling method was used to select study participants. In the 2016 survey, a total of 645 EA (202 urban, 443 rural) were chosen. The survey included 15,683 women of childbearing age and 18,008 households in these census areas. The Ethiopian Demographics and Health Survey Report contains detailed sampling methods, which are available on the Measure DHS website (www.dhsprogram.com) [11]. In this study, a total weighted sample of 10,547 mothers from reproductive age mothers who were interviewed about infant deaths in the preceding 5 years before the survey were included in this analysis for this study (Fig. 1).

Fig. 1
figure 1

Sample selection procedure

Study variable

Dependent variable

The number of infant deaths per mother, which refers to the death of an infant before his or her first birthday, was the dependent variable.

Independent variable

Possible predictors of infant death use such as region, mother’s age, father’s education level, mother’s education level, father’s occupation, mother’s occupation, family size, mother’s age at first birth, religion, source of water, toilet facility, wealth index, Sex of child, Marital status, Contraceptive use, birth order, preceding birth interval, child twin, birth weight, place of delivery antenatal visit, breastfeeding, and residence were potential predictors of infant death.

Statistical software

SPSS software version 23 was used for recording the secondary data, which was exported to R software version 3.4.1 and analyzed with R software.

Statistical models

Sometimes, the response of interest in epidemiological studies involves a count; such as number of infant deaths. In these instances, a multilevel count regression model is most appropriate due to the nature of the data set. A multilevel count regression model can account for lack of independence across levels of nested data (in this case, individual mothers nested within regions). Multilevel models used to assess whether the effects of predictors vary from region to region. A number of multilevel count regression models are applied to this study, including Poison, Negative Binomial, Zero-Inflated Poisson, Zero-Inflated Negative Binomial, Hurdle Poisson, and Hurdle Negative Binomial.

Multilevel Poisson regression model

In practice, although the value of goodness of fit tests as measured by R Square indicates that the model can adequately explain the response variable, there is a violation of the assumption that such problems as heteroscedasticity and dependence error exist. The multilevel Poisson regression model for a count Yij for ith individual mothers in jth region can be written as:

$${y}_{ij}\left/ {\lambda}_{ij}\right.= poisson\left({m}_{ij},{\lambda}_{ij}\right)$$

The standard link function for the Poisson distribution is the logarithm, and we get

$${\mu}_{ij}=\ln \left({\lambda}_{ij}\right)$$
$${\mu}_{ij}=\ln \left({\lambda}_{ij}\right)={\beta}_0+{\sum}_{h=1}^k{\beta}_h{X}_{hij}+{u}_{oj}+\sum_{i=1}^k{u}_{ij}{X}_{hij},\kern1.5em \mathrm{h}=1,2,3,\dots, \mathrm{k}\kern1.75em$$

The first part of the equation, \({\beta}_0+{\sum}_{h=1}^k{\beta}_h{X}_{hij},\) is called the fixed part of the model. The second, \({u}_{oj}+\sum_{i=1}^k{u}_{ij}{X}_{hij},\) is called the random part. Where, λij= the expected/mean number of death for ith mother in jth region, Xhij = x1ij, x2ij, x3ij, …, xkij represent the first and the second level covariates, β = β0, β1, β2, …. βk are regression coefficients, u0j, u1j, u2j, …. . , ukj , are the random effect of model parameter at level two (region level) and assume a distributed with means zero and has a multivariate normal distribution with a constant variance matrix [30, 31].

Multilevel negative binomial regression model

NB regression model is a popular approach to analysis over-desperation data. Often, because of the hierarchical study design or the data collection procedure, over-desperation and lack of independence may occur simultaneously, which render the standards NB model inadequate. Multilevel negative binomial regression model is given by [32]:

$$\Pr \left({Y}_{ij}={y}_{ij}\right)=\frac{\varGamma \left({y}_{ij}+v\right)}{y_{ij}!\varGamma (v)}\ X\ \frac{v^v{\mu_{ij}}^{\ast {y}_{ij}}}{{\left(v+{\mu_{ij}}^{\ast}\right)}^{v+{y}_{ij}}},\kern2em {y}_{ij}=0,1,2,\dots \dots \dots$$

With mean and variance given, respectively, as follows:

$$E\left({Y}_{ij}\right)={\lambda_{ij}}^{\ast }=\log \left({\mu}_{ij}\right), and\ \mathit{\operatorname{var}}\left({Y}_{ij}\right)={\lambda_{ij}}^{\ast }+\alpha {\left({\lambda_{ij}}^{\ast}\right)}^2$$

Where

$${\mu}_{ij}={\beta}_{0j}+{\beta}_{1j}{X}_{1 ij}+{\beta}_{2j}{X}_{2 ij}+\dots \dots \dots +{\beta}_{kj}{X}_{kij}$$

Multilevel ZIP regression model

When the non-zero part is a discrete random variable, a popular approach to analyze count data with excess zeros is to use a zero-inflated Poisson (ZIP) regression model. Often, because of the hierarchical study design or the data collection procedure, zero inflation and lack of independence may be present simultaneously as a con-sequence of the inherent correlation structure and underlying heterogeneity. A multi-level ZIP regression model developed to handle correlated count data with extra zeros. A multi-level ZIP regression model incorporating random effects to account for the data dependency [31, 33]. Consider the hierarchical state where Yij for ith individual mothers in jth region. The observations may be taken to be independent between regions, but certain within-household and within-individual correlations are anticipated, which can be modelled explicitly through random effects attached to the linear predictors:

$$\log \left({\lambda}_{ij}\right)={X_{ij}}^{\prime}\beta +{X_{ij}}^{\prime }U$$
$$\log \left(\frac{\varnothing_{ij}}{\left(1-{\varnothing}_{ij}\right)}\right)={\xi}_{ij}={Z_{ij}}^T\gamma +{V}_{0j}+\sum_{h=1}^k{V}_{hj}{Z}_{hij}$$

Here, the covariates Xij and Zij appearing in respective Poisson and logistic components are not necessarily the same and β and γ are corresponding vectors of regression coefficient [34, 35].

Multilevel zero-inflated negative binomial (ZINB) regression model

Count data with excess zeros and dispersion occur in many areas. Although zero-inflated negative binomial (ZINB) regression is useful for modeling such data, over dispersion, zero-inflation, and correlation may occur concurrently due to the hierarchical study design or the data collection procedure [34, 36]. A multilevel ZINB regression incorporating random effects to account for data dependency and over-dispersion is used. Let yij(i = 1, 2, …, m; j = 1, 2, ……, ni) be a count say, the death of the ith infant in the jth province/region. Then the two level ZINB regression model can be expressed in vector form as:

$$\log \left(\frac{\varnothing }{\left(1-\varnothing \right)}\right)=\xi = A\alpha +{R}_ww+{R}_ss$$
$$\log \left(\mu \right)= X\beta +{R}_uu+{R}_vv$$

Where, A, X, Rw, Rs, Ru, and Rv are design matrices. For simplicity of presentation, the random effects w, s, u, and v, are assumed to be indipendant and normally distributed with mean zero and variance δw2, δs2, δu2, and δv2,respectively [34].

Multilevel hurdle regression model

The Poisson-hurdle model is composed of two sub models in such a way that the total likelihood is decomposed into two separate likelihoods that can be maximized independently. The proportion of zero counts is described by a binary model with covariate dependence, which is commonly modeled with the logit link, but the probit and log–log links are also used. Second, the positive counts are represented by a zero-truncated Poisson distribution. Thus, the Poisson-hurdle model is a non-overlapping mixture of two Poisson distributions: a degenerate Poisson with mean zero and a truncated Poisson defined on positive integers [37]. A popular approach to the analysis of such data is to use a Hurdle Poisson regression model [38]. Let yij(i = 1, 2, …, m; j = 1, 2, ……, ni) be a count say, the death of the ith infant in the jth province/region The probability that a particular count occurs at the jth region for the ith infant under the Poisson-hurdle model is given by:

$$\Pr \left({Y}_{ij}=0\right)={p}_{ij}$$
$$\Pr \left({Y}_{ij}=k\right)=\left(1-{p}_{ij}\right).\frac{e^{-{\mu}_{ij}\left(\frac{{\mu_{ij}}^k}{k!}\right)}}{1-{e}^{-{\mu}_{ij}}}$$

Further, the probabilities pij and the means of the truncated Poisson distribution, μij, might depend on covariates. In the Poisson-hurdle model each individual belongs to the one of the two components of the mixture, i.e. zero-part or truncated Poisson part. The repeated measures Poisson-hurdle model implies that the same individual might belong to the zero part at one time point, while it can change the class at another visit, e.g. when a positive count is observed. Therefore, the repeated measures Poisson-hurdle model is defined for each time point [37].

Parameter estimation and model comparison

The most popular methods used to estimate the parameter of a count regression model is maximum likelihood estimation technique (MLE). MLE estimation technique determine the parameters by maximize likelihood of the sample data. MLE methods are versatile and apply to most models and to different types of data [32]. Model comparison was made using deviance information criteria (DIC), Akaike’s Information Criterion (AIC), and Bayesian’s Information Criterion (BIC) [30].

Result

A total of 10,547 women were included in the study, with 7570 (71.8%) of the mothers having no infant deaths and 18 (0.2%) having lost 6 infants. Further examination of the number of infant deaths revealed that the variance (0.994) is greater than the mean (0.526), indicating over-dispersion. This indicates that the data could be better fitted by count data models that account for excess zeroes (Table 1).

Table 1 Frequency distribution of number of infant death per mothers

Socio-demographic and economic characteristics of infant death per mother in Ethiopia

Somalia has the highest (0.69) mean infant death per mother, while Addis Abeba has the lowest (0.089). Rural areas had a higher mean number of infant death per mother (mean = 0.593) than urban areas (mean = 0.23). The infant death per mother with a primary education is (0.57), which is higher than for mothers with a secondary or higher education (0.184). Similarly, the average number of infant deaths for the poorest mothers (0.65) is higher than for the richest mothers (0.34). As a result, a woman living in a better and standard economic level had a lower mean number of infant deaths than a woman living in a high income economic level. Male infant death per mother is higher (0.53) than female infant mortality. Infants born with birth orders five and above had the highest mean (0.95) number of deaths. The infant death rate per mother is high for infants born at more than one interval of birth (0.43). It was also discovered that families who did not use toilets had the highest mean number of infant deaths per mother, which was around 0.6(Table 2).

Table 2 Characteristics of variables with infant deaths per mother in Ethiopia, 2016 EDHS

Model selection criteria

The results in Table 3 were used to identify the appropriate count model among the six commonly considered count models using AIC, BIC, and Deviance statistics. A lower value for any of these criteria indicates a better fit or that the model is a parsimonious model. Therefore, multilevel ZINB regression model has a lower deviance, AIC, and BIC value. As a result, we conclude that the multilevel ZINB regression model fits the data better than the other count regression models (Table 3).

Table 3 Model selection criteria for the multilevel count regression models

Risk factors associated with infant mortality in Ethiopia

When counting infant deaths per mother, there are often excess zeros. In such cases, zero-inflated Poisson regression (ZIP) and zero-inflated negative binomial regression (ZINB) can be used, but zero-inflation and correlation may occur simultaneously because of hierarchical study design. It is still possible to use ZIP or ZINB to overcome these challenges. In this study, we overcome these problems through multilevel ZINB regression. To account for both the hierarchical structure of the dataset and excessive zeros, a ZINB model with random effects (mixed ZINB) was used as shown in Table 4, and two random effects parameters were estimated. Compared to mothers who live in rural areas, the expected log count for infant death for mothers who live in urban areas was 1.22 (exp (0.2)) times the expected log count for mothers who live in rural areas. These findings indicate that the risk of infant death decreases with each additional year a mother has lived at the time of giving birth. Consequently, it follows that if the age of the mother at the time of birth increases by one unit, the log count of infant deaths will decrease by 0.98 (exp (− 0.022)). As well, for each unit increase in age at first birth, the expected log count of infant mortality decreased by 0.97(exp(− 0.03)). The expected log count of infant mortality for infants born to birth order 5 and above is 5.16(exp(1.641)) times that of infants born to birth order 1. It has been estimated that infant mortality is 2.45 (exp(0.9)) times higher among those born between 2 and 4 than those born first. In comparison with breast-feeding mothers, the expected log count of not breast-fed faced a higher mortality rate 1.433(exp(0.36)) than the expected log count of mothers who breastfed their babies. Compared to mothers without contraceptive use, the expected log count of infant death is 0.86 (exp(− 0.2025)) times lower. In comparison to single birth mothers, the expected log count of infant deaths was 1.37 (exp(0.314)) times higher with multiple birth mothers. Additionally, through the use of inflated models, it was shown that the odds of the infant mortality rate per mother being zero when living in an urban area were 1.584(exp(0.46) times higher than when living in rural areas. The estimated odds of infant deaths with non-breastfed mothers were 1.433(exp(0.36)) times higher than those with breastfed infants (Table 4).

Table 4 The results of random coefficients ZINB model on infant mortality in Ethiopia, 2016 EDHS

Discussion

A multilevel log linear model was used in this study to estimate the risk factors related to infant mortality in Ethiopia, using a 2016 EDHS dataset. Estimating these risk factors are useful for policymakers in identifying improvements to the lives of people and tracking progress toward achieving the MDGs. The results of this study shed light on the causes of infant mortality in Ethiopia. Compared to infants living in rural areas, the study estimated odds that the infant mortality rate per mother would become zero when the infant was in an urban area. It is evident from this result that urban mothers experience a lower infant mortality rate than their rural counterparts. This result in lined with the previous study that [13, 39,40,41]. Infant mortality is a major issue, even in rural India [42], the location of the infant’s residence (rural versus urban) may also affect their health. In rural areas, lacking infrastructures such as electricity, running water, toilets, and basic health care facilities could cause infants to become seriously ill. The problem stems from the lack of adequate health facilities in rural areas. As a result, rural areas do not have the same level of infrastructure as urban areas, which prevents mothers from accessing better health care in urban areas.

According to the age of the mother at the time of birth, infant deaths are more likely to occur when the age of the mother at birth is younger than an increase in the mother’s age. Both the mother and the child may be at risk of being in danger when many births occur at a young age This finding is supported by a study conducted in low- and middle-income countries [43], Ethiopia [42], Kenya [13], and Nepal [41] which found that both teenage and elderly maternal ages predispose infants to increased mortality risks during infancy, with the risk being higher for teenage mothers. Additionally, researchers found a higher risk of death for children whose mothers were younger, confirming previous research findings that suggest child mortality can be influenced by birth age in a number of ways [43,44,45,46]. Most importantly, the covariate age of the mother at a time of birth was discovered to significantly increase the risk of death of an infant born at a young age of the mother and there is evidence that prematurity increases the risk of infant mortality among mothers [42, 43]. The increased risk of child death among younger mothers is due to immature reproductive systems and less stability in dealing with the complexities of childbirth.

This study also discovered that household size has a significant impact on the number of infant deaths. The risk of infant death increased significantly as household size increased. Then the covariate household size was also identified statistically that infants born to mother live with large family members have more likely to die, this study in line with [17, 25, 47,48,49], infants born to mothers with a large household size were more likely to die than infants born to mothers with a small household size. Too many siblings at home may result in insufficient and improper care for infants. The infant death is often correlated with infant birth weight. For every unit increase in infant birth weight, the expected number of infant deaths decreases. This study supported the previous study [13, 50, 51]. Smaller than average sized infants are at a higher risk than larger than average sized infants because their bodies are weaker, making them more susceptible to illness that could kill them later in life. Possibly, decreasing birth weight increases the risk of infant mortality [43]. The highest mortality risk is associated with births of order six or higher with a short preceding interval. There is a greater mortality risk for infants born to mothers with birth orders second and above than for infants born to mothers with birth orders one. It indicates that as birth order increases the likelihood death of infant death become wide large. The variable birth interval has a significant effect on infant mortality, however. This study was consistent with the done by [13, 23, 25], and shows that the risk of mortality was higher for children born in a higher birth order. It is evident that the child death rate falls with increasing birth intervals. The chance that a higher income mother will produce a healthy child is greater, however the number of infant death is lower. The findings of this study were consistent with previous research [21,22,23,24,25,26], which found that infants born to poor mothers are at a higher risk of dying than those who are richest. Perhaps it is because these mothers can afford good nutrition for their babies and have time to spend with them.

According to this study, infants who have not been breastfed face a greater risk of death than mothers who are breastfeeding. This study supported a previous study [52, 53], which suggested that the risk of death for infants who were not breastfed was higher than for those who were breastfed. It is probable that breastfed babies are protected from childhood illnesses [42]. The risk of infant mortality is reduced when parents are more educated. It confirms the finding from the previous study: the higher the level of education of the parents, the lower their infant death [13, 23, 43, 54, 55]. This is because more educated parents are more aware of the importance of providing better care for their child and his wife during the antenatal and postnatal periods. Moreover, his study found that the risk of an infant death from a contraceptive-using mother was significantly lower than the death of an infant from a non-contraceptive-using mother, which agrees with previous research findings, use of contraceptive is the issue for reducing infant mortality and birth interval [17, 23]. In a similar study, vaccinated infants had a lower risk of mortality than non-vaccinated infants [48]. This could be because those mothers who use contraception seek advice from a physician in a health or non-health institution. A study we conducted found that multiple-birth infants are at greater risk of death than single-birth infants. Having multiple births is associated with numerous negative birth outcomes, which are linked to a higher rate of infant mortality. The risk of infant mortality is higher with multiple births than with single births; similar to previous studies, this study determines that birth type affects the risk of infant mortality [23, 25, 56]. Multiple births increase the economic burden of the family, which can lead to poor nutrition and health care for the infant.

Strengths and limitations

Although the EDHS is a nationally representative, large-scale dataset, there may be some limitations to this analysis. Participants were asked to recall events from the 5 years before the survey, and they may have forgotten some details. Another limitation is the cross-sectional nature of the data, which prevents the identification of causal relationships between outcome and exposure variables. As well, since this study is based on self-reported values taken from a nationally representative survey, it is subject to several sources of error: estimates for specific countries may change over time as a result of these limitations, and these should be taken with caution. However, the EDHS is a nationally representative dataset and is designed and deployed rigorously by the Centers for Disease Control and Prevention using a global framework. This allows the results to be generalized throughout the country, even with these limitations. Due to the use of similar instruments across countries, there will also be an opportunity to compare the results internationally.

Conclusion

This study aimed to estimate the risk factors associated with infant mortality in Ethiopia using multilevel log linear model. As a result, we discovered an excess of zeroes in the dataset as well as several sources of heterogeneity. This study included 10,547 mothers, and the results show that 71.8% of the mothers did not experience infant death, while 0.2% experienced at least 6 infant deaths per mother in their lifetime prior to the survey. Multilevel Poisson model analysis determined ZINB to be most parsimonious to estimate the risk factors associated with infant mortality in Ethiopia. According to the findings, the number of infant deaths per mother varies by region. Residents’, age of the mother at the time of birth, household size, mother’s age at first childbirth, breast feeding, child’s birth weight, contraceptive use, birth order, wealth index, father education level, multiple birth, and birth interval were found to be significant risk factors for infant mortality. The findings of this study suggest that, more efforts are needed to expand educational programs to educate mothers about the advantages of contraception, breast feeding, and spacing birth intervals in order to reduce infant death per mother.