Introduction

Neonatal mortality is the death of a live-born baby within the first 28 days of birth. For the selected households, neonatal mortality was collected from children aged 0–28 days and women aged 15–49. The neonatal period is a significant 4-week period in human life because it carries a great risk of mortality1. In comparison, the probability of dying after the first month and before reaching age 1 was estimated at 11 deaths per 1000, and the probability of dying after reaching age 1 and before reaching age 5 was estimated at 10 deaths per 1000 in 20192. The average daily mortality rate during the neonatal period is close to 30-fold higher than during the postnatal period (1 month to 1 year of age)3.

Globally, 4 million neonates die every year before they reach the age of 1 month4. Approximately two-thirds of all neonatal deaths worldwide occur in only 10 countries. Even though Ethiopia is among the countries with the largest percentage decline in the neonatal mortality rate from 1990 to 2013, 3% of global neonatal deaths (84,000 in 2013) occurred in the country, which places it among the top six countries, following India, China, Pakistan, Nigeria, and Bangladesh, with the highest number of neonatal deaths4. Moreover, the most recent growth reports on global trends in neonate mortality have shown alarmingly slow progress in curbing mortality rates among neonates, the slowest being in sub-Saharan Africa5.

The reduction in neonatal mortality rates has been slower than both under five and child mortality rates after the first month of life6. The good news is that neonatal mortality is on the decline globally. The world’s neonatal mortality rate fell from 33 deaths per 1000 live births in 1990 to 21 per 1000 in 2012. All regions showed drops, with lower percentage reductions in South Asia and sub-Saharan Africa (39% and 28%, respectively) than the other regions, even though the smaller regional reductions represent significant progress. The overall result was a reduction in global neonatal deaths from 4.6 million in 1990 to 2.9 million in 20127.

The average neonatal mortality rate (NMR) in low-income countries (36/1000) was nine times higher than the average NMR in high-income countries (4/1000) and 1.5 times the global NMR (24/1000). According to the World Health Organization, 140 million babies are born worldwide, 90% in low-income countries and 10% in high-income countries. Approximately 99% of neonatal deaths occur in low-income countries and 1% in high-income countries; every minute, seven babies die worldwide (415 babies every hour)3.

In Ethiopia, infant mortality declined by 39% over the 15 years between the 2000 and 2011 EDHS, from 97 to 59 deaths per 1000 live births. Under-five mortality declined by 47% over the same period, from 166 deaths per 1000 live births to 88 deaths per 1000 live births. Neonatal mortality has also decreased from 49 deaths per 1000 live births according to EDHS 2000 to 39 deaths per 1000 live births in EDHS 2005; it remained stable at 37 deaths per 1000, as reported in EDHS 20118.

The causes of neonatal death may also vary within the same country depending upon the socioeconomic status of the regional population and access to health care services. Maternal health before, during and after pregnancy. Conditions during labor, delivery and postnatal care of babies play a significant role in decreasing neonatal mortality9. In Ethiopia, many studies were conducted about neonatal mortality using different statistical models. In this study, the response variable is the number of death occurrences, which is limited to non-negative values. There are two problems with applying the usual linear regression model to these data. The first problem in many count data distributions is positively skewed with many observations having 0 values. The second reason is that the regression model will likely produce negative predicted values, which are theoretically impossible.

Hence, we attempted to identify the determinants of neonatal mortality in Ethiopia, focusing on socioeconomic, demographic, and environmental factors using count regression models.

Neonatal mortality is generally regarded as an important national indicator of health because it is particularly sensitive to general structural factors, like socio-economic development and basic living conditions. This study's results identified risk factors indicating the need for more intensified prenatal care and improved transport of high-risk neonates, especially in rural areas.

Participants and methods

This cross-sectional study used about 10,641 live births born within the 5 years preceding EDHS 2016 from September to December 2015. This study is approved by the ethical review committee at Bahir Dar University, College of Science. All procedures were performed in accordance with the ethical standards of the Bahir Dar University ethics review committee declaration. No prospective studies with human participants or animals were performed by any authors for this article. Given the cross-sectional nature of this work, the need to obtain informed consent was waived for the individual participants included in the study, following the standards of the Bahir Dar University ethical review committee.

Methods

The data source used in this study was the 2016 Ethiopia Demographic and Health Survey, conducted in Ethiopia Central Statistics Agencies (CSA) http://www.dhsprogram.com. All neonates in Ethiopia born within the 5 years preceding the survey were the source population for this study. In contrast, all neonates in Ethiopia who were in the selected enumeration areas (EAs) were considered as the study population.

A two-stage stratified cluster sampling technique was selected from a population and housing census frame for EDHS 2016. In the first stage, 645 clusters (202 in urban and 443 in rural areas) were selected with probability proportional to the cluster size and independent selection in each sampling stratum. A household listing operation was carried out in all the selected enumerations from September to December 2015. In the second stage of choice, a fixed number of 28 households per cluster were selected with an equal probability of systematic selection from the newly created household list. Finally, a nationally representative sample of 16,650 suitable households and 15,683 women was interviewed10. All neonatal mortality of women in Ethiopia born within the preceding 5 years of 2016 EDHS was included, while among women who did not experience neonatal mortality in the household, two EAs were not included initially in the EDHS coordinate file.

Statistical data analysis

Different statistical techniques have been used in the analysis of collecting data. However, the method of data analysis depends on the nature of the variables included in the study and the data type of the basic variables. Depending on the above point, the study used descriptive and inferential statistics. The outcome variable of this study was neonatal mortality (number of death) and the independent factors for the outcome variable were socioeconomic variables (residence, region, source of water supply, toilet, wealth index, and mothers occupation), neonatal factors (sex of a child, birth order, a child is a twin), health care service factors (breastfeeding, place of delivery, contraceptive, number of antenatal visit during pregnancy, distance to health facility) and maternal factors (educational attainment, marital status, sex of households and mother’s age at child birth).

Count regression models

Count regression models were developed to model data with integer outcome variables. These models can be employed to examine the occurrence and frequency of occurrence. The most popular model for count data is the Poisson model, which is based on the property that the mean and variance of the dependent variable are assumed to be equal11. However, this is not always the case, as the variance sometimes exceeds the mean. This is referred to as over dispersion12. Over-dispersion can be modeled using a negative binomial (NB) regression model, but more models accounting for over-dispersion exist. The negative binomial regression model assumes a gamma distribution for the Poisson mean with variation over the subjects12.

Furthermore, the response variable can be observed to show excess zero counts, contrary to what is expected, based on Poisson or negative binomial distribution. According to Ref.13, this implies that the count data are zero-inflated. Zero-inflated models allow for over-dispersion as well as modeling zero-inflated count data. The frequently used models for zero-inflated count data are zero-inflated Poisson (ZIP) and zero-inflated negative binomial (ZINB). This study employed count regressions to identify the determinant factors of neonatal mortality.

The ZIP model may often fail to fit such data over dispersion concerning the Poisson distribution. We extend the ZIP mixed regression model to the ZINB mixed regression model. The zero-inflated negative binomial (ZINB) regression model is an extension of the regression model. If the number of zeroes in the count distribution was excessive, then the ZIP or ZINB model more accurately fit the data than the negative binomial or Poisson model. ZINB regression is used for count data that exhibit over dispersion and excess zeros14. The over-dispersed data are characterized by “excess zeros”, excessively large outcomes or both. Therefore, the ZINB model accounts for “excess zeros” and extra heterogeneity in a positive outcome. The ZINB distribution is a general model for counts that nests the Poisson, NB and ZIP models15. The probability density function of a zero-inflated negative binomial distribution is a simple modification of the ZIP.

Suppose \(yi\) is the number of neonatal deaths per household; then, the probability mass function of ZINB is given by:

$$P\left({Y}_{i}={y}_{i}\right)=\left\{\begin{array}{c}{\Pi }_{i}+\left(1-{\Pi }_{i}\right){\left(1+{\varvec{\delta}}{{\varvec{\mu}}}_{{\varvec{i}}}\right)}^{\frac{-1}{{\varvec{\delta}}}}; if y=0\\ \left(1-{\Pi }_{i}\right)\frac{\mathbf{ \Gamma }({{\varvec{y}}}_{{\varvec{i}}}+\frac{1}{{\varvec{\delta}}})({\varvec{\delta}}{{{\varvec{\mu}}}_{{\varvec{i}}}{\varvec{i}})}^{{{\varvec{y}}}_{{\varvec{i}}}}}{\mathbf{ \Gamma }\left({{\varvec{y}}}_{{\varvec{i}}}+1\right)\mathbf{ \Gamma }\left(\frac{1}{{\varvec{\delta}}}\right)}{\left(1+{\varvec{\delta}}{{\varvec{\mu}}}_{{\varvec{i}}}\right)}^{{{\varvec{y}}}_{{\varvec{i}}}}+\frac{1}{{\varvec{\delta}}}if {y}_{i}>0\end{array}\right.,\quad where; \; 0\le {\Pi }_{i}\le 1,$$

where; \({\mu }_{i}\) is the mean of the underlying negative binomial distribution, \({\Pi }_{i}\) is the logistic link function and \(\delta\) is the over dispersion parameter. The ZINB distribution reduces to the ZIP distribution when the dispersion parameter \(\delta\) →0. The ZINB model is a special case of a two-class finite mixture model such as the ZIP model with mean and variance16.

$$E({y}_{i})= (1-{\Pi }_{i}){\mu }_{i} \; \mathrm{ and },$$
$$var\left({y}_{i}\right)= \left(1-{\Pi }_{i}\right){\mu }_{i}\left(1+{\Pi }_{i}{\mu }_{i}\right).$$

The maximum likelihood parameter estimation method was used to determine the parameters that maximize the probability (likelihood) of the sample data. MLE methods are versatile and apply to most models and different data types. The generalized linear models (GLM) have no closed-form of the maximum likelihood function. To approximate the MLEs of GLM, we rely on the Newton–Raphson algorithm. The log-likelihood functions of β is:

$$\begin{aligned} l\left( {\beta ,\phi } \right) & = \sum \left\{ {\frac{{y_{i} \theta_{i} - b\left( {\theta_{i} } \right)}}{\phi } + c(y_{i} ,\phi } \right\} \\ & = \sum l_{i} (\theta_{i} ,\phi ) = \sum l_{i} , \\ \end{aligned}$$

where; \(\mathrm{\theta i}=\uptheta \left({\mathrm{X}}^{\mathrm{^{\prime}}},\upbeta \right)=\uptheta (\mathrm{\eta i})\).

The MLEs of β are obtained by maximizing the log-likelihood functions L(β, ϕ). Where θ is a known and monotone function, the likelihood function of GLM depends on β only through the linear predictor of η.

Results

Among the 10,641 live births in this study, approximately 364 (3.4%) neonates were died. Of the experienced neonatal mortality, 316 (3%), 36 (0.3%) and 12 (0.1%) were one, two and three children died per mother, respectively. Therefore, there are a large number of zero outcomes. However, large observations (a large number of neonatal mortality per mother) are less frequently observed. The mean number of neonatal deaths per mother was 0.04, and the variance was 0.052. Therefore, the variance in neonatal mortality is greater than the mean. This greater variance indicates the dispersion of the data (Table 1)

Table 1 Distribution of neonatal mortality per mother.

Experienced neonatal mortality in urban was 40 (0.376%) from these 38 (0.357%), 2 (0.0187%) number of child mortality per mother were died at one and two neonatal periods, respectively. Regarding neonatal mortality in rural areas, 324 (3.045%) of 278 (2.612%), 34 (0.319%), and 12 (0.113%) children died per mother were at one, two and three neonatal times, respectively. Therefore, there was high neonatal mortality in rural areas compared to urban areas in one, two and three neonatal periods.

Concerning regions, Somali, Oromiya and SNNPR had the highest proportion of neonatal mortality 69 (0.648%), 50 (0.470%) and 38 (0.357%), respectively), while the Addis Ababa city administration had the lowest proportion of neonatal mortality 10 (0.094%).

In Somali region, 59 (0.554%), 4 (0.037%) and 6 (0.056), Oromiya 46 (0.432%), 4 (0.037%) and 0 (0.0%) and SNNPR 35 (0.330%), 0 (0.0%) and 3 (0.0282%) neonatal mortality were occurred at 1, 2 and 3 neonatal periods respectively (Table 2).

Table 2 Socioeconomic variables in the study.

Of the total neonatal mortality, 236 (2.218%) were male, and 128 (1.203%) were female. Of the male neonates, 207 (1.945%), 24 (0.225%), and 5 (0.047%) death were occurred at one, two and three neonatal time, respectively. In female neonates, 128 (1.203%) were 109 (1.024%), 12 (0.113%), and 7 (0.066%) children, mortality per mother, happened in one, two and three neonatal periods, respectively. Therefore, male 236 (2.218%) had greater than 128 (1.203%) female neonatal mortality (Table 3).

Table 3 Neonatal factors.

The total number of neonatal mortality where places of delivery at home were 260 (2.443%), the public sector was 91 (0.855%), and the private sector was 13 (0.122%). From delivery at home 221 (2.077%), 27 (0.254%) and 12 (0.113%), at public sector 84 (0.789%), 7 (0.066%) and 0 (0.0%), and private sector also 11 (0.103%), 2 (0.019%) and 0 (0.0%) number of children per mothers died one, two and three neonatal period respectively. Therefore, delivery at home has the highest neonatal mortality compared to the private and public sectors of delivery.

The total number of neonatal deaths not experiencing breastfeeding was 221 (2.077%), and 143 (1.34%) were experiencing breastfeeding. From not experiencing breastfeeding 195 (1.832%), 20 (0.210%), and 6 (0.056%) and experiencing breastfeeding 121 (1.137%), 16 (0.150%), and 6 (0.056%) children died in one, two and three neonatal periods respectively. Therefore, 221 (2.077%) women who did not experience breastfeeding had greater neonatal mortality than 143 (1.344%) women who experienced breastfeeding. From our study findings, we had observed a difference between facilities-based birth and home-based birth in terms of neonatal mortality, which indicates about 221 (2.1%) death has occurred in home-based delivery services. In comparison, 95 (0.9%) of death occurred in facilities-based birth delivery in the first neonatal period. This difference implies neonatal mortality was higher in home-based birth delivery compared to facilities-based birth delivery in all neonatal periods (Table 4).

Table 4 Health care service factors.

Of the total number of neonatal deaths for uneducated mothers, 6838 (64.261%) were primary, 2678 (25.167%) were secondary, and more than 1,125 (10.572%). Child mortality was about 218 (2.049%), 28 (0.263%), 9 (0.085%) among uneducated mothers in one, two and three neonatal periods, respectively. Similarly, among primary education mothers, there were about 73 (0.686%), 4 (0.037%) and 3 (0.028%) death of neonatal at one, two and three neonatal period, respectively, and among mothers whose education were secondary and above 25 (0.235%), 4 (0.037%) and 0 (0.0%) number of died children per mother occurred at one, two and three neonatal time respectively. There was a decreasing trend in neonatal mortality concerning the mother's educational level (Table 5).

Table 5 Demographic (maternal) factors.

The model selection methods include information criteria AIC, BIC and log-likelihood values. The models are described to fit the data. AIC, BIC and the corresponding log-likelihood value of each model were presented in Table 6. The Poisson, negative binomial, zero-inflated Poisson regression models had the largest value of AIC and BIC, demonstrating a poor fit to the data. The remaining ZINB model had smaller AIC, BIC and the largest log-likelihood value. Therefore, ZINB models are appropriate and preferred for the number of Neonatal Mortality per mother.

Table 6 Model selection criteria for the count regression model.

According to the result of Table 7, in ZINB model, the variables breastfeeding, child twin, number of antenatal visits during pregnancy, the number of birth order, place of residence, sex of household head and region are statistically significant factors of neonatal mortality per mother in Ethiopia.

Table 7 Parameter estimates for the ZINB model for the count data.

The average number of neonatal deaths experienced during breastfeeding was lower (IRR = 0.38) than that experienced by mothers who did not experience breastfeeding their child. The average neonatal mortality difference in rural residences was increased by IRR = 3.99 times compared to the urban residence of mothers. The average neonatal mortality difference in multiple births was increased by IRR = 8.53 times compared with a single birth. The average neonatal mortality differences at antenatal visits (1–4) and (5–10) during pregnancy were decreased by IRR = 0.48 and IRR = 0.72, respectively, compared to those at no antenatal visits during pregnancy. The average neonatal mortality difference in the female household head was decreased by IRR = 0.66 times compared to that in the male household head. The average number of neonatal mortality differences in the first, second up to the third number of birth order has decreased by (IRR = 0.38, IRR = 0.48) times, respectively, as compared to six and above the number of birth order. The average neonatal mortality difference in Dire Dawa and Harari increased by IRR = 3.085 and IRR = 2.53, respectively, compared to that in the Tigray region (Table 7).

The number of neonatal mortalities in the zero-inflated model was modeled by including the covariates antenatal visits during pregnancy, breastfeeding, sex of a child, distance to health facilities and mother’s occupation, which had statistically significant effects on the probability of being in the zero number of neonatal mortalities. The estimated average number of neonatal mortalities that became zero when experiencing breastfeeding was lower by IRR = 0.045 than when not experiencing breastfeeding, while the other variables remained constant. The average number of neonatal deaths becomes zero when having a female child is lower by IRR = 0.49 compared to a male child, keeping other variables constant. The average number of neonatal deaths becomes zero, having (1–4) fewer antenatal visits during pregnancy (IRR = 0.42) than no antenatal visit during pregnancy, keeping other variables constant. The average number of neonatal deaths becomes zero, with a large problem higher by IRR = 0.8 compared to a non-large problem to reach health facilities, keeping other variables constant. The average number of neonatal deaths becomes zero when having employed mothers is higher by IRR = 1.68 as compared to not employing mothers, keeping other variables constant (Table 8).

Table 8 Zero inflation parameter estimates.

Discussion

Neonatal mortality erodes the potential economic labor force, thus plunging the country into a human resource disaster and retarding development. Children are the human resource bank of every nation and nationality of the world. However, they are more exposed to diseases and other health risks. Therefore, this study attempted to identify the socioeconomic, demographic, health care service and environmental-related factors of neonatal mortality based on EDHS 2016 data.

From the result of this study, the prevalence of neonatal mortality was about 364 (3.4%), which was in line with the study conducted in Ethiopia17. But this study's result is lower than the result shown in Ethiopia18. The possible reason for the difference in the prevalence was due to the difference in the sample size included in the study, the study period difference and also the statistical methodology we applied.

The results of this study indicated that breastfeeding, child twin, number of antenatal visits during pregnancy, number of birth orders, place of residence, sex of household head, sex of a child, distance to health facilities, mother’s occupation and region had a statistically significant influence on the number of neonatal deaths in Ethiopia. The risk of neonatal mortality associated with multiple births is very high relative to single births, and this study is consistent with other studies1. The risk of neonatal mortality associated with rural residence is very high relative to an urban residence, and this study is consistent with a previous study17. The reason might be that mothers living in rural areas have limited access to health facilities, maternal healthcare services, and health care professionals for neonatal mortality in rural areas1.

The study result showed that; experienced breastfeeding children have a lower risk of neonatal mortality than children who do not experience breastfeeding. This finding seemed to be consistent with other studies19. The possible reason might be that the harmful effects of infections related to neonatal deaths can be prevented by breast milk, especially when practicing breast milk feeding within the first 1 h after birth. This practice minimizes the risk of death and helps wealth and brain development20. Additionally, increasing the number of antenatal visits during pregnancy reduces the risk of neonatal mortality, and this finding seemed to be consistent with other studies1. This might be due to the lack of access to health care, especially basic and emergency obstetric care, related to neonatal mortality in Ethiopia21.

In this study, rural neonatal were experienced higher mortality than their urban counterparts. The main reason might be a low maternal health service utilization level amongst rural women compared with their urban counterparts. In addition, health institutions are typically concentrated in urban areas and women in urban tended to benefit from better awareness and access to services than rural mothers. Moreover, previous studies have found that rural women are more readily influenced by traditional practices contrary to modern health care. These elements potentially explain the close connection between residence and neonatal mortality. This result was similar to the study result of Refs.22,23.

The risk of neonatal mortality associated with the Dire Dawa and Hareri regions is very high relative to the Tigray region, and this study is consistent with other studies17. This might be due to the difference in the health facility among regions of the country.

The study also showed that children born from employed mothers have a higher mortality risk than non-employed mothers. This finding is consistent with previous studies24,25. The difference could be attributed to the reasons that most employed mothers were engaged in the informal sector like agriculture and sales, which this study has further confirmed to be risky for children's survival. And also, this negative relationship of mother employment and neonatal mortality could be due to the non-availability of adequate time for childcare among working mothers.

Furthermore, birth order has been associated with a variety of adverse life outcomes for children, including morbidity and mortality. This study result found that short preceding birth intervals (first, second to third birth orders) have higher odds of neonatal mortality, a finding that is consistent with most previous findings. This study is consistent with an earlier study26. However, in contrast to previous findings, our analyses found that long preceding birth intervals have lower odds of mortality. The possible argument for the impact of birth order on health and mortality risks has been formulated within the biological depletion hypothesis. According to this hypothesis, later-born children are more likely than their older siblings to be less healthy because they are born to older mothers who have already given birth to a number of children before and therefore are physiologically less able to produce stronger children. However, there is some evidence to the contrary that has found t older siblings, particularly first-borns, are at greater risk of diseases and death because they are typically born to younger women who are neither mature and experienced enough in mothering nor economically well enough off to provide adequate emotional and material resources to their newborn babies27.

The risk of neonatal mortality associated with female children is lower than that associated with male children, and this study is consistent with a previous study28. The study conducted in India and Pakistan also confirmed that neonatal mortality risks were higher among male infants than their female counterparts29. This may be due to biological differences between male and female neonates. That is, female neonates had a stronger immune system than male neonates’ due to genetic differences between females and males. Newborn girls have a biological advantage in survival over newborn boys27,30.

Moreover, the study revealed that children born from the big problem of distance to a health facility have a higher risk of neonatal mortality than those not with big problems with reaching a health facility. This study is consistent with the previous study21. The possible reasons might be lack of ambulance or transportation problems due to low safety of roads which turn results in a delay of the ambulance to referrals hospitals or clinics, most especially for remote areas.

Limitation of the study

All research has its limitation, and it is very important to present the limitation of our study to our audience (other researchers, journal editors and article reviewers).

Therefore, the limitations of this study are that the data are from secondary sources and cannot be centrally extrapolated between neonatal death and their determinants, nor can they explain geographical variation. Future studies need to address spatial effects to explain the fluctuations in neonatal death and prevalence caused by geographic differences.

Conclusion

The main objective of this study was to identify the determinant factor of neonatal mortality in Ethiopia based on EDHS 2016 data with the application of a spatial and count regression model. The study was based on secondary data obtained from the central statistical agency of Ethiopia. For this study, the ZINB regression model was the most appropriate. This result suggested that the ZINB model is most suitable to deal with the problem of over dispersion and excess zero values simultaneously.

The prevalence of neonatal mortality in Ethiopia was higher (3.4%). This study also investigated high-risk areas of neonatal mortality in Somali, Afar, border of Tigray and afar, border of Amhara and Afar, Oromia, border of Amhara and Oromiya, and Hararie and Dire Dawa regions of the country. In this study, the risk factors for neonatal mortality were multiple birth types, rural residences, health facilities, environmental places of Dire Dawa administrative city and Hararie Region, and employed mothers as significant factors of neonatal mortality that increased the risk of death during the neonatal period. However, having early breastfeeding, a female household head, antenatal visits (1–4) and (5–10) during pregnancy and having the first and two to three birth orders decreased the risk of neonatal death.

Based on the finding of this study, we strongly recommend that attention be paid to socio-economic, demographic, environmental, and maternal health service factors to ensure a significant reduction in neonatal mortality. The Federal Ministry of Health needs to prioritize the high risk of neonatal death and improve early maternal and child health care. Efforts are underway to expand educational programs to educate mothers about the benefits of breastfeeding, prenatal screening, birth order, mother's profession, and out-of-hospital residences to reduce neonatal mortality.