Background

Plasmodium falciparum malaria is a leading infectious disease, accounting for approximately 300 to 500 million clinical cases each year and causing over one million deaths, mostly in African children younger than 5 years. Insecticide treated nets (ITN) have been proven to be an effective tool in reducing the burden of malaria [13]. Numerous trials all over the world have shown that such nets can reduce child mortality in endemic areas in Africa by 17% and roughly halve the number of clinical malaria episodes [4]. These results were later confirmed under programme implementation [5, 6]. It is well known that the use of ITNs provides significant individual protection, but direct and indirect effects on malaria transmission of treated and untreated nets on the wider community of bednet users and non-users are still little understood, despite some recent progresses [7]. Randomised trials in different malaria transmission regions examined the effect of ITNs on mortality of children without bednets. A study carried out in northern Ghana estimated that mortality risk in individuals without insecticide nets increased by 6.7% with every 100 m shift away from the nearest intervention compound [8]. In western Kenya households without ITNs but within 300 m of ITN villages received nearly full protection [9]. These results conflict with those found from studies in The Gambia which concluded that protection against malaria seen in children using ITN is due to personal rather than community effect [1012]. A better understanding of these spatial effects in real-life situations is paramount for setting control targets, especially for understanding equity issues since these spatial effects mainly improve the situation of unprotected individuals, who are on average poorer. Moreover, the spatial effects of ITNs on non-bednet users in relation with the degree of density of bednets will indicate the type and level of bednet coverage that control programs need to achieve in order to maximize protection of non-bednet users. Here we present for the first time results for the spatial effects of mosquito nets in a "real-life" programme. One of the limitations of previous studies is that they used standard statistical methods which assume independence between observations. When these methods are applied to spatially correlated data, they underestimate the standard errors and thus overestimate the statistical significance of the covariates [13]. In this paper we analyzed data from a large follow-up study in a highly malaria endemic area in southern Tanzania. Making use of a demographic surveillance system (DSS) we tracked child mortality prospectively and assessed the relation between all-cause child mortality rates and the spatial effect of bednet density. To account for spatial clustering we fitted Bayesian geostatistical models with household-specific random effects. Models for geostatistical data introduce the spatial correlation in the covariance matrix of the household-specific random effects and model fit is based on Markov chain Monte Carlo methods (MCMC). MCMC estimation requires repeated inversions of the covariance matrix which, for large number of locations is computationally intensive and time consuming. To address this problem we propose a convolution model for the underlying spatial process which replaces large matrix inversion by the inversion of much smaller matrices.

Methods

Study area and population

The study was carried out from October 2001 to September 2003 in the 25 villages covered by a demographic surveillance system (DSS) in the Kilombero Valley, southern Tanzania. The DSS updates every 4 months demographic information on a population of about 73, 000 people living in 12, 000 dispersed households (Figure 1) in two districts – Kilombero and Ulanga [14]. Most residents practice subsistence farming with rice and maize being the predominant crops. The climate is marked by a rainy season from November to May with annual rainfall ranging from 1200 to 1800 mm. Malaria is the foremost health problem, for both adults and children [15]. The prevailing malaria vectors in this region are Anopheles gambiae and Anopheles funestus with an estimated average entomological inoculation rate estimated of over 360 infective bites per person a year [16]. A large-scale social marketing programme of ITNs for malaria control has been running in this area since 1997 [6].

Figure 1
figure 1

Distribution of the DSS households according to their socio-economic status. Socio-economic status of the DSS households: from light red to dark red: most poor, very poor, poor, less poor and least poor.

Data collection

Mortality data were obtained prospectively and continuously over a two-year period from the DSS, which allowed us to register age and sex data, births and migrations in and out the study area. Exact procedures are described in [14].

An additional survey was carried out in the DSS population in 2002 to collect socio-economic information. The survey questionnaire included a list of household assets (e.g. bednet), housing characteristics (e.g. type of roofing material) and type of energy and light. Although information on ITNs ownership was also collected, we did not use these data in our analysis since it was shown [17] that in this area two-thirds of the nets that were reported as having been re-treated within the last 12 months had insufficient insecticide to be effective.

Households and health facilities were geolocated using a hand-held Global Positioning System (Garmin GPS 12, Garmin corp.) and Euclidean distances between houses and the health facilities were calculated. Oral informed consent was obtained from the heads of the households included in the study.

Statistical analysis

Bednet density was defined as the number of bednets per person within a certain radius around each household. The following radii were chosen: 0 m (bednet coverage at household level), 50 m, 100 m, 150 m, 200 m, 300 m, 400 m, 500 m and 600 m.

A wealth index was calculated as a weighted sum of household assets. It has been shown that there is an inverse relationship between mortality and socio-economic status [18]; therefore the weights of the wealth index were obtained from the coeffcients of a negative binomial model which estimated the effect of assets on all-age mortality. The weight of asset i was calculated as w i = b i i b i 2 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4DaC3aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpjuaGdaWcaaqaaiabdkgaInaaBaaabaGaemyAaKgabeaaaeaadaGcaaqaamaaqababaGaemOyai2aa0baaeaacqWGPbqAaeaacqaIYaGmaaaabaGaemyAaKgabeGaeyyeIuoaaeqaaaaaaaa@3A3C@ , where b i is the regression coeffcient corresponding to asset i. The wealth index was divided into quintiles corresponding to poorest, very poor, poor, less poor and least poor groups of the population.

Negative binomial models were fitted to assess the effect of different bednet density measures on child mortality after adjusting for possible confounders: sex, wealth index and distance to the nearest health facility, using STATA v. 9.0 (Stata Corporation, College Station, TX, USA).

To estimate the effect of bednet density on the mortality of children without nets we performed a similar analysis. In particular, we defined bednet density as above, considering as index households the ones without any bednet. We then fitted the negative binomial models adjusted for the above mentioned confounders.

The household mortality data are correlated in space since common environmental risk factors, proximity to breeding sites and socio-economic exposures may influence the mortality outcome similarly in households within the same geographical area. The independence assumption of the standard negative binomial models may result in overestimation of the significance of the bednet coverage covariate. To address this problem Bayesian geostatistical negative binomial models were fitted with household-level random effects. Spatial correlation was modeled by assuming that the random effects are distributed according to a multivariate normal distribution with variance-covariance matrix related to an exponential correlation function between household locations, i.e. σ 2 exp (-d ij ρ), where d ij is the Euclidean distance between households i and j, σ 2 is the geographic variability known as the sill and ρ is the rate of correlation decay. The distribution of random effect defines the so called Gaussian spatial process. Model fit requires the inversion of a covariance matrix with the same size as the sample size. Due to the large number of observations in our dataset, the estimation of model parameters becomes unstable and unfeasible. To overcome this problem we propose a model based on a convolution representation that is, we approximate the spatial random process by a weighted sum of a small number of stationary spatial processes. The size of the covariance matrix that needs to be inverted is then much smaller, therefore the method is computationally efficient. We employed Markov chain Monte Carlo simulation to estimate the model parameters. Further details on this modeling approach are given in the appendix. The analysis was implemented using a software written by the authors in FORTRAN 95 (Compaq Visual FORTRAN Professional 6.6.0) using standard numerical libraries (NAG, The Numerical Algorithm Group Ltd.).

Results

A total number of 11, 134 children from 7, 403 households with children had information available on both geolocation and socio-economic covariates.

The pooled data revealed an overall all-age crude mortality rate of 9.5 per 1000 person-years and an overall child mortality of 26.2 per 1000 person-years with no difference between the two districts (P = 0.98 and P = 0.73, respectively).

The insecticide treatment status of the nets was difficult to ascertain, therefore the results reported in this section refer to bednets only, whether treated or not. The mean bednet density in Kilombero Valley was 270 nets per 1000 inhabitants. 10, 160 households (85%) had at least one bednet and the mean number of bednets per household was 1.64.

Table 1 shows the overall child mortality rates together with district-specific child mortality rates by sex, socio-economic status, distance to the nearest health facility and bednet density at household level. Since there were no significant differences between child mortality rates in Kilombero and Ulanga Districts, all further analysis was done by pooling the data of the two districts. Males had a slightly lower mortality rate than females, but sex was not significantly associated with childhood mortality rates (Incidence-Rate Ratios (IRR) = 0.90, P = 0.216). Similarly, socio-economic status was not significantly associated with child mortality (P = 0.124), but we could notice a trend for children from the relatively better off households to have a lower mortality rate than their poorer counterparts. No significant association was observed with distance to the nearest health facility, but children living ≥ 1 km away from the nearest health facility tended to have higher mortality rates than those living in close proximity.

Table 1 Overall and district-specific child mortality rates by sex, socio-economic status, distance to the nearest health facility and bednet density at household level

A simple bivariate analysis showed that bednet density at household level was significantly associated with child mortality (IRR = 0.50, P = 0.020). There was a tendency for mortality rates to decrease for children living in households with at least 30% bednet density coverage.

The effect of various bednet density measures on child mortality after adjusting for possible confounders is shown in Table 2. Surprisingly, the only measure significantly associated with child mortality was the bednet density at household level (R 0) (IRR = 0.53, P = 0.037). We noted that the mean bednet density was similar for all radii, whereas the standard deviation tended to become smaller as the radius was increasing.

Table 2 Summary of bednet density measures and estimates of the effect of bednet measures on child mortality, adjusted by sex, socio-economic status and distance to the nearest health facility

The results of the bivariate and multivariate non-spatial negative binomial models are shown in Table 3. None of the explanatory variables were significantly associated with child mortality, except the fourth wealth quintile. After taking into account the spatial correlation present in the data, the effect of the covariates remained non-significant. However, the confidence intervals became wider, confirming the importance of taking into account spatial correlation when analyzing geographical data [19]. The parameters σ 2 and ρ shown in Table 3 measure the spatial variance and the rate of correlation decay (smoothing parameter), respectively. The estimates of the smoothing parameter ρ indicate a low spatial correlation in the child mortality rate data. In fact ρ was estimated to be 774.5, which in our exponential setting is translated to a minimum distance for which spatial correlation decrease to 0.05 of only around 0.43 km.

Table 3 Results of the association of sex, socio-economic status, bednet density at household level and distance to nearest health facility with child mortality, resulting from the bivariate and multivariate non-spatial models and spatial model

Table 4 depicts the effect of different bednet density measures on the mortality of children without any bednet after adjusting for sex, socio-economic status and distance to the nearest facility. The results show no significant association between any bednet density measure and mortality of children without nets, indicating no detectable community effect.

Table 4 Estimated effect of bednet measures on mortality of children without nets, adjusted by sex, socioeconomic status and distance to the nearest health facility, obtained by fitting negative binomial models

Pearson's correlation coeffcient between bednet density and bednet usage was 0.83, indicating a strong correlation between the two measures. Hence, the results regarding the bednet density could be extended to bednet usage.

Conclusion and discussion

We examined the effect of a variety of factors on child mortality in an area of high perennial malaria transmission in southern Tanzania and identified that the density of household bed net ownership was the only factor significantly associated with child mortality reduction. The spatial effects of bednets on all-cause child mortality in an area of high perennial malaria transmission in southern Tanzania have been presented here. The effect of different bednet density measures was estimated after adjusting for possible confounders like sex, socio-economic status and distance to the nearest health facility. We concentrated on all-cause child mortality because in rural Africa it is difficult to assess malaria-specific mortality. Most deaths occur at home and verbal autopsy is the only tool available to determine the cause of mortality. It has been shown [20, 21] that this is an inaccurate method to detect malaria, having a low sensitivity and specificity.

Our results indicated an apparent lack of community effect of bednets on childhood mortality. This conclusion is based on the fact that only the bednet density at household level had a significant protective effect on child mortality. When net density within ≥ 50 m was considered, the risk of child mortality increased slightly but the relation was not significant. Our findings contrast with previous studies in Africa, which demonstrated a strong community-wide effect of ITNs on child mortality [8, 9]. However, our study differed from the studies mentioned above in a number of ways.

Firstly, the epidemiological studies that demonstrated the mass effect of ITNs on child mortality were all designed as community-trial interventions, ensuring a uniformly high coverage of treated nets in the intervention group, with a control group almost not using any sort of nets. This creates a strong gradient of ITN at the margins use, which allows a good measure of spatial effects. By contrast, net usage, treated or not, was uniformly high in our study area, with the result that any sort of spatial effects would be more difficult to detect unless there would be heterogeneity in coverage, which was not the case.

Secondly, we were not able to distinguish between treated and untreated nets in the field because there is no reliable testing method to do this at present. [17, 22] showed that in our study area compliance with insecticide re-treatment is relatively low, with only 32% of the nets having enough insecticide to ensure an entomological impact. Since untreated nets are less effective than treated ones [4, 5, 23], this had certainly an impact on the analysis by reducing differences between users and non-users. However, despite these limitations, our study showed that mosquito nets still show a protective effect on child mortality.

Lastly, as specific data on bednet use was not available for the whole sample, we created a different measure of the impact of bednets: the "bednet density" defined as the ratio between the number of bednets owned and the number of people living in a specific area. Previous studies in this region showed that on average 2 people sleep under a bednet with an overall bednet use of about 75% [16]. A limitation of our study consists in linking child mortality data across 2 years (2001 – 2003) with data on mosquito net ownership collected at a single time point (2002).

All analyzes of bednets effect on different malaria-related outcomes so far have been based on the assumption of independence between observations. However, household mortality data are spatially correlated due to common exposures. When the spatial correlation present in the data is ignored, the statistical significance of the covariates is overestimated. We could control for that by using a Bayesian geostatistical approach to assess the child mortality-bednet density relation. Bayesian computation implemented via MCMC enabled simultaneous estimation of all model parameters together with their standard errors, a feature that is not available in the maximum likelihood based framework. Fitting geostatistical models for non-Gaussian data requires repeated inversions of the covariance matrix of the spatial random effects. These computations are not feasible when analyzing DSS mortality data collected at very large number of locations. A convolution model for the underlying spatial process has been suggested for handling large spatial data sets. This approach can be further applied for modeling mortality data coming from other DSS sites.

Despite these limitations, our results are consistent with the analysis of ITN' protective efficacy against malaria transmission in Kilombero Valley [24], which predicted little community-level protection for the individuals not using ITNs. The most likely explanations for this were the small proportion of re-treated nets and the insufficient concentration of insecticide present in the bednets. A recently developed model for the transmission of malaria using data collected in Tanzania [7] predicted that modest bednet coverage (35% – 65%) of the entire population, rather than just high-risk groups (pregnant women and young children) is needed to achieve community-wide protection similar to, or greater than, individual protection. Hence, there is clearly a strong case for improving the status of insecticide treatment through the introduction of long-lasting insecticidal nets (LLINs) which are now becoming increasingly available [25] and for the wide-use of ITNs and LLINs by the whole population. We expect that achieving a high coverage with LLINs will result in further substantial reductions of malaria transmission and hence malaria-related mortality and morbidity for both users and non-users.

Appendix

Let Y il be the mortality outcome of child l at site s i i = 1, ... , n taking value 1 if the child is dead and 0 otherwise. We assume that Y il arises from a negative binomial distribution, that is Y il ~NegBin(p il , r), where p il is the probability that child l at location s i is dead and r is the parameter that quantifies the amount of extra Poisson variation. To account for spatial variation in the data, location-specific random effects were integrated in the negative binomial model. The probability p il , is modeled as p i l = r r + z i l MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemiCaa3aaSbaaSqaaiabdMgaPjabdYgaSbqabaGccqGH9aqpjuaGdaWcaaqaaiabdkhaYbqaaiabdkhaYjabgUcaRiabdQha6naaBaaabaGaemyAaKMaemiBaWgabeaaaaaaaa@39EA@ , with log(z il ) = log(pyrs il ) + X i l T β MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemiwaG1aa0baaSqaaiabdMgaPjabdYgaSbqaaiabdsfaubaakiabek7aIbaa@32D3@ + ϕ i , where X il is the vector of associated covariates, β are the regression coeffcients and ϕ i 's are the spatial random effects. pyrs il represents the number of person-years corresponding to child l at location s i and log(pyrs il ) is considered as covariate with regression coeffcient fixed to 1 and is referred as offset.

The standard approach to model the spatial dependence is to assume that the covariance of ϕ i 's at every two locations s i and s j decreases with their distance d ij , that is Σ ij = σ 2 f (d ij ; ρ) with f(d ij ; ρ) = exp(-d ij ρ), where ρ > 0 is a smoothing parameter that controls the rate of correlation decay with increasing distance and σ 2 quantifies the amount of spatially structured variation. Estimation of the location-specific random effects and of the spatial parameters requires repeated inversions of the covariance matrix Σ. Due to the large number of locations in our dataset (7, 403), matrix inversion is computationally intensive and is not feasible within practical time constrains. To overcome this issue we develop a convolution model for the underlying spatial process. In particular, we choose a small number of locations t k , k = 1, ... , K over the study region, assume a stationary spatial process ω k over these locations and we model the spatial random effect ϕ i at each data location s i as a weighted sum of the fixed location stationary processes. That is, ϕ i = k = 1 K a ( i , k ) ω k MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeqy1dy2aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpdaaeWaqaaiabdggaHjabcIcaOiabdMgaPjabcYcaSiabdUgaRjabcMcaPiabeM8a3naaBaaaleaacqWGRbWAaeqaaaqaaiabdUgaRjabg2da9iabigdaXaqaaiabdUealbqdcqGHris5aaaa@408E@ , where the weights a(i, k) are decreasing functions of the distance between data location s i and the fixed location t k and ω k ~N (0, Σ k ), with (Σ k ) hl = σ 2 exp(-d hl ρ), where d hl is the distance between the fixed locations t h and t l . This approach avoids the inversion of the large covariance matrix nxn, reducing the problem to the inversion of a much smaller size matrix KxK. For this specific analysis we have chosen K = 200.

For the correlation function chosen, the minimum distance for which spatial correlation between locations is below 5% is 3/ρ (range). The above specification of spatial correlation is isotropic, assuming that correlation is the same in all directions.

Following a Bayesian model specification, we adopt prior distributions for the model parameters as follows: non-informative uniform prior distributions for the regression coeffcients β, inverse gamma prior distribution for σ 2 and gamma prior distribution for the decay parameter ρ and the over-dispersion parameter r.

We estimate the model parameters using Markov chain Monte Carlo simulation. In particular we implemented Gibbs sampler (Gelfand and Smith, 1990), which requires simulating from the full conditional distributions of all parameters iteratively until convergence. The full conditional distribution of σ 2 is inverse gamma distribution and it is straightforward to simulate from. The conditional posterior distribution of β, ρ, and r do not have known forms. We simulate from these distributions using the Metropolis algorithm with a Normal proposal distribution having the mean equal to the parameter estimate from the previous Gibbs iteration and the variance equal to a fixed number, iteratively adapted to optimize the acceptance rates. We have run a five-chain sampler with a burn-in of 10, 000 iterations and we assessed the convergence by inspection of ergodic averages of selected model parameters after 200, 000 iterations.