Abstract
Background
Geographically weighted Poisson regression (GWPR) was applied to the relation between cervical cancer disease incidence rates in England and socioeconomic deprivation, social status and family structure covariates. Local parameters were estimated which describe the spatial variation in the relations between incidence and socioeconomic covariates.
Results
A global (stationary) regression model revealed a significant correlation between cervical cancer incidence rates and social status. However, a local (nonstationary) GWPR model provided a better fit with less spatial correlation (positive autocorrelation) in the residuals. Moreover, the GWPR model was able to represent local variation in the relations between cervical cancer incidence and socioeconomic covariates across space, whereas the global model represented only the overall (or average) relation for the whole of England. The global model could lead to misinterpretation of the relations between cervical cancer incidence and socioeconomic covariates locally.
Conclusions
Cervical cancer incidence was shown to have a nonstationary relationship with spatially varying covariates that are available through national datasets. As a result, it was shown that if low social status sectors of the population are to be targeted preferentially, this targeting should be done on a regionbyregion basis such as to optimize health outcomes. While such a strategy may be difficult to implement in practice, the research does highlight the inequalities inherent in a uniform intervention approach.
Similar content being viewed by others
Background
Regression is a well known statistical tool for exploring the relationship between target and explanatory variables [1]. Different types of regression models are used widely in ecological and disease research, for example, global regression modelling, multilevel modelling and Bayesian modelling for small area studies [2]. For example, regression has been used to explore the relations between limiting longterm illness, ethnicity and income in London [3]. However, global regression models are stationary in the parameters and, thus, geographical variation in the relations is ignored. Geographically weighted regression (GWR) is a well established technique that relaxes the stationarity decision implicit in global models, thereby allowing parameters to vary spatially [4–6]. This amounts to a nonstationarity decision. GWR can, thus, be used to examine spatial variation in relations (i.e., in the parameters that define those relations) and reveal spatial patterns in parameters. Information on local spatial variation in parameters can lead to greater understanding of the relations between the target and explanatory variables.
Global regression models have an important role in disease studies [7]. However, in such studies, it is assumed that the relation between disease rate (or disease incidence) and explanatory variables is spatially constant, which may not be the case. The decision to ignore potential local spatial variation in parameters can lead to biased results which may in turn lead to poor guidance being provided to healthcare practitioners and the general population. Local spatial variation can be important and meaningful in disease analysis, pointing to the key local risk factors associated with disease incidence. Such information may have important implications for policy makers.
Geographical information systems (GIS) are commonly applied in disease studies [2, 8, 9]. GIS facilitate the handling of spatially referenced data and allow visualisation of spatial patterns in disease and identification of local hotspots. The geographical referencing of data that allows application of GIS also allows application of GWR. GWR is well developed for different statistical modelling frameworks (e.g., Gaussian and Poisson models). In the context of disease studies, Gaussian GWR has previously been applied to longterm limiting illness in the northeast of England, and the results showed regional variation in the regression parameters [10]. Geographically weighted Poisson regression (GWPR) can be applied to model disease counts and incidence rates (the focus of this paper, and a common focus in disease studies).
Many studies have shown that ill health issues are related to the surrounding socioeconomic and socioeconomic deprivation conditions [11–13]. For example, children in Bangladesh with a working mother have been found to have a higher chance of suffering from diarrhoea than those who have mothers who stay at home [14]. Other studies have shown that such relations may also vary between regions and that such variation should be taken into account [15] to provide more representative modelling and more accurate prediction. One reason postulated for the importance of local variation in such relations has been local variation in ability to access healthcare services [16]. Illhealth condition may also be related to human behaviour which may be a function of social background as well as educational level.
The Black report [17, 18] suggested that higher income populations commonly made better use of health services, and there are significant social inequalities in using local health services in England [19]. Some research showed evidence of inequalities in health care access due to age distribution, sex structure, local deprivation conditions, and ethnic mix [16, 19, 20]. Such factors may explain variation in willingness to attend regular screening, and such factors may vary spatially. Therefore, socioeconomic conditions and deprivation may be correlated with illhealth condition either directly, or through the effect of social conditions on poor service uptake [17].
Cancer is a common cause of death globally, with cervical cancer the second most common cancer for women worldwide [21, 22]. The number of cases of cervical cancer is increasing, with about 471,000 new diagnostic cervical cancer cases per year worldwide [23]. About 80% of cervical cancer incidence cases occur in low income countries [22] while 70% of all cancer deaths in 2007 occurred in low and middleincome countries [24].
The National Statistics Report revealed differences in incidence in cervical cancer in the UK between manual and nonmanual social classes, with a higher incidence in manual social classes [25]. In 2006 there were 2,873 new diagnostic cases and by 2007 there were 2,828 new diagnostic cases in the UK [23, 26]. It is, thus, important to understand the risk factors associated with cervical cancer. Sexual behaviour is considered to be one of the main risk factors, as research has revealed an association between Human Papilloma Virus (HPV) and cervical cancer development [27]. In particular, HPV 16 and 18 are highly related to cervical cancer development [28–30]. It is estimated that 99% of cervical cancer cases are related to HPV infection [22]. Age is considered to be one of the risk factors associated with cervical cancer incidence, while other causal factors include family history, and female reproductive history. It is likely that cervical cancer development is also related to further associated factors.
Given the above evidence, it is important to understand the relations between cervical cancer disease risk and deprivation conditions, social status and family structure factors. Knowledge of such relations may be of use in planning screening programmes to reduce risk through early detection. In addition, such knowledge may be used to underpin resource allocation and service access design in relation to observed inequalities (e.g., screening programmes).
The aim of screening programmes is to detect abnormal or cancerous cells at an early stage because patients are expected to respond better to treatment at early disease stages. A screening programme can increase the chances of detecting cancerous and especially precancerous cells at early disease stages so that the cancer incidence rate may be reduced and, thus, the likelihood of survival may be increased [21, 23]. Early detection is a costeffective and life saving strategy for chronic disease when the disease is still highly curable or preventable at early disease stages. The survival rate for cervical cancer in England and Wales between 1971 to 1999 was up to 80% for a one year period, 5060% for a five year period and 4050% for a 10 year period [31]. Importantly, the NHS Annual Screening Review Report [32] and the Cervical Screening Pocket Guide [23, 33] suggested that the UK's cervical cancer screening programme can prevent about 75% of cervical cancer cases on average if all female patients attend screening regularly [34]. However, there has been concern that (i) the highest risk population is not tested sufficiently frequently and (ii) those with a positive test result are not followedup and treated properly [33].
The aim of this research was to explore the spatial pattern in the relations between cervical cancer incidence and a set of socioeconomic and deprivation conditions, social status and family structure factors in England using GWPR. The analysis has implications for the UK National Cervical Cancer Screening Programme.
Methods
Poisson regression
When modelling disease cases (count data) and particularly for rare diseases with low numbers of cases, the Poisson model is an appropriate regression model [35, 36]. Many disease analysis studies over small areas have applied the Poisson model to describe the disease distribution [2, 8, 36].
In practice, the standardized mortality ratio (SMR) is commonly used to measure and compare regional mortality rates. In this research, the property of interest is the incidence rate rather than mortality rate, and so the standardized incidence rate (SIR) was used [13]. The SIR is defined as [13, 37];
Where y_{ i } is the number of observed incidence cases, and E_{ i } is the expected number of cases for region i, where i = 1, 2, ..., N. The expected number of cases E_{ i } is based on the overall incidence rate r_{ g } applied to the demographic structure [37]. The expected number of cases was calculated by using the normalized incidence rate r_{ g } per age group. This rate was normalized by multiplying by the ratio (~2.4) between total cervical cancer cases (the data used here) and new diagnosed cases (the data used in the Cancer Research UK rates). The normalized rate was then multiplied by the female population p_{ gi } within that age group in region i, where g is the age group. The female population per age group was determined from the 2001 UK Census of those aged between 2529, 3034, ..., 8084 and is defined as;
where, g is the age group
Since SIR is a standardized indicator of incidence rate, it varies around one; if the rate is above one, the observed incidence is greater than expected; if the rate is less than one, the observed incidence is less than expected. The Poisson regression model can be written as [4, 6];
The link between the target variable and K covariates can be described by a function f(x_{ i } ). H_{ i } is the offset variable, which is a measurement unit of exposure for region i. Most disease studies based on the Poisson distribution framework use the expected number of cases E_{ i } as the offset variable.
Geographically Weighted Poisson Regression (GWPR)
GWR is a well established technique that can be used to examine spatial variation in relations (i.e., nonstationary regression parameters). Information on local variation in parameters can lead to greater understanding of the relations between the target and explanatory variables. When GWR was first developed, the Gaussian model was used in disease studies [10]. This section expands on GWR to describe the GWPR method. The theory and materials in this section are covered by Fotheringham et al.[4] and Nakaya et al.[6] and so only a summary is provided. The traditional linear regression model is generally defined as below in equation (5);
Where β_{0} is the intercept, the β_{ k } are the coefficients of the covariates k and ε_{ i } is the error term for region i = 1, ..., N. The estimated parameters are constant over space. GWR is an extension of the above traditional model in which all parameters are allowed to vary over space. The model framework is defined as below [4, 6];
where, (u_{ i } , v_{ i } ) is the coordinate of the i th region, which describes the location of i. For polygons, such coordinates are normally defined as the centroid of region i recorded as a twodimensional vector. β_{0} (u_{ i } , v_{ i } ) is the intercept for location i, β_{ k } (u_{ i } , v_{ i } ) is a realisation of the continuous function of β_{ k } at region i and ε_{ i } represents the error term and it is assumed to follow a Gaussian distribution with mean zero and variance σ^{2} .
As discussed at the beginning of this section, the Poisson model is generally more appropriate for disease data. The GWPR model can be written as [4, 6];
Such a model allows the parameters to vary geographically [6]. The model can be calibrated based on a kernel regression method, which allows users to estimate the geographical variation in model parameters with a given spatially weighted kernel. The optimal size of the kernel is usually estimated through calibration.
To estimate the GWPR parameters, a local likelihood methodology was applied to estimate the local parameters at location i by maximizing the geographically weighted loglikelihood function in equation (8) [4, 6, 38]:
where, w_{ ij } is the weighted value and (u_{ i } , v_{ i } )  (u_{ j } , v_{ j } ) represents the distance between regression point i and data point j.
The weighting function is defined by the kernel type and the size of kernel (referred to here as the bandwidth). The weighting function w_{ ij } determines the geographical weight of the j th observation at the i th regression point. In theory, the weight should decrease gradually as the distance between i and j increases, eventually, converging to or reaching zero. Parameter estimates are highly related to kernel size such that choice of kernel is an important consideration. There are two commonly employed types of kernel: (i) the Gaussian kernel and (ii) the bisquare kernel:

(i)
Gaussian kernel with fixed bandwidth in which each local regression model has the same spatial size of kernel, but each kernel may cover a different number of data points. The function is defined as [4];
{w}_{ij}=exp\left(\frac{1}{2}\frac{{d}_{ij}}{d}\right)(9)
where d_{ ij } is the distance between regression point i and data point j and d is a nonlinear parameter (bandwidth). The closer a data point j to regression point i, the larger the weight given [4].

(ii)
Adaptive method with bisquare kernel, in which the bandwidth covers the same number of data points with nonzero weight within each regression model. Any points outside the bandwidth d have zero weight and are excluded from the local regression. This adaptive kernel is a common choice, especially, when the sampling density varies greatly across space. The function is given as [4];
{w}_{ij}=\left\{\begin{array}{cc}\hfill {\left[1{\left(\frac{{d}_{ij}}{d}\right)}^{2}\right]}^{2}\hfill & \hfill {d}_{ij}<d\hfill \\ \hfill 0\hfill & \hfill otherwise\hfill \end{array}\right.(10)
The choice of bandwidth has an important role in relation to the level of smoothing of the outputs. A larger bandwidth generally produces greater smoothing. An optimal bandwidth may be selected in terms of some criterion. In practice there are three common means of choosing the bandwidth, (i) subjective, (ii) smallest crossvalidation error and (iii) smallest Akaike information criterion (AIC). In this paper, parameter estimates were calibrated in a pointwise way, and the kernel size with minimum adjusted Akaike Information Criterion (AICc) was selected as optimal (and the Bayesian Information Criterion (BIC) was also considered).
Geographically weighted Poisson regression statistics
In GWPR, all parameter estimates are made using an iterative procedure that continues until convergence; once the prediction at location i has changed, the prediction at j may also be affected if j is within the bandwidth of the regression point i. Therefore, it is necessary to maximise equation (8). The method for solving this equation is to apply a type of local Fisher scoring procedure, which is called iteratively reweighted least squares [6]. All the methods in this section are covered by [4, 6] and only a brief summary is given.
The estimation of local parameters is given in equation (11),
where, z(u_{ i } , v_{ i } )^{(l)}is a vector of adjusted dependent variables, A(u_{ i } , v_{ i } )^{(l)}is the variance weights matrix associated with Fisher scoring for each location i, W (u_{ i } , v_{ i } ) is the diagonal spatial weights matrix for location i, X is the design matrix, X ^{t} is the transpose of X, and l represents the number of iterations. Finally, the parameters are estimated for each location i, until the estimates converge.
Standard error
Since the aim is to estimate local parameters, it is important to calculate the local standard errors. Such standard errors take account of variation in the data, which can be used to compare the estimates. If there are only a few points within the regression bandwidth area or those regression points are far away from the regression point the local error may be large. The local standard errors are highly related to the j points (data), which lie within the regression bandwidth. So the locations of the regression point i and data points j determine the standard error of the parameters.
In this section, estimation of the local standard errors is considered. The local parameter estimates are defined as in equation (11). When the estimation process has converged the number of iterations l can be ignored and the equation redefined as [4, 6];
where, A(u_{ i } , v_{ i } ) and z(u_{ i } , v_{ i } ) are calculated based on the converged estimates of \widehat{\beta}\left({u}_{i},{v}_{i}\right). The z(u_{ i } , v_{ i } ), are assumed to follow a normal distribution with zero mean and variancecovariance A(u_{ i } , v_{ i } )^{1}. The asymptomatic variancecovariance of the k th parameter estimate is given by,
where, the standard error of the k th parameter estimation is given by,
Model measurement and comparison for GWPR
The coefficients vary continuously over space. Therefore, it is almost impossible to achieve universally accurate estimation. Models with very few data points lead to large standard errors in local parameter estimation. On the other hand, a model with a large number of data points can provide more reliable local parameter estimation. However, such models may contain a large amount of bias as the distances between regression point i and data points j increase. Thus, it is important to obtain a balance between the bias and variance of the parameters being estimated.
An optimal size of bandwidth is needed to provide unbiased estimation of the local parameters. There are many indicators available, such as the Akaike information criterion (AIC) and Bayesian information criterion (BIC). For GWR, it is common to use the AIC to assess the performance of the fitted model with certain covariates and for a given bandwidth. One way of achieving the right balance is to use some model selection indicators. There are many available indicators. In this study, the adjusted AIC was used to assess the performance of the bandwidth size and BIC was used as an alternative measurement. AIC was developed by Akaike in 1974 [39, 40] to measure the performance of statistical models. The AIC of the model with bandwidth d is given as [4, 6];
The deviance is represented by D and the effective number of parameters is represented by K with bandwidth d. The model with the smallest AIC (i.e., the model with optimal bandwidth) is called the minimum AIC estimator (MAICE). In practice, if the difference in AIC between two models is less than or equal to two, there is no significant difference between the two models.
In some situations the AIC can perform poorly or may even be biased, for example, when there are too many parameters with a small number of observations [40, 41]. To avoid such biased estimation from AIC, Sugiura [41] derived a second order variant of AIC which is called the cAIC, and Hurvich and Tsai [42] incorporated a small sample (second order) bias adjustment which led to a criterion called AICc.
The other bandwidth selection criterion that can be used in GWPR is called the BIC [4], the calculation of which is given by,
where, L is denoted as the model likelihood, K is the effective number of parameters and N is the total number of regions. BIC was derived from Bayesian theory, where each of a discrete number of candidate models have equal prior probabilities (the prior distributions on the model parameters). The model with the smallest BIC is the better fitted model compared to the other candidate models. AICc was used here to compare candidate models, and BIC was used as an alternative.
Data collection
Any locations with a small number of incidence cases and deaths per district or unitary authority (i.e. 05) were represented as missing data (NAs) for reasons of confidentiality. In the modelling, such NAs were treated as truncated data. Further details of the data collection are described below.
Cervical cancer and socioeconomic data
Two sets of data were included for analysis; cervical cancer count data for 2004 and explanatory variables for 2001. The cervical cancer count data were provided by the Association of Public Health Observatories (APHO), which represents the set of nine Public Health Observatories (PHO) in England (Table 1). In total, 7179 cervical cancer cases and 2391 deaths were recorded in 2004. The data are represented at the district and unitary authority levels of the Cancer Registries in England. At the beginning of this study the data were subjected to a Chisquare goodness of fit test, which showed that the data approximately follow a Poisson distribution.
In this research, the Townsend index was chosen to measure deprivation. It is a common index and it has been used widely in health studies. The Townsend index comprises four scores (Figure 1) that represent socioeconomic deprivation (Table 2).
The calculation of the Townsend score for each variable is defined below. Let V_{ ih } be the value of each socioeconomic variable, for variables h = 1 to 4 and i = 1 to N areal units in the data. The Townsend score z_{ ih } is a standardized measure for each of the four deprivation variables obtained by subtracting from V_{ ih } the mean m_{ ih } and dividing by the standard deviation σ_{ ih } as below.
Both variables (i) unemployed population and (iv) overcrowded housing were transformed by a natural log q = ln(s + 1), where q is the value after the transformation and s is the observed value of the socioeconomic variables, to make the variables approximately normally distributed.
The Townsend index is calculated from the sum of z_{ ih } as follows:
The greater the Z value the greater the deprivation.
Other variables were added to represent family structure and social status. Thus, the set of explanatory variables comprises the Townsend Index plus family structure (the proportion of married females, proportion of single females, proportion of lone parents including male and female parents, and proportion of female lone parents) and low social status. All explanatory variables are listed in Table 2 and mapped in Figure 2. All data were downloaded from the UK Census of 2001. Since the census is carried out once every ten years the closest matched year to 2004 was 2001.
Truncated data
Because of confidentiality restrictions, data were aggregated as summary counts for regions rather than being provided for individuals. For districts with less than or equal to five cases, the number of incidence cases was not disclosed in order to protect the patients' privacy. Closed, missing or truncated data are common in disease studies. While it is possible to exclude or remove truncated data from analysis such an approach would amount to information being discarded, potentially reducing predictive power [43]. Therefore, within the analysis all the counts between 05 were treated as truncated data. In total, these truncations were applied to 21 regions (6%).
One way of dealing with truncation is to estimate the basic true mean of the Poisson distribution from the data, including the missing data. Accepting the estimated means to be true, then a random number can be drawn from the Poisson distribution for each of the regions. For each region, a stream of random numbers was drawn and the first 100 random numbers between 05 were used to replace the missing data. 100 sets of missing values were produced, and the GWPR models were fitted 100 times, each time with a different set of missing values. Then the mean and variance of the predictions was calculated.
Since there were 100 different sets of realisations for the missing data, the GWPR model was fitted 100 times and the average of the predictions for the 100 models was estimated:
where, n is the number of GWPR models, {\u0177}_{in} is the prediction for the n th model, and E\left({\u0177}_{i}\right) is the expectation of {\u0177}_{i}, the overall average prediction from the 100 GWPR model predictions.
The variance for the 100 predictions was estimated to characterise the overall variation resulting as a function of the uncertainty due to truncating the distribution. The variance provides information on prediction uncertainty and parameter estimation uncertainty. The variance was calculated as;
where, \mathsf{\text{var}}\left({\u0177}_{i}\right) is the overall variance between the n = 100 models.
Results
Global model
To examine the possible determinants of the geographical patterns in cervical cancer incidence, a traditional global Poisson regression model was fitted with an offset equal to the expected number of cases based on the demographic composition of each region. All covariates were significant to the observed incidence. For full details of measurements of all other candidate models please refer to Table 3. The final fitted global Poisson regression model is defined as:
where, x_{ G }_{45}_{ i } represents the proportion of low social grade population (i.e., from social status IV and V) (Table 2). x_{ G }_{45}_{ i } was found to be significant and associated to cervical cancer incidence rate at the global level. The AICc of this model is 612.97 and BIC is 620.67 (Model 6 in Table 3), which can be used to compare this model with other models. The proportion of low social status population includes semiskilled manual workers, unskilled manual workers, people on state benefit, the unemployed and the lowest grade workers (Table 2). When this proportion of the population increases the incidence rate is likely to increase. The ratio between the likelihood of cervical cancer for the low social grade population and that for the high and medium social grade population is about 2.8. The x_{ G }_{45}_{ i } variable may reflect the amount of general knowledge about personal illhealth issues or the ability or willingness to access local healthcare services including attending regularly the National Cervical Cancer Screening Programme.
GWPR analysis
The global regression model showed that the proportion of low social status population is significant covariate of incidence rate at the global level, but such a model may mask potential local spatial variation in the relation between incidence rate and the covariates. Thus, the GWPR model was applied.
As summarized above, the use of an adaptive weighting function and the optimal bandwidth were selected based on the smallest AICc in Table 3. Figure 3 shows the AICc plotted against kernel size, and that the optimal bandwidth is 91 regions. The size is relatively large, which may be due to the sample size being small in most of the regions. Therefore, a large bandwidth was required to cover sufficient data to predict reliably.
As described in the methods section, the models were compared using the AICc; the smallest AICc values were assumed to provide the best fitting model from the candidate models. One of the datasets from the set of 100 randomly imputed datasets is shown in Table 3. From Table 3, it can be concluded that the best fitting model is the GWPR model with the proportion of low social status population as a covariate (model 6 in Table 3). The final fitted model is given as (model 6 in Table 3);
The overall means and variances of the predictions of SIR from the 100 local models are displayed in Figure 4b and 4c). The average estimated means and variances of {\widehat{\beta}}_{0}\left({u}_{i},{v}_{i}\right) and {\widehat{\beta}}_{1}\left({u}_{i},{v}_{i}\right) from the 100 samples are displayed in Figure 5.
The GWPR analysis revealed an interesting positive local relationship between incidence rate and the proportion of low social status population, which was hidden from the global model. The raw data (Figure 4a) and the mean of the 100 predictions (Figure 4b) have a similar spatial pattern and the variance in Figure 4c reveals only a small amount of variation between the 100 models, meaning that data truncation had little effect on the results. The map of the mean of the 100 maps of residuals from the 100 local models (Figure 6a) exhibits little spatial correlation and the variance of these residuals (Figure 6b) again reveals only a small amount of variation between the 100 models. The R^{2} values of the local models in Figure 7a are generally large, between 0.78 to 0.98, and the variance of the R^{2} values from the 100 models is relatively small between 0.00075 to 0.0045 (Figure 7b).
The correlation is generally positive, as for the global model, but the effect of and contribution from the low social status variable (i.e., the estimated coefficient) varies between 0.07 and 4.40 times across England. There is a greater contribution from this variable in the south and northeast of England (see Figure 5c). Low social status population had far less effect in the west of England. This might be related to population structure, due to the higher proportion of elderly population in the west of England compared to the national average. Two regions exhibit a negative relation (0.22 and 0.04 times) which are Penwith (South West England) and Scilly. From (Figure 5a and Figure 5c), it is clear that the contribution from low social status varies over space, and when β_{0i}decreases then β_{1i}increases. The intercept β_{0i}varies between 1.18 and 0.44.
Stationary parameters
It is interesting to examine which explanatory variables are fitted adequately using a model with stationary parameters, and which variables required a nonstationary model. The method used to answer this question is to compare the interquartile range at the local level and the standard error at the global level. If the local interquartile range is twice the global standard error, then the variable requires a nonstationary model to represent it adequately [4] (Table 4).
Table 4 shows that both β_{0i}(intercept) and β_{1i}(low social grade population G4 and G5) have an interquartile range more than twice the global standard error. This indicates that the socioeconomic variables (i.e. low social grade population G4 and G5) are better fitted by a nonstationary model in GWPR than using a global regression. A nonstationary model allows greater prediction power, and as a result, leads to greater understanding of the relation between incidence rate and the proportion of low social grade population and how it varies over space in relation to deprivation conditions regionally.
Discussion
From the above results it is clear that the relation between incidence rate and proportion of low social status population varies spatially. Specifically, the estimated parameters mapped in (Figure 5a and Figure 5c) vary spatially. Low social status was the most significant factor related to cervical cancer incidence rate. The local coefficient {\widehat{\beta}}_{1i} (Figure 5c) showed how the proportion of low social status of population contributed to the incidence rate. The coefficient mapped in Figure 5c revealed different contributions across England of between 0.07 and 4.40 times. A larger contribution is evident in the south and northwest of England than in the west of England. This suggests that the proportion of low social status in south and northwest England has a greater effect than in the Midlands and southwest of England. Therefore, global models are not suitable to describe the relationships between cervical cancer risk and explanatory variables. Southwest England (e.g. Cornwall) has a lower incidence rate and also a small estimated coefficient. In particular, Penwith and Scilly had a negative relation with incidence rate. This might have arisen due to the population structure; the proportion of elderly people is larger in the west of England than the south of England. It might also arise because the locations are relatively isolated, which hinders prediction due to lack of local data points.
In terms of prediction, some regions were underpredicted while others were overpredicted. There are two important cases (i) those regions that are relatively large (i.e. the size of the cell polygon is large), and (ii) those regions that include extreme cases. For the first case, when the regions are relatively large, the distance between the data point i and regression point j is large, so that the accuracy of the prediction may be reduced. In the second case, the prediction can be biased because of the influence of extreme neighbours.
In Figure 6(c), the residual values from the global model seem to exhibit a small amount of autocorrelation. This autocorrelation was measured using Moran's I index as 0.04 with a zscore of 6.60 from Table 5. Moran's I index was also calculated based on restricting the local distance to 100 km, in which case Moran's I was 0.08, with a zscore equal to 6.78. These results suggest that the global model based on proportion of low social status cannot explain the spatially correlated variation in incidence rate. Thus, some potentially explanatory variables may be missing from the model (e.g., sexual behaviour, personal HPV history, family history etc.). The residual values from the local model (Figure 6a) exhibit a random pattern with a Moran's I index of 0.0012 and zscore 0.63 (Table 5). Moran's I was also calculated based on restricting the local distance to 100 km, in which case Moran's I was  0.0021 with a zscore equal to 0.012. Thus, the GWPR model removed the problem of autocorrelation in the residuals.
A disadvantage of using the set of socioeconomic covariates used in the Townsend index is that they account for socioeconomic conditions, but no family structure information is included. Thus, in this research information was added on family structure. However, it is not possible to distinguish between individuals who are not able to buy a car and those who do not need a car. Those people who live in a main city (e.g. London) may not need a car since public transportation is more convenient. Similarly, it is not possible to distinguish between individuals who are not able to buy a property and those who do not want to buy a property. For example, it is more common for people who live in a main city to rent a flat than buy a house.
Most deprivation indices used commonly in the UK are obtained from the UK census. However, there are limitations in the use of census data, most notably that the data are aggregated into areal units (i.e. data are not available at the individual level) and some information is not available (e.g. personal income, environmental conditions). The analysis presented in this paper is, therefore, valid at the census unit scale only. Since deprivation indices are commonly represented on areal units and the units are likely to vary over space, some units may be relatively larger than others. For this reason the covariates may be sensitive to the size of denominator.
The Townsend variables used here measure local welfare and local socioeconomic behaviour which can be useful in health care studies. These variables are also adopted in other deprivation indices (e.g. Carstairs and DoE81) [44]. Alternative indices (e.g. the multiple deprivation index) [45] could be applied to explore the impact on cervical cancer incidence of income, employment, education deprivation and living environment deprivation. However, since some variables (i.e. income and living environment deprivation) were not available at the national level for the study period, such analysis is beyond the present scope and will be considered in further work.
In the present research, a simple method was applied to deal with truncated data. Random numbers were drawn from a Poisson distribution to replace the missing data. However, Figures 4(c) and (Figure 5b and Figures 5d) show very limited variance around the predictions for those areas with missing data, which suggests that the results are not affected greatly by truncation. Further research is needed to explore other possible methods to solve the truncation problem. For example, [46] demonstrated an approximate Bayesian bootstrap method which would be interesting to apply here.
It is possible to go beyond determining which variables can be described adequately by a stationary process and which are best fitted by nonstationary models by applying a mixed GWPR. A mixed GWPR is a semiparametric GWPR model; it allows some variables to vary spatially and others to remain constant in a single approach. This will be explored in future research.
Many studies [3, 9, 20, 47] have suggested that poor health outcomes often appear in the most deprived areas. Some studies have demonstrated health care inequality in terms of patient needs and access to NHS services in England [16, 20, 30]. The relation between health outcomes and social status should be a concern to all governments that espouse ideals of equality. This research demonstrated a locally varying relation between cervical cancer incidence and low social status. This relation may be associated to some patient factors including (i) personal understanding of the cervical cancer programme, (ii) misunderstanding of the current screening policy with regard to age criteria between groups and (iii) lack of knowledge about preventing cervical cancer at early disease stages. However, further analysis is required to explore the underlying causes.
The GWPR results may be useful for policy makers engaged with reviewing current policy and services. For example, it is possible to target patients in at least two ways: (i) divide the population into risk groups according to their age and social status (e.g. low, medium and high risk), or (ii) divide the study area (England) into several regions with similar social status. Each of the groups might then be allocated a different screening programme (e.g. a different screening test or screening interval). From the financial viewpoint such a strategy may save resources or make better use of available limited resources. From the patient's point of view the benefit may be an increase in the chances of detecting and preventing longterm disease. In practice, it is unlikely to be practical to divide the population into risk groups or divide England into several regions with varying risk levels. However, the analysis does provide evidence for the inequality of cervical cancer screening at the local level.
An NHS study showed that the number of cases with Cervical Intraepithelial Neoplasia (CIN3) has increased for women aged between 20 and 24 because of trends in sexual behaviour, with increasing numbers of young people becoming more active sexually when they are still in their midteens [48]. A recent study discussed the poor use of cervical cancer screening resources within current NHS practice [49]. This change in sexual behaviour arises partly because of socioeconomic changes through time and from place to place. If that is true, then recognizing the associated risk factors may be useful for developing a longterm prevention strategy for cervical cancer. For example, it may be possible to improve sex education in local schools, teaching midteen pupils about protective sex and sexually transmitted infectious diseases.
A HPV vaccine is available, and some clinical studies in Italy and Germany showed that use of the vaccine significantly reduces the incidence of cervical cancer. Thus, the vaccine might be considered as a means of achieving increased efficacy and costeffectiveness in screening programmes in the future [50, 51], particularly for younger age groups.
For interventions such as the national screening programme, sex education in schools and for vaccination, it may be considered desirable to target preferentially the low social status sectors of the population (the global model). The results of this paper show that if such targeting were to be considered then it should be done on a regionbyregion basis (the GWPR model).
Conclusions
Traditionally, global regression models have been used to explore the relations between health outcomes and explanatory variables. However, such techniques do not account for spatial variation in the relations. This research demonstrated the use of GWPR to examine the relations between cervical cancer incidence rates and socioeconomic covariates across England. Cervical cancer incidence rates were found to vary spatially across England (e.g. Cornwall and the North of England had low incidence rates compared to the rest of England). Moreover, cervical cancer incidence was found to be associated with low social status and, importantly, this relation was found to vary spatially across England.
Spatial variation in the relations between incidence and socioeconomic covariates means that in some places socioeconomic status has a greater effect on incidence than in other places. This may reflect differences in personal behaviour, local differences in educational levels across the social classes, or differences in screening uptake rates over space. Ignoring such spatial variation could lead to inefficient resource usage nationally. To maximise the benefits of the national cervical cancer screening programme this research suggests that the low socioeconomic status sectors of the population should be targeted, and in some places more so than in others.
References
Chandler R: On the use of generalized linear models for interpreting climate variability. Environmetrics. 2005, 16: 699715. 10.1002/env.731.
Green P, Richardson S: Hidden Markov models and disease mapping. Journal of the American statistical association. 2002, 460: 116.
Jackson C, Best N, Richardson S: Improving ecological inference using individuallevel data. Statistics in medicine. 2006, 25: 21362159. 10.1002/sim.2370.
Fotheringham A, Brunsdon C, Charlton M: Geographically weighted regression: the analysis of the spatially varying relationships. 2002, Chichester; John Wiley & Sons
Nakaya T: Local spatial interaction modelling based on the geographically weighted regression approach. Geo Journal. 2001, 53: 347358.
Nakaya T, Fotheringham A, Brunsdon C, Charlton M: Geographically weighted Poisson regression for disease association mapping. Statistics In Medicine. 2005, 24: 26952717. 10.1002/sim.2129.
Best N, Ickstant K, Wolpert R: Spatial Poisson regression for health and exposure data measured at disparate resolutions. Journal of the American statistical association. 2000, 95: 10761088. 10.2307/2669744.
Lawson A, Clark A: Spatial mixture relative risk models applied to disease mapping. Statistics in medicine. 2002, 21: 359370. 10.1002/sim.1022.
Richardson S, Thomson A, Best N, Elliott P: Interpreting posterior relative risk estimates in diseasemapping studies. Environmental health perspective. 2004, 112: 10161025. 10.1289/ehp.6740.
Fotheringham A, Brundson C, Charlton M: Geographically weighted regression: a natural evolution of the expansion method for spatial data analysis. Environment and planning A. 1998, 30: 19051927. 10.1068/a301905.
Abellan J, Fecht D, Best N, Richardson S, Briggs D: Bayesian analysis of the multivariate geographical distribution of the socioeconomic environment in England. Environmetrics. 2007, 18: 745758. 10.1002/env.872.
Carstairs V: Socioeconomic factors at areal level and their relationship with health. Spatial Epidemiology: methods and applications. Edited by: Elliott P, Wakefield JC, Best NG, Briggs DJ. 2000, Oxford: Oxford University Press, 5167.
Jarup L, Best N, Toledano M, Wakefield J, Elliott P: Geographical epidemiology of prostate cancer in Great Britain. International journal of cancer. 2002, 97: 695699. 10.1002/ijc.10113.
Smith J: The relationship between maternal work and other socioeconomic factors and child health in Bangladesh. Public health. 1999, 113: 299302. 10.1016/S00333506(99)001845.
Wakefield JC, Best NG, Richardson S, Bernardinelli L, Staines A: Statistical issues in the analysis of disease mapping data. Statistics in Medicine. 2000, 19: 24932519. 10.1002/10970258(20000915/30)19:17/18<2493::AIDSIM584>3.0.CO;2D.
Goddard M, Smith P: Equity of access to health care services: Theory and evidence from the UK. Social Science and Medicine. 2001, 53: 11491162. 10.1016/S02779536(00)004159.
Townsend P, Davidson N, Black D, Morris JN, Smith C: Inequalities in health: the black report. 1982, Penguins: Pelican, 1
Townsend P, Phillimore P, Beattie A: Health and Deprivation: inequalities and the North. 1988, London: Croom helm
Judge A, Welton N, Sandhu J, BenShlomo Y: Equity in access to total joint replacement of the hip and knee in England: cross sectional study. British Medical Journal. 2010, 341: c409210.1136/bmj.c4092.
Raine R, Mclvor M: 9 years on: what progress has been made on achieving UK healthcare equity?. Lance. 2006, 368: 15421545. 10.1016/S01406736(06)692568.
Whynes DK, Philips Z, Avis M: Why do women participate in the English cervical cancer screening programme. Journal of Health Economics. 2007, 26: 306325. 10.1016/j.jhealeco.2006.08.007.
World Health Organization, Cervical cancer.http://www.who.int/reproductivehealth/topics/cancers/en/http://www.who.int/reproductivehealth/topics/cancers/en/
NHS Cervical Screening Programme: A pocket Guide. Sheffield. 2004
World Health Organization, Cancer.http://www.who.int/mediacentre/factsheets/fs297/en/http://www.who.int/mediacentre/factsheets/fs297/en/
Office for National Statistics, Incidence of Health of the Nation cancers by social class.http://info.cancerresearchuk.org/cancerstats/types/cervix/riskfactors/cervicalcancerriskfactorshttp://info.cancerresearchuk.org/cancerstats/types/cervix/riskfactors/cervicalcancerriskfactors
England cervical cancer incidence rate.http://info.cancerresearchuk.org/cancerstats/types/cervix/incidence/http://info.cancerresearchuk.org/cancerstats/types/cervix/incidence/
Sing A, Monaghan J: Lower genital tract precancer: colpocopy, pathology and treatment. 2000, Oxford: WileyBlackwell, Second
AriasPulido H, Joste NE, Vargas H, Wheeler CM: Human papillomavirus type 16 integration in cervical carcinoma in situ and in invasive cervical cancer. Journal of clinical microbiology. 2006, 44: 17551762. 10.1128/JCM.44.5.17551762.2006.
Goldie SJ, Grima D, Kohli M, Wright TC, Weinstein M, Franco E: A comprehensive natural history model of HPV infection and cervical cancer to estimate the clinical impact of a prophylactic HPV 16/18 vaccine. International Journal of cancer. 2003, 106: 896904. 10.1002/ijc.11334.
Jenkins D, SherlawJohnson C, Gallivan S: Can papilloma virus testing be used to improve cervical canceer screening?. International Journal of cancer. 1996, 65: 768773. 10.1002/(SICI)10970215(19960315)65:6<768::AIDIJC10>3.0.CO;20.
England cervical cancer survival rate.http://info.cancerresearchuk.org/cancerstats/types/cervix/survival/?a=5441http://info.cancerresearchuk.org/cancerstats/types/cervix/survival/?a=5441
NHS Cervical Cancer Screening Programmes: Annual Review Report. Sheffield. 2008
NHS Cervical Screening Programme: A pocket Guide. Sheffield. 2009
NHS Cervical Cancer Screening Programmes: Celebrating 20 years of Screening. Sheffield. 2008
Gelman A, Carlin J, Stem H, Rubin D: Bayesian data analysis. 2003, Chapman and Hall, CRC
Lawson A, Browne W, Rodeiro C: Disease mapping with WinBUGS and MLwiN. 2003, Chichester; Wiley
Waller LA, Gotway CA: Applied Spatial Statistics for Public Health Data. 2004, John Wiley & Sons, Inc
Loader C: Local regression and likelihood. 1999, New York; Springer
Akaike H: A new look at the statistical model identification. Anal Biochem. 1987, 162: 156159.
Akaike information criterion statistics: KTK Scientific Publishers. Tokyo. 1986
Sugiura N: Further analysis of the data by Akaike's information criterion and the finite corrections. Communications in Statistics, theory and methods. 1978, A7: 1326.
Hurvich C, Tsai C: Regression and time series model selection in small samples. Biometrika. 1989, 76: 297307. 10.1093/biomet/76.2.297.
Lunn D, Whittaker J, Best N: A Bayesian toolkit for genetic association studies. Genetic Epidemiology. 2006, 30: 231247. 10.1002/gepi.20140.
The English Indices of Deprivation: Technical report, Office of the Deputy Prime Minister (ODPM). 2004, Neighbourhood Renewal Unit, London
Wright G, Noble M: The South African Index of Multiple Deprivation 2007 at Municipality level. 2009, Department: Social Development Republic of South Africa
Lavori P, Dawson R, Shera D: A multiple imputations strategy for clinicaltrials with truncation of patient data. Statistics in medicine. 1995, 14: 19131925. 10.1002/sim.4780141707.
Jackson C, Best N, Richardson S: Hierarchical related regression for combining aggregate and individual data in studies of socioeconomic disease risk factors. Journal of the royal statistical society series A. 2008, 171: 159178.
Herbert A, Smith J: Cervical screening  Women under 25 should be offered screening. British medical journal. 2007, 334: 273273.
Raffle AE: Cervical screening: recent changes in policy regarding age and frequency are a poor use of resources. British Medical Journal. 2004, 328: 12721273. 10.1136/bmj.328.7451.1272.
Ferko N, Debicki D, Barnfi F, Marocco A, Mantovani K: Estimating the longterm health and economic impact of a prophylactic cervical cancer vaccine on the burden of cervical disease in Italy. Value in health. 2007, 10: A440A441.
Hammerschmidt T, Siebery U, Schwarz T, Schneider A, Rogoza R, Ferko N, Welte R: A costeffectiveness analysis of a prophylactic cervical cancer vaccine in Germany: results from a health economic model. Value in health. 2007, 10: A441
Acknowledgements
We thank the APHO for permission to use the cervical cancer incidence data and the cancer registries that supply the data. Prof. Stewart Fotheringham and Dr. Martin Charlton are thanked for help with the GWR software. Dr Jeganathan Chockalingam is thanked for providing GIS support to the research. The GeoData Institute provided support in GIS, and the GIS shape file was provided by EDINA UKBORDERS. This work is based on data provided through EDINA UKBORDERS with the support of the ESRC and JISC and uses boundary material which is copyright of the Crown.
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors' contributions
PMA conceived the original idea. PMA and MYEC designed the study, including the choice of methods. MYEC undertook the statistical analysis and drafted an initial manuscript. PMA, AKS and MYEC contributed to writing the final manuscript. All authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Cheng, E.M., Atkinson, P.M. & Shahani, A.K. Elucidating the spatially varying relation between cervical cancer and socioeconomic conditions in England. Int J Health Geogr 10, 51 (2011). https://doi.org/10.1186/1476072X1051
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/1476072X1051