Skip to main content
Log in

Investigating spatio-temporal similarities in the epidemiology of childhood leukaemia and diabetes

  • Pediatric Epidemiology
  • Published:
European Journal of Epidemiology Aims and scope Submit manuscript

Abstract

Childhood acute lymphoblastic leukaemia (ALL) and Type 1 diabetes (T1D) share some common epidemiological features, including rising incidence rates and links with an infectious aetiology. Previous work has shown a significant positive correlation in incidence between the two conditions both at the international and small-area level. The aim was to extend the methodology by including shared spatial and temporal trends using a more extensive dataset among individuals diagnosed with ALL and T1D in Yorkshire (UK) aged 0–14 years from 1978–2003. Cases with ALL and T1D were ascertained from 2 high quality population-based disease registers covering the Yorkshire region of the UK and linked to an electoral ward from the 1991 UK census. A Bayesian model was fitted where similarities and differences in risk profiles of the two diseases were captured by the shared and disease-specific components using a shared-component model, with space-time interactions. The extended model revealed a positive correlation of at least 0.70 between diseases across all time periods, and an increasing risk across time for both diseases, which was more evident for T1D. Furthermore, both diseases exhibited lower rates in the more urban county of West Yorkshire and higher rates in the more rural northern and eastern part of the region. A differential effect of T1D over ALL was found in the south-eastern part of the region, which had a more pronounced association with population mixing than with population density or deprivation. Our approach has demonstrated the utility in modelling temporally and spatially varying disease incidence patterns across small geographical areas. The findings suggest searching for environmental factors that exhibit similar geographical-temporal variation in prevalence may help in the development and testing of plausible aetiological hypotheses. Furthermore, identifying environmental exposures specific to the south-eastern part of the region, especially locally varying risk factors which may differentially affect the development of T1D and ALL, may also be fruitful.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  1. Draper GJ, Kroll ME, Stiller CA. Childhood cancer. In: Doll R, Fraumeni JF, Muir CS, editors. Cancer surveys vol 19/20: trends in cancer incidence and mortality. New York: Cold Spring Harbor Laboratory; 1994. p. 493–517.

    Google Scholar 

  2. Linet MS, Ries LA, Smith MA, Tarone RE, Devesa SS. Cancer surveillance series: recent trends in childhood cancer incidence and mortality in the United States. J Natl Cancer Inst. 1999;91:1051–8.

    Article  CAS  PubMed  Google Scholar 

  3. Onkamo P, Väänänen S, Karvonen M, Tuomilehto J. Worldwide increase in incidence of type I diabetes—the analysis of the data on published incidence trends. Diabetologia. 1999;42:1395–403.

    Article  CAS  PubMed  Google Scholar 

  4. Green A, Patterson CC. Trends in the incidence of childhood-onset diabetes in Europe 1989–1998. Diabetologia. 2001;44(Suppl 3):B3–8.

    Article  PubMed  Google Scholar 

  5. Greaves M. Childhood leukemia. BMJ. 2002;324:283–7.

    Article  PubMed  Google Scholar 

  6. EURODIAB Substudy 2 Study Group. Infections and vaccinations as risk factors for childhood type 1 (insulin dependent) diabetes mellitus: a multi-centre case-control investigation. Diabetologia. 2000;43:47–53.

    Article  Google Scholar 

  7. Feltbower RG, McKinney PA, Greaves MF, Parslow RC, Bodansky HJ. International parallels in leukaemia and diabetes epidemiology. Arch Dis Child. 2004;89:54–6.

    Article  CAS  PubMed  Google Scholar 

  8. Feltbower RG, Manda SOM, Gilthorpe MS, Greaves MF, Parslow RC, Kinsey SE, et al. Detecting small-area similarities in the epidemiology of childhood acute lymphoblastic leukemia and diabetes mellitus, Type 1: a Bayesian approach. Am J Epidemiol. 2005;161:1168–80.

    Article  PubMed  Google Scholar 

  9. Richardson S, Abellan JJ, Best N. Bayesian spatio-temporal analysis of joint patterns of male and female lung cancer risks in Yorkshire (UK). Stat Methods Med Res. 2006;15:385–407.

    Article  PubMed  Google Scholar 

  10. Nobre AA, Schmidt AM, Lopes HF. Spatio-temporal models for mapping the incidence of malaria in Para. Environmetrics. 2005;16:291–304.

    Article  Google Scholar 

  11. McKinney PA, Parslow RC, Lane SA, et al. Epidemiology of childhood brain tumors in Yorkshire, UK 1974–1995: changing patterns of occurrence. Br J Cancer. 1998;78:974–9.

    CAS  PubMed  Google Scholar 

  12. Feltbower RG, McKinney PA, Parslow RC, Stephenson CR, Bodansky HJ. Type 1 diabetes in Yorkshire, UK: time trends in 0–14 and 15–29 year olds, age at onset and age-period-cohort modeling. Diabetic Med. 2003;20:437–41.

    Article  CAS  PubMed  Google Scholar 

  13. Feltbower RG, McNally RJ, Kinsey SE, Lewis IJ, Picton SV, Proctor SJ, et al. Epidemiology of leukaemia and lymphoma in children and young adults from the north of England, 1990–2002. Eur J Cancer. 2009;45:420–7.

    Article  PubMed  Google Scholar 

  14. Besag J, York J, Mollie A. Bayesian image restoration, with two applications in spatial statistics. Ann Inst Statist Math. 1991;43:1–59.

    Article  Google Scholar 

  15. Leyland AH, Langford IH, Rabash J, Goldstein H. Multivariate spatial models for event data. Stat Med. 2000;19:2469–78.

    Article  CAS  PubMed  Google Scholar 

  16. Langford IH, Leyland AH, Rasbash J, Goldstein H. Multilevel modeling of the geographical distributions of diseases. J R Stat Soc C. 1999;48:253–68.

    Article  CAS  Google Scholar 

  17. Knorr-Held L, Best NG. A shared component model for detecting joint and selective clustering of two diseases. J R Stat Soc A. 2001;164:73–85.

    Article  Google Scholar 

  18. Held L, Natario I, Fenton SE, Rue H, Becker N. Towards joint disease mapping. Stat Methods Med Res. 2006;14:61–82.

    Article  Google Scholar 

  19. Spiegelhalter D, Thomas A, Best N, Lunn D. BUGS: Bayesian inference using Gibbs sampling, version 1.4. Cambridge: MRC Biostatistics Unit; 2003.

    Google Scholar 

  20. Spiegelhalter DJ, Best NG, Carlin BP, van der Linde A. Bayesian measures of model complexity and fit (with discussion). J R Stat Soc B. 2002;64:583–640.

    Article  Google Scholar 

  21. Ranta J, Penttinen A. Probabilistic small area risk assessment using GIS-based data: a case study of Finnish childhood diabetes. Stat Med. 2000;19:2345–59.

    Article  CAS  PubMed  Google Scholar 

  22. Greaves M. Etiology of acute leukemia. Lancet. 1997;349:344–9.

    Article  CAS  PubMed  Google Scholar 

  23. Kinlen L. Evidence for an infective cause of childhood leukemia: comparison of a Scottish new town with nuclear reprocessing sites in Britain. Lancet. 1988;2:1323–7.

    Article  CAS  PubMed  Google Scholar 

  24. Wills-Karp M, Santeliz J, Karp CL. The germless theory of allergic disease: revisiting the hygiene hypothesis. Nat Rev Immunol. 2001;1:69–75.

    Article  CAS  PubMed  Google Scholar 

  25. Kline RB. Principles of structural equation modelling. LEA; 2005.

  26. Mollie A. Bayesian mapping of disease. In: Gilks WR, Richardson S, Spiegelhater DJ, editors. Markov chain Monte Carlo in practice. London: Chapman & Hall; 1996. p. 359–79.

    Google Scholar 

  27. Waller LA, Carlin BP, Xia H, Gelfand AE. Hierachical spatio-temporal mapping of disease rates. J Am Stat Assoc. 1997;92:607–17.

    Article  Google Scholar 

  28. Gill P. Spatio-temporal modelling and mapping of teenage birth rates. [Online], available: http://people.ok.ubc.ca/gillpara/research/Poster.pdf. Accessed 13 Dec 2006.

Download references

Acknowledgments

We thank the Office for National Statistics for the provision of population data and special migration statistics. We are grateful to Margaret Buchan, Carolyn Stephenson, and Sheila Jones for data collection and the co-operation of all pediatricians, pediatric oncologists, physicians, Diabetes Specialist Nurses and General Practitioners in Yorkshire. We thank the Candlelighters Trust and Leeds Teaching Hospitals NHS Trust for supporting the costs of running the cancer and diabetes registers. This diabetes registry work was also undertaken by the University of Leeds who received funding from the Department of Health. The views expressed in the publication are those of the authors and not necessarily those of the Department of Health.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Samuel O. M. Manda.

Appendix

Appendix

The two diseases spatial temporal model

In the presence of longitudinal data for two diseases, an extension of the BYM model is given by:

$$ \begin{gathered} \log (R_{it1} ) = \alpha_{0t1} + \beta_{t1}^{T} x_{it} + u_{it1} + v_{it1} \hfill \\ \log (R_{it2} ) = \alpha_{0t2} + \beta_{t2}^{T'} x_{it} + u_{it2} + v_{it2} \hfill \\ \end{gathered} $$
(1)

where α 0tj is an intercept term on the log relative risk scale for disease j at time t; x it is a covariate risk vector with the corresponding disease-specific coefficient parameter vector β tj at time t; and u itj and v itj represent unstructured and spatially structured random effects, respectively, in area i at time t for disease j. The unstructured random effects are assumed to follow a normal distribution with time varying variance such that \( u_{itj} \sim N\left( {0,\sigma_{tj}^{2} } \right) \) for j = 1, 2. The spatially structured effects are modelled by the intrinsic conditional autoregressive normal (CAR Normal) prior [14, 26], which specifies a conditional distribution for a specific spatially structured effect v itj as:

$$ f(v_{itj} |v_{ktj} ,k \ne i) = N\left( {{\frac{{\sum\nolimits_{{k \in \Uptheta_{i} }} {W_{iktj} v_{ktj} } }}{{\sum\nolimits_{{k \in \Uptheta_{i} }} {W_{iktj} } }}},{\frac{{\lambda_{tj}^{2} }}{{\sum\nolimits_{{k \in \Uptheta_{i} }} {W_{iktj} } }}}} \right) $$
(2)

at time t for disease j, where Θ i is the set of areas adjacent to area i; W iktj is the weight reflecting spatial dependence between area i and k at time t for disease j and \( \lambda_{tj}^{2} \) is the (conditional) structured variance. For simplicity, we assume that the weighting factor W iktj is constant over time and disease, which seems reasonable as we do not expect the influence of neighbouring areas to change over time and to be different between the two diseases, i.e. W iktj  = W ik . The most common and simplest adjacent specification is to set W ik  = 1 if areas i and k are neighbours that share a common boundary and W ik  = 0 otherwise. Thus, for disease j at time t, a CAR Normal prior specifies the conditional distribution of each area-specific effect v itj , given all the other v’s, to be a normal distribution with mean equal to the average of the v’s of its neighbours, and variance inversely proportional to the number of neighbours; the more neighbours an area has, the greater the precision for that area effect. This scenario is reasonable for population density, as urban areas are likely to have more neighbours that sparsely populated rural areas. For brevity, following [14, 26], a conditional autoregressive normal prior on the spatially structured effects will be denoted by

$$ v_{itj} \sim {\text{CAR Normal}}\left( {W,\lambda_{tj}^{2} } \right) $$

where W is simply the neighbourhood adjacency weight matrix.

A common and specific spatial-temporal model

The spatio-temporal model we consider to estimate relative risks of the two diseases in space and time is

$$ \begin{gathered} \log (R_{it1} ) = \alpha_{01} + \beta_{t1}^{T} x_{it} + \phi_{i} \kappa + \psi_{t} \omega + \zeta_{it} + \varepsilon_{it1} \hfill \\ \log (R_{it2} ) = \alpha_{02} + \beta_{t2}^{T} x_{it} + {\frac{{\phi_{i} }}{\kappa }} + {\frac{{\psi_{t} }}{\omega }} + \zeta_{it} + \theta_{i} + \gamma_{t} + \varepsilon_{it2} \hfill \\ \end{gathered} $$
(3)

where now α 0j is the overall risk for disease j (j = 1, 2), φ i and ψ t are the main shared spatial and temporal effects, respectively; θ i and γ t represent disease 2 (T1D) specific spatial and temporal components, which capture the differential effect between the two diseases; ζ it are the shared space-time interaction effects, capturing departure from space and time main effects, which may highlight space-time clusters of risk, and ɛ itj are the disease-specific heterogeneous effects, capturing possible extra-Poisson variation not explained by terms included above. Thus, the shared-temporal factor model (3) partitions the risk profile for the two diseases into disease-specific components and shared spatial and time components. Parameters κ and ω are included to allow for a differential gradient on the main shared spatial and temporal components, respectively. The ratio κ 2 compares the risk of disease 1 (ALL) to the risk of disease 2 (T1D) associated with the main shared spatial component and the ratio ω 2 compares the risk associated with the main temporal effect.

Ideally, in model (3) we could have split the main shared spatial components into unstructured and structured effects or the main shared temporal component into unstructured and structured time trends or similarly the diseases-specific components. This could have resulted into 8 × 5 = 40 extra random effects, as well as the space-time interaction terms. This seems overly complex and could lead to identifiability problems, so we decided to minimise the number of random effects. We could have also included specific components for disease 1, but such a symmetric formulation can result in identifiability problems [9].

Prior specification

There are various choices of prior distributions for the space-time interaction effects, ζ it . They can simply be taken to be independent for every period and time with a constant variance over time, as in [9], or, depending on the length of the time periods, can evolve smoothly over time with a polynomial temporal trend as in [27, 28]. Alternatively, they can be allowed to vary independently or be spatially structured for every period, with variances that are either independently and identically distributed or temporally structured as proposed in Nobre et al. [10]. In the present study, we define only five periods, too few to show any reliable space-time jumps in risk. Thus, we assume a spatially structured prior distribution at every period for the shared interaction term, \( \zeta_{it} \sim {\text{CAR Normal}}\left( {W,\sigma_{\zeta t}^{2} } \right) \) (i.e. a time-independent prior but linked in space) with variances that are allowed to change smoothly over time,\( \log \sigma_{\varsigma t}^{2} = \log \sigma_{\varsigma (t - 1)}^{2} + e_{t} \), where \( e_{t} \sim N\left( {0,\sigma_{e}^{2} } \right) \). The pair of disease-specific heterogeneity terms \( (\varepsilon_{it1} ,\varepsilon_{it2} )^{T} \)are also assumed to evolve smoothly over time according to a first-order autoregressive (AR(1)) multivariate normal prior distribution with covariance matrix Σ to allow for correlations between the ALL and T1D risks in each space-time unit. Thus, \( (\varepsilon_{i11} ,\varepsilon_{i12} )^{T} \sim {\text{MVN}}(0,\Upsigma ) \) and \( (\varepsilon_{it1} ,\varepsilon_{it2} )^{T} \sim {\text{MVN}}\left( {(\varepsilon_{i(t - 1)1} ,\varepsilon_{i(t - 1)2} ),\Upsigma } \right);t = 2, \ldots ,5 \). Since we are using the CAR Normal prior, with sum-to-zero constraints on the random effect terms, we assign a flat prior on the overall disease risk terms, α 0j .

Rights and permissions

Reprints and permissions

About this article

Cite this article

Manda, S.O.M., Feltbower, R.G. & Gilthorpe, M.S. Investigating spatio-temporal similarities in the epidemiology of childhood leukaemia and diabetes. Eur J Epidemiol 24, 743–752 (2009). https://doi.org/10.1007/s10654-009-9391-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10654-009-9391-2

Keywords

Navigation