Abstract
Population-based cancer registry data are invaluable tools for cancer epidemiology. However, they present some challenges for the statistical analysis to derive cancer-specific quantities useful for patients, carers and policymakers. In this chapter, we present these challenges as well as some of the statistical methods used and developed in this context, referred to as ‘relative survival’ methods. We start by detailing the main assumption behind the relative survival methods. We present the non-parametric estimation of ‘net survival’, which can be interpreted as the survival probability of cancer patients once other causes of death have been removed. Then, we detail the general principles of fitting hazard-based regression models for modelling the excess mortality hazard. Finally, we present an extension of regression models for the excess mortality hazard when there is a hierarchical structure in the data, which proves to be useful when the main aim is to investigate socioeconomic disparities in cancer survival. We illustrate the methods using real data for 15–75-year-old men diagnosed with a lip–oral cavity–pharynx cancer between 1997 and 2010 in a French region.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Antunes L, Mendonça D, Bento MJ, Rachet B. No inequalities in survival from colorectal cancer by education and socioeconomic deprivation - a population-based study in the North Region of Portugal, 2000–2002. BMC Cancer. 2016;16:608. https://doi.org/10.1186/s12885-016-2639-9.
Belot A, Remontet L, Rachet B, et al. Describing the association between socioeconomic inequalities and cancer survival: methodological guidelines and illustration with population-based data. Clin Epidemiol. 2018;10:561–73. https://doi.org/10.2147/CLEP.S150848.
Belot A, Ndiaye A, Luque-Fernandez M-A, et al. Summarizing and communicating on survival data according to the audience: a tutorial on different measures illustrated with population-based cancer registry data. Clin Epidemiol. 2019. https://www.dovepress.com/summarising-and-communicating-on-survival-data-according-to-the-audien-peer-reviewed-fulltext-article-CLEP. Accessed 4 Feb 2020.
Berkson J, Gage RP. Calculation of survival rates for cancer. Proc Staff Meet Mayo Clin. 1950;25:270–86.
Bolard P, Quantin C, Esteve J, et al. Modelling time-dependent hazard ratios in relative survival: application to colon cancer. J Clin Epidemiol. 2001;54:986–96. https://doi.org/10.1016/S0895-4356(01)00363-8.
Brenner H, Hakulinen T. On crude and age-adjusted relative survival rates. J Clin Epidemiol. 2003;56:1185–91. https://doi.org/10.1016/S0895-4356(03)00209-9.
Brenner H, Hakulinen T. Age adjustment of cancer survival rates: methods, point estimates and standard errors. Br J Cancer. 2005;93:372–5. https://doi.org/10.1038/sj.bjc.6602704.
Brewster DH, Coebergh J-W, Storm HH. Population-based cancer registries: the invisible key to cancer control. Lancet Oncol. 2005;6:193–5. https://doi.org/10.1016/S1470-2045(05)70071-1.
Buckland ST, Burnham KP, Augustin NH. Model selection: an integral part of inference. Biometrics. 1997;53:603–18. https://doi.org/10.2307/2533961.
Buckley JD. Additive and multiplicative models for relative survival rates. Biometrics. 1984;40:51–62.
Burnham KP, Anderson DR. Model selection and multimodel inference: a practical information-theoretic approach. 2nd ed. New York: Springer-Verlag; 2002.
Carstairs V, Morris R. Deprivation: explaining differences in mortality between Scotland and England and Wales. BMJ. 1989;299:886–9.
Charvat H, Remontet L, Bossard N, et al. A multilevel excess hazard model to estimate net survival on hierarchical data allowing for non-linear and non-proportional effects of covariates. Stat Med. 2016;35:3066–84. https://doi.org/10.1002/sim.6881.
Corazziari I, Quinn M, Capocaccia R. Standard cancer patient population for age standardising survival ratios. Eur J Cancer Oxf Engl. 2004;1990(40):2307–16. https://doi.org/10.1016/j.ejca.2004.07.002.
Cortese G, Scheike TH. Dynamic regression hazards models for relative survival. Stat Med. 2008;27:3563–84. https://doi.org/10.1002/sim.3242.
Danieli C, Remontet L, Bossard N, et al. Estimating net survival: the importance of allowing for informative censoring. Stat Med. 2012;31:775–86. https://doi.org/10.1002/sim.4464.
Danieli C, Bossard N, Roche L, et al. Performance of two formal tests based on martingales residuals to check the proportional hazard assumption and the functional form of the prognostic factors in flexible parametric excess hazard models. Biostatistics. 2017;18:505–20. https://doi.org/10.1093/biostatistics/kxw056.
De Angelis R, Francisci S, Baili P, et al. The EUROCARE-4 database on cancer survival in Europe: data standardisation, quality control and methods of statistical analysis. Eur J Cancer. 2009;45:909–30. https://doi.org/10.1016/j.ejca.2008.11.003.
Dickman PW, Sloggett A, Hills M, Hakulinen T. Regression models for relative survival. Stat Med. 2004;23:51–64. https://doi.org/10.1002/sim.1597.
Diez Roux AV. Investigating neighborhood and area effects on health. Am J Public Health. 2001;91:1783–9.
Diez Roux AV. A glossary for multilevel analysis. J Epidemiol Community Health. 2002;56:588–94. https://doi.org/10.1136/jech.56.8.588.
Diez-Roux AV. Bringing context back into epidemiology: variables and fallacies in multilevel analysis. Am J Public Health. 1998;88:216–22.
Diggle P, Heagerty P, Liang K-Y, Zeger S. Analysis of longitudinal data. 2nd ed. Oxford/New York: Oxford University Press; 2002.
Duchateau L, Janssen P. The frailty model. New York: Springer-Verlag; 2008.
Ederer F, Axtell LM, Cutler SJ. The relative survival rate: a statistical methodology. Natl Cancer Inst Monogr. 1961;6:101–21.
Efron B. Estimation and accuracy after model selection. J Am Stat Assoc. 2014;109:991–1007. https://doi.org/10.1080/01621459.2013.823775.
Estève J, Benhamou E, Croasdale M, Raymond L. Relative survival and the estimation of net survival: elements for further discussion. Stat Med. 1990;9:529–38. https://doi.org/10.1002/sim.4780090506.
Fauvernier M, Roche L, Uhry Z, et al. Multi-dimensional penalized hazard model with continuous covariates: applications for studying trends and social inequalities in cancer survival. J R Stat Soc Ser C Appl Stat. 2019;68:1233–57. https://doi.org/10.1111/rssc.12368.
Gauthier J, Wu QV, Gooley TA. Cubic splines to model relationships between continuous variables and outcomes: a guide for clinicians. Bone Marrow Transplant. 2019. https://doi.org/10.1038/s41409-019-0679-x.
Giorgi R, Abrahamowicz M, Quantin C, et al. A relative survival regression model using B-spline functions to model non-proportional hazards. Stat Med. 2003;22:2767–84. https://doi.org/10.1002/sim.1484.
Gondos A, Parkin DM, Chokunonga E, Brenner H. Calculating age-adjusted cancer survival estimates when age-specific data are sparse: an empirical evaluation of various methods. Br J Cancer. 2006;94:450–4. https://doi.org/10.1038/sj.bjc.6602976.
Goungounga JA, Touraine C, Grafféo N, et al. Correcting for misclassification and selection effects in estimating net survival in clinical trials. BMC Med Res Methodol. 2019;19:104. https://doi.org/10.1186/s12874-019-0747-3.
Grafféo N, Castell F, Belot A, Giorgi R. A log-rank-type test to compare net survival distributions. Biometrics. 2016;72:760–9. https://doi.org/10.1111/biom.12477.
Guillaume E, Pornet C, Dejardin O, et al. Development of a cross-cultural deprivation index in five European countries. J Epidemiol Community Health. 2016;70:493–9. https://doi.org/10.1136/jech-2015-205729.
Hakulinen T. Cancer survival corrected for heterogeneity in patient withdrawal. Biometrics. 1982;38:933–42.
Hakulinen T, Tenkanen L. Regression analysis of relative survival rates. J R Stat Soc Ser C Appl Stat. 1987;36:309–17. https://doi.org/10.2307/2347789.
Hardin JW, Hilbe JM. Generalized estimating equations. 2nd ed. Boca Raton: CRC Press; 2013.
Hu FB, Goldberg J, Hedeker D, et al. Comparison of population-averaged and subject-specific approaches for analyzing repeated binary outcomes. Am J Epidemiol. 1998;147:694–703. https://doi.org/10.1093/oxfordjournals.aje.a009511.
Hubbard AE, Ahern J, Fleischer NL, et al. To GEE or not to GEE: comparing population average and mixed models for estimating the associations between neighborhood risk factors and health. Epidemiology Camb Mass. 2010;21:467–74. https://doi.org/10.1097/EDE.0b013e3181caeb90.
Ito Y, Nakaya T, Nakayama T, et al. Socioeconomic inequalities in cancer survival: a population-based study of adult patients diagnosed in Osaka, Japan, during the period 1993–2004. Acta Oncol. 2014;53:1423–33. https://doi.org/10.3109/0284186X.2014.912350.
Johnson CJ, Hahn CG, Fink AK, German RR. Variability in cancer death certificate accuracy by characteristics of death certifiers. Am J Forensic Med Pathol. 2012;33:137–42. https://doi.org/10.1097/PAF.0b013e318219877e.
Lambert PC, Smith LK, Jones DR, Botha JL. Additive and multiplicative covariate regression models for relative survival incorporating fractional polynomials for time-dependent effects. Stat Med. 2005;24:3871–85. https://doi.org/10.1002/sim.2399.
Launoy G, Launay L, Dejardin O, et al. European Deprivation Index: designed to tackle socioeconomic inequalities in cancer in Europe. Eur J Public Health. 2018;28. https://doi.org/10.1093/eurpub/cky213.625.
Li R, Daniel R, Rachet B. How much do tumor stage and treatment explain socioeconomic inequalities in breast cancer survival? Applying causal mediation analysis to population-based data. Eur J Epidemiol. 2016;31:603–11. https://doi.org/10.1007/s10654-016-0155-5.
Mant J, Wilson S, Parry J, et al. Clinicians didn’t reliably distinguish between different causes of cardiac death using case histories. J Clin Epidemiol. 2006;59:862–7. https://doi.org/10.1016/j.jclinepi.2005.11.021.
Maringe C, Belot A, Rubio FJ, Rachet B. Comparison of model-building strategies for excess hazard regression models in the context of cancer epidemiology. BMC Med Res Methodol. 2019;19:210. https://doi.org/10.1186/s12874-019-0830-9.
Morris R, Carstairs V. Which deprivation? A comparison of selected deprivation indexes. J Public Health Med. 1991;13:318–26.
Nelson CP, Lambert PC, Squire IB, Jones DR. Flexible parametric models for relative survival, with application in coronary heart disease. Stat Med. 2007;26:5486–98. https://doi.org/10.1002/sim.3064.
Oliveira IRC, Molenberghs G, Demétrio CGB, et al. Quantifying intraclass correlations for count and time-to-event data. Biom J Biom Z. 2016;58:852–67. https://doi.org/10.1002/bimj.201500093.
Pavlič K, Perme MP. On comparison of net survival curves. BMC Med Res Methodol. 2017;17:79. https://doi.org/10.1186/s12874-017-0351-3.
Percy C, Stanek E, Gloeckler L. Accuracy of cancer death certificates and its effect on cancer mortality statistics. Am J Public Health. 1981;71:242–50.
Perme MP, Pavlic K. Nonparametric relative survival analysis with the R package relsurv. J Stat Softw. 2018;87:1–27. https://doi.org/10.18637/jss.v087.i08.
Perme MP, Stare J, Estève J. On estimation in relative survival. Biometrics. 2012;68:113–20. https://doi.org/10.1111/j.1541-0420.2011.01640.x.
Perperoglou A, Sauerbrei W, Abrahamowicz M, Schmid M. A review of spline function procedures in R. BMC Med Res Methodol. 2019;19. https://doi.org/10.1186/s12874-019-0666-3.
Piffaretti C, Moreno-Betancur M, Lamarche-Vadel A, Rey G. Quantifying cause-related mortality by weighting multiple causes of death. Bull World Health Organ. 2016;94:870–9. https://doi.org/10.2471/BLT.16.172189.
Pohar Perme M, Estève J, Rachet B. Analysing population-based cancer survival - settling the controversies. BMC Cancer. 2016;16:933. https://doi.org/10.1186/s12885-016-2967-9.
Pornet C, Delpierre C, Dejardin O, et al. Construction of an adaptable European transnational ecological deprivation index: the French version. J Epidemiol Community Health. 2012;66:982–9. https://doi.org/10.1136/jech-2011-200311.
Price DL, Manatunga AK. Modelling relative survival using transformation methods. Stat Med. 2004;23:2463–77. https://doi.org/10.1002/sim.1832.
Rabe-Hesketh S, Skrondal A. Multilevel and longitudinal modeling using Stata. In: Volume I: continuous responses. 3rd ed. College Station: Stata Press Publication; 2012a.
Rabe-Hesketh S, Skrondal A. Multilevel and longitudinal modeling using Stata. In: Volume II: categorical responses, counts, and survival. 3rd ed. College Station: Stata Press Publication; 2012b.
Remontet L, Bossard N, Belot A, Estève J. An overall strategy based on regression models to estimate relative survival and model the effects of prognostic factors in cancer survival studies. Stat Med. 2007;26:2214–28. https://doi.org/10.1002/sim.2656.
Royston P, Sauerbrei W. Multivariable modeling with cubic regression splines: a principled approach. Stata J. 2007. https://doi.org/10.1177/1536867X0700700103.
Rubio FJ, Rachet B, Giorgi R, et al. On models for the estimation of the excess mortality hazard in case of insufficiently stratified life tables. Biostatistics kxz017. 2019a. https://doi.org/10.1093/biostatistics/kxz017.
Rubio FJ, Remontet L, Jewell NP, Belot A. On a general structure for hazard-based regression models: an application to population-based cancer research. Stat Methods Med Res. 2019b;28:2404–17. https://doi.org/10.1177/0962280218782293.
Sasieni P, Brentnall AR. On standardized relative survival. Biometrics. 2017;73:473–82. https://doi.org/10.1111/biom.12578.
Sloggett A, Young H, Grundy E. The association of cancer survival with four socioeconomic indicators: a longitudinal study of the older population of England and Wales 1981–2000. BMC Cancer. 2007;7:20. https://doi.org/10.1186/1471-2407-7-20.
Stare J, Pohar M, Henderson R. Goodness of fit of relative survival models. Stat Med. 2005;24:3911–25. https://doi.org/10.1002/sim.2414.
Subramanian SV. The relevance of multilevel statistical methods for identifying causal neighborhood effects. Soc Sci Med. 2004;58:1961–7. https://doi.org/10.1016/S0277-9536(03)00415-5.
Therneau TM, Grambsch PM. Modeling survival data: extending the cox model. New York: Springer New York; 2000.
Touraine C, Grafféo N, Giorgi R, CENSUR Working Survival Group. More accurate cancer-related excess mortality through correcting background mortality for extra variables. Stat Methods Med Res. 2020;29:122–36. https://doi.org/10.1177/0962280218823234.
Townsend P, Phillimore P, Beattie A. Health and deprivation: inequality and the north. Croom Helm; 1988.
Weller EA, Feuer EJ, Frey CM, Wesley MN. Parametric relative survival regression using generalized linear models with application to Hodgkin’s lymphoma. J R Stat Soc Ser C Appl Stat. 1999;48:79–89. https://doi.org/10.1111/1467-9876.00141.
Woods LM, Rachet B, Coleman MP. Choice of geographic unit influences socioeconomic inequalities in breast cancer survival. Br J Cancer. 2005;92:1279–82. https://doi.org/10.1038/sj.bjc.6602506.
Wynant W, Abrahamowicz M. Impact of the model-building strategy on inference about nonlinear and time-dependent covariate effects in survival analysis. Stat Med. 2014;33:3318–37. https://doi.org/10.1002/sim.6178.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Belot, A., Pohar-Perme, M. (2021). Social Disparities in Cancer Survival: Methodological Considerations. In: Launoy, G., Zadnik, V., Coleman, M.P. (eds) Social Environment and Cancer in Europe. Springer, Cham. https://doi.org/10.1007/978-3-030-69329-9_5
Download citation
DOI: https://doi.org/10.1007/978-3-030-69329-9_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-69328-2
Online ISBN: 978-3-030-69329-9
eBook Packages: MedicineMedicine (R0)