Is it possible to estimate the incidence of breast cancer from medico-administrative databases?
- 164 Downloads
One approach to estimate cancer incidence in the French Départements is to quantify the relationship between data in cancer registries and data obtained from the PMSI (Programme de Médicalisation des Systèmes d’Information Médicale). This relationship may then be used in Départements without registries to infer the incidence from local PMSI data. We present here some methodological solutions to apply this approach. Data on invasive breast cancer for 2002 were obtained from 12 Départemental registries. The number of hospital stays was obtained from the National PMSI using two different algorithms based on the main diagnosis only (Algorithm 1) or on that diagnosis associated to a mention of “resection” (Algorithm 2). Considering registry data as gold standard, a calibration approach was used to model the ratio of the number of hospital stays to the number of incident cases. In Départements with registries, validation of the predictions was done through cross-validation. In Départements without registries, validation was done through a study of homogeneity of the mean number of hospital stays per patient. Cross-validation showed that the estimates predicted by the model were true with data extracted by Algorithm 1 but not by Algorithm 2. However, with Algorithm 1, there was an important heterogeneity between French Départements as to the mean number of hospital stays per patient, which had an important impact on the estimations. In the near future, the method will allow using medico-administrative data (after calibration with registry data) to estimate Départemental incidence of selected cancers.
KeywordsBreast cancer Incidence Cancer registries Claims database Prediction Statistical modelling
Programme de Médicalisation des Systèmes d’Information Médicale
International Classification of Diseases-Oncology
- 8.McBean AM, Warren JL, Babish JD. Measuring the incidence of cancer in elderly Americans using Medicare claims data. Cancer. 1994;73:2417–25. doi :10.1002/1097-0142(19940501)73:9>2417::AID-CNCR2820730927<3.0.CO;2-L.Google Scholar
- 16.Uhry Z, Colonna M, Remontet L, Grosclaude P, Carre N, Couris CM, et al. Estimating infra-national and national thyroid cancer incidence in France from cancer registries data and national hospital discharge database. Eur J Epidemiol. 2007;22:607–14. doi:10.1007/s10654-007-9158-6.PubMedCrossRefGoogle Scholar
- 17.Carroll RJ, Ruppert D. Prediction and calibration. Transformation and weighting in regression. New York: Chapman & Hall; 1988. p. 51–62.Google Scholar
- 18.Davidian M, Giltinan DM. Analysis of assay data. Nonlinear models for repeated data. London: Chapman & Hall; 1995. p. 275–98.Google Scholar
- 19.Goldstein H. Multilevel statistical models. 3rd ed. London: Arnold; 2003.Google Scholar
- 20.Couris CM, Foret-Dodelin C, Rabilloud M, Colin C, Bobin JY, Dargent D, et al. Sensitivity and specificity of two methods used to identify incident breast cancer in specialized units using claims databases. Rev Epidemiol Sante Publique. 2004;52:151–60. doi:10.1016/S0398-7620(04)99036-0.PubMedCrossRefGoogle Scholar
- 21.Carroll RJ, Ruppert D, Stefanski LA, Crainiceau CM. Measurement error in nonlinear models. 2nd ed. New York: Chapman & Hall/CRC; 2006.Google Scholar
- 22.Couris CM, Polazzi S, Olive F, Remontet L, Bossard N, Gomez F et al. Breast cancer incidence using administrative data: correction with sensitivity and specificity. J Clin Epidemiol (in press).Google Scholar