Lessons Learned Regarding Missing Clinical Stage in the National Cancer Database

Hoskin, Tanya L.; Boughey, Judy C.; Day, Courtney N.; Habermann, Elizabeth B.

doi:10.1245/s10434-018-07128-3

Lessons Learned Regarding Missing Clinical Stage in the National Cancer Database

Health Services Research and Global Oncology
Published: 04 January 2019

Volume 26, pages 739–745, (2019)
Cite this article

Annals of Surgical Oncology Aims and scope Submit manuscript

Tanya L. Hoskin MS¹,
Judy C. Boughey MD²,
Courtney N. Day BS¹ &
…
Elizabeth B. Habermann PhD^1,2,3

767 Accesses
19 Citations
1 Altmetric
Explore all metrics

Abstract

Background

The National Cancer Database (NCDB) is a valuable resource for studying national cancer treatment patterns. However, data abstraction rules from 2004 to 2007 resulted in missing clinical stage for a high percentage of cases. We investigated how this missingness can bias results in breast cancer studies including patients treated with neoadjuvant chemotherapy (NAC).

Methods

The impact of missing clinical stage on the estimated percentage of breast cancers treated with NAC versus adjuvant chemotherapy (AC) was examined from 2004 to 2013. Trends in NAC use were presented, excluding those cases with missing clinical stage, and compared with trends after multiple imputation, performed using the chained equations approach with predictive mean matching.

Results

Clinical stage was missing for 56% of cases in 2004–2007, versus 12% in 2008–2013, and was missing more than twice as often for AC patients versus NAC patients (31% vs. 12% overall), with the largest difference occurring in 2004–2007 (60% vs. 27% missing). Because stage was more frequently missing in AC patients, excluding those missing clinical stage introduced bias when considering NAC versus AC trends. With multiple imputation, significant increases in NAC use were identified between 2004 and 2013 for each stage: use for stage I was 2% in 2004 and 5% in 2013, use for stage II was 11% in 2004 and 24% in 2013, use for stage III was 34% in 2004 and 46% in 2013, in contrast to an analysis excluding those missing stage, which suggested little or no increase within any stage.

Conclusion

NCDB data abstraction rules from 2004 to 2007 resulted in missing clinical stage for > 50% of breast cancers, which may introduce substantial bias. Multiple imputation or exclusion of the years 2004–2007 should be considered to mitigate the problem of missing clinical stage in NCDB.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Implications of missing data on reported breast cancer mortality

Article 05 November 2022

Characteristics of patients with missing information on stage: a population-based study of patients diagnosed with colon, lung or breast cancer in England in 2013

Article Open access 02 May 2018

Twenty-Five Years of Cancer Follow-Up; Is the Data Worth the Effort?

Article 24 October 2021

References

Raval MV, Bilimoria KY, Stewart AK, Bentrem DJ, Ko CY. Using the NCDB for cancer care improvement: an introduction to available quality assessment tools. J Surg Oncol. 2009;99(8):488–90.
Article PubMed Google Scholar
Bilimoria KY, Stewart AK, Winchester DP, Ko CY. The National Cancer Data Base: a powerful initiative to improve cancer care in the United States. Ann Surg Oncol. 2008;15(3):683–90.
Article PubMed PubMed Central Google Scholar
Boffa DJ, Rosen JE, Mallin K, et al. Using the National Cancer Database for outcomes research: a review. JAMA Oncol. 2017;3(12):1722–8.
Article PubMed Google Scholar
National Cancer Database. Participant Use Data File (PUF) data dictionary. https://www.facs.org/~/media/files/quality%20programs/cancer/ncdb/puf_data_dictionary_puf_2015.ashx. Accessed 13 Aug 2018.
National Cancer Database. Participant Use Data File (PUF) supplemental documents. https://www.facs.org/~/media/files/quality%20programs/cancer/ncdb/puf_supplemental_documentation.ashx. Accessed 13 Aug 2018.
Merkow RP, Rademaker AW, Bilimoria KY. Practical guide to surgical data sets: National Cancer Database (NCDB). JAMA Surg. 2018;153(9):850–1.
Article PubMed Google Scholar
Haider AH, Bilimoria KY, Kibbe MR. A checklist to elevate the science of surgical database research. JAMA Surg. 2018;153(6):505–7.
Article PubMed Google Scholar
Kaji AH, Rademaker AW, Hyslop T. Tips for analyzing large data sets from the JAMA surgery statistical editors. JAMA Surg. 2018;153(6):508–9.
Article PubMed Google Scholar
Little RJ, Rubin DB. Statistical analysis with missing data. Wiley, London; 2014.
Google Scholar
Sterne JA, White IR, Carlin JB, et al. Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls. BMJ. 2009;338:b2393.
Article PubMed PubMed Central Google Scholar
Horton NJ, Kleinman KP. Much ado about nothing: A comparison of missing data methods and software to fit incomplete data regression models. Am Stat. 2007;61(1):79-90.
Article PubMed PubMed Central Google Scholar
Moons KG, Donders RA, Stijnen T, Harrell FE. Using the outcome for imputation of missing predictor values was preferred. J Clin Epidemiol. 2006;59(10):1092–101.
Article PubMed Google Scholar
Eisemann N, Waldmann A, Katalinic A. Imputation of missing values of tumour stage in population-based cancer registration. BMC Med Res Methodol. 2011;11(1):129.
Article PubMed PubMed Central Google Scholar
White IR, Royston P, Wood AM. Multiple imputation using chained equations: issues and guidance for practice. Stat Med. 2011;30(4):377–99.
Article PubMed Google Scholar
Van Buuren S, Groothuis-Oudshoorn K. mice: multivariate imputation by chained equations in R. J Stat Softw. 2011;45(3):1–67.
Article Google Scholar
Knol MJ, Janssen KJ, Donders ART, et al. Unpredictable bias when using the missing indicator method or complete case analysis for missing confounder values: an empirical example. J Clin Epidemiol. 2010;63(7):728–36.
Article PubMed Google Scholar
Mougalian SS, Soulos PR, Killelea BK, et al. Use of neoadjuvant chemotherapy for patients with stage I to III breast cancer in the United States. Cancer. 2015;121(15):2544–52.
Article PubMed Google Scholar
Mackinnon A. The use and reporting of multiple imputation in medical research: a review. J Intern Med. 2010;268(6):586–93.
Article CAS PubMed Google Scholar

Download references

Acknowledgment

The NCDB is a joint project of the CoC of the American College of Surgeons and the American Cancer Society. The CoC’s NCDB, and the hospitals participating in the CoC NCDB, are the source of the de-identified data used herein; they have not verified and are not responsible for the statistical validity of the data analysis or the conclusions derived by the authors.

Funding

The Mayo Clinic Robert D. and Patricia E. Kern Center for the Science of Health Care Delivery provides salary support for Dr. Habermann, Ms. Hoskin, and Ms. Day. No external funding was used.

Author information

Authors and Affiliations

Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
Tanya L. Hoskin MS, Courtney N. Day BS & Elizabeth B. Habermann PhD
Department of Surgery, Mayo Clinic, Rochester, MN, USA
Judy C. Boughey MD & Elizabeth B. Habermann PhD
Robert D. and Patricia E. Kern Center for the Science of Health Care Delivery, Mayo Clinic, Rochester, MN, USA
Elizabeth B. Habermann PhD

Authors

Tanya L. Hoskin MS
View author publications
You can also search for this author in PubMed Google Scholar
Judy C. Boughey MD
View author publications
You can also search for this author in PubMed Google Scholar
Courtney N. Day BS
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth B. Habermann PhD
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tanya L. Hoskin MS.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hoskin, T.L., Boughey, J.C., Day, C.N. et al. Lessons Learned Regarding Missing Clinical Stage in the National Cancer Database. Ann Surg Oncol 26, 739–745 (2019). https://doi.org/10.1245/s10434-018-07128-3

Download citation

Received: 24 July 2018
Published: 04 January 2019
Issue Date: 15 March 2019
DOI: https://doi.org/10.1245/s10434-018-07128-3

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Lessons Learned Regarding Missing Clinical Stage in the National Cancer Database