An Implementation and Visualization of the Tree-Based Scan Statistic for Safety Event Monitoring in Longitudinal Electronic Health Data
- 66 Downloads
Longitudinal electronic healthcare data hold great potential for drug safety surveillance. The tree-based scan statistic (TBSS), as implemented by the TreeScan® software, allows for hypothesis-free signal detection in longitudinal data by grouping safety events according to branching, hierarchical data coding systems, and then identifying signals of disproportionate recording (SDRs) among the singular events or event groups.
The objective of this analysis was to identify and visualize SDRs with the TBSS in historical data from patients using two antifungal drugs, itraconazole or terbinafine. By examining patients who used either itraconazole or terbinafine, we provide a conceptual replication of a previous TBSS analyses by varying methodological choices and using a data source that had not been previously used with the TBSS, i.e., the Optum Clinformatics™ claims database. With this analysis, we aimed to test a parsimonious design that could be the basis of a broadly applicable method for multiple drug and safety event pairs.
The TBSS analysis was used to examine incident events and any itraconazole or terbinafine use among US-based patients from 2002 through 2007. Event frequencies before and after the first day of drug exposure were compared over 14- and 56-day periods of observation in a Bernoulli model with a self-controlled design. Safety events were classified into a hierarchical tree structure using the Clinical Classifications Software (CCS) which mapped International Classification of Diseases, 9th Revision (ICD-9) codes to 879 diagnostic groups. Using the TBSS, the log likelihood ratio of observed versus expected events in all groups along the CCS hierarchy were compared, and groups of events that occurred at disproportionally high frequencies were identified as potential SDRs; p-values for the potential SDRs were estimated with Monte-Carlo permutation based methods. Output from TreeScan® was visualized and plotted as a network which followed the CCS tree structure.
Terbinafine use (n = 223,968) was associated with SDRs for diseases of the circulatory system (14- and 56-day p = 0.001) and heart (14-day p = 0.026 and 56-day p = 0.001) as well as coronary atherosclerosis and other heart disease (14-day p = 0.003 and 56-day p = 0.004). For itraconazole use (n = 36,025), the TBSS identified SDRs for coronary atherosclerosis and other heart disease (p = 0.002) and complications of an implanted or grafted device (14-day p = 0.001 and 56-day p < 0.05). Use of both drugs was associated with SDRs for diseases of the digestive system at 14 days (p < 0.05) and this SDR had been observed among terbinafine users in a previous TBSS analysis with a different data source. The TreeScan® visualization facilitated the identification of the atherosclerosis and other heart disease SDRs as well as highlighting the consistency of the SDR for diseases of the digestive system across drugs and data sources.
With the TBSS, we identified potential SDRs related to the circulatory system that may reflect the cardiac risk that was described in the itraconazole product label. SDRs for diseases of the digestive system among terbinafine users were also reported in a previous signal detection analysis, although other SDRs from the previous publications were not replicated. The TBSS visualizations aided in the understanding and interpretation of the TBSS output, including the comparisons to the previous publications. In this conceptual replication, differences in the results observed in our analysis and the previous analyses could be attributable to variation in modeling and design choices as well as factors that were intrinsic to the underlying data sources. The broad consistency, but far from perfect concordance, of our results with the known safety profile of these antifungals including the risks from the itraconazole product label supports the rationale for continued investigations of signal detection methods across differing data sources and populations.
We would like to thank Richard Gong for his support in the creation of the TreeScan® visualizations and careful inspection of the SAS® code that generated the input data for TreeScan®.
Compliance with Ethical Standards
Conflict of interest
Stephen Edward Schachterle, Qing Liu, Kenneth R. Petronis, and Andrew Bate are full-time employees of Pfizer and hold Pfizer stocks and stock options. Sharon Hurley was a full-time contract employee of Pfizer at the time of her contribution.
No sources of external funding were used to assist in the preparation of this study.
- 8.Maro JC, Brown JS, Dal Pan GJ, Kulldorff M. Minimizing signal detection time in postmarket sequential analysis: balancing positive predictive value and sensitivity. Pharmacoepidemiol Drug Saf. 2014;23(8):839–48.Google Scholar
- 13.Yih KW, Nguyen M, Maro JC, Baker M, Balsbaugh C, Brown J, et al. Mini-sentinel CBER/PRISM methods protocol: pilot of self-controlled tree-temporal scan analysis for gardasil vaccine. Version 2.0. https://www.sentinelinitiative.org/sites/default/files/Methods/Mini-Sentinel_PRISM_Pilot-Self-Controlled-Tree-Temporal-Scan-Analysis-Gardasil-Vaccine-Protocol_0.pdf. Accessed 20 Dec 2015.
- 15.Maro JC, Dashevsky I, Kulldorff M. Postlicensure medical product safety data-mining: power calculations for Bernoulli Data. Sentinel Methods Report. 2017. https://www.sentinelinitiative.org/sites/default/files/vaccines-blood-biologics/assessments/TreeScanPower_FinalReport.pdf. Accessed 16 Nov 2018.
- 17.Wang SV, Schneeweiss S, Berger ML, Brown J, de Vries F, Douglas I, et al.; Joint ISPE-ISPOR Special Task Force on Real World Evidence in Health Care Decision Making. Reporting to improve reproducibility and facilitate validity assessment for healthcare database studies V1.0. Pharmacoepidemiol Drug Saf. 2017;26(9):1018–32.Google Scholar
- 18.R Core Team. R: a language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2014. http://www.R-project.org/. Accessed 12 Sept 2018.
- 19.Stergachis A, Saunders KW, Davis RL, Kimmel SE, Schinnar R, Chan KA, et al. Examples of automated databases. In: Strom B, Kimmel SE, editors. Textbook of pharmacoepidemiology. Chichester: Wiley; 2013. p. 173–214.Google Scholar
- 21.Elixhauser A, Steiner CA, Whittington C, et al. Clinical classifications for health policy research: hospital inpatient statistics, 1995. Healthcare Cost and Utilization Project, HCUP 3 Research Note. Rockville, MD: Agency for Health Care Policy and Research; 1998. AHCPR Pub. No. 98-0049.Google Scholar
- 22.Elixhauser A, Steiner CA. Hospital inpatient statistics, 1996. Healthcare Cost and Utilization Project (HCUP) Research Note. Rockville, MD: Agency for Health Care Policy and Research; 1999. AHCPR Pub. No. 99-0034.Google Scholar
- 23.Cowen ME, Dusseau DJ, Toth BG, Guisinger C, Zodet MW, Shyr Y. Casemix adjustment of managed care claims data using the clinical classification for health policy research method. Med Care. 1998:1108–13.Google Scholar
- 24.Elixhauser A, McCarthy E. Clinical classifications for health policy research, version 2: hospital inpatient statistics. Rockville: US Department of Health and Human Services, Public Health Service, Agency for Health Care Policy and Research; 1996.Google Scholar
- 25.Duffy S, Elixhauser A, Sommers JP. Diagnosis and procedure combinations in hospital inpatient data. Rockville: US Department of Health and Human Services, Public Health Service, Agency for Health Care Policy and Research; 1996.Google Scholar
- 27.Lamisil (terbinafine hydrochloride) [package insert]. New Jersey: Novartis Pharmaceuticals Corporation; 2011.Google Scholar
- 28.Sporanox (itraconazole) [package insert]. Beerse: Janssen Pharmaceuticals, Inc.; 2003.Google Scholar
- 39.ICD-9-CM: International classification of diseases, 9th revision, clinical modification. Salt Lake City: Medicode; 1996.Google Scholar