Background

Gastric cancer continues to be a major global health threat, having accounted for 0.8 million deaths and 19.1 million disability-adjusted life-years (DALYs) in 2017 [1]. Results from a recent meta-analysis, which demonstrated a 40% reduction in risk of death from gastric cancer with endoscopic screening [2], shed light on the opportunity to reduce the burden of gastric cancer through effective screening policy. With the third-highest rate of gastric cancer incidence globally [3], Japan introduced a national endoscopic screening program in 2015, offering biennial and triennial endoscopic screening for people older than 50 years [4]. Understanding the trade-offs in lifetime benefits and costs of current screening guidelines, as opposed to alternative screening strategies, at the population level is a vital input into dialogues on cancer control policy. However, with a paucity of empirical evidence and longitudinal data, the lifetime cost-effectiveness of population-wide screening strategies with different screening intervals at various starting and stopping ages remains unclear.

To inform screening policy in a timely fashion, microsimulation decision models can estimate the long-term consequences of a large number of potential policies that are not routinely examined in empirical studies [5, 6]. Here, to identify which strategies might deliver cost-effective care, we developed a microsimulation model which incorporates the best available data to estimate the lifetime cost-effectiveness of various national endoscopic screening scenarios while accounting for individual-level heterogeneity in gastric cancer risk.

Methods

Model description

We synthesized information from nationally representative data sets on demographics; prevalence of risk exposure; cancer incidence, mortality, and survival; and endoscopic screening (Additional file 1: Table S1) [7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55]. Using these data, we then developed a population-based microsimulation model of gastric cancer and created a virtual population with individual risk profiles and life expectancy which were representative of the population of Japan (Fig. 1).

Fig. 1
figure 1

Overview of the modeling logic. Note: Parallelograms represent data input, stadiums represent the process of modeling, diamonds represent intervention, and square indicates model output

The natural history of disease progression was simulated on the basis of Correa’s cascade of gastric carcinogenesis (Fig. 2) [11]. As a simulated individual ages, precancerous lesions (atrophic gastritis, intestinal metaplasia, or dysplasia) may develop. The model focused on non-cardia intestinal-type gastric adenocarcinoma (NCGA), the major histologic type of gastric cancer [12]. The model allows individual risk profiles (smoking behavior and Helicobacter pylori infection) to change dynamically and to affect the probability of disease progression over time. Simulated smoking behavior at the individual level depended on respective age, sex, and calendar year and was updated and tracked annually throughout a lifetime [29, 33]. To reflect the secular trend in prevalence across birth cohorts, H. pylori infection status was generated according to the birth year when a simulated individual entered the model (Additional file 1: Figure S1) [34]. Preclinical cancer may either become symptomatic, be detected by screening, or progress to a more advanced preclinical cancerous stage. Using long-term survival data from population-based cancer registries, survival time of individuals after cancer diagnosis was simulated by sex, clinical stage, and year after diagnosis [26]. Competing risk of mortality was modeled by respective age, sex, and calendar year [9, 10, 40].

Fig. 2
figure 2

Conceptual framework on the Markov chain of the natural history module. Note: Arrows indicate the transition from one health state to another

To ground our model in an empirical context, we calibrated and validated the model using population-based cancer registries, which include individual-level, de-identified data on 1.2 million gastric cancer cases diagnosed from 1994 to 2013 [12]. The model was calibrated to the age- and sex-specific incidence and stage distribution of gastric cancer from 2006 to 2008 [12]. We defined the initial search bounds for calibration by conducting a systematic review and explored parameter space systematically by performing 6000 independent searches with 1,000,000 individuals in each search. Of these 6000 resamples, we applied the least-squares method to identify the top 50 best fitted parameter sets and reported model outputs using these 50 parameter sets as uncertainty intervals. To validate the model, we assessed its predictive ability on data not used in the calibration process, namely gastric cancer mortality on vital statistics from 1994 to 2013 [12, 39]. The model was developed using TreeAge Pro 2019 (TreeAge Software Inc., Williamstown, MA). Data was analyzed using STATA 15 (Stata Corp., College Station, TX, USA). The details of the model are described in Additional file 1 and Table S1–S4.

Scenarios

In addition to current screening recommendations (biennial and triennial endoscopic screening from age 50 with no stopping age), we evaluated 12 strategies with varying starting ages (40, 45, and 50 years), stopping ages (75 and 80 years), and screening intervals (2 and 3 years). Strategies in this report are denoted by starting age-stopping age-screening interval. The baseline scenario was modeled to project the trend of gastric cancer in the absence of a national endoscopic screening policy. After introduction of the endoscopic screening scenarios, the natural history of gastric cancer could then be altered due to the detection of preclinical cancer, or detection or removal of dysplasia, depending on the sensitivity and specificity of the screening tests (Table 1). Individuals with a biopsy result of dysplasia were assumed to be treated by endoscopic submucosal dissection (ESD) and offered yearly surveillance endoscopy for 5 years [37, 69]. Perfect adherence was assumed in all scenarios.

Table 1 Key input parameters included in our analysis

Cost-effectiveness analysis

The cost-effectiveness analysis was conducted from a societal perspective in which the model repeatedly simulated 10 million individuals born between 1965 and 1985 in all screening scenarios and followed them from age 20 years until either death or age 100 years. Lifetime screening effectiveness (reduction in gastric cancer mortality, and quality-adjusted life-years [QALYs] gained) and resources (endoscopies and costs) were simulated for each screening strategy. Incremental cost-effectiveness ratios were calculated by dividing the incremental cost by the QALYs gained. QALYs were defined as the product of health utility and time. In this study, one QALY was equivalent to 1 year of perfect health. Incremental cost-effectiveness ratio (ICER) is a metric designed to inform decision-makers of trade-offs when allocating resources to an intervention. In the present study, ICER was calculated by dividing the incremental cost by the QALYs gained [70]. Strategies were ranked by increasing costs. A strategy was dominated if it was more costly but yielded fewer QALYs than its adjacent strategy or had a higher ICER than a more effective strategy. Dominated strategies were excluded, and ICERs were calculated for non-dominated strategies [7]. Costs and QALYs were discounted at an annual rate of 3% [63]. A willingness-to-pay (WTP) threshold of $50,000 US dollars per QALY saved was applied [71]. Costs for gastric cancer screening, diagnosis, and treatment were obtained from the Japanese diagnosis procedure combination-based payment system and published literature and are presented in 2015 US dollars (Table 1) [72]. Table S5 in the Additional file 1 presents a Consolidated Health Economic Evaluation Reporting Standards (CHEERS) checklist.

Sensitivity analysis

To assess the robustness of results to changes in individual parameters, we performed multiple deterministic sensitivity analyses by varying the sensitivity and specificity of endoscopy and the resection rate of ESD using the reported lower and upper 95% uncertainty bounds (Table 1). The effects of uncertainty surrounding cost inputs of the endoscopic screening examinations, ESD, and cancer treatments at different stages were assessed by varying the costs by ± 20% (Table 1).

In the base case analysis, we applied a discount rate of 3% per annum to both costs and effects, but explored uncertainty as recommended by WHO-choice guidelines using a discount rate of 0% for effects and 6% for costs [63]. We also evaluated differential discounting (1.5% for effects and 3.5% for costs) in sensitivity analysis.

A second-order probabilistic sensitivity analysis was done with a Monte Carlo simulation to investigate the effect of parameter uncertainty on the cost-effectiveness results. The model was run 1000 times, each taking random draws from all inputs with the prespecified uncertainty distributions listed in Table 1.

Results

Accuracy of the simulation model

Our microsimulation model accurately reproduced the age- and sex-specific incidence rates and stage distributions to the observed trends in population-based cancer registries from 2006 to 2008 (Fig. 3 and Additional file 1: Figure S2–S5). The external-validation analyses also demonstrated long-term coverage estimates of 100% for predicted mortality rates from 1994 to 2013 (Additional file 1: Figure S6).

Fig. 3
figure 3

Model predicted gastric cancer incidence. Note: Model calibration shows that the observed and model predicted gastric cancer incidence in a 2006, b 2007, and c 2008 for individuals aged 25 years or above. The observed age-specific gastric cancer incidence is indicated by black hollow dots. The dotted lines indicate the mean predicted age-specific incidence of gastric cancer between 2006 and 2008, and shaded areas indicate uncertainty intervals

Base case analysis

The model estimated that no endoscopic screening resulted in 9.1 gastric cancer mortality, and 47,252 QALYs per 1000 simulated individuals over a lifetime course. Compared with no endoscopic screening, all strategies conferred more QALYs (25.5–32.7 QALYs gained per 1000 individuals) and resulted in reduced simulated rates of gastric cancer mortality (5.7–7.3 events per 1000 individuals). Among 14 screening strategies, lifetime costs were highest in a biennial endoscopic screening program targeting individuals aged 40 to 80 years ($4.2 million per 1000 individuals) and lowest in triennial screening for individuals aged 50 to 75 years ($1.9 million per 1000 individuals). Lifetime number of endoscopies ranged from 9694 per 1000 individuals with the 50-75-3 strategy to 22,358 per 1000 individuals with the 40-80-2 strategy. Detailed base case results are shown in Table 2.

Table 2 Estimated lifetime effects and cost-effectiveness of gastric cancer screening strategies

We computed incremental cost-effectiveness ratios (ICERs) for the non-dominated screening strategies and present the cost-effectiveness frontier in Fig. 4. This frontier was comprised of three triennial screening scenarios: the 50-75-3 strategy, 50-80-3 strategy, and 45-80-3 strategy. Incremental cost-effectiveness ratios were $45,665 for the 50-75-3 strategy compared with no screening, $60,731 for the 50-80-3 strategy compared with the 50-75-3 strategy, and $130,149 for the 45-80-3 strategy compared with the 50-80-3 strategy. Using a WTP threshold of $50,000 per QALY, only triennial screening for individuals aged 50–75 years was cost-effective. This strategy prevented 63% of gastric cancer mortality at an expense of $1.9 million and 9694 endoscopies per 1000 simulated individuals over a lifetime course.

Fig. 4
figure 4

Lifetime cost and quality-adjusted life-year (QALY) of gastric cancer screening strategies compared with no screening. Note: Screening strategies are indicated as starting age-stopping age-screening interval. Costs and QALYs were discounted at an annual rate of 3%. An incremental cost-effectiveness ratio is shown for each strategy on the cost-effectiveness frontier

Scenarios simulating the current national endoscopic screening guidelines in Japan, namely biennial and triennial endoscopic screening from age 50 with no stopping age, were not on the efficient frontier. In comparison with no screening, the current biennial screening program would prevent 80.5% of gastric cancer mortality and result in a cost of $3.1 million and 21,379 endoscopies per 1000 simulated individuals. The current triennial screening program would yield a reduction of 74.8% gastric cancer mortality at an expense of $2.2 million and 14,516 endoscopies per 1000 simulated individuals.

Sensitivity analysis

The 50-75-3 strategy remained cost-effective under one-way changes to key assumptions. The cost-effectiveness ratio varied between $39,181 and $49,994 per QALY gained (Fig. 5a). The parameters that affect the ICER of the 50-75-3 strategy, from most to least, were the direct cost of endoscopy, direct cost of ESD, sensitivity of endoscopy, complete resection rate of ESD, specificity of endoscopy, first-year medical cost for local cancer, and first-year medical cost for distal cancer and regional cancer. The projected cost-effectiveness results were insensitive to both the use of a 0% discount rate for effects and 6% for the costs ($2736 per QALY gained) and a discount rate of 1.5% for effects and 3.5% for costs ($14,312 per QALY gained).

Fig. 5
figure 5

One-way sensitivity analyses and probabilistic sensitivity analysis. Note: ESD, endoscopic submucosal dissection; ICER, incremental cost-effectiveness ratio; QALY, quality-adjusted life-year; WTP, willingness-to-pay threshold. a Tornado plot showing ICER estimates for one-way sensitivity analyses. Details of changes to parameter values are given in Table 1. The dotted line indicates ICER from the base case analysis. b Result of 1000 bootstraps was generated in the probabilistic sensitivity analysis. Each dot represents the lifetime discounted incremental cost and QALYs of one bootstrap sample. The dotted line indicates willingness-to-pay threshold of US$50000. c Cost-effectiveness acceptability curves showing probability that the 50-75-3 strategy is cost-effective across a range of threshold values. The 50-75-3 strategy indicates a triennial screening strategy from 50 to 75 years

In our probabilistic sensitivity analysis (Fig. 5b), 100% of the simulations conferred a positive ICER, suggesting that, on a national scale, the 50-75-3 strategy is associated with greater QALYs, although at the expense of a higher cost than no screening. The 50-75-3 strategy was cost-effective in 92.6% of simulations using a WTP threshold of $50,000 per QALY, and in 100% using a WTP threshold of $100,000 per QALY (Fig. 5c).

Discussion

We developed a well-calibrated and validated microsimulation model to simulate an average-risk population over the lifetime course under 15 unique screening scenarios. Comprehensive modeling showed that the current national endoscopic screening program in Japan is unlikely to be cost-effective. In contrast, a more favorable option would be triennial endoscopic screening of individuals aged 50 to 75 years. This would result in an estimated 27.2 QALYs gained per 1000 individuals and a reduction in the lifetime risk of gastric cancer mortality by 63% with a corresponding ICER of $45,665.

The present study highlights several important policy questions. To date, no recommended age for endoscopic screening cessation has been available. However, with the population aging rapidly, Japan is likely to experience an increased demand for cancer screening despite limited resources. Comprehensive examination of the added value of extending screening practices in older populations is clearly paramount. Our modeling results demonstrate that the benefits of continuing endoscopic screening in people beyond age 75 years do not justify the additional endoscopic screenings performed, owing to diminishing returns. In terms of competing causes of death, adverse events caused by screening, and reduced eligibility for curative surgery in older individuals, determining whether to screen individuals aged over 75 years should be done on an individualized basis, such as with regard to individual risk profile, previous screening history, and the individual’s values and preferences. Further research to explore optimal screening strategies at higher age limits is warranted.

In this analysis, the efficient frontier was dominated by strategies which initiated endoscopic screening at age 50 years; these provided a more favorable option in terms of cost-effectiveness. Our results are in line with the gastric cancer burden in Japan. Data from the population-based cancer registry revealed that the age-specific incidence rate among people aged 40 to 49 years (10.4 per 100,000) was lower than that in people aged 50 to 59 years (65.8 per 100,000) in 2014 [12]. Gastric cancer incidence in these two populations has declined steadily over the past two decades, with reductions in men and women since 1993 of 62.8% and 51.3% at 40 to 49 years compared with 35.8% and 33.4% at 50 to 59 years, respectively (Additional file 1: Figure S7) [12]. For screening effectiveness, age-specific analysis from the Korean national cancer screening program demonstrated that endoscopic screening is related to reduced risk of gastric cancer mortality in the population aged 40 to 74 years [73]. The burden of disease and screening effectiveness, together with the modeling results, indicated that initiating screening at the age of 50 years would be a reasonable option.

To our knowledge, this is the first study to use microsimulation decision modeling to comprehensively project the lifetime cost-effectiveness of national gastric cancer control measures in a high-incidence setting. The strength of this study is that a comprehensive approach was used during formulation of the microsimulation model. This approach incorporated detailed gastric cancer natural history findings and the impact of dynamic individual risk profiles on disease progression by synthesizing the best available data from nationally representative surveys and meta-analyses. This model generates population-level estimates that can hardly be achieved by simpler models, since it also preserves individual-level heterogeneity. In addition, it has been extensively calibrated to the nationally representative observed data. Validation analyses across the period 1994 to 2013 have shown that the secular trends of model predictions are consistent with the observed mortality data. We rigorously modeled 15 clinically relevant scenarios to explore potential lifetime effects across the endoscopic screening spectrum. Further, comprehensive sensitivity analyses were performed, adding robustness to the efficient frontier in this study.

Our study also has several limitations. First, the model assumed full adherence with screening and diagnostic evaluations for all strategies. The current analysis was designed to inform population guidelines; therefore, this assumption allowed the model to predict the maximum achievable benefit of a public health action. Furthermore, to facilitate comparisons, all screening scenarios were based on an identical assumption. Second, because this study focused on current policy and its alternatives, we did not evaluate risk stratification approaches to gastric cancer screening. The combination test of serum pepsinogen and H. pylori antibody has been proposed to be a potential tool for predicting gastric cancer development [74,75,76]. However, the specificity for both single and combination tests of serum pepsinogen and H. pylori antibody was shown to be low in one population-based cohort study in Japan [77]. In the future, given that the predictive accuracy of biomarkers could be improved, the risk stratification approach remains a future opportunity which may lead to further enhancement of the cost-effectiveness in a gastric cancer screening context. Lastly, since costs were Japan-based, it is unclear how generalizable our cost efficacy results are to other healthcare systems. However, we have provided both the health benefits and number of endoscopies needed, which are more likely to be generalizable.

Conclusions

Screening policy could lead to the arrival of a propitious moment in the advancement of gastric cancer control. However, in this microsimulation modeling study, it was estimated that the current national endoscopic screening program in Japan may be less economically attractive than the model-recommended strategy. These findings clearly underpin the need to re-evaluate the current guidelines to develop an efficient policy in Japan.