Identified Studies (Single Technology Appraisals and Published Papers)
An overview of the identification of relevant studies is presented as a Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) diagram in Fig. 1. After excluding duplicates and ineligible publications, a total of 100 NICE STAs and 124 published studies were identified as relevant.
Overview of Identified Studies
Table 2 provides a summary of the identified studies. Across both the literature and the NICE STAs, there appears to be an increasing number of oncology cost-effectiveness models published each year. Note that one identified STA and four papers from the published literature reported on multiple economic models within one publication.
Table 2 Summary of identified studies Figure 2 presents the model structures employed across the NICE STAs and the literature. PartSAs were demonstrated to be the most popular structure across the NICE STAs (n = 54), followed by discrete-time state transition structures (n = 41) and decision tree or other cohort-based combination structures (n = 6). State transition models most commonly applied the Markov assumption (n = 19). However, 16 STAs relaxed this assumption via a semi-Markov approach. Five STAs did not provide any further detail on their state transition approach (i.e. Markov or semi-Markov). The remaining models considered simulation approaches, which also fall under the state transition bracket and were described as patient-level simulation (n = 1) and time series individual simulation (n = 1). Of the four combination structures, two considered a decision tree populating a state transition model and two considered a decision tree populating a PartSA.
In comparison, discrete-time state transition models were demonstrated to be the most popular structure in the published literature (n = 102), followed by a PartSA (n = 15) and either a decision tree or other cohort-based combination structures (n = 10). Five of the published papers did not describe their model structure. State transition models were most often reported as applying a Markovian (memoryless) assumption (n = 88), with only four relaxing this assumption and considering a semi-Markov approach. Models reporting funding from pharmaceutical companies were more frequently PartSA structures (13/40; 33%) than were those with other funding sources. Patient-level simulations were described as discrete event simulation (n = 4), patient-level simulation (n = 2), micro-simulation (n = 1), time series individual simulation (n = 1), simulation (n = 1) and a macro-simulation Markov (n = 1) models. Combination structures relate to those which consider more than one type of structure within the model; in terms of the identified published papers, five papers considered a decision tree feeding into a state transition Markov model.
When reviewing each of the NICE STAs and published papers, it was evident that there was some confusion between state transition and PartSA structures. This was also previously reported in the review detailed in NICE DSU TSD 19 where authors found that ten of the PartSA models had been erroneously labelled as Markov models [6]. Whilst there is insufficient information provided in most studies to conclusively identify model misspecifications, there were some clear examples where the model was labelled as a Markov or semi-Markov state transition model but further description indicated these were in fact PartSAs. For example, health state occupancy calculations were informed by partitioning the overall survival (OS) curve using intermediate outcomes such as progression-free survival (PFS) or event-free survival (EFS)—occupancy in the progressed disease state was then calculated by the difference between the OS and intermediate event curves, indicating a PartSA structure. Other examples included reported parameters focused on the coefficients relevant to parametric curves fit to the OS and intermediate outcomes data rather than health state-specific transitions, as well as the application of hazard ratios to baseline OS and intermediate outcomes curves with no description on how the competing events reflected by the intermediate outcome (e.g. PFS) were separated, as required in a state transition model. Consequently, we believe that many of the reported state transition models were actually implemented as a PartSA. The inability to conclusively identify model structures from the literature highlights both the confusion between state transition and PartSA structures as well as the lack of transparency in reporting.
Description of Most Common Modelling Approaches
Our review identified two modelling approaches as the most commonly used in oncology in recent years: the PartSA and the discrete-time state transition approaches. Detailed descriptions of each of these approaches can be found in the literature, as well as the relative advantages and limitations of each method [6, 14, 15]. A summary of these methods is given in Sects. 3.3.1 and 3.3.2.
Partitioned-Survival Analysis
PartSA models are characterised by health states with state membership determined by a series of (usually) independently modelled, non-mutually exclusive survival curves; N represents the number of health states in the model and to which N−1 survival curves are required. The PartSA structure is most commonly applied as a three-state model, which makes use of two modelled survival curves for OS and PFS. The area underneath the PFS curve represents the proportion of patients who are yet to progress, and the area between the OS and PFS curves represents the proportion of patients who have progressed and are still alive. The remaining proportion of patients (i.e. patients who are neither progressed nor yet to progress) represents those who have died.
In this three-state model, the assumption of independent survival curves necessitates that OS and PFS are independent of each other, i.e. progression is not considered to be a prognostic factor of death within the fitted OS model. The dependency between OS and PFS is captured in the within-trial period. However, this is not reflected in the extrapolation beyond this point if independence is assumed. Therefore, the more immature data, the less dependence captured between OS and PFS in the long-term extrapolations.
State Transition Model
State transition models are defined by distinct events or health states that individuals experience and transition between; the speed at which transitions occur are called transition probabilities or rates. These health states should be mutually exclusive and exhaustive. Unlike the PartSA approach, state transition models capture the dependency between events; the extent to which this dependence is captured depends on the specific state transition model applied. In the example of a three-state transition model (pre-progression, progressed disease and death), there are three transitions which are estimated: (1) pre-progression to progressed disease; (2) pre-progression to death; and (3) progressed disease to death. Therefore, the dependence between progressed disease and death is captured in the extrapolations through the evolving proportion of patients in the progressed disease health state.
Most commonly in oncology, discrete time periods have been considered employing either a Markov or semi-Markov approach. The discrete time periods are implemented as model cycles—fixed time periods over which model transitions are calculated (e.g. daily, weekly, monthly). The semi-Markov approach relaxes the ‘memoryless’ feature of the Markovian assumption by allowing transitions to depend on time spent within an intermediate health state; for example, the transition from progressed disease to death may depend on how long a patient has spent in the progressed disease health state.
Justification and Criticism of Modelling Approaches
Within the NICE STA process, companies are required to provide a detailed explanation for the selected modelling approach. The role of the independent ERG is to critically appraise the companies’ rationale as part of the appraisal process, and ultimately determine if the model is appropriate to inform decision-making or not.
The most common strength of the submitted model structure noted by the company was the similarity of the chosen structure to previous appraisals conducted by NICE and/or other published literature (n = 75). The second most common strength noted was the ability to use data directly from the pivotal trial(s) to inform health state occupancy within the submitted model (n = 20)—this was primarily raised as a strength of the PartSA structure as the endpoint of many trials of cancer treatments is either OS or PFS. Other strengths raised included the ability to reflect time dependency (n = 6), applicability of the structure to a treatment pathway with multiple treatment lines (n = 6), being aligned with the irreversible nature of transitions (n = 5), providing a straightforward and/or transparent approach (n = 5), avoiding specific issues with the extrapolation of OS (n = 4), allowing for incorporation of external data (n = 4), avoiding the need for tunnel states to be included (n = 2), exhibiting intuitive interpretation (n = 2), and permitting calibration of model transitions (n = 2).
The ERGs rarely commented on the merits of the companies’ submitted model structures. We found no explicit instances where the ERG commented on the appropriateness of the company’s model structure beyond a general comment that the model was either aligned with convention, appropriate to inform decision-making, or was considered transparent and simple.
In only seven of the 104 STAs, the company noted at least one limitation of their chosen modelling approach (i.e. choice of PartSA, state transition or another model structure). The most common limitation highlighted by the submitting company was that the PartSA structure was considered ‘rigid’, as individual transitions are not modelled. Despite reporting limitations, none of the NICE STAs explored the impact of structural uncertainty through an alternative model structure. One appraisal submitted with two different models (TA284; one semi-Markov state transition model and one PartSA). However, data from different populations informed these models and they were viewed in isolation [16].
Criticisms raised by ERGs in relation to the model structure used by the company included the lack of calibration between outcome measures (e.g. through independently fitted survival curves) (n = 5), an ‘over-simplification’ of the final health state (such as ‘progressed disease’) (n = 4), seemingly ‘counterintuitive’ results (e.g. an improvement in PFS leading to a higher ICER) (n = 4) and structural assumptions that were considered ‘inappropriate’ (such as a constant hazard of a given event occurring) (n = 3). It was noted that relatively few ERG reports specifically flagged issues relating to the model structure, but we suspect some of the criticisms may have either been raised during the appraisal committee meetings or within other sections of the appraisal documentation (e.g. in relation to the incorporation of clinical efficacy data).
Within the published literature, there was limited description provided around the choice, justification and design of model structure. A minority of published studies (22%; 27/124) reported any strengths or limitations associated with the chosen model structure, and, where reported, the information provided was very brief. Two papers from the published literature reported four different Markov state transition models in each publication and compared the results across these, emphasising the need to assess structural uncertainty [17, 18]. A further two papers from the published literature reported on multiple model structures; however, these reflected different stages in the disease pathway and did not provide an assessment of structural uncertainty.