Cost-utility models for infantile- and later-onset SMA were based on Markov health-state structures. Development of the model structures was informed by both clinical and health economic experts. One UK clinical expert, chosen because of the clinician’s involvement in clinical trials of nusinersen, provided initial input, while seven UK paediatric neuromuscular disease specialists were chosen to attend an advisory board because of their experience in treating patients with SMA. Health economic input was sought via face-to-face interviews with two UK health economists, who were chosen for their expertise in knowing the requirements for economic submissions to health technology assessment bodies throughout the world. Cost and health-state utility assumptions were validated by one Swedish clinical expert. Data inputs were informed by clinical and economic systematic literature reviews and reviewed by UK and Swedish clinical experts. The economic systematic literature review [Table S1 and Fig. S1 of the Electronic Supplementary Material (ESM)] identified limited resource use and cost data and did not identify existing published economic models or utility weights. Therefore, de novo models for infantile-onset and later-onset SMA were developed and additional studies were performed to derive country-specific cost estimates and health-state utilities.
Model Structures
Infantile-Onset Spinal Muscular Atrophy
The infantile-onset SMA model structure comprised ten health states (Fig. 1a). A response-based section of the model structure defined patients by their change in CHOP INTEND score from baseline (worsened, stabilisation of baseline function and improvement health states). Scores on the CHOP INTEND range from 0 to 64, with higher scores indicating better motor function [34]. Improvement is defined as at least a 4-point increase from baseline (same response definition used in the ENDEAR trial), and worsened is defined as at least a 4-point decrease from baseline (stabilisation of baseline function is defined as a change from baseline between − 3 and 3 points). Although the literature does not report a defined minimal clinically important difference for the CHOP INTEND score change, results from the ENDEAR trial support the view that a 4-point change increase is rarely observed in natural history, with a significantly higher percentage of infants in the nusinersen group achieving response compared with the SoC arm (71% vs. 3%) [22]. A natural history study by Finkel et al. [42] reported a CHOP INTEND score decline of 1.27 [95% confidence interval (CI) − 2.33 to − 0.21] points per annum for patients with infantile-onset SMA.
A second section divided patients into motor milestone-defined health states (according to Sect. 2 of the Hammersmith Infant Neurological Examination scale. The scores ranged from 0 to 26 [through the achievement of motor milestones], with higher scores indicating better motor function [33]) not usually observed in patients with infantile-onset SMA (sits without support, stands with assistance, walks with assistance and stands/walks unaided health states) and a health state (loss of later-onset motor function) that captured patients who had previously been able to sit without support but then lost that ability. Patients entered the model in the stabilisation of baseline function health state. The model structure included health states uncharacteristic of patients with infantile-onset SMA because interim data from ENDEAR and 26-month data from CS3A in symptomatic patients with infantile-onset SMA showed that patients treated with nusinersen achieved motor milestones typical of later types of SMA.
A patient remained in the response-based section of the model until the patient achieved a milestone characteristic of later-onset SMA (sits without support). Once a patient has achieved a later-onset milestone (as observed in the ENDEAR trial for patients in the nusinersen arm), the patient could either continue to achieve higher motor milestones or could lose the ability to sit without support and move to the loss of later-onset motor function health state. This latter health state was included to explore scenarios where the quality of life and costs associated with a patient who was not able to sit without support differ between patients that at some point had achieved the sitting without support milestone (or higher) and patients that never achieved that milestone.
Later-Onset Spinal Muscular Atrophy
The model structure for later-onset SMA incorporated changes from baseline HFMSE scores (scores range from 0 to 66, with higher scores indicating better motor function [35]) to determine the membership in the response-based health states (worse than baseline, stabilisation of baseline function, mild increase HFMSE score and moderate increase in HFMSE score) and achievement of motor milestones characteristic of patients with SMA type III, as assessed by the WHO motor milestones criteria (stands/walks with assistance, stands unaided and walks unaided) (Fig. 1b) [37]. The WHO motor milestones are commonly used in clinical practice. The later-onset SMA model structure also included one health state that captured patients who had previously been able to stand or walk with assistance but then lost that ability (lost ambulation with/without assistance health state) and the dead health state.
In the model, an increase from baseline in the HFMSE score of less than 3 points is considered mild, a score increase of 3 or more points is considered moderate and a score decrease of 1 or more points is considered worsening (stabilisation of baseline function is defined as no change from baseline). The definition of mild and moderate improvement was included to differentiate patients who had score improvements not normally observed in natural history (i.e. those with moderate improvement). A study by Mercuri et al. [43] reported that an increase of greater than two points was unlikely in patients with later-onset SMA. In addition, patients and caregivers considered a 1-point increase or even stabilisation as meaningful. [44, 45]
A patient remained in the response-based section of the model until the patient achieved a motor milestone uncharacteristic of patients with SMA type II (stands/walks with assistance). Once a patient has achieved the stands/walks with assistance milestone, the patient could either continue to achieve higher motor milestones (characteristic of SMA type III) or move to the loss of ambulation with/without assistance health state. This latter health state was included to explore scenarios where the quality of life and costs associated with a patient who was not able to stand/walk with assistance differ between patients that at some point had achieved the stand/walk with assistance milestone (or higher) and patients who never achieved that milestone.
Patient Characteristics
The model populations were patients with infantile-onset and later-onset SMA. The patient cohort for the infantile-onset model had the same starting age as that of patients who received their first dose in the ENDEAR trial (Table 1). The starting age of the later-onset model cohort matched the mean age at screening observed in the CHERISH trial (Table 2).
Table 1 Base-case model input parameters: infantile-onset spinal muscular atrophy (SMA) Table 2 Base-case model input parameters: later-onset spinal muscular atrophy (SMA) Analytical Perspective and Time Horizon
The models were developed from the societal perspective in Sweden (direct medical costs [primary and secondary care], patients’ reimbursed transportation costs and informal care costs; Dental and Pharmaceutical Benefits Agency [TLV] guidelines [46]). The payer perspective also was considered in sensitivity analyses.
A time horizon of 40 years was used in the infantile-onset model because data from clinical trials suggested that treatment with nusinersen had a significant effect on survival [the ENDEAR trial reported an OS hazard ratio of 0.37 (95% CI 0.32–0.89; p = 0.004) and the phase II dose-escalation trial indicated a significant differentiation in age at death or permanent ventilation compared with a natural history case series (log-rank test; p = 0.0014)], [22, 23] which potentially could be associated with the achievement of motor milestones similar to those achieved by patients in the later-onset model. The later-onset model used a default time horizon of 80 years, informed by data from Zerres et al. [47] that were used to guide OS after the trial follow-up.
In both models, the model cycles during the trial follow-up were aligned with the motor function assessment time points (ENDEAR: days 1, 64, 183, 302 and 394; CHERISH: days 1, 92, 169, 274, 365 and 456). After the trial follow-up, the models used a cycle length of 4 months to match the 4-month maintenance dosing interval specified by the European Medicines Agency [40]. Costs other than drug costs after the trial follow-up were based on cost-per-year estimates. A half-cycle correction was applied for both models.
Both costs and outcomes were discounted at 3.0% (Pharmaceutical Benefits Board [46]; TLV guidelines). Model outcomes included incremental cost per quality-adjusted life-year (QALY) gained (base-case scenario included QALYs gained for both patients and caregivers) and OS.
Intervention and Comparators
The costs and outcomes for nusinersen and SoC were compared with those for SoC alone, as nusinersen is currently the only approved treatment for SMA. Safety and efficacy of alternative treatments are being evaluated in phase I–III studies [48].
Disease Progression and Death: Trial Follow-up Period
Probabilities of transitioning to health states other than death within the trial follow-up period were calculated from the individual patient’s motor function score and motor milestone achievement at the different assessment points in the ENDEAR and CHERISH trials (Tables S9–S30 of the ESM).
For predicting OS for patients with infantile-onset SMA, parametric survival functions were fitted to observed OS data in the ENDEAR trial in accordance with TLV guidelines and based on the Decision Support Unit guidance for fitting and selecting survival functions from the UK National Institute for Health and Care Excellence [49]. Alternative assumptions regarding the continuation of the treatment effect after the trial follow-up and treatment discontinuation were explored.
The clinical and biological plausibility of OS prediction was explored by comparison with external long-term survival data [7, 50]. Potential limitations associated with the use of these data include possible differences in SoC, patient characteristics and improvements in respiratory management. Fitting parametric survival functions to the ENDEAR data without the use of external data was not considered appropriate for this analysis (OS predictions did not appear clinically plausible).
For patients with later-onset SMA, the model assumed that, within the trial follow-up period, nusinersen had no effect on improving the mortality rates for these patients, as no patients in either treatment arm in the CHERISH trial died. Survival after the trial follow-up was guided by long-term OS data presented by Zerres et al. [47] (see Sect. 2.7.2).
Disease Progression to Health States Other Than Dead After the Trial Follow-Up
Infantile-Onset Spinal Muscular Atrophy
The model assigned a CHOP INTEND score to each health state based on the mean CHOP INTEND score of the patients in each health state at the end of the ENDEAR trial follow-up. Because patients in the ENDEAR trial did not achieve the walks with assistance and the stand/walks unaided milestones, data from the last follow-up in the CS3A trial were used to assign the CHOP INTEND score to those health states. To estimate the probability of transitioning to the next best or next worst health state, the model used the rate of CHOP INTEND score change. For patients in the nusinersen arm, the score increased 0.251 (95% CI 0.19–0.32) points per week, while the score for patients in the SoC arm decreased 0.364 (95% CI 0.27–0.46) points per week, which is consistent with natural history [42]. Therefore, the base-case analysis assumed that patients in the nusinersen arm continue to improve after the trial follow-up and that patients in the SoC arm transition to worse health states after the trial follow-up according to the corresponding rate of the CHOP INTEND score change. A scenario analysis assumed that a proportion of patients in the nusinersen arm reached an improvement plateau, with some of those patients progressing as in the SoC arm.
Later-Onset Spinal Muscular Atrophy
Similar to the infantile-onset model, the transition probabilities to health states other than death were based on the rate of motor function score change. The model assigned a HFMSE score to each health state based on the mean HFMSE score of the patients in each health state at the end of the CHERISH trial follow-up. The HFMSE score of the walking unaided health state was determined using 35-month long-term data from the CS2/12 trial. Patients in the nusinersen arm in the CHERISH trial had a change from baseline of 3.9 (95% CI 3.00–4.90) points at 15 months, while the mean HFMSE score change from baseline in the SoC arm was − 1.0 (95% CI − 2.50 to 0.50) points at 15 months [26]. The HFMSE score change at the interim data cut was included as a scenario analysis (Table S7 of the ESM). Hence, after the trial follow-up in the base-case analysis, patients in the nusinersen arm moved to the next best health state while patients in the SoC arm transitioned to the next worse health state.
External Long-Term Overall Survival Data After the Trial Follow-up
Infantile-Onset Spinal Muscular Atrophy
External long-term data identified in the literature have shown large differences in survival between countries [5, 7, 50, 51]. This could be attributed to heterogenous patient populations and the type of care provided (i.e. invasive or not invasive respiratory support). Gregoretti et al. [50] reported OS for three groups of patients receiving different types of respiratory support. Survival at 24 months for the no respiratory support group, the non-invasive ventilation group and the invasive ventilation group was 1.3% (95% CI, 0.1–6), 67.7% (95% CI 46.7–82) and 95% (95% CI 81.8–8.8), respectively. Survival at 24 months in other studies [5, 7, 51] ranged from 32 to 74%.
Feedback from clinical experts has shown that invasive respiration is generally not used in Sweden, resulting in an expected survival of less than 3 years (usually 1 year; TLV, 2017 [52]); therefore, survival data for respiratory care without invasive ventilation were used in the base case. For this scenario, the method of least squares [53] was used to fit parametric functions to the Kaplan–Meier data presented by Gregoretti et al. [50] for a group of patients that did not receive respiratory support.
After the trial follow-up, patients in the SoC arm were assumed to have the same hazard rate as that from the function fitted to data in Gregoretti et al. [50] (Fig. 2). Different scenarios were used for the nusinersen arm, including assuming an indefinite treatment effect, no residual treatment effect and a gradual decline. The base-case analysis assumed that the treatment effect continues indefinitely.
The base case also included an assumption that patients achieving motor milestones consistent with later-onset SMA have a lower risk of death than patients who never achieve these milestones. Zerres et al. [47] reported OS for patients with SMA type II. To estimate the risk of death for patients with later-onset SMA, a flexible spline-based Weibull (2 knots) function was fitted to the Kaplan–Meier from Zerres et al. [47]. The risk of death for patients achieving later-onset milestones was assumed to be between the risk of death of a patient with infantile-onset SMA and the risk of death of a patient with later-onset SMA. The model used an adjustment factor between 0 and 1, where 0 resulted in the same risk of death for patients with infantile-onset SMA and 1 resulted in the same risk of death for patients with later-onset SMA. The base case used an adjustment factor of 0.9 based on clinical opinion (Fig. 2). Time-dependent transition probabilities to death were calculated from the parametric survival functions.
Later-Onset Spinal Muscular Atrophy
No deaths occurred in the CHERISH trial. The later-onset SMA model used long-term OS data for patients with later-onset SMA (type II) presented by Zerres et al. [47]. From the studies reporting long-term survival for patients with SMA type II [5, 47, 54, 55], only Zerres et al. [47] and Farrar et al. [5] reported survival estimates beyond 10 years. However, the model used the data from Zerres et al. [47] as the sample in that study was more robust than the sample in Farrar et al. [5] (240 patients in Germany and Poland compared with 30 patients in Australia). Parametric models were selected based on Akaike information criterion results, Bayesian information criterion results, integrated Brier score through bootstrap cross validation, the visual fit of the function to the Kaplan–Meier data and the assessment of the clinical plausibility of the extrapolated portion of the survival curve. [56] The flexible spline-based Weibull function with two knots was selected as the base case because it provided the best visual fit.
After the trial follow-up, patients on the SoC arm continued to die according to OS data from Zerres et al. [47]. Scenarios explored for the nusinersen arm included assumptions that there was no treatment effect (base-case analysis), the treatment effect continued (using a placeholder hazard ratio) and a tapering of the treatment effect over time.
The base-case analysis assumed that patients achieving advanced motor milestones characteristic of later-onset SMA (i.e. stands unaided, walks unaided) had a lower risk for death than patients in other health states. Patients with SMA and advanced motor milestone characteristics have a life expectancy that is not reported to be significantly less that the general population [5]. It was assumed in the base-case analysis that the risk of death for patients with advanced motor milestone characteristics is between that of patients with later-onset SMA (type II) and that of the general population (adjustment factor of 0.5 was applied) (Fig. 3). Time-dependent transition probabilities to death were calculated from the parametric survival functions.
Resource Use and Costs
Resource use data for the models were collected from interviews with two Swedish clinical experts, as the economic literature review (Table S1 and Fig. S1 of the ESM) presented resource use and costs from only a German study [57] and a Spanish study [58]. In accordance with Swedish guidelines for health economic evaluations, the societal perspective was applied, including caregiver production loss for one caregiver in accordance with clinical expert input. Production loss was estimated based on the monthly salary and the assumption (according to the Swedish clinical experts) that a caregiver of a patient with infantile-onset SMA would give up work completely, while a caregiver of a patient with type II SMA would give up work for 6 months during the first year and for about 3 months the following year. Country-specific unit costs were mainly collected from hospital price lists. When available, prices were gathered for 2016 and inflated to the year 2018 level according to the Swedish Consumer Price Index. The resulting per-patient annual resource use and total costs of infantile- and later-onset SMA in Sweden for the first and the following years are summarised in Tables S2–4 of the ESM. These per-patient annual costs were then assigned to each health state in the model (Tables 1, 2: health-state costs). If the health state was associated with milestones characteristic of SMA type I, the model assigned the infantile-onset SMA costs; for health states associated with milestones characteristic of SMA type II, the model assigned the later-onset SMA costs; and for health states associated with milestones characteristic of SMA type III, the model assigned 25% of the later-onset SMA costs (assumption validated by Swedish clinical experts).
Nusinersen is administered via lumbar puncture with four loading doses and a maintenance dose every 4 months thereafter [40]. With a list price of 800,757 SEK (Swedish krona) per vial (€78,300Footnote 1), [59] the first year of treatment with nusinersen was calculated to be 4.8 million SEK (€469,500) per patient and 2.4 million SEK (€234,800) per year per patient in subsequent years. Because nusinersen is a new medicine, it was assumed to be administered in inpatient care, which was verified by Swedish experts. Hence, the procedure cost for lumbar puncture was retrieved from the hospital price list for the Southern Health Care Region in Sweden, which amounted to 3308 SEK (code TAB00; Södra Regionvårdsnämnden [60]).
The cost for production loss should be added to administration costs, according to TLV guidelines. [46] However, it was assumed that production loss as a result of drug administration is already taken into account as caregiver production loss.
The base-case analysis assumed that patients treated with nusinersen spend less time in hospital and require less ventilatory support than patients in the SoC arm. Results from the ENDEAR trial supported this assumption, as the overall proportion of time spent hospitalised was significantly lower in the nusinersen arm than the SoC arm (least-square mean treatment difference − 0.093; 95% CI − 0.151 to − 0.034; p = 0.0022). In addition, the proportion of time spent hospitalised for respiratory reasons was significantly lower in the nusinersen arm when compared with the SoC arm (least-square mean treatment difference − 8.638%; 95% CI − 14.190 to − 3.086; p = 0.0026) [61]. To adjust costs, the model used the ratio of ventilation use observed in ENDEAR (0.66; 95% CI 0.32–1.37) [22]) as an adjustment factor applied to the proportion of each respiratory care cost item. The ENDEAR rate of hospitalisation (0.759; 95% CI 0.548–1.051 [61]) was used as an adjustment factor applied to the hospitalisation cost item. The proportions between each type of care used in the model (respiratory, gastrointestinal, nutritional or orthopaedic/neurologic) were delimited based on estimated Swedish resource use (Tables S5 and S6 of the ESM).
Utility Weights
The ENDEAR and CHERISH trials did not collect utility values from patients or caregivers. The economic systematic literature review did not identify utility weights for patients with SMA. Therefore, a vignette study was commissioned to elicit utilities for the health states in the infantile- and later-onset models. In the study, five UK clinical experts reviewed case study descriptions for each health state and were asked to provide an assessment of how health-related quality of life would be affected in each state using the EQ-5D (youth version) [62]. The EQ-5D (youth version) was scored using the EQ-5D-3L UK preference weights. Although it is common in vignette studies that members from the general public rate the severity of the health states, Lloyd et al. [62] used clinical experts because this allowed them to include clinical and health-related quality-of-life information in the descriptions of the health state, and to produce a range of experience of each clinical expert when rating each state. The descriptions of health states were tailored to match the health states in the economic models. The base-case utility weights for the infantile- and later-onset models are presented in Tables 1 and 2, respectively. As the CHERISH study collected the Pediatric Quality-of-Life Inventory (PedsQL™) and a mapping algorithm for PedsQL™ to EQ-5D was available [63], EQ-5D utility values were estimated for the later-onset model health states with the limitation that the patients in the CHERISH trial were substantially younger than the patients used to develop the algorithm [63]. However, these utility values were not explored in the scenario analysis, as they lacked face validity (the utility estimated for the worst health state was 0.73, while the valuation elicited from clinicians in the study by Lloyd et al. [62] for the same health state was −0.13). The model also included disutility for caregivers, as the burden on caregivers of children with SMA is substantial. The impact of using utility data from diseases with similar clinical characteristics was explored in scenario analyses (Tables S7 and S8 of the ESM).
Adverse Events
No treatment-related adverse events occurred in the ENDEAR trial. In the CHERISH trial, one adverse event experienced by one patient (post-sedation nausea) was considered to be related to treatment. Treatment-related adverse events were not included in the base-case analysis for the two models.
Base-Case Parameters
Base-case parameters for the infantile- and later-onset models are presented in Tables 1 and 2, respectively.
Sensitivity and Scenario Analyses
Univariate sensitivity analyses were performed for all parameters to identify those with most influence on the incremental cost-effectiveness ratio (ICER). The probabilistic sensitivity analysis included all parameters (the model was run for 1000 simulations). Estimates of uncertainty were based on the uncertainty in the source data (data permitting). Parameters were sampled from appropriate statistical distributions [64].
Model Validation
Model validation was performed in alignment with best practices [65] and included face validity (review of model structure and input parameters by two clinical experts and two independent academic health economic experts), internal validity (verification of input data against original sources and programming validation), dependent external validity (comparing model predictions with outcomes in studies used to build the model [7, 47, 50]) and independent external validity (comparing model predictions with outcomes in studies not used to build the model [5, 51, 54, 55]).