Introduction

As defined by an international consortium in 2017 [1], “primary mitochondrial myopathies (PMM) are genetically defined disorders leading to defects of oxidative phosphorylation affecting predominantly, but not exclusively, skeletal muscle”. In our previous study, we characterized the functional scales and biomarkers of an Italian cohort of 118 PMM [2]. PMM natural history and evolution over time of outcome measures are unknown; a better knowledge of PMM natural history is of pivotal importance also considering current and future PMM trials. We have therefore collected, basally and at 12 months of follow-up, data of the functional scales already defined in the previous study, trying to: (1) define their evolution over time; (2) trace disease trajectories; (3) explore genotype—phenotype correlations.

Patients and methods

We have collected the clinical data, the outcome measures and the quality-of-life questionnaires at 12 months (T1) of 117 adult Italian patients with a diagnosis of PMM due to either mtDNA or nDNA mutations affecting mitochondrial oxidative phosphorylation, registered in the Nation-wide Italian Collaborative Network of Mitochondrial Diseases. One patient of the original cohort developed Parkinsonism and, thus, was excluded from the cohort. The baseline characterization of the cohort (T0), as well as the used outcome measures and scales, have been published [2]. Here, we are only recalling the outcome measures performed at 12 months:

  1. 1.

    Clinician-reported outcome measures—clinical scales: the Newcastle Mitochondrial Disease Scale for Adults (NMDAS). We have evaluated global scores and the subitems myopathy and exercise intolerance.

  2. 2.

    Functional tests: 6-min walk test (6MWT), triple timed up-and-go test (3TUG), 5X sit-to-stand test (5XSST), timed water swallow test (TWST), and test of masticating and swallowing Solids (TOMASS).

  3. 3.

    Performance outcome measures: spirometry.

  4. 4.

    Patient-reported outcome measures: Fatigue Severity Scale (FSS) and West Haven-Yale Multidimensional Pain Inventory (WHYMPI).

Starting from the clinical observation, we have now differentiated PMM into three phenotypes: mitochondrial myopathy (MiMy) without extraocular muscles involvement, pure chronic progressive external ophthalmoplegia (PEO) and PEO&MiMy. PEO was defined as a pure ocular myopathy with ptosis and progressive ophthalmoplegia without significant objective muscle weakness and/or exercise intolerance (NMDAS subitem myopathy and exercise tolerance both score of 0/5) or multisystem involvement; PEO patients could refer with subjective muscle pain. On the contrary, PEO&MiMy had ptosis and ophthalmoplegia with other features of muscle involvement like muscle weakness, exercise intolerance (NMDAS subitem myopathy and/or exercise tolerance score ≥ 1/5) and other features of myopathic involvement (dysphagia, restrictive respiratory syndrome). MiMy are those patients without ocular myopathy and with other myopathic signs (proximal/distal or axial muscle weakness, isolated exercise intolerance, myoglobinuria triggered by exercise) and/or other signs of myopathic involvement.

Standard protocol approvals, registrations, and patient written informed consent was obtained from all participants, and the ethics committees of each centre approved the study.

Statistical analysis

Frequencies, average, median, SD, standard error, and percentiles were calculated for each feature. Values were reported as mean ± SD for variables with normal distribution, as median and interquartile range (IQR) for variables with skewed distribution, and as a percentage for categorical data. To verify the distribution of each parameter, the Kolmogorov–Smirnov test was performed. For continuous variables, the independent Student t test and Mann–Whitney U test (Wilcoxon's test) were applied to find differences between 2 groups. For comparisons between paired data (T0 and T12), we used Student t-test for data with normal distribution and Wilcoxon test for data with non-normal distribution. Proportions were analysed by Fisher’s exact. Differences among patients with the three different phenotypes were evaluated using analysis of variance one-way and Bonferroni post hoc tests for data with normal distribution and Kruskal–Wallis and Dunn-Bonferroni post hoc tests for data with skewed distribution, after assessing the equality of variances for each variable using parametric and nonparametric Levene tests. In all cases, a p value of less than 0.05 was regarded as significant; a lower value was indicated if it was found. Biostatistical analysis was performed with IBM SPSS 20.0.0 program.

Results

The predominant phenotype was PEO&MiMy (44.9%), followed by MiMy (36.4%) and PEO (17.8%) (Table 1), both at baseline and follow-up.

Table 1 PMM phenotype distribution

PEO and PEO&MiMy presented a significantly higher proportion of single deletion vs MiMy (respectively 76.2% and 58.5% vs 2.3%, both p < 0.005), whereas PEO&MiMy exhibited the highest proportion of nuclear DNA mutation although not statistically significant when compared with the other two phenotypes (47.4% with POLG mutations). The predominant genotype of MiMy (81.4%) was a mtDNA point mutation [18 of 35 (51.4%) m.3243A>G and 11 of 35 (31.4%) m.8344A>G]. The prevalence of mtDNA mutations in MiMy were significantly higher than in PEO and PEO&MiMy (respectively 81.4% vs 9.5% and 5.7%, both p < 0.005 (Fig. 1). The distributions of mutations in our cohort are shown in supplementary Table 1.

Fig. 1
figure 1

The histogram represents the different proportion of genotypes in the 3 PMM phenotypes. The histogram represents the different proportion of genotypes in the three PMM phenotypes described. PEO progressive external ophthalmoplegia, MiMy mitochondrial myopathy

PEO patients showed significantly better performance than PEO&MiMy and MiMy in 6MWT (p < 0.0005) and several other outcome measures (3TUG p < 0.0005, 5XSST p < 0.005, NMDAS total score p < 0.005 and subitems p < 0.0005) and less fatigue and pain (FSS p < 0.005 and WHIMPY p < 0.0005), both at T0 and T1 (Fig. 2). PEO&MiMy and MiMy did not differ significantly in functional outcome measures results.

Fig. 2
figure 2

Difference between PMM phenotypes (red plots T0, blue blots T1). 6MWT 6-min walk test, 3TUG triple timed up-and-go test, 5XSST 5X sit-to-stand test, FSS Fatigue Severity Scale, NMDAS The Newcastle Mitochondrial Disease Scale for Adults, WHYMPI West Haven-Yale Multidimensional Pain Inventory, TWST timed water swallow test, TOMASS test of masticating and swallowing solids

Table 2 shows the paired data (T0 and T1) for each functional outcome measure in the whole cohort. NMDAS total score and its subitems (myopathy and exercise intolerance), TWST, TOMASS and pain severity showed a significant worsening, while apparently in the entire cohort 6MWD (6 min walking distance) seems to show a significant improvement (17.98 m). NMDAS total score, subitem myopathy and exercise tolerance declined from T0 to T1 of 0.9, 0.43 and 0.33 respectively.

Table 2 Distribution of paired values in T0 and T1 evaluation for all patients

In Table 3, we show paired values at T0 and T1 evaluation according to PMM phenotype, and only data with significant variation are shown. 6MWT significantly improved in PEO, but was substantially stable in MiMy and PEO&MiMy.

Table 3 Paired values at T0 and T1 evaluation according to phenotype distribution

To better analyse the apparent increase in the 6MWD, which however, we could explain by a training effect or by the intrinsic variability of the test (discussed later [3]), we conducted a sub-analysis in two ways: by dividing patients by phenotype and by comparing the lower percentiles with the higher ones. Analysing by percentiles (Table 4), we found a stability of 6MWT in the lower percentiles and a significant 6MWT increase in the two higher percentiles. The lower and the higher two percentiles had a different distribution of phenotypes: the higher percentiles showed significantly more PEO patients (most of them with single deletion) and significantly fewer MiMy (Table 5).

Table 4 Paired values in T0 and T1 evaluation according to median of results obtained by patients in 6MWT in T1
Table 5 The table shows the frequency of PMM phenotypes in the lower two percentiles (6MWT < 408M) and in the higher two percentiles (6MWT > 408M)

Furthermore, the lower percentiles showed worst score on both NMDAS subitem myopathy and exercise tolerance, worst score on 5XSST (Fig. 3).

Fig. 3
figure 3

Differences in NMDAS subitem, 5XXST and FSS between higher and lower percentiles based on T1 6MWT (T0 red plot, T1 blue plot). 5X sit-to-stand test, FSS Fatigue Severity Scale, NMDAS The Newcastle Mitochondrial Disease Scale for Adults, 6MWT 6-min walk test

We therefore hypothesized that the apparent increase of 6MWD in the whole cohort could be explained by the increased variability of 6MWD in the PEO phenotype and in patients with better physical performance.

In a subset of 17 patients (at T1), we have assessed the 6MWT fatigability (slope) (first and last minute speed comparison): the comparison was statistically significant (first minute 72.7 ± 24.3 m vs last minute 47.5 ± 24 m; mean speed 1.2 m/s first minute vs 0.8 m/s last minute, p < 0.005) (Fig. 4).

Fig. 4
figure 4

6MWT slope: in 17 patients, we observed a significant decline in 6MWT speed (m/s) between the first and the last minute. 6MWT 6-min walk test

6MWT is recognized to have significant test–retest variability, so, as previously done for other neuromuscular disorders like Pompe disease or Duchenne muscular dystrophy [4, 5], we calculated the MCID (minimal clinically important difference) [6]: baseline SD/3; MCID for the whole cohort was 33.3 m.

Differences found at baseline on mtDNA, nuclear DNA and single deletion [2] were confirmed at follow-up; single deletion showed better performance on 6MWT but worst score of 5XSST when compared with nuclear DNA mutations; NMDAS total score was not significantly different between genotypes, mtDNA single deletion had better NMDAS subitem exercise tolerance vs nDNA and other mtDNA mutations and better NMDAS subitem myopathy vs mtDNA mutation (Fig. 5).

Fig. 5
figure 5

Differences among genotypes. T0 red plot, T1 blue plot. 6MWT 6-min walk test, 5XSST 5X sit-to-stand test, NMDAS The Newcastle Mitochondrial Disease Scale for Adults, mtDNA mitochondrial DNA, nDNA nuclear DNA

Discussion

We described the 12-month evolution of selected outcome measures in a cohort of 117 PMM patients. The three PMM phenotypes showed a different proportion of genotypes: we confirm the previous observation of a positive association between ocular myopathy (both PEO and PEO&MiMy) with single mtDNA deletion and nuclear DNA (mainly POLG) mutations [7], while MiMy phenotype is associated with other mtDNA mutations, mainly m.3243A>G and m.8344A>G.

The three PMM phenotypes also showed significant differences in outcome measures, as PEO had significantly better value of 6MWT compared to MiMy and PEO&MiMy. Moreover, PEO patients showed significantly better performance than PEO&MiMy and MiMy in other outcome measures (3TUG, 5XSST, NMDAS total score and subitems) and less fatigue and pain, at both T0 and T1. PEO patients walked more at the 6MWT at 12 months follow-up, while in MiMy and PEO&MiMy this was not observed. In the whole cohort NMDAS total score, subitem myopathy and exercise tolerance declined from T0 to T1 of 0.9, 0.43 and 0.33, respectively, whereas these declines are not evident in PEO.

It is difficult to compare our data with those in the literature, as the published series have shorter follow-up times. Madsen et al. in the MOTOR trial stated a “large interparticipant and intraparticipant variation”, even in the placebo group, with a significant variation (up to 130 m in two placebo treated PMM) [8]; moreover, the 6MWD median value was high (442 ± 133 in the placebo group, 421 ± 130 in the omaveloxone group) and similar to our two higher percentiles. A similar 6MWD increment was also described in the elamipretide trial, in which an improvement of 20.9 m was observed in the placebo group at 5 days [9].

The interparticipant and intraparticipant variability of 6MWD is well known; 6MWT has a significant variability across different tests and observers, both in healthy people and in several diseases, including PMM [3, 10]. On the other hand, the 6MWT has been used as the main outcome measure in several trials of neuromuscular diseases, including Pompe disease, mucopolysaccharidosis, Duchenne dystrophy and spinal muscular atrophy [6, 11,12,13,14].

Its role in assessing motor performance and exercise intolerance has been evaluated in PMM, and some researchers proposed other measures, like 12MWT or 6MWT slope across minutes as described in RYR related myopathies [10, 15]. Flickinger et al. recently showed firstly 6MWT slop as a measure of exercise intolerance [10]. Our data, but also the MOTOR trial, showed a great variability in 6MWT, in our cohort especially in the higher percentiles with more PEO patients and single deletion. On the other hand, the 6MWT in the lower two percentiles was stable at 12 months.

In the whole cohort, the apparent increase in 6MWD (17.98 m) is striking; however, this increase is lower than the MCID (33.3 m). The MCID is defined as “the smallest difference in score in the domain of interest which patients perceive as beneficial and which would mandate, in the absence of troublesome side effects and excessive cost, a change in patient management” [16]. Therefore, this increase in 6MWT is probably not to be considered clinically significant. Under this consideration, we can affirm a substantial stability of the 6MWT at 12 months.

Based on our data, we consider useful for a clinical trial set-up our clinical and genetic subgroup PMM classification. Although some genotype–phenotype correlations were observed, these are not as strong as is often the case in mitochondrial medicine; therefore, it would be advisable to evaluate the effect of therapies on specific phenotypes rather than specific genotypes.

We did not find any difference between MiMy and PEO&MiMy populations in the analysed outcome measures; however, we do believe that MiMy, with or without PEO, should not be considered as a single entity in future clinical trials because, as shown in Table 5, the presence of PEO in the MiMy patients may lead to a milder phenotype as demonstrated by the 6MWD above 408 m in 50% of cases (Table 5).

Overall, 6MWT is a good outcome measure in PEO&MiMy and MiMy walking less than 408 m, but not in PEO and in those walking above 408 m, providing that the primary endpoint of an interventional trial is not the stability at 12 months, which is intrinsic to the natural history of the disease. It would be interesting to evaluate the effect of therapies on the 6MWD slope even if we offer only a few supporting observations. We have no biomarkers (FGF-21 and GDF-15) follow-up data; their role in future trials is still unclear, although in TK2 myopathy a reduction in GDF-15 levels after treatment has recently been observed [17].

Unfortunately, because of the COVID-19 pandemic, we were not able to re-evaluate most of our patients at 24 months, as originally planned. Further natural history studies, with prolonged monitoring of appropriate outcome measures, are needed. As the literature indicates, some PMM patients may also develop additional system or organs involvements, which was the case for our excluded patient developing parkinsonism, and this is a typical feature of primary mitochondrial diseases, impinging not only on their phenotypic classification but, most importantly, changing their responses to specific outcome measures.

This work represents a real-life picture of a cohort of patients with PMM monitored for twelve months, which provides important and useful information for the planning of clinical trials. For example, from this work we may affirm that a clinical trial on PMM cannot have the stability of 6MWT at 12 months as endpoint, since this is intrinsic in the PMM natural history. Furthermore, inclusion criteria that allow the enrollment of all PMM, despite different phenotypes (PEO and PEO&MiMy or MiMy) that evolve differently in a twelve-month period, may lead to a strong methodological bias and failure of the trial.

Data availability statement

Current article data are accessible from Michelangelo Mancuso, University of Pisa. In accordance with the data protection legislation in Europe (General Data Protection Regulation), to share the data of the Italian Network, it is necessary to stipulate an agreement between the University of Pisa and the applicant institution. Study data can be requested by contacting Michelangelo Mancuso (michelangelo.mancuso@unipi.it).