Background

Chronic Fatigue (CF) affects a substantial proportion of the population. In adolescents, about 20% of girls and 6.5% of boys report to have been severely fatigued during the last month [1, 2]. The label Chronic Fatigue Syndrome (CFS), sometimes referred to as Myalgic Encephalomyelitis (ME), may be appropriate if the fatigue is unexplained, long lasting, disabling and accompanied by other symptoms such as post exertional malaise, musculoskeletal pain, orthostatic intolerance, and cognitive problems [3]. Adolescent CFS prevalence is estimated at 0.1 to 1.0% [4,5,6], and CFS may have detrimental effects on psychosocial and academic development [7, 8], as well as family functioning [8].

The diagnostic criteria of CFS has been a scientific controversy for decades. As no diagnostic biomarker has been discovered, the diagnosis depends upon specific constellation of symptoms. One part of the scientific community has promoted wide diagnostic criteria [9,10,11], and also maintained that CFS is most properly understood as a variant belonging to an even broader category, such as Functional Somatic Syndrome [12] or Bodily Distress Syndrome [13]. This “lumping together” tendency has been strongly opposed by another part of the scientific community, advocating CFS as a heterogeneous group of patients with different diseases and pathophysiological features, e.g., ME is claimed as a distinct unique entity different from other fatiguing conditions, such as reflected in the Canadian diagnostic criteria of CFS (sometimes referred to as the International Consensus Criteria of ME/CFS) [14, 15]. The commonly used Fukuda-criteria [16] as well as the more recently proposed SEID-criteria [3] may be taken to represent pragmatic compromises.

Nevertheless, the net result has been a confusing existence of at least 20 case definitions. Most of them require between 3 and 6 months of unexplained fatigue but vary considerably regarding requirement of additional symptoms. In a systematic review from 2014, Brurberg et al. could not draw firm conclusions concerning the validity of any of these criteria due to weak methodology and inconsistent results of the 38 included validation studies [17]. Accordingly, studies from our own institution question the validity of the Canadian-criteria [18] as well as the SEID-criteria [19] for diagnosing CFS in adolescents.

In an attempt of investigating possible subgroups within widely defined CFS cohorts, latent class analyses have been applied; recent reports suggest the presence of discrete endophenotypes [20,21,22,23]. However, these approaches still rely on subjective reporting of symptoms, and it remains unclear to what extent a specific endophenotype corresponds with certain pathophysiological mechanisms or etiological factors. The presence or not of such correspondence may be considered essential for a proper understanding of the underlying disease mechanisms of CFS.

Despite the absence of diagnostic biomarkers, associations between CFS and candidate genetic markers as well as certain aberrations of immunological, autonomic, neuroendocrine, cognitive and sensory processing functions have been firmly established in previous research [24]. As for genetic markers, a single nucleotide polymorphisms (SNP) in the gene encoding the catecholaminergic breakdown enzyme COMT (catechol-O-methyltransferase) has been linked to CFS in several reports [25, 26]. In addition, mutations in the serotoninergic system are one of the most consistently reported findings in genetic studies of CFS [27,28,29]. As for immunological aberration, the most consistent finding appears to be a tendency towards low-grade systemic inflammation, as reflected in elevated serum C-reactive protein (CRP) [30], elevated pro-inflammatory cytokines [31, 32], and increased levels of innate immunity gene products in whole blood gene expression analyses [33]. Also, low-grade inflammation has been hypothesized as a common pathophysiological phenomenon across fatigue states in general [34]. As for autonomic aberrations, most studies suggest a sympathetic predominance, reflected in increased sympathetic cardiovascular activity [35,36,37,38], decreased parasympathetic (vagal) heart rate control [39], altered sympathetic thermoregulatory responses [39], and increased plasma and urine catecholamines [30, 40]. This sympathetic predominance may be the underlying cause of the Postural Orthostatic Intolerance Syndrome (POTS) phenomenon, which is frequently observed among CFS patients [3]. As for neuroendocrine aberrations, attenuated hypothalamus–pituitary–adrenal [HPA] axis dynamics is a consistent finding across adult and adolescent CFS studies [30, 41,42,43,44]. Interestingly, normalization of HPA responses may be associated with improvement of symptoms and functional disabilities [43, 45]. As for cognitive functions, previous research has provided evidence of aberrations in the domains of attention, memory and reaction time [46,47,48]. Studies specifically addressing executive functions in adolescent CFS patients have reported impaired interference control [49, 50], cognitive flexibility [51], and working memory [50, 52]. Finally, as for sensory processing functions, three studies have reported strongly reduced pressure pain thresholds [53,54,55], suggestive of central sensitization to afferent sensory stimuli [56]. Accordingly, functional brain imaging studies have demonstrated differences across CFS patients and healthy controls [57].

Thus, an alternative approach for delineating possible CFS subgroups would be to use the above-mentioned biological aberrations as a point of departure instead of subjective symptoms when performing subgroup-generating statistical analyses. To the best of our knowledge, such an approach is novel in the field of CFS. In the present study, we aimed to: a) Explore possible subgroups based on biological aberrations within a widely defined cohort of adolescent CFS patients; b) Investigate to what extent these subgroups are associated with constitutional factors (including genetic markers), diagnostic criteria, subjective symptoms and prognosis.

Methods

Study design and ethics

This study is part of the Norwegian Study of Chronic Fatigue Syndrome in Adolescents: Pathophysiology and Intervention Trial (NorCAPITAL) (ClinicalTrials ID: NCT01040429), which is a combined cross-sectional and randomized controlled trial of low-dose clonidine in adolescent CFS; the design has been described in detail elsewhere [30]. In the present study, we used baseline data and follow-up data from week 30, collected between March 2010 and October 2012. The study was approved by the Regional Committee for Medical and Health Research Ethics for South-East Norway and the Norwegian Medicines Agency and adhered to the Declaration of Helsinki. Informed, written consent was obtained from all participants and from parents or next-of-kin if required.

Recruitment of CFS patients

All 20 hospital paediatric departments in Norway primary care paediatricians and general practitioners were invited to refer adolescents with CFS aged 12 to 18 years consecutively to the Department of Paediatrics at Oslo University Hospital, which served as a national referral center for young patients with CFS. To be eligible for the NorCAPITAL project, we required 3 months of unexplained chronic/relapsing fatigue of new onset. The patients were not required to meet any additional symptom criteria, in line with clinical Paediatric guidelines [9]. A standard form required the referral unit to confirm the result of clinical investigations considered compulsory to diagnose pediatric CFS (specialist evaluation, extensive hematology and biochemistry work-up, chest X-ray, abdominal ultrasound, and brain MRI). Also, the referring units were required to confirm that the patient (a) was hindered from normal school attendance due to fatigue; (b) was not permanently bedridden; (c) was not stroked by a medical or psychiatric disorder (including depression) and/or did not go through any concurrent demanding life event; and (d) did not use medicines (including hormone contraceptives) regularly. Patients considered eligible were summoned to our study center; a final decision on inclusion was made after a separate clinical examination combined with quality assessment of the previously conducted screening program. Details of the recruitment procedure and inclusion/exclusion criteria are described elsewhere [29].

All participants underwent an identical investigational program at baseline, 8 weeks and 30 weeks, which included a one day in-hospital assessment encompassing clinical examination, blood sampling, autonomic testing, and cognitive testing. Immediately afterwards, daily physical activity was monitored during seven consecutive days, and a self-administered questionnaire was completed.

Markers of biological aberrations

All methods for assessing markers of biological aberrations have been thoroughly described in previous publications from the NorCAPITAL project [30, 45, 50, 55, 58, 59]; a brief description is provided below.

Immunological markers were investigated by examining plasma CRP level through a high-sensitive assay (Roche Diagnostics, Indianapolis, IN, USA), and by measuring 27 plasma cytokines, including interleukins, chemokines and growth factors, using a multiplex technique (Bio-Plex Human Cytokine 27-Plex; Bio-Rad Laboratories Inc., Hercules, CA, USA) [58].

Autonomic markers were investigated using the Task Force Monitor ® (Model 3040i, CNSystems Medizintechnic, Graz, Austria), a combined hardware and software device for noninvasive continuous recording of autonomic cardiovascular control [60]. Supine values as well as responses to a low intensity 20 deg. head-up tilt test (HUT) are reported [59]. Power spectral analysis of heart rate variability (HRV) was calculated in the Low Frequency (LF) range (0.05 to 0.17 Hz), and High Frequency (HF) range (0.17 to 0.4 Hz) [61]. Vagal (parasympathetic) activity is the main contributor to HF variability, whereas both vagal and sympathetic activity contributes to LF variability.

Neuroendocrine markers included plasma and urine norepinephrine and epinephrine. These markers were assayed by high-performance liquid chromatography (HPLC) with a reversed-phase column and glassy carbon electrochemical detector (Antec, Leyden Deacade II SCC, Zoeterwoude, The Netherlands), using a commercial kit (Chromsystems, München, Germany) [62]. Urine free cortisol (non-conjugated cortisol) was assayed by solid phase competitive luminescence immunoassay (LIA) (type Immulite® 2000, Siemens Healthcare Diagnostics, NY, USA) after extraction from the urine sample with ether [63]. Plasma cortisol, adrenocorticotrophic hormone (ACTH), thyroid-stimulating hormone (TSH), and free thyroxine (FT4), as well as serum Insulin-like Growth Factor 1 (IGF1), were determined by routine assays at the accredited Hormone laboratory at Oslo University Hospital, Norway.

Cognitive function was assessed using the digit span test from the Wechsler Intelligence Scale for Children, 4th edition (WISC-IV) [64], the conditions 1–3 of Color-Word Interference test from the Delis-Kaplan Executive Function System (D-KEFS) [65], and the Total recall part of Hopkins Verbal Learning Test-Revised (HVLT-R) [66].

Pressure pain threshold was assessed by gradually applying increasing pressure to six predefined areas (the third finger’s cuticles, the trapezius muscle and the supraspinatus muscle bilaterally), by using the force transducer Commander™ Algometer, which has a rubber tip of 0.5 cm2 (JTECH Medical, Midvale, USA) [54]. Participants were asked to indicate the first sensation of pain during increasing pressure. All sites were assessed in the same order for each patient, and the pressure stimuli were applied twice to each spot and then averaged. Values were reported in Newton (N).

Genotyping

Procedures for genotyping in the NorCAPITAL project have been described in detail elsewhere [29]. In short, genomic DNA was extracted from whole blood samples. Single Nucleotide Polymorphism (SNP) genotyping was carried out using custom TaqMan SNP genotyping assays (Applied Biosystems, Foster City, CA, USA). Approximately 10% of the samples were re-genotyped and the concordance rate was 100%. To determine the length of the polymorphic promoter region of the serotonin transporter (5-HTT)-gene (SLC6A4), the DNA sequence was first amplified by polymerase chain reaction (PCR) and then separated by gel electrophoresis. The PCR yielded a long (529 bp) and a shorter (486 bp) fragment [67]. After four hours separation at 100 V on a 2.5% agarose gel (MetaPhor Agarose, Lonza cologne GmbH, Cologne, Germany), GelRed dye was added and the fragments were visualized by UV light (Biotium Inc, California, USA). The PCR 100 bp low ladder (Sigma-Aldrich CO, St. Louis, Mo, USA) was used to determine the length of the fragments.

Questionnaires

A CFS symptom inventory for adults [68] has previously been used to develop an analogous inventory for adolescents [30]. A total of 24 common symptoms are evaluated in terms of frequency during the last month (five-point Likert scale ranging from never/rarer than once a month to present every day/almost every day, scored from 1 to 5). The questionnaire includes case defining symptoms of CFS according to the Canada as well as the Fukuda definition. As a general rule, all symptoms required in the definitions had to be present more than once a week (corresponding to a score of three or higher) for patients to be categorized as CFS [18].

In addition, validated inventories were used to assess symptoms and disabilities.

Fatigue was assessed by the Chalder Fatigue Questionnaire (CFQ), which encompasses 11 items scored on a 4-point (0 to 3) Likert scale [69]; total sum score is applied. Depressive symptoms were charted with the Mood and Feelings Questionnaire (MFQ) consisting of 34 items scored on a 3-point (0 to 2) Likert scale [70]; total sum score is applied. Quality of life was assessed with the Pediatric Quality of Life Inventory (PedsQL), consisting of 23 items scored on a 5-point (0-25-50-75-100) Likert scale [71]; mean score across all items is applied. Functional disability was assessed using the Functional Disability Inventory (FDI) encompassing 15 items scored on a 5-point (0-4) Likert scale [72]; total sum score is applied.

Finally, the symptom of post-exertional malaise (PEM) was charted by a single item: “How often do you experience more fatigue the day after an exertion?”, scored on a 5-point [1,2,3,4,5] Likert scale.

Daily physical activity

The activPAL accelerometer device (PAL Technologies Ltd, Glasgow, Scotland) was used to provide data on step number and cadence as well as time spent on walking, standing and sitting/lying during everyday activities [73]. A recording period of seven consecutive days was selected. For each participant, all recording epochs were carefully and independently reviewed, and the mean number of steps per day was calculated for all recording epochs. Details on the activity recording procedure have been reported elsewhere [30].

Cluster construction

A total of 69 different biomarkers was selected from the NorCAPITAL database for analyses in the present study; the selection was guided by expert knowledge of the CFS/ME scientific literature. The biomarkers were grouped into five domains: endocrine (n = 10), inflammatory (n = 30), cardiovascular (n = 18), pressure pain threshold (n = 3), and cognitions (n = 8) (Fig. 1). Thereafter, in order to reduce the number of variables, correlation analyses among variables under each domain were performed. When two or more variables were strongly correlated (correlation coefficient ≥ 0.7), interpretability, suitability regarding statistical analyses and the size of the correlation coefficient were evaluated. The variable in total considered most suitable was kept for further analyses. A final correlation analysis of all remaining variables from each domain were performed, resulting in a total of 37 variables which become the basis for subsequent cluster analyses (Fig. 1, Additional file 1).

Fig. 1
figure 1

Number of variables before and after correlation analyses. A total of 69 different biomarkers was selected from the NorCAPITAL database and grouped into five domains (upper row): Endocrine, inflammatory, cardiovascular, pressure pain threshold, and cognitions. Correlation analyses among variables within each domain were performed. When two or more variables were strongly correlated (correlation coefficient ≥ 0.7), the variable considered most suitable for further analyses was carried over. Then, a final correlation analysis of all remaining variables from each domain were performed, resulting in a total of 37 variables across each of the five groups (lower row) which become the basis for subsequent cluster analyses

Firstly, hierarchical clustering analyses were performed within each of the five domains separately, using Ward’s method, squared Euclidian distance and Z-score. Thereafter, 1-3 variables from each domain were used for a final cluster analysis across all domains [74]. Variables were selected due to their importance in the cluster formation under each domain. The final number of clusters was decided primarily by visual inspection of dendrograms, but a preliminary validation of the possible cluster-solutions was also performed to ensure that there were meaningful differences between the clusters.

Cluster validation

Associations between clusters and simple demographic variables, constitutional factors (including genetic markers) and adherence to CFS diagnostic criteria were explored (Table 3). Baseline values of CFQ, PEM, MFQ, Steps per day, PedsQL and FDI were used to investigate associations between clusters and markers of symptoms and function. Changes in markers of symptoms and function from baseline to week 30 were used to assess prognostic value of clusters.

Generally, differences across clusters were analyzed by Fisher’s exact test, one-way ANOVA or Kruskal–Wallis test as appropriate. All statistical analyses were carried out by SPSS statistical software. A p-value of < 0.05 was considered statistically significant. No correction for multiple testing was performed due to the exploratory nature of the analyses.

Results

Of the 120 CFS/ME patients included in the NorCAPITAL project, four were excluded from further analyses due to lack of valid data, leaving 116 for analyses in the present study. In this group, 28% were males, and mean age was 15.4 years (Table 1).

Table 1 Background characteristics

Separate cluster analyses within each domain of variables revealed substantial cluster overlap as well as few statistically significant associations with symptoms, functional abilities and prognosis. However, from each analysis it was possible to identify the most important variables driving the cluster formation, which in turn were carried over to the final cluster analysis across all domains (Additional file 2).

The final cluster solution revealed six clusters based on a total of 10 variables (Fig. 2, Table 2). Cluster 1 is characterized by high pressure pain threshold levels and high scores on cognitive function tests, and was labelled pain tolerant & good cognitions. Cluster 2 is characterized by high urine cortisol:creatinin ratio, which signalizes restored HPA dynamics. Cluster 3 is characterized by a strong tachycardia response and corresponding fall in stroke volume during orthostatic challenge, typical of orthostatic intolerance. Cluster 4 is characterized by high levels of interferon gamma (INFγ) and interferon gamma-induced protein 10 (IP-10), indicative of low-grade inflammation. Cluster 5 is characterized by low pressure pain threshold levels and low scores on cognitive function tests; this “mirror image” of cluster one is labelled pain intolerant & poor cognitions. Cluster 6 is characterized by strong power of heart rate variability within the high-frequency (HF) domain, reflecting high vagal (parasympathetic) activity. Few individuals (a total of 4 and 3, respectively) belonged to cluster 4 and cluster 6.

Fig. 2
figure 2

Final cluster solution and cluster characterizations. Dendrogram displaying the final cluster solution, revealing six clusters based on a total of 10 variables (each vertical bar on the x-axis corresponds to one individual). The clusters were labelled according to results presented in Table 2 (cf. manuscript for further details): The pain tolerant & good cognitions-cluster (C1) is characterized by high pressure pain threshold levels and high scores on cognitive function tests. The restored HPA dynamics-cluster (C2) is characterized by high urine cortisol:creatinin ratio. The orthostatic intolerance-cluster (C3) is characterized by a strong tachycardia response and corresponding fall in stroke volume during orthostatic challenge. The low-grade inflammation-cluster (C4) is characterized by high levels of interferon gamma (INFγ) and interferon gamma-induced protein 10 (IP-10). The pain intolerant & poor cognitions-cluster (C5) is characterized by low pressure pain threshold levels and low scores on cognitive function tests. The high vagal activity-cluster (C6) is characterized by high heart rate variability within the high-frequency (HF) domain

Table 2 Final cluster solution-contributing variables

There were no significant differences between clusters regarding demographic and constitutional variables, including candidate genetic markers (Table 3). However, individuals belonging to Cluster 5—Pain intolerant & poor cognitions—were significantly more prone to adhere to the Canada 2003 diagnostic criteria for CFS. Also, this cluster had significantly poorer scores on the FDI and PedsQL functional inventories as compared to clusters 1–3 (Table 4). Symptoms scores for fatigue (CFQ), post-exertional malaise and depressive thoughts (MFQ) did not differ significantly across clusters. As for changes in symptoms and function over a 30-week follow-up period, there were no significant differences between the clusters, but a non-significant tendency for stronger functional improvement among Cluster 5—Pain intolerant & poor cognitions (Table 5).

Table 3 Final cluster solution–demographic, genetic and CFS criteria adherence differences
Table 4 Final cluster solution–differences in symptoms and functional abilities
Table 5 Final cluster solution–differences in development of symptoms and functions over time (baseline to week 30 follow-up)

A scatterplot of the three most important variables for the final cluster formation (urine cortisol:creatinine ratio, Δ HR orthostatic response and digit span forward) revealed a substantial overlap between the clusters (Fig. 3).

Fig. 3
figure 3

Scatterplot of the three most important variables for the final cluster formation. Each colored dot represents one individual belonging to one of the six clusters from the final cluster solution. Even though the three most important variables driving the cluster formation are used as coordinates, there is substantial overlap between the clusters

Discussion

The most important finding of the present study is that within a widely defined cohort of adolescent CFS patients, clusters corresponding to certain pathophysiological characteristics could be delineated, but overlap between clusters were substantial and no distinct subgroups could be identified. Also, there were scarce associations between clusters and constitutional factors, subjective symptoms and prognosis.

Medical diagnoses remain the foundations for treatment, rehabilitation, and prognostic assumptions; hence, the importance of valid diagnostic entities can hardly be underestimated. Sadly, the lack of objective criteria for CFS has contributed to the co-existence of multiple sets of diagnostic criteria, all of which are based on subjective reporting of symptoms. It is frequently maintained that certain case definitions correspond to specific underlying disease mechanisms; for instance, the Canada 2003 definition put strong emphasis on a possible inflammatory pathophysiology [14]. Accordingly, it is often argued that studies based on a wide diagnostic definition of CFS (i.e., a definition that requires a minimum of accompanying symptoms), such as the Oxford criteria [11], are at risk of introducing substantial heterogeneity in the patient sample which in turn may obscure results that pertain to a specific subgroup only. Specifically, it is frequently maintained that the use of a wide diagnostic definition in clinical trials tend to select a large portion of patients suffering from mental distress who may benefit from psychological/behavioural interventions, whereas such interventions are claimed to be unhelpful (or even harmful) for the potential subgroup of patients suffering from another (such as inflammatory) disease mechanism [75].

This argument seems to rely upon an a priori-assumption of the existence of subgroups within a widely defined CFS cohort, and the related research efforts tend to focus on how such subgroups can be found from analysis of patients’ symptoms (or biomarkers yet to be discovered). However, a more fundamental scientific question, which has scarcely been addressed in previous research, is whether such subgroups exist at all. In the present study, while the cluster analyses did suggest some delineation corresponding to previously identified characteristics of CFS pathophysiology, such as low-grad inflammation, altered HPA dynamics, and orthostatic intolerance, the most striking finding is the absence of well-defined subgroups. Rather, the data seems to represent continuous variables in a multidimensional space. Accordingly, the clusters were not significantly associated with symptom scores nor prognosis. Taken together, the findings of the present paper favor a “lumping together” rather than “splitting apart” approach to CFS caseness, and question the clinical usefulness of searching for CFS subgroups as well as the validity of the most “narrow” CFS diagnostic criteria. If confirmed by future research, this finding may have important clinical implications. It would suggest, for instance, that well-documented rehabilitation strategies might be applicable to a wide range of CFS sufferers.

That said, the cluster analysis did reveal some interesting associations, such as the positive association between restored HPA axis and functional abilities, confirming findings from previous reports [43, 45]. Also, there was an association between low pain tolerance and cognitive functions, poor functional abilities and quality of life, and adherence to the Canada 2003 diagnostic definition of CFS [14]. The causality of these associations remains to be clarified; for instance, functional disability may have a negative impact on cognitive test performance, as well as the other way round. Anyway, a better characterization of this particularly vulnerable group of CFS patients may help to tailor clinical rehabilitation programs.

Interestingly, while low-grade inflammation is advocated as an important pathophysiological feature of CFS patients adhering to the Canada 2003 diagnostic criteria [14], results from the present study opposes these assumptions. The cluster characterized by low-grade inflammation was not associated with the Canada 2003 case definition for CFS, while the cluster characterized as Pain intolerant & poor cognitions, which actually was associated with the Canada 2003 case definition, had the lowest score on inflammation variables. This result corroborates previous finding from our group [18], and further questions the validity of the Canada 2003 case definition.

Study strengths and limitations

A strength of the present study is the detailed characterization of CFS pathophysiology within several domains. Limitations include the relatively low number of CFS patients, leaving some of the clusters with few participants, and the study should therefore be regarded exploratory. Also, the study included adolescent patients only, and it is unknown to what extent results can be generalized to adults. Further research should seek to validate the present findings in a larger cohort of adult CFS patients.

The question on how to measure fatigue is a controversy in the field of CFS. The present study assumed a priori that fatigue is best conceptualized as a subjective sensation [76]; accordingly, a validated instrument based on self report (the Chalder Fatigue Questionnaire) was selected to operationalize fatigue. We acknowledge, however, that other researchers maintain that fatigue should be measured by objective standards (e.g. activity recordings). Also, recent findings suggest that the symptom of post-exertional malaise (PEM) is even more central to the phenomenon of CFS than previously understood, and that it should be assessed with comprehensive, validated instruments [77]. Unfortunately, these instruments were not available when the present study was planned.

Conclusion

Within a widely defined cohort of adolescent CFS patients, clusters could be delineated based on biological markers, but no distinct subgroups could be identified. Associations between clusters and constitutional factors, subjective symptoms and prognosis were scarce. These results question the clinical usefulness of searching for CFS subgroups, as well as the validity of the most “narrow” CFS diagnostic criteria.