Background

Chronic obstructive pulmonary disease (COPD) is highly heterogeneous and complex hence, personalising assessments and treatments to this population across different settings and available resources imposes challenges and debate [1, 2].

Research efforts have been made to advance knowledge in this field, namely through the identification of homogeneous subgroups of patients with COPD [3, 4], the so-called clinical phenotypes or profiles, grouped by different type of personal characteristics (e.g., genetic, clinical, biochemical, radiological) for prognostic and therapeutic purposes [3,4,5,6,7,8,9,10,11,12]. More recently, a new approach, “treatable traits”, i.e., pulmonary, extra-pulmonary and behaviour/lifestyle characteristics of each person that are clinically relevant, identifiable and treatable, emerged [13,14,15]. Although studies on clinical profiles and treatable traits have been conducted, their cross-sectional nature, narrow eligibility criteria, main focus on physiological/pulmonary measures often not available across settings [5, 6], absence of decision trees and lack of validation with independent samples [3], limits our understanding of the heterogeneous manifestations of COPD and hinders their applicability in daily clinical practice.

Few studies have tried to identify profiles and their treatable traits using clinical and patient-reported outcomes beyond pulmonary measures and understand their behaviour over time [9, 14, 15]. Moreover, to be easily applied and useful in clinical practice, these profiles need predicting tools, such as decision trees [5]. Simple and validated decision trees are scarce but important as a first line approach, as they quickly enable the allocation of each individual into a profile [5, 10]. Such allocation can then aid the decision-making of the most suitable comprehensive assessments to identify each person’s treatable traits and guide tailored and multicomponent interventions that can alter the course/impact of the disease and its daily management [10, 13, 14].

Thus, we aimed to (i) identify and describe profiles and respective treatable traits in people with COPD based on simple and meaningful clinical measures possible to be collected with minimal resources; (ii) develop and validate a decision tree to quickly identify the profile of each person and (iii) assess the stability of the profiles during a six-months period.

Methods

Study design

An observational, prospective cohort study was conducted with data collected between 2017 and 2020 in GENIAL (PTDC/DTP-PIC/2284/2014) and PRIME (PTDC/SAU-SER/28806/2017) projects. PRIME builds on GENIAL project, i.e., settings, recruitment strategies and data collection protocols were similar. Data were first gathered from both projects and then analysed. The study was approved by five Ethics Committees (CHMA 09/2016-10/2018; ULS Matosinhos 10/CES/JAS 17/02/2017-73/CE/JAS 12/10/2018; CHBV 777,638–086,892; HDFF 1807/2017-27/05/2019; ARSCentro 64/2016-85/2018). This work is described according to the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement [16].

Participants

People with COPD diagnosed according to the GOLD criteria [17], clinically stable in previous month (i.e., no hospital admissions, acute exacerbations or changes in medication) were included. Exclusion criteria comprised the presence of other respiratory diseases or any clinical condition that precluded participation in the assessment (i.e., signs of cognitive impairment or presence of a significant cardiovascular, neurological, musculoskeletal, immunological or infectious disease). Eligible participants were identified in hospitals and primary healthcare centres routine appointments by their clinicians, who explained the study. Interested participants were contacted by the researchers who obtained written informed consents and performed data collection.

Data collection

Sociodemographic (age, sex), anthropometric (height and weight to compute body mass index [BMI]) and general clinical (smoking habits, self-reported medication for COPD, use of long-term oxygen therapy and non-invasive ventilation, number of acute exacerbations and hospitalisations in the past year) data were collected. Severity of comorbid diseases was scored with the Charlson Comorbidity Index (CCI) and interpreted as: (i) mild, 1–2; (ii) moderate, 3–4; and (iii) severe, ≥ 5 [18]. Self-reported physical activity was assessed with the brief physical activity assessment tool (BPAAT) [19]. Scores range from 0 to 8 being interpreted as 0–3 ‘insufficiently active’ or ≥ 4 ‘sufficiently active’ [19].

Lung function was assessed with a spirometer (MicroLab 3535, CareFusion, Kent, UK) and values from the forced expiratory volume in one second (FEV1) percentage predicted were used to classify participants’ according to the GOLD grades (1, 2, 3 and 4) [17]. GOLD groups (A, B, C and D) were determined using the number of acute exacerbations and hospitalisations in the previous year and the COPD assessment Test (CAT) [17].

Impact of the disease was assessed with CAT [20]. Scores range from 0 to 40 and are interpreted as ≤ 10 low, 11–20 medium, 21–30 high and 31–40 very high impact [20]. Activity-related dyspnoea was assessed with the modified British Medical Research Council dyspnoea questionnaire (mMRC) [21]. Scores range from 0 (no trouble with breathlessness) to 4 (too breathless to leave the house) [21].

Symptoms of anxiety and depression were measured with the Hospital Anxiety and Depression Scale (HADS) [22]. Scores range from 0 (minimum symptom load) to 21 (maximum symptom load) [22].

The Saint George’s Respiratory Questionnaire (SGRQ) was used to measure health-related quality of life [23]. Scores range from 0 (no impairment) to 100 (worst possible health-related quality of life) [23].

Quadriceps muscle strength (QMS) was measured through a maximum voluntary isometric contraction with a handheld dynamometer [24] (microFET2, Hoggan Health, The best Salt Lake City, Utah). Measurements were taken at the dominant side [24], and the best of three measurements (less than 10% of variation) was used for analysis. Functional status was measured with the 1-min sit-to-stand test (1minSTS) twice and the best performance was used for analysis [25]. Percentage of predicted for QMS and 1minSTS values was calculated based on reference values [24, 26].

These measures were chosen for their simplicity, wide availability, quick application in space constrained settings and easy identification of evidence-based treatable traits possible to tackle with non-pharmacological interventions, since our population was already receiving standard pharmacological treatment.

Data were collected during an initial visit by trained physiotherapists with more than three years of experience in similar data collection protocols. A subsample of participants was followed-up monthly over six months. Participants of different GOLD groups and grades, living within 100 km of our research facilities and willing to receive a physiotherapist for a short-monthly assessment at their homes were invited to participate. Data collected over the follow-up period consisted of number of acute exacerbations, hospitalisations and changes in medication in the previous month, CAT, mMRC, QMS and 1minSTS.

Statistical analysis

All analyses were performed in R (version 3.6.1) with a level of significance set at p < 0.05. Normality of data distribution was tested with the Shapiro–Wilk test. Comparison between participants’ characteristics and between the total and follow-up sample were analysed using t-test or Mann Whitney U-test, for continuous variables; and chi-squared or Fisher’s test, for categorical variables. Differences among profile characteristics were analysed using univariate ANOVA followed by Tukey’s multiple comparison test or Kruskal–Wallis test followed by Dunn’s multiple comparison test, for continuous variables; and chi-squared or Fisher’s test, for categorical variables, according to their assumptions.

Profiles and treatable traits

A principal component analysis was conducted to avoid strongly correlated variables dominating the cluster assignment [27]. Scores associated to the principal components yield a data matrix, on which the clustering algorithm, k-medoids, was applied [28]. The optimal number of clusters, k, was identified using the gap statistic [29] (Additional file). Age, BMI, pack years, CCI, FEV1, forced vital capacity (FVC) percentage predicted, CAT, mMRC, HADS-A, HADS-D, SGRQ, QMS, and 1minSTS were explored.

The presence/absence of treatable traits was considered as described in Table 1.

Table 1 Treatable traits, measurement instrument and cut-off values used for interpretation in people with chronic obstructive pulmonary disease

Decision tree

Random Forest was used to rank the importance of the above-mentioned 13 variables used in the clustering procedure (Additional file). The top variables were selected to construct the decision tree.

The decision tree was developed and validated with two independent data sets randomly selected from the total sample; 70% was used for generation and 30% for validation, i.e., to determine its prediction ability in identifying the correct profile [30]. The accuracy, defined as the proportion of correct classifications, was used to quantify the predictive ability of the decision tree. The decision tree that maximised accuracy was chosen.

Agreement between the profile predicted by the decision tree and the profile defined by the clustering procedure was determined using Cohen’s Kappa and interpreted as: poor (k < 0), slight (0.00 ≤ k ≤ 0.20), fair (0.21 ≤ k ≤ 0.40), moderate (0.41 ≤ k ≤ 0.60), substantial (0.61 ≤ k ≤ 0.80) or almost perfect (k > 0.80) [31].

Stability of the profiles over time

The decision tree was applied in all timepoints for each participant. Stability of the profiles was assessed with a stability score defined as the ratio between the number of timepoints that a participant was classified in the same profile (most frequent profile for that participant) and the total number of timepoints (i.e., 6). Possible scores ranged from 1/3 (maximum instability—migrating across the 4 profiles) to 1 (complete stability—there was no change in the profile allocation over time).

Results

A total of 523 people with COPD were recruited however, 171 had incomplete data due to participants’ lack of time. Therefore, for establishing the profiles and developing the decision tree, data from 352 people with COPD (67.4 ± 9.9 years; 275[78.1%] male; FEV1 = 56.2 ± 20.6% predicted), were used. From these, 133 people were followed monthly for six months. In 43 participants, data were missing in at least one timepoint and therefore, they were excluded from the follow-up analysis. Hence, stability of the profiles was performed with data from 90 people with COPD (67.6 ± 8.9 years old; 77 [85.6%] male; FEV1 = 52.1 ± 19.9% predicted). Figure 1 shows the flow of participants in the study.

Fig. 1
figure 1

Flow chart of participants’ recruitment for establishing the profiles and developing and validating the decision tree (n = 352) as well as for studying the stability of the proposed profiles (n = 90) in people with chronic obstructive pulmonary disease (COPD)

Four profiles were identified (Fig. 2 and Additional file 1: Figure S1). Table 2 presents baseline characteristics and treatable traits for the total sample, per profile and for the longitudinal subsample. Each profile incorporated people with COPD from all GOLD grades and groups (Additional file 1: Figure S2). Profiles 1 and 3 were composed of older, mostly overweight/obese and comorbid people. Profiles 2 and 4 included younger and underweight/normal weight people.

Fig. 2
figure 2

Graphical representation of the four profiles of people with COPD (n = 352) identified. A colour grade system of red (worst), yellow and green (better) was used to facilitate the clinical interpretation of each variable across profiles. Percentage of people on non-invasive ventilation (NIV), long-term oxygen therapy (LTOT), with anxiety/depression symptoms or with impairment are indicated in blue. BMI body mass index, CAT COPD Assessment Test, FEV1 forced expiratory volume in 1 s, HADS-A and HADS-D The Hospital Anxiety and Depression Anxiety and Depression Subscales, LTOT Long-Term Oxygen Therapy, mMRC Modified British Medical Research Council questionnaire, NIV Non-Invasive Ventilation, QMS Quadriceps muscle strength, SGRQ Saint George’s Respiratory Questionnaire, 1minSTS 1-min Sit-to-Stand Test

Table 2 Characteristics of people with chronic obstructive pulmonary disease (COPD), for the total sample (n = 352), for each of the four profiles identified and for the longitudinal subsample (n = 90)

Profiles 1 and 2 were mostly composed of people with severe airflow obstruction, medium–high levels of dyspnoea, medium–high impact of the disease and low lower-limb muscle function, functional status and health-related quality of life. Profile 2 integrated the higher proportion of people on long-term oxygen therapy, with anxiety and depression symptoms, highest impact of the disease and poorest health-related quality of life.

Profiles 3 and 4 included mostly people with moderate airflow obstruction, less dyspnoeic, with low impact of the disease, high lower-limb muscle function and functional status, and better health-related quality of life. Profile 4 also presented the highest proportion of sufficiently active people. Further details are presented in Additional file 1: Figure S3.

Decision tree

The Random Forest identified CAT (cut-off 18 points), age (cut-off 65 years), and FEV1% predicted (cut-off 48% predicted) as the most informative variables for the decision tree (Fig. 3) of the four profiles (Additional file). The tree assigned correctly 71.7% of people with COPD to their actual profile, thus a substantial agreement (Cohen’s Kappa = 0.62, p < 0.001) was observed.

Fig. 3
figure 3

Decision tree to assign people with chronic obstructive pulmonary disease (COPD) to the identified profiles (1, 2, 3 and 4). Cut-off points were 18 for the COPD Assessment Test (CAT), 65 years for age and 48% of predicted for the forced expiratory volume in 1 s (FEV1). Pie charts represent the proportion of people with COPD correctly assigned to each profile using the decision tree (using the actual profile provided in Fig. 2 as the criterion)

Stability of the profiles

From the 90 participants followed-up, 26 (28.9%) were in profile 1, 24 (26.7%) in profile 2, 23 (25.5%) in profile 3 and 17 (18.9%) in profile 4 (Additional file 1: Table S1). 42% (11/26) of people in profile 1, 25% (6/24) in profile 2, 65% (15/23) in profile 3 and 71% (12/17) in profile 4 remained in the same profile over time. Therefore, profile 4 presented the higher stability score (0.94 ± 0.10), followed by profile 3 (0.88 ± 0.19), profile 1 (0.85 ± 0.14) and profile 2 (0.83 ± 0.13). Overall, 48.9% (n = 44) of people remained in the same profile during the six months (stability score = 1). The other participants (n = 43; 47.8%) remained predominantly in one of the profiles (i.e., for at least 3 months) but moved between two profiles. Three participants (3.3%) moved across three profiles over time. The overall percentage of stability was 86.8 ± 15%. Figure 4 shows the flow of participants across profiles during the follow-up.

Fig. 4
figure 4

Flow of people with chronic obstructive pulmonary disease (n = 90) across profiles (1, 2, 3 and 4) during the six-month follow-up period

Discussion

Four distinct profiles and respective treatable traits of people with COPD were identified and described using simple and widely available measures. Heterogeneity within each profile and overlap in some treatable traits existed. We proposed a simple decision tree (CAT < 18 or ≥ 18 points; age < 65 or ≥ 65 years; FEV1 < 48 or ≥ 48% predicted) that can aid healthcare professionals in clinical practice to allocate each person to a specific profile quickly, even with minimal resources. This is of most importance, as it can rapidly guide personalised assessments and decision-making regarding the best treatments for each treatable trait. We also showed that 51.1% of the sample changed their profile over a six-month period and that those more symptomatic, with moderate-high impact of the disease and lower limb muscle function, functional status and health-related quality of life (profiles 1 and 2) were the ones less stable in their profile. Our findings suggest that frequent assessments of people with COPD might be needed to ensure that the most adequate personalised interventions are offered.

Despite the enormous variability of published clinical phenotypes/profiles in COPD [3,4,5,6,7,8,9,10,11,12, 32], some common ground to our study seems to exist, i.e., profiles composed of older, overweight and dyspnoeic people and others with younger people with nutritional depletion [5,6,7, 11] and high as well as low impact of the disease, lower-limb muscle function, functional status and health-related quality of life may be present in people with similar airflow limitation [5, 11, 12]. Thus, our findings reinforce the previously acknowledged need to include comprehensive assessments in people with COPD, that go beyond pulmonary measures [9, 12, 15, 33]. Such assessments, even if applied with minimal resources, as demonstrated in our study, allow the easy identification of extrapulmonary treatable traits within each profile, which is of most importance for the daily management of COPD since they can be modified with person-centred interventions [14].

In our study, we were able to identify that people with COPD from profiles 1, 2 and 3 might benefit from pulmonary rehabilitation (exercise, education and psychosocial support) to target their activity-related dyspnoea, impact of disease, lower-limb muscle dysfunction and/or functional status impairment [34]. Profiles among people with COPD who could benefit from pulmonary rehabilitation and physical activity promotion to address their symptoms, exercise intolerance and functional status impairments have been previously reported [8, 10, 32, 35,36,37]. Those in profiles 1 and 3 might also benefit from dietary counselling and calorie restriction plus resistance exercise training [38, 39] and referral for comorbidities’ management [40], due to their high BMI and severity of comorbidities. Indeed, different body composition and comorbidity profiles which involve several body systems in people with COPD have been acknowledged and multidisciplinary care approaches advocated [7, 41]. People in profile 2 showed the worst emotional status. This was not surprising, as a specific emotionally dysfunctional cluster has been previously identified in people with COPD [9]. People in profile 2 might benefit from a specific psychological assessment and possibly cognitive behavioural therapy [42, 43]. Lastly, people in profile 4 exhibit much better health status than those in the other profiles, indicating that physical activity counselling [44] and self-management programmes [45] might be the most appropriate interventions as well as nutritional support to manage their underweight [46]. In fact, regular physical activities in the local community have been recommended for older people with COPD who integrate the profile of no or low disease burden and better health status [10]. This recommendation is based on the rationale that any additional increase in physical activity and decrease in sedentary behaviour is important to improve health outcomes in this population [47, 48].

The proposed and validated simple decision tree facilitates the identification of these profiles in clinical practice. Similar cut-offs have been previously proposed: for CAT when referring to exercise-based interventions[10] and to better differentiate symptom burden in COPD [2]; as well as for age [5] and for FEV1% predicted when predicting mortality [49], which suggests that these cut-offs might be the most adequate to differentiate health status of people with COPD. The quick allocation of each person to a profile can aid decision-making regarding prioritisation of assessments and identification of specific targets to improve patient-related outcomes [10], saving time and financial resources spent in first assessments. However, it should never preclude healthcare professionals from conducting individual comprehensive assessments [9, 12].

Our work also emphasised the need for regular assessments as we found that 51.1% of people with COPD assigned to one profile changed to another over time, and hence some of their treatable traits might have changed too. Migration across profiles within one [49] or two [50] years has been previously observed, mainly driven by changes in symptoms (CAT), lung function (FEV1), functional status (Six-minute walking test) [50] and physical activity levels [49]. Our findings show that this variability was especially evident in the more “deteriorated” profiles, mainly due to changes in the CAT score, and confirms that not all patients progress unfavourably [49, 50]. It is already known that dramatic changes can occur over time in variables that capture patients’ real life, such as symptoms (e.g., fatigue) and functional status [51], whilst FEV1 remains relatively stable [52]. In fact, the profile of patients with COPD with prolonged hospitalizations has been found not to be associated with disease severity assessed by lung function [53]. Therefore, identifying and tackling extrapulmonary treatable traits as early as possible seems fundamental to optimise outcomes in COPD, independently of the resources available. Reasons causing the shift among our profiles (e.g., change in lifestyle, acute exacerbation occurrence) would be of interest but it was beyond the scope of our study. The literature on the variability/stability of profiles and respective treatable traits over time is yet scarce and further investigation is warranted.

Strengths of our study include the integration of people with COPD from all GOLD grades and groups, living in the community, being a good representation of their daily lives. Our findings are prevenient from comprehensive assessments based on simple and widely available measures, and a simple decision tree to identify each profile and respective treatable traits is provided. Variability of each profile was evaluated over time, showing the need of personalising assessments and their frequency.

Limitations of this study include the relatively small number of participants included and proportion of females in our sample. All efforts were conducted to avoid missing data. To minimise participants’ transportation and discomfort, data were mostly collected during their routine appointments with their clinician (at the hospital or primary healthcare centre), where time, space and equipment constraints exist. This decision had, however, implications in our study as often participants were not available to complete the full assessments. Conducting the assessment in a separate occasion of the regular appointment and/or agreeing to collect data in two different moments, could minimise this constraint. Nevertheless, data gathered in this work represents approximately 5% of the COPD population in our country and were collected across different settings and geographic regions, representing real-world data. Similar samples with unbalance proportion of female/male have been widely reported in COPD profiles [5,6,7, 11] however, we acknowledge that this may limit the findings to all people with COPD. Our follow-up period was also relatively short, limiting our understanding about the impact of profile migration in the overall disease development and important outcomes (e.g., risk of exacerbations). Longer longitudinal studies, with the inclusion of a higher proportion of females, and exploring associations with clinically relevant variables are warranted to confirm and further develop our findings. We did not include people who dropped out from follow-up as we wanted to assess stability of the profiles and respective treatable traits, thus we are unable to infer if the profile of those lost to follow-up differed from those included. Although we performed a comprehensive assessment, many other important measures could be considered when profiling and identifying treatable traits in people with COPD (e.g., fatigue impairment, sleep disturbances, sedentary behaviour)[14]. Core treatable traits in COPD and efficacy of targeting treatments to them are yet unknown but should be addressed in future investigations.

Conclusion

Four profiles and their treatable traits were identified in people with COPD using minimal resources. A simple decision tree (CAT < 18 or ≥ 18 points; age < 65 or ≥ 65 years; FEV1 < 48 or ≥ 48% predicted) is now available to facilitate routine allocation of people to these profiles. Heterogeneity within each profile and overlap in treatable traits existed, as well as migration across profiles (51.1% of the sample) over time, especially in those more symptomatic and with worse functional status. Our findings provide additional evidence emphasising the need of frequent personalised assessments to identify treatable traits and ensure the most adequate person-centred intervention is offered to people with COPD. Nevertheless, future research namely on the validation of our decision tree and profiles in other samples would be beneficial.

Take home message

Profiles and treatable traits in COPD can be identified even with minimal resources. People with COPD may change profile and treatable traits over time hence, regular assessments to deliver goal-targeted personalised treatments are needed.