Background

Chronic obstructive pulmonary disease (COPD) is common in industrialized nations [1]. Within the next decade, COPD deaths are expected to increase further and become the third leading cause of death worldwide by 2030 [2]. A majority of these patients perceive their quality of life to be inferior to that of patients with cardiovascular disease or diabetes. The advanced stages of the disease are characterized by symptom burdens similar to those associated with malignancies [3, 4].

The progression of COPD may result in chronic hypercapnic respiratory failure caused by deteriorating respiratory mechanics associated with an increase in airway obstruction and progressive lung hyperinflation [5]. These patients benefit from non-invasive ventilation (NIV), significantly reducing mortality and improving quality of life [6, 7]. It is recommended that NIV be started in stable COPD patients with chronic hypercapnia at a partial pressure of carbon dioxide (PCO2) level greater than 52 millimeters of mercury (mmHg), with the objective of reducing PCO2 to physiological values [6]. However, frequently, an acute exacerbation of COPD (ECOPD) results in acute hypercapnia and respiratory acidosis [8], for which NIV is highly effective in avoiding the need for intubation with invasive mechanical ventilation [9]. For these patients, NIV should only be used long-term if hypercapnia persists for more than two weeks following the exacerbation [7].

COPD patients with chronic hypercapnia who do not exhibit acute symptoms usually start NIV later in their disease course. According to a data evaluation of COPD patients with GOLD disease stages III and IV, a significant number of subjects present with untreated chronic hypercapnia [10], a condition associated with an increased risk of mortality [11]. Identifying such patients as early as possible will allow prompt NIV introduction, probably improving their survival. However, in most cases, chronic hypercapnia is diagnosed by chance when patients present with clinical symptoms such as exertional dyspnea.

There is currently scarce information about the factors that predict chronic hypercapnia development in COPD [12, 13]. Sufficient early risk stratification for chronic hypercapnia lacks valid parameters that would permit a transition into justified, closer clinical monitoring to start NIV as early as possible. According to prior research, a low forced expiratory volume seems to be associated with chronic hypercapnic respiratory failure in such patients [14].

The present study aimed to evaluate pulmonary function parameters derived from spirometry, body plethysmography, and lung diffusion capacity for carbon monoxide as predictors of chronic hypercapnia in patients with advanced COPD.

Methods

This monocentric, retrospective, observational cohort study was conducted at the Schillerhoehe Lung Clinic (Robert-Bosch Hospital GmbH, Germany). Project approval was granted by the local institutional review board for human studies (Ethics Committee of the State Chamber of Physicians of Baden-Württemberg, Germany, file number F-2022-136) and was performed according to the Declaration of Helsinki principles. Because data were evaluated retrospectively and pseudonymously, the institutional review board waived informed consent requirements. We adhered to STROBE guidelines for reporting observational studies [15].

Patient selection

The recording period is five years, from January 2018 to December 2022. We assessed adult patients with a confirmed COPD diagnosis according to the GOLD definition [16] and with a complete set of pulmonary function tests (spirometry, body plethysmography), blood gas analysis, and carbon dioxide lung diffusion capacity (whenever available).

Exclusion criteria were as follows: History of ECOPD within six weeks of the examination (treated with systemic steroids or antibiotics), body mass index (BMI) > 35 kg/m2, the introduction of domiciliary NIV before 2018, any other severe lung diseases (e.g., combined emphysema and pulmonary fibrosis, severe pulmonary hypertension with a mean pulmonary artery pressure > 35 mmHg on right heart catheterization), lung cancer, or previous lung resection surgery.

Data collection

Data was collected from the hospital’s electronic medical records and charting systems (iMedOne, Telekom Healthcare Solutions, Bonn, Germany). We evaluated patients` baseline demographics, clinical characteristics, comorbidities, pulmonary function tests, blood gas analysis, respiratory support such as long-term oxygen therapy (LTOT) or NIV, and current inhaled medications (long-acting ß2-agonists, long-acting anticholinergics, and corticosteroids).

Classification of outcomes

Patients were divided into two groups according to their actual daytime PCO2 as determined by capillary or arterial blood gas analysis. Hypercapnia was defined based on the physiological threshold of > 45 mmHg, equal to 6.0 kilopascals (kPa). As the introduction of domiciliary NIV is typically triggered by higher values [6], additional analyses were performed based on the clinically relevant threshold of > 52 mmHg (6.9 kPa).

Candidate predictors of hypercapnia

We focused our analysis on lung function parameters derived from spirometry (FVC, FEV1), body plethysmography (RV, TLC), and lung diffusion capacity for carbon monoxide (DCO-SB, DCO-VA).

Forced vital capacity (FVC) refers to the lung volume that can be exhaled at maximum speed after maximum inspiration [17]. The FVC is primarily used to measure the loss in lung capacity associated with restrictive lung diseases such as pulmonary fibrosis. The forced expiratory volume (FEV1) refers to the amount of air the subject can forcefully expel within the first second following maximum deep inspiration [17]. FEV1 primarily serves to determine the severity of airflow limitation in obstructive pulmonary diseases such as COPD or asthma. Residual volume (RV) is the volume remaining in the lungs following maximum forceful expiration [17], preventing the alveoli from closing at end-expiration. Total lung capacity (TLC) represents the volume of gas in the lungs after maximal inhalation. FVC and FEV1 are measured via spirometry, whereas RV and TLC require additional body plethysmography.

A subject’s carbon monoxide lung diffusing capacity (single-breath DCO-SB versus alveolar volume-corrected DCO-VA, referring to the Krogh index) reflects the amount of gas transported through its alveoli by diffusion in a given time period, which is determined by the size of the alveolar space and the thickness of the alveolar membrane [17, 18]. Numerous factors, such as pulmonary fibrosis, emphysema, pulmonary hypertension, and anemia, can reduce diffusion capacity.

Pulmonary function testing was conducted following the standards of the American Thoracic Society [17, 19].

Statistical analysis

Descriptive and frequency statistics were employed to compare demographics and clinical characteristics between patients with and without hypercapnia. A Chi-square or Fisher’s exact test was used to compare categorical variables. Depending on the continuous variables` homogeneity of variance, determined by the Kolmogorov-Smirnov normality test, differences between groups were analyzed through Student’s t-test or Mann-Whitney U-test.

Receiver operating characteristic (ROC) curve analysis and 2-times repeated, 5-fold cross-validation were used to assess the parameters’ diagnostic accuracy and internal validity for predicting chronic hypercapnia and to determine the parameters’ thresholds (the criterion associated with the Youden index) [20] that best separate hypercapnic from non-hypercapnic individuals. The resulting performance of each parameter was expressed as area under the ROC curve (AUROC), sensitivity, specificity, positive/negative predictive value, accuracy, positive/negative likelihood ratio (PLR/NLR), diagnostic odds ratio (DOR), F1 score, and Matthews correlation coefficient (MCC) [21]. Correlations between lung function parameters and spontaneous breathing PCO2 were determined through Spearmans` coefficient of rank correlation (ρ). We performed binary logistic regression analysis to determine if pulmonary function tests were independently associated with chronic hypercapnia and to estimate the probability of hypercapnia. The multivariable model used forward selection and included variables deemed clinically significant a priori (age, gender, and obesity) and those lung function parameters with a P value of less than 0.2 in bivariate analysis. Hosmer & Lemeshow and Nagelkerke R2 were used to evaluate the model’s goodness of fit. Probabilities are reported as odds ratios (OR) with 95% confidence intervals (95%CI). Finally, we performed linear and multiple regression analysis to formulate equations for estimating spontaneous breathing PCO2 based on those lung function parameters found to be independently associated with chronic hypercapnia.

Sensitivity analyses were conducted for selected tests using the clinically relevant PCO2 threshold of > 52 mmHg. Since there were no comparable studies on prolonged ventilated lung transplant recipients to determine sample size, we recruited patients to the maximum extent possible. We performed two-tailed tests; statistical significance was indicated by P < 0.05. The analyses were conducted with MedCalc® software v20.305 (Ostend, Belgium).

Results

Baseline characteristics

The present study screened 1588 patients, of whom 424 were included in the analysis (Fig. 1). The median age of the group was 70 years [IQR 63–77 years]; 165 (39%) of the participants were female, 162 (38%) were hypercapnic (based on the physiological PCO2 threshold), and there were no significant differences between the two cohorts in terms of comorbid diseases. Gender-specific distributions were not apparent.

Fig. 1
figure 1

Patient flow diagram. Abbreviations: COPD chronic obstructive pulmonary disease, ECOPD exacerbations of COPD, NIV non-invasive ventilation, FEV1%FVC Tiffeneau index

In hypercapnic patients, 93% received LTOT, and 74% received NIV. However, only 47% of normocapnic patients were treated with LTOT. In contrast, 103 of 106 patients (97%) received NIV when PCO2 levels were higher than 52 mmHg (the threshold of clinical relevance) (Table 1).

Table 1 Clinical characteristics – comparison of patients with and without chronic hypercapnia

Lung function parameters

There was a significant difference in FVC% (49% [41–58%] versus 64% [55–78%], P < 0.01) and FEV1% (27% [21–33%] versus 37% [28–46%], P < 0.01) between hypercapnic and normocapnic individuals. RV differences were less pronounced, and TLC was comparable between groups. Patients with hypercapnia exhibited significantly lower lung diffusion capacity, as well as a lower pH and an increased bicarbonate level in their blood gas analyses (Table 2).

Table 2 Pulmonary function parameters and blood gas analysis – comparison of patients with and without chronic hypercapnia

Predictors of hypercapnia

ROC curve analysis and cross-validation demonstrated reasonable accuracy in discriminating hypercapnic and normocapnic individuals by FVC% (AUROC 0.77 [95%CI 0.72–0.81], DOR 7.9, MCC 0.41) and FEV1% (AUROC 0.75 [95%CI 0.70–0.79], DOR 6.8, MCC 0.34), while DCO-SB% performed poorly (AUROC 0.64 [95%CI 0.58–0.71], DOR 5.0, MCC 0.17) (Fig. 2; Table 3, Additional file 1: Table S1). Similar results were obtained when sensitivity analyses were conducted based on the 52 mmHg PCO2 threshold for defining clinically relevant hypercapnia (Additional file 1: Table S2).

Fig. 2
figure 2

Comparison of ROC curves for selected pulmonary function parameters analyzed to predict chronic hypercapnia. Abbreviations: FVC forced vital capacity, FEV1 forced expiratory volume in 1 s, RV residual volume, TLC total lung capacity, DCO-SB single-breath lung diffusion capacity for carbon monoxide, DCO-VA transfer coefficient for carbon monoxide (Krogh index)

Table 3 Cross-validated performance of pulmonary function parameters analyzed to predict chronic hypercapnia

Based on the results of cross-validation, FVC% and FEV1% thresholds that best separated hypercapnic from normocapnic subjects reached 56% and 33% of predicted values (Table 3).

Rank correlations

Regarding FVC, FEV1, RV, and DCO-SB, there was a significant correlation between these variables and spontaneous PCO2, which was particularly strong for FVC% (Spearmans` ρ = -0.51 [95%CI -0.58 – -0.44], P < 0.01) and FEV1% (ρ = -0.49 [-0.56 – -0.42], P < 0.01). In contrast, this was not observed with TLC or DCO-VA (Fig. 3, Additional file 1: Table S3).

Fig. 3
figure 3

Rank correlations of selected pulmonary function parameters with spontaneous breathing PCO2. The heat map of Spearman’s rank correlation coefficients (ρ) with the LOESS (Local Regression Smoothing) trendline. Abbreviations: ρ, Spearman’s correlation coefficient (with 95% confidence interval); FEV1, forced expiratory volume in 1 s; DCO-SB, single-breath lung diffusion capacity for carbon monoxide; DCO-VA, transfer coefficient for carbon monoxide (Krogh index)

Regression analysis results

In univariable binary logistic regression analysis, FVC%, FEV1%, and RV% were independently related to chronic hypercapnia. After adjusting for age, gender, and presence of obesity (BMI ≥ 30 kg/m2), only FVC% (OR 0.95 [95%CI 0.93–0.97], P < 0.01) and FEV1% (OR 0.97 [95%CI 0.94–0.99], P = 0.029) remained in the final multivariable model (Additional file 1: Table S4-S5). Moreover, both linear and multiple regression analyses showed a significant relationship between FVC% and FEV1% concerning spontaneous PCO2 (Fig. 4, Additional file 1: Table S6).

Fig. 4
figure 4

PCO2 as a function of FVC% and FEV1%: Linear regression analysis. Abbreviations: FVC forced vital capacity, FEV1 forced expiratory volume in 1 s

Discussion

This study aimed to identify pulmonary function tests that predict the likelihood that patients with COPD will develop chronic hypercapnia, which is the primary indicator of whether they should receive domiciliary NIV, a treatment proven to reduce overall mortality. Statistically significant correlations and independent relationships were found between spirometric parameters and actual PCO2 levels in patients without a history of acute exacerbations within the previous six weeks. Specifically, FVC% and FEV1% showed reasonable accuracy in identifying patients with and without hypercapnia, as defined by the physiological PCO2 threshold of 45 mmHg. Optimal cut-off values for hypercapnia prediction were 56% and 33% of predicted FVC% and FEV1%, respectively. The results of sensitivity analyses were comparable when using 52 mmHg PCO2 as the threshold for clinically relevant hypercapnia, which usually results in the introduction of NIV.

It is known that chronic hypercapnia in the context of respiratory insufficiency is associated with an increased mortality rate [22]. The development of hypoxemia or hypercapnia in COPD can be explained by structural changes to the airways, alveoli, and mechanisms of alveolar ventilation [5] in conjunction with a mismatch in ventilation and perfusion [23]. Hypercapnia is characterized by chronically overloaded inspiratory respiratory muscles, referred to as the respiratory pump. With COPD progression, the resistance of the airways increases, resulting in progressively hyperinflated lungs [5]. Due to this obstruction, mechanical stress (pressure) is placed on the respiratory pump. Accordingly, this implies that the lower the FEV1, the higher the airway resistance and, consequently, the respiratory pump’s work of breathing. Upon exhaustion, the alveolar ventilation necessary to maintain normocapnia can no longer be provided. In light of this, it is reasonable that decreasing FEV1 may indicate the development of hypercapnia. Indeed, previous studies have shown that the decrease in FEV1 correlates with the rise in PCO2 [14]. However, the present study is the first to evaluate the diagnostic accuracy of pulmonary function tests in predicting chronic hypercapnia, demonstrating that FEV1% and, even more interestingly, FVC% have predictive value for the development of hypercapnia. We defined hypercapnia as PCO2 above 45 mmHg (the physiological threshold); however, the introduction of NIV is typically performed when the PCO2 exceeds 52 mmHg. Nevertheless, the physiological threshold is a reasonable choice, as many patients with daytime borderline hypercapnia may experience significant increases in PCO2 during sleep, which may also result in the initiation of NIV [24].

Currently, the FVC serves primarily as a parameter for assessing restrictive lung diseases, in which a decline indicates the progression of interstitial lung disease [25] and is predictive of patient mortality [26]. However, FVC has not yet been described as a prognostic factor or an indicator of progression in COPD. Based on the present study’s findings, a reduction in ventilated lung volume, as measured by FVC, may also contribute to chronic respiratory failure due to obstructive lung disease. In this scenario, FVC reduction may be more than just an indicator of lung hyperinflation severity, given that RV, another measure of lung hyperinflation, was less accurate at predicting chronic hypercapnic respiratory failure. Theoretically, FVC may be a more accurate indicator of actual functional lung size than RV, with decreasing FVC – indicative of “shrinking” functional lung volume – causing a faster and shallower breathing pattern [12], thereby aggravating dead-space ventilation (which equals reduced ventilatory efficiency) and lung hyperinflation [5]. Evidence suggests that respiratory muscle dysfunction contributes to hypercapnia development [27], primarily due to unfavorable stretching of muscle fibers resulting from lung hyperinflation, adversely affecting their mechanical efficiency [5].

The present findings indicate that lung diffusion capacity is also involved in developing hypercapnia, even though it is not as accurate in diagnosing hypercapnia as FVC1% and FEV%. A decreased diffusion capacity is also associated with reduced ventilatory efficiency, resulting in a higher minute ventilation required to maintain the same PCO2 [28, 29]. Consequently, the respiratory pump is under more intense mechanical stress, probably causing exhaustion and chronic respiratory failure.

Several methods have been established for monitoring and following up on COPD patients. The BODE index combines various parameters (body mass index, airflow obstruction, dyspnea, and exercise capacity) and is more predictive of patient mortality than its components alone [30]. However, lung function parameters such as FVC% and FEV1% should be considered more carefully when evaluating chronic hypercapnic respiratory failure. Although spirometry is unlikely to be used as a screening tool [31], it is an essential follow-up test owing to its cost-effectiveness, precision, and objective nature. As per GOLD report 2023, there is no clear recommendation regarding spirometry monitoring and follow-up interval after detecting progressive declines in FEV1% [16]. Generally, one year is considered an appropriate time interval. In light of the potential prognostic value of pulmonary function tests, it may be reasonable to use a shorter interval of six months or even three months for those with a decline in FVC% or FEV1% below the thresholds determined in the present study to prevent missing the onset of clinically relevant hypercapnia. Moreover, there is evidence that FVC and FEV1 are predictive factors for mortality irrespective of smoking status [32, 33], suggesting that regular spirometry should be performed in patients with declining values below critical thresholds.

This study has limitations. Although it involves a large sample size of predominantly advanced COPD GOLD Stage III and IV, it is undoubtedly limited by its retrospective single-center design in that external validation is required to verify the conclusions.

Conclusions

To summarize, hypercapnia in COPD progression was accurately predicted by spirometric parameters collected during routine care of COPD patients. To detect clinically significant hypercapnia promptly, the control intervals for spirometry should be narrowed in patients with COPD stages 3 and 4 with declining FVC% or FEV1% below critical thresholds.