Introduction

The natural history of SARS-CoV-2 infection is extremely variable, ranging from asymptomatic or mild infection to a severe acute distress respiratory syndrome (ARDS), characterized by a typical hyperinflammatory response associated with a microangiopathy and a widespread thrombosis [1,2,3,4].

In the face of the rapidly spreading disease and the large number of infected people, there is an urgent need to cluster patients in risk categories by identifying reliable parameters related to COVID-19 disease progression to stratify high-risk patients, who will suffer rapid disease progression to severe complications and death.

The clinical respiratory symptoms of SARS-CoV-2 infection frequently do not correspond to the severity of lung damage; therefore, to correctly stratify COVID-19 patients, it is important to evaluate the acute lung injury based on well-accepted test for acute lung injury and ARDS, such as the partial pressure of arterial oxygen (PaO2) to fraction of inspired oxygen (FiO2) ratio (PaO2/FiO2) [5,6,7,8].

Several inflammatory biomarker abnormalities have been identified to stratify COVID-19 patients who will develop a severe disease [9,10,11]. High-sensitivity C-reactive protein (Hs-CRP) as well as neutrophil (NEU)-to-lymphocyte (LYM) ratio (NLR) and platelet (PLT)-to-lymphocyte (LYM) ratio (PLR) have been investigated in SARS-CoV-2 infection and have been indicated to be useful in clinical managements of infected patients [12, 13]. Moreover, lactate dehydrogenase (LDH), an intracellular enzyme that well correlates with lung damage and inflammation, has been associated with worse outcomes in patients with viral infections and has been identified as an independent indicator for predicting severity and mortality in patients with COVID-19 [14, 15].

The aim of our study was to assess the PaO2/FiO2 ratio as a reliable prognostic biomarker to forecast the progression of COVID-19 patients towards the most severe form, according to WHO criteria, comparing its predictive value with other canonical inflammatory biomarkers.

Materials and methods

Study design

We conducted a cross-sectional monocentric study, adhering to the principles of the Declaration of Helsinki. The study was approved by the institutional ethic committee (License Prot. n. 71726) and informed consent was obtained from participating patients at the San Salvatore Hospital, L’Aquila, Italy.

Discovery cohort

Inclusion criteria

During the first and second pandemic “wave” of SARS-CoV-2 epidemic, from April 15, 2020, to November 31, 2020, 153 patients were recruited at the Department of Infectious Disease and at the Department of Pneumology of the San Salvatore Hospital, L’Aquila, Italy. All patients showed positivity to RT-PCR assay from nasopharyngeal swab sample and clear futures to chest CT. At the arrival patients were consecutively enrolled and biochemical parameters were collected at three time points: (1) the day of hospitalization due to the respiratory symptoms onset (T0); (2) day 3 (T3) and day 7 (T7) after the admission. At the same time points, arterial blood gas was performed to evaluate the respiratory function based on partial pressure of arterial oxygen (PaO2) to fraction of inspired oxygen (FiO2) ratio (PaO2/FiO2 ratio).

Out of the 153 patients, three were drop-outs. Specifically, one of them refuses to keep on participating, and two were suspected to have a neoplastic disease. Of the remaining 150 patients, 94, who presented with mild COVID-related symptoms, were assigned to population “A” and 56 with severe COVID-19-related symptoms to population “B”, according to the global clinical status. In particular, patients were divided in two reference populations using the term “A” or “B”, if the evaluation of the WHO score at T7, was between 1 and 4 or between 5 and 8, respectively. The fundamental criterion to be part of the population “A” or “B” was the need or not of oxygen support by the infected patients.

In particular, the World Health Organization (WHO) was engaged to classify the severity of this infection for COVID-19 trial endpoints (http://www.who.int/blueprint/priority-diseases/key-action/novel-coronavirus/en/). The WHO ordered the severity of the infectious disease in “low” (WHO score 1–2) that correspond to the presence of mild symptoms, “moderate” for patients who no required supplemental oxygen (WHO scores 3–4), and “severe” for all patients who needed oxygen support (WHO scores 5–8 high-flow oxygen requirement continuous positive airway pressure (CPAP), non-invasive ventilation (NIV), multi-organ support and death).

Exclusion criteria

Patients were excluded if they were known to be pregnant, had a history of vasculitis or connective tissue disease and were subjected to long-term corticosteroids or chronically immunosuppressed, received antivirals, anti-interleukin (IL)-1, anti-IL-6 or anti-TNF therapy, or underwent dialysis for chronic kidney disease or had suspect/active neoplasia.

Predictive index test

As predictive index test was used the PaO2/FiO2 ratio, the most reliable diagnostic criteria for acute lung injury and ARDS [16].

Principal outcome

As principal outcome, we wondered to investigate if PaO2/FiO2 ratio was reliable for classifying SARS-CoV-2-infected patients who will show a severe disease.

Secondary outcome

We were interested in comparing the predictive value of PaO2/FiO2 ratio to inflammatory scores (hs-CRP, NLR, PLR and LDH), already known to be useful in predicting the outcome of SARS-CoV-2-infected patients.

Validation cohort

To validate PAO2/FiO2 ratio as a reliable COVID-19 risk biomarker for the identification of patients who will develop critical illness, according to the WHO criteria, we retrospectively analyzed the clinical and biochemical data of 170 patients (broken into Population “A” and “B”) collected by two hospitals of central Italy. The variables required for validating PaO2/FiO2 ratio were collected and cross-checked by two experienced physicians (Table S2).

The number of patients, both in the discovery and in the validation cohort, who were given oxygen support is shown in Table S3.

Statistical evaluation

Variables were not normally distributed and were expressed as median plus 1st and 3rd interquartile, after controlling by the Shapiro–Wilk test. Difference in medians was evaluated by the Mann–Whitney U test. Frequencies were analyzed by the chi-square.

As measure of association, we chose to study predictions that were carried out by two types of regression techniques.

Dealing with the binary dependent variable or dichotomous outcome, i.e., Population “A”/“B” at time point T7, and one independent variable the simple logistic regression was used, calculating the odds ratio, the Std. Err. the z, the P and 95% confidence intervals (CI).

In case of more than one independent variable, the multiple logistic regression was performed, always calculating the same afore mentioned parameters, by using the enter method alone. This choice was based on having strong hypotheses about which variables belong in the model. There were implanted three models, i.e., model A comprehending all the inflammatory parameters, model B adjusted for PaO2/FiO2 ratio and model C, further adjusted for age.

ROC analysis was used as a diagnostic decision-making. Indicatively, to measure the performance of the binary classification test (index test) was studied the best cut-off, coupled with the sensitivity, specificity, positive likelihood ratio (LR +) = sens/(1-spec) and the negative likelihood ratio (LR-) = (1-sens)/spec), pointing out that the more the LR + is > 1, the more likely the outcome. On the contrary, the more a likelihood ratio for a negative test is < 1, the less likely the outcome.

Furthermore, the correct classification percentage of Population “A”/“B” at the time point T7 and the area under the receiver operating characteristic (AUROC/AUC) were performed to evaluate the most appropriate models (the highest specificity and sensitivity), under the nonparametric assumption. The cut-off (cut-points) with the highest specificity and sensitivity was calculated by the means of Youden’s Index according to [17].

Results

Dynamic profile of SARS-CoV-2-infected patients

Of the whole cohort of patients 38% were female, the mean age was 61 (ranging from 51.75 to 70.25). Among all the parameters analyzed the following displayed statistically significant changes during the first week of hospitalization: PaO2/FiO2 ratio, HR, procalcitonin, D-dimer, INR, ALT, AST, gamma-GT, FiO2, hs-CRP, platelet and lymphocyte count (Table 1).

Table 1 Demographic variable symptoms and biochemical parameters in SARS-CoV-2 at T0–T3 and T7

ALT (T0 vs T3, P < 0.01 and T3 vs T7, P < 0.01), AST (T0 vs T3, P < 0.01) and gamma-GT (T0 vs T3, P < 0.05 and T3 vs T7, P < 0.05) were significantly increased in our cohort of patients during the progression of the infection.

Among the respiratory function parameters, FiO2 and PaO2/FiO2 ratio displayed significant negative trend from T0 to T3 (P < 0.001 and P < 0.001, respectively), that appeared to be significantly improved at day 7 (P < 0.001 and P < 0.05, respectively).

Platelet and lymphocyte count were significantly increased from T3 to T7 (P < 0.001 and P < 0.05, respectively).

Characteristics of the whole population and of both cohorts “A” and “B”

The demographics and clinical characteristics are shown in Table 2.

Table 2 Demographics variables symptoms and biochemical parameters in Population “A” and “B”

The rate of severe (population “B”) and moderate cases (population “A”) was 37.4% and 62.6%, respectively.

The population “B” was characterized by the presence of older patients respect to the population “A” (median 57.50 vs 68.00, P < 0.001), without any difference between cohorts concerning gender (chi square, P = 0.65). Fever (86%) and cough (71.3%) were the first and most common symptoms at admission, and, in this regard, no significant differences between the two populations were observed. RR was higher in the population “B” [21 (17;28) vs 17 (15;19), P < 0.05].

The length of stay of the whole cohort of patients was 12 days (7.25–19.00), population A and population B length stay were 8.5 (6.00–13.00) and 18 (11.25–26.00) days, respectively (Table 2).

Most severe cases (population “B”) exhibited higher rate of co-morbidities (58.93 vs 38.30%) without reaching significance. The rate of patients worsening to ARDS was 38.6% and mortality was 9.3%.

Procalcitonin, ferritin, fibrinogen, D‐dimer, international normalized ratio (INR) and alkaline phosphatase (ALP) were significantly higher in the population “B” respect to “A” (P < 0.001, P < 0.01, P < 0.05, P < 0.001, P < 0.01, P < 0.001, respectively), whereas the liver functional parameters were not significantly different between the two populations.

Lymphocytes [median 0.970 (0.660–1.335)], neutrophil [4.32 (2.83–6.56)] and platelet [196.0 (154.5–254.0)] were collected in whole patients and significant differences in neutrophil and platelet count were observed in the population “B” respect to “A” [median 227 (146.5–294.8) and 261 (194–309), P < 0.001, respectively].

PaO2/FiO2 ratio versus LDH, hs-CRP, NLR and PLR in the population “A” and “B”

Logistic regression analysis revealed a significant negative prediction of the clinical outcome at day 7 by PaO2/FiO2 ratio (P < 0.001).

Similarly, LDH, hs-CRP, and PLR were positively correlated with a poor outcome (P < 0.001, P < 0.001, P = 0.016, respectively) (Table S1).

To evaluate the predictive power of the PaO2/FiO2 ratio, we performed three models of multivariable regression analysis.

Among the three different models, the model A was performed to correlate the inflammatory parameters (hs-CRP, NLR, PLR and LDH) at the time of hospitalization with the clinical outcome at T7. Our analyses showed that the most representative variables were LDH and hs-CRP (P = 0.002 and P = 0.039, respectively) (Table 3). Surprisingly, when we adjusted the model for PaO2/FiO2 ratio (model B, Table 3), we observed a completely different scenario, letting us think that the predictive ability of the respiratory function parameter likely obscures the prognostic value of the inflammation parameters (Table 3). To confirm the result of model B we performed a multivariable regression analysis adjusted for age (model C, Table 3).

Table 3 Multivariable logistic regression models for Prediction of population “A” and “B” at days 7

Remarkably, when we analyzed age, at univariate analysis, as a single predictor, it resulted very significant (P < 0.001, Table S1).

Developing a clinical prediction model

Dealing with the diagnostic accuracy of an index test, we generate the AUC curves of all the studied parameters. AUC curve of PaO2/FiO2 ratio for differentiating the population “A” and “B”, which was 0.838, (95% CI 0.771–0.908). The PaO2/FiO2 appeared as the most reliable prognostic biomarker compared to hs-CRP, NLR, PLR and LDH (AUC curves of 0.725, 0.741, 0.632, 0.729, respectively), see Fig. 1.

Fig. 1
figure 1

AUC curves of PaO2/FiO2 ratio, hs-CRP, NLR, PLR and LDH of 150 COVID-19 positive patients

The optimum PaO2/FiO2 ratio cut-off value to separate population “A” from “B” was < 274 mmHg, with sensitivity and specificity of 71.79% and 85.25% (LR + 4.8661, LR−0.3309), respectively.

Model validation

To establish its reproducibility, we tested the reliability of the PaO2/FiO2 ratio in the validation cohort, including retrospectively 170 patients, 100 were female, with a median age of 68.00 years (Table S2). The AUC of the PaO2/FiO2 ratio resulted to be 0.826 (95% CI 0.760–0.891), Fig. 2. The cut-off neared that was found in the discovery cohort with sensitivity of 82.50% and specificity 71.74%, LR + 2.9192, LR−0.2439.

Fig. 2
figure 2

AUC curves of PaO2/FiO2 ratio, hs-CRP, NLR, PLR and LDH of the validation cohort

Discussion

Summarizing the key results, our research highlights that PaO2/FIO2 ratio could be considered a reliable prognostic biomarker for patients’ management of COVID-19 pneumonia, as demonstrated by the AUC curve that indicate as cut-off the value < 274 mmHg. The performance of this biomarker was more than satisfactory with a good accuracy based on AUCs, in both the discovery and validation cohorts, of 0.838 and 0.826, respectively.

It is known that COVID-19 prognosis is worse in older patients, but in an interesting way, age did not modify the prediction by PaO2/FiO2 ratio. Presenting the importance of the study, we think that one of the strengths of our work is that we tracked levels of biomarkers with the aim to establish trends for each patient towards the successive phases of the disease.

The likely contribution of this work to literature consists in evidencing that in our cohort of patients, among the variables analyzed (PaO2/FiO2 ratio, hs-CRP, NLR, PLR and LDH), PaO2/FiO2 ratio was the best independent prognostic biomarker for forecasting pneumonia progression toward ARDS in COVID-19 patients.

Describing similar data of literature, our findings were consistent with those of previous studies on LDH, hs-CRP, NLR, PLR [1, 8,9,10,11,12] that associated the afore mentioned prognostic markers with poor disease outcome. However, the reliability of PaO2/FiO2 ratio in comparison with other inflammatory tests to forecast the worst outcomes of COVID-19-positive patients has not yet been highlighted.

Various indices have been used to describe pneumonia progression towards ARDS, such as the arterial to alveolar O2 difference, the intrapulmonary shunt fraction, the oxygen index and the PaO2/FIO2 ratio [18]. Of these different indices, the PaO2/FIO2 ratio, because of its accuracy but also its simplicity, could be taken into serious account also by hospitalists in their every-day practice when dealing with OVID-19.

Concerning this specific pulmonary test, Grasselli et al. in 1591 patients, admitted to an intensive care unit (ICU), found a reduced median of 160 mmHg (IQR 114–220) [19]. According to our results, Santus et al. discovered that a moderate-to-severe impairment in PaO2/FiO2 ratio was independently associated with a threefold increase in risk of intra-hospital mortality, concluding that the severity of respiratory failure is useful to identify patients at higher risk of mortality [20].

Our study covered a time period during which advances and changes in treatment approaches may have differently reduced the levels of inflammatory biomarkers among patients hospitalized with COVID-19, but this aspect does not affect the importance of our index parameter, on the contrary it reinforces its reliability.

Conclusion

Could our results fill existing gaps in the field that had not been previously exposed? We aimed to give to clinicians a way to discriminate SARS-CoV-2-positive patients with different fate. We intentionally categorized our cohort in “low” (population “A”) or “high” (population “B”)-risk patients in such a way that clinicians could make decisions quickly based on local or regional conditions and opportune therapeutic approaches. In fact, we consider of great importance the access to good clinical and supportive care for providing a more or less aggressive care whereby the patient’s outcomes might be optimized.

Limitations

Possible limitations exist in this paper. The study was conducted with a not so large sample size and on data deriving only from the central Italy, which could limit the value of PaO2/FiO2 ratio as a prognostic biomarker representative of a general population.

Moreover, although the PaO2/FiO2 ratio is simple and easily available, it has a suboptimal power in categorizing patients with ARDS [21].

Future directions

As work in progress, we will follow up these cohorts of patients. Nowadays, everlasting effects of COVID-19 are still largely unknown. Long-term pulmonary and systemic complications have been reported in some patients after recovery, with residual fatigue or muscle weakness, chest CT abnormalities, cardiovascular and neurological complications [22, 23].