INTRODUCTION

It has become evident that most of the mortality due to this COVID-19 results from the so-called cytokine storm,1 an exaggerated and uncontrolled inflammatory response that frequently leads to death if not aborted. The 2 most commonly used drugs in the management of COVID-19 are corticosteroids (CS)2, 3 and tocilizumab (TCZ).317 While CS have been uniformly found helpful,2, 3 TCZ efficacy has been mixed.317 However, most studies have evaluated efficacy based on oxygenation/ventilation of the patients rather than inflammatory markers. Our study aims to evaluate the efficacy of these drugs based on the degree of inflammation.18

METHODS

Study Design, Patient Selection, and Data Collection

This is a retrospective cohort study with data on patients collected and followed up from March 1st, 2020, to May 1st, 2021, from the nationwide Spanish SEMI-COVID-19 Registry. The characteristics of the patients included in this registry have been extensively described previously.19 This is a multicenter, nationwide registry with over 150 hospitals. All included patients were diagnosed by polymerase chain reaction (PCR) test taken from a nasopharyngeal sample, sputum, or bronchoalveolar lavage. The collection of data from each patient in terms of laboratory data, treatments, and outcomes was verified by the principal investigator of each center through the review of clinical records.

All participating centers in the register received approval from the relevant Ethics Committees, including Bellvitge University Hospital (PR 128/20).

Inclusion Criteria

The group that received only CS was considered the standard of care (SOC) for hospitalized patients. We included patients whose CS use started within the first 72 h after hospital admission and before the onset of high-flow nasal cannula (HFNC), non-invasive mechanical ventilation (NIMV), invasive mechanical ventilation (IMV), or the requirement of intensive care unit (ICU) admission. The CS plus TCZ group included patients who received both drugs in the first 72 h after hospital admission and also before the onset of HFNC, NIMV, IMV, or ICU admission.

Exclusion Criteria

We excluded patients who did not receive CS, or received it more than 3 days after hospitalization, those with a nosocomial infection, and those who died within 24 h.

Treatments Prescribed and Definitions of Groups

We divided the cohort into 2 groups: patients who received solely CS, and patients who received both CS and TCZ. The usual dose of TCZ in our country was 4–8 mg/kg iv, generally in a single dose, although some additional doses are allowed at the discretion of the responsible physician.

Regarding antiviral treatment, the use of antivirals (lopinavir/ritonavir,20 remdesivir21), hydroxychloroquine,22 and azithromycin22 was allowed according to the recommendations of the Spanish Ministry of Health.

Degrees of Inflammation

We previously reported the 3 categories of risk: (low, intermediate, and high risk) based on the total lymphocyte count, and the C-reactive protein (CRP), lactate dehydrogenase (LDH), ferritin, and d-dimer values taken at the time of admission (Table S1).18 The high-risk category was defined based on only 1 of the 5 criteria described above the previously defined cutoff. In addition, for the present study, a very high–risk category was added, defined as the presence of 3 or more high-risk upon admission criteria (Table S1).

Outcome Definition

The primary outcome of our study was in-hospital mortality. Secondary outcomes included length of stay (LOS), and the requirement of HFNC, NIMV, IMV, and ICU admission.

Statistical Analysis

Categorical variables were expressed as absolute numbers and percentages. Continuous variables are expressed as mean plus standard deviation (SD) in the case of parametric distribution or median [IQR] in the case of non-parametric distribution. Differences among groups were assessed using the chi-square test for categorical variables and the t-test or Mann-Whitney test as appropriate for continuous variables. p values< 0.05 indicated statistical significance.

For the study of risk factors associated with in-hospital mortality, univariate and multivariate binary logistic regression was performed. For the latter, variables with p<0.10 in the univariate study plus age and gender were included. Differences in mortality were shown graphically using Kaplan-Meier curves with their log-rank test (event: death; censored data: hospital discharge). Missing data were treated with multiple imputations. To improve the comparability of the groups, propensity score matching (PSM) was performed. This included age, sex, body mass index (BMI), race, smoking behavior, days from onset to admission, all comorbidities, Charlson index, heart rate on admission, tachypnea on admission, PaO2/FiO2, lymphocyte count, CRP, LDH, ferritin, d-dimer, remdesivir treatment, and prescription of low-molecular-weight heparins (LMWH) during admission.

Statistical analysis was performed by IBM SPSS Statistics for Windows, Version 26.0. Armonk, NY, USA: IBM Corp.

RESULTS

General Data and Symptoms Between Groups

A total of 21,962 patients were included in the Registry by May 2021; 9430 were treated with CS or CS plus TCZ and 5940 met our inclusion criteria (5332 were treated with CS and 608 with CS plus TCZ) (Fig. 1). Table 1 shows the differences between the two groups. Those in the isolated CS group were older (72.4 vs. 63.6 years), with less male predominance (58.9% vs. 72.9%), higher prevalence of Caucasian population (90.8% vs. 82.1%), higher degree of dependency (21.8% vs. 4. 7%), hypertension (57.8% vs. 48.8%), chronic heart failure (42.3% vs. 38.5%), chronic renal failure (7.7% vs. 1.1%), dementia (11.3% vs. 1.8%), chronic obstructive pulmonary disease (COPD) (9.8% vs. 6.4%), and a higher Charlson index (1 vs. 0). The groups were similar after propensity score matching (Table 1).

Fig. 1
figure 1

Flow chart.

Table 1 General Data

The CS group presented less frequently (Table 2) with cough (69% vs. 79.8%), arthromyalgia (26.7% vs. 34.7%), sore throat (8% vs. 11.8%), headache (11.4% vs. 15.8%), fever (77.2% vs. 86), and diarrhea (23.1% vs. 29.8%). There were also differences in heart rate (87 vs. 91 bpm) and tachypnea (39.6% vs. 53.9%).

Table 2 Symptoms and Physical Examination Upon Admission

Lab Tests Between Groups

The CS group had higher PaO2/FiO2 (285.7 vs. 271), lymphocytes (900×106 vs. 820×106), and d-dimer (869.5 ng/ml vs. 717 ng/ml) at admission. Alternatively, they had lower CRP (84.1 mg/l vs. 131 mg/l), LDH (344 U/l vs. 382.5 U/l), and ferritin (857.4 mcg/l vs. 1120.7 mcg/l) (Table 3). These differences disappeared after PSM except for d-dimer, which remained higher in the CS group (842 ng/ml vs. 717 ng/ml).

Table 3 Lab Tests Upon Admission

Treatments Between Groups

The treatments received in both groups are shown in Table S2. The CS group less frequently received remdesivir (9.8% vs. 14.3%) as well as intermediate (12.2% vs. 22.5%) or full doses of LMWH (14.3% vs. 22.2%). These differences disappeared after PSM.

The CS regimen was not standard in all patients. There were significant differences between both groups in the maximum dose of prednisone or equivalent (75 mg vs. 100 mg), days of treatment (7 days vs. 8 days), and cumulative dose (400 mg vs. 600 mg).

Outcomes Between Groups

Compared to patients receiving CS alone, in-hospital mortality was significantly lower for combination of CS + TCZ for high risk (27.4% vs. 20%, p<0.001) and very high–risk patients (40.9% vs. 23.9%, p<0.001) (Figs. 2 and 3). There was no difference for those with low/intermediate risk (8.9% vs. 7.4%, p=1.000) (Table 4, Figs. 2 and 3). Our PSM analysis found significantly lower mortality only in the very high–risk category (31.9%, vs. 23.9%, p=0.049) (Table 5; Figure S1). After PSM, we found no differences in the low/intermediate-risk (1.5% vs. 7.4%, p=0.175) or the high-risk category (23.1% vs. 20%, p=0.223).

Fig. 2
figure 2

In-hospital mortality (%) between groups in the general cohort.

Fig. 3
figure 3

In-hospital mortality (%) between groups. Kaplan-Meier. Log-rank test=44.3; p<0.001.

Table 4 Outcomes in the General Cohort
Table 5 Outcomes in the Matched-Cohort

HFNC use was lower in CS patients in all 3 risk categories: 6.2% vs. 29.6% (p<0.001) for the low/intermediate-risk category, 10.4% vs. 28.3% (p<0.001) for the high-risk category, and 12.5% vs. 31.2% (p<0.001) for the very high–risk category (Table 4). After PSM, we found the same differences between groups in the 3 risk categories: 4.6% vs. 29.6% (p<0.001), 16.1% vs. 28.2% (p<0.001), and 19.5% vs. 31.1% (p=0.003) (Table 5).

NIMV use was also lower in CS patients in all 3 risk categories: 3.4% vs. 18.5% (p<0.001) for the low/intermediate-risk category, 6.1% vs. 18.6% (p<0.001) for the high-risk category, and 8.4% vs. 16.4% (p<0.001) for the very high–risk category (Table 4). After PSM, we found similar differences between groups in the 3 risk categories: 1.5% vs. 18.5% (p<0.001), 10.5% vs. 18.6% (p<0.001), and 13.2% vs. 16.4% (p=0.322).

The use of IMV was also lower in CS patients in all 3 risk categories: 3.2% vs. 16.7% (p<0.001) for the low/intermediate-risk category, 6.1% vs. 22.2% (p<0.001) for the high-risk category, and 9.7% vs. 28.6% (p<0.001) for the very high–risk category (Table 4). After PSM, we found similar differences between groups in the 3 risk categories: 3.1% vs. 16.7% (p<0.001), 13.8% vs. 22.2% (p<0.001), and 17.9% vs. 28.6% (p=0.005) (Table 5).

The need for ICU admission was lower in patients with CS in all 3 risk categories: 4.9% vs. 20.4% (p<0.001) for the low/intermediate-risk category, 8.6% vs. 30.3% (p<0.001) for the high-risk category, and 12.2% vs. 35.3% (p<0.001) for the very high–risk category (Table 4). After PSM, we found similar differences between groups in the 3 risk categories: 6.2% vs. 20.4% (p<0.001), 18.8% vs. 30.3% (p<0.001), and 23% vs. 35.3% (p=0.002) (Table 5).

Finally, median LOS was higher in patients with CS+TCZ in all 3 risk categories: 8 days vs. 12 (p<0.001) for the low/intermediate-risk category, 9 days vs. 14 (p<0.001) for the high-risk category, and 9 days vs. 15 (p<0.001) for the very high–risk category (Table 4). After PSM, we found similar differences between groups in the 3 risk categories: 8 days vs. 12 (p=0.001), 10 days vs. 14 (p<0.001), and 11 days vs. 15 (p<0.001) (Table 5).

Risk Factors for In-Hospital Mortality

The independent risk factors for mortality in the high-risk category were age, male sex, moderate/severe dependency, higher Charlson index, tachypnea on admission, and lower PaO2/FiO2 (Table 6). The use of TCZ showed a trend of benefit that did not reach statistical significance as an independent protective factor. The very high–risk category showed similar results (data not shown). The AUC of the final model was 0.792 (Figure S2).

Table 6 Risk Factors in the Matched-Cohort. High-Risk Category

DISCUSSION

We found that higher degrees of inflammation responded to combination therapy, consistent with COVID-19 as an inflammatory disease. Treatment should be risk-stratified based on inflammation. At present, the approach to the disease has been heterogeneous and often based on oxygenation/ventilation status. In order to evaluate the efficacy of immunosuppressive/anti-inflammatory treatments, we have to include the degree of inflammation in patients to judge efficacy. The degree of inflammation in most studies is difficult to assess and appears to include many patients with low degrees of inflammation. It is thus difficult to know the real efficacy of these drugs and explain differences in efficacy between observational studies and clinical trials [23].

Our results suggest that the greater the inflammation, the more effective these drugs will be. Our group previously described 3 categories of inflammation based on 5 parameters at admission (lymphopenia, CRP, LDH, ferritin, and d-dimer).18 Since the low-risk category rarely requires hospital admission, it is the least numerous in our national series.

Our study shows that the addition of TCZ does not provide benefit in the low/intermediate-risk category. While the combination reduced mortality in the high-risk group, we did not achieve statistical significance, due to inadequate power. Patients classified as very high risk (3–5 high-risk criteria) had statistically significant reduction in death.

Our secondary outcomes (use of HFNC, NIMV, and IMV, and admission to the ICU) suggest that the CS+TCZ group had more severe disease despite PSM. The sociodemographic, clinical, and analytical data included could not fully capture patient severity. Our patients were on oxygen therapy (not high-flow) at the time of treatment initiation (CS patients or CS+TCZ patients). However, we do not know the exact FiO2 they were receiving; it is possible that the combination of CS+TCZ was used in patients requiring higher amounts of oxygen.

Our study strengths include that it is large and nationally representative. In addition, the therapeutic approach based on degree of inflammation is a good approximation to clinical practice decision-making.

Our study also has some limitations. First, it is a retrospective study. Second, it comes from a multicenter registry, with the heterogeneity that this implies, though we used standardized definitions. Another limitation to be taken into account is the heterogeneity in CS dosage and administration time as well as lack of information of important variables that might trigger addition of TCZ, such as oxygen requirement.

In conclusion, the prescription of CS alone or in combination with TCZ should be based on the degrees of inflammation and reserve the CS plus TCZ combination for patients at high and especially very high risk.