Introduction

This study aimed to investigate clinical differences between the two predominant European health care systems in critically ill COVID-19 patients aged 70 years and older. The classification into national health care systems, social insurance systems, and private insurance systems is often used to compare the performance of different national organizational forms of health care systems1,2,3,4,5. In Europe a rough dichotomization can be made between two general health care systems, the tax-based healthcare system (THS) and the social health insurance system (SHI)2,3,6. The World Health Organization defines the THS as a system in which more than half of public health expenditures are financed by revenues other than earmarked payroll taxes (this in contrast to SHI) and in which access to publicly financed services is at least formally open to all citizens1,5. In an analysis of the two very old intensive care patients (VIP) studies, we recently highlighted differences in admission policies, as well as management and outcomes in critically ill elderly patients between the two predominant organizational forms of health care systems in Europe7. To the best of our knowledge, there are no studies that have examined the effects of THS versus SHI on elderly, critically ill patients with COVID-19, and this knowledge gap motivated us to conduct this analysis.

The coronavirus disease (COVID-19) pandemic has put healthcare systems worldwide to an involuntary stress test and pushed critical care capacities to their limits even in highly developed health systems. As a result, some countries have been forced to develop criteria for prioritizing patients who benefit most from intensive care and to restrict access, at least temporarily, to patients who are unlikely to benefit from these medical interventions in terms of mortality and morbidity8. With increasing age, the benefit of intensive care treatment has been the subject of scientific, clinical, and ethical debates even before COVID-199. Especially in older patients, who are not only acutely critically ill but also more vulnerable overall, the added prognostic value of critical care quoad vitam is unclear9,10,11,12. The assessment of frailty has been established for a structured assessment of the vulnerability of critically ill patients and can be quantified by means of the Clinical Frailty Scale (CFS). We have already confirmed the prognostic relevance of the CFS in critically ill patients aged 80 years and older in terms of survival in several prospective studies11,12,13.

Our primary objective was to examine differences in 30-day mortality among critically ill patients aged ≥ 70 years treated in intensive care units in a THS or SHI.

Methods

Setting

A retrospective cohort study of the COVIP study (COVID-19 in very old intensive care patients) which is part of the Very old Intensive care Patients (VIP) project (www.vipstudy.org) was performed. Baseline characteristics, treatment, and outcomes of patients, in intensive care units from countries classified as THS and SHI, respectively, were compared. The statistical methodology was performed similarly to a preliminary work from the VIP study14. The COVIP study was endorsed by the European Society of Intensive Care Medicine (ESICM) and registered on ClinicalTrials.gov (NCT04321265). The official study protocol is available online (https://vipstudy.org/covip-study/). The COVIP study complies with the European Union General Data Privacy Regulation (GDPR) directive, which applies in most participating countries. As in the previous VIP-studies, national coordinators recruited intensive care units (ICU), coordinated national and local ethical permissions, and supervised patient recruitment at the national level. A full list of all Collaborators is provided in Supplementary Table 1.

Participants

Critically ill patients aged 70 years and older admitted for COVID-19 to an ICU were included. The decision to admit to an ICU was made by the attending physicians. Patients were classified as having been treated in a THS (Denmark, Ireland, Italy, Norway, Spain, Portugal, United Kingdom) or SHI (Austria, Belgium, France, Germany, Netherlands, Poland, Romania).

Eligibility criteria

All patients admitted between 19th March 2020 and 4th February 2021 to one of the participating 129 ICUs were included. Only patients with complete data on age, sex, clinical frailty score (CFS), sequential organ failure assessment (SOFA) score and 30-day-mortality were included (Supplementary Fig. 1), leaving no missing values on these variables. Other missing values were handled by listwise deletion. Data collection started on ICU admission. The admission day was defined as day one, all consecutive days were numbered sequentially from the admission date. The methodology was performed similarly to a preliminary work from the VIP study14.

Outcomes, exposures, predictors and confounders

The primary endpoint of this study was 30-day-mortality, the secondary endpoints were ICU- and 90-day-mortality, and rates of treatment limitation (the frequency of decisions to withdraw or withhold treatment). The primary exposure was being in a SHI versus THS. We used the dichotomisation chosen by the World Bank and the Organisation for European Economic Co-operation (OEEC)2.

For each patient relevant baseline characteristics (including age, sex, main reason for admission, and frailty) and management strategies (including use of renal replacement therapy (RRT), mechanical ventilation (MV), non-invasive ventilation (NIV), and rates of vasoactive use) were documented. Also, any treatment limitations (treatment withheld, or treatment withdrawn) were documented. Frailty was assessed by the Clinical Frailty Score (CFS) and the respective visual and simple descriptions were used with permission15,16. The gross domestic product (GDP) per capita for 2019 in US$ was accessed from the International Money Fund (IMF), the human development index (HDI) from the United Nations Development Program (UNDP)17, and the total (compulsory, voluntary, out-of-pocket) amount of health spending per capita in US$ in 2019 from the Organisation for Economic Co-operation and Development (OECD)18. The methodology was performed similarly to a preliminary work from the VIP14.

Statistical analysis

The study size was defined by the retrospective dataset and no sample size calculation was performed. The data are likely to be clustered on an ICU level. Therefore, a multilevel regression approach was chosen to correct for potential confounders and bias. As the health economic data do not vary within a given cluster, generalised estimation equations (GEE) with robust standard errors to produce population average odds ratios were used. The survival analysis/GEE-based analyses were conducted using only robust estimators of the standard errors and not in the sense of robustness against violations of normality assumptions as for the robust methods (e.g., Mann–Whitney tests) used for the univariate analyses. Model-1 includes only the ICU as a panel. Model-2 includes patient-specific confounders (sex, age per year, SOFA score per point, frailty score per CFS point). Model-3 adds the health economic confounders (HDI). Adjusted odds ratios (aOR) and respective 95% confidence intervals (95%CI) were obtained. Sensitivity analyses stratifying 30-day-mortality for patient-specific characteristics, treatment strategies and health economic data and local 7-day-incidences were performed. Continuous data are given as median ± inter-quartile range (IQR) and compared using Mann’s Whitney U-Test or mean ± standard deviation (SD) and compared using Student’s T-Test accordingly. Categorical data are given as numbers (percentage) and compared using the chi-square test. All tests were two-sided, and a p-Value of < 0.05 was considered statistically significant. Stata/IC 17 was used for all statistical analyses. The statistical methodology was performed similarly to a preliminary work from the VIP study14.

Ethics approval

Each study site obtained approval from the institutional research ethics board (IRB). The original vote was obtained at the Ethics Committee of the Medical Faculty of Heinrich-Heine-University Düsseldorf (Vote Number #2020-892). In some participating countries, patients could be recruited without informed consent, whereas in other countries, informed consent had to be obtained because of the respective ethical approval procedures. In all countries where informed consent was deemed necessary by the respective ethics committee, informed consent was obtained from all participating individuals. We confirm that all methods were performed in accordance with the relevant guidelines and regulations.

Results

Baseline demographics in SHI versus THS

In total, we included 2406 patients in this analysis. Patients were defined as being in a THS (n = 886) or SHI (n = 1520) health care system. The mean age was relatively similar in both groups (76 vs. 75 years; p = 0.08), while in the THS group fewer patients were between 70 and 79 years old (80 vs. 84%) and more patients were between 80 and 89 years old (19 vs. 15%; p = 0.03). We found no differences regarding patient´s sex and BMI (see Table 1). In the THS group SOFA scores at baseline were significantly higher (6 vs. 5 pts.; p = 0.002), whereas no significant difference was observed regarding CFS and length of stay (see Table 1). More patients in the THS group suffered from ischemic heart disease (24 vs. 18%; p = 0.005), whereas no difference was shown for all other comorbidities (see Table 1). Table 2 provides an overview of the number of patients from each country enrolled in this study. A higher health spending per capita (median 5.376 vs. 3.616 US$; p < 0.001), HDI (median 0.904 vs. 0.901; p < 0.001) and GDP (median 41.897 vs. 29.993 US$; p < 0.001) were observed in the THS group. We found no differences in days in the hospital prior to ICU admission and days with symptom onset prior to hospital admission between THS and SHI (Table 1).

Table 1 Demographic and clinical characteristics of the cohort.
Table 2 Healthcare economical characteristics of the cohort.

Organ support and management

In the univariate analysis, the rates of vasopressor use (67% in both groups; p = 0.9) and invasive mechanical ventilation (68 vs. 70%; p = 0.33) were similar between both groups (see Table 3). These findings persisted after multivariable adjustment (see Table 4). Non-invasive ventilation was performed more frequently in the SHI group (32 vs. 21%; p < 0.001). However this association was not confirmed in the multivariable analysis (aOR 1.47 95%CI 0.89–2.42; p = 0.13). Yet, the rates of RRT were significantly higher in the SHI group (17 vs. 10%; p < 0.001) and remained so after multivariable adjustment (aOR 0.61, 95%CI 0.40–0.92; p = 0.02; Table 4).

Table 3 Univariate comparison of organ support, treatment limitations and mortality in THS versus SHI.
Table 4 Generalised estimation equations (GEE) producing population average odds ratios.

Mortality and treatment limitation analysis

ICU mortality, as well as mortality at 30 and 90 days was similar in both groups (ICU: 44 vs. 46%, p = 0.41; 30-day: 47 vs. 50%, p = 0.16; 90-day: 55 vs. 57%, p = 0.22). This finding persisted after multivariable adjustment across all models (ICU: aOR 1.08 95%CI 0.80–1.46, p = 0.62; 30-day: aOR 1.18 95%CI 0.87–1.61, p = 0.29; 90-day: aOR 1.20 95%CI 0.84–1.72, p = 0.31). These findings are shown in Tables 3 and 4, and in the Kaplan Meier Curve in Fig. 1. The rates of any treatment limitations were similar between THS and SHI (40% vs. 39%; p = 0.37) and remained so after multivariable adjustment (aOR 1.02; 95%CI 0.69–1.50; p = 0.93). No significant difference was observed for withholding treatment (THS 34 vs. SHI 31%; p = 0.09; aOR 0.91 95%CI 0.59–1.41, p = 0.68). There was a trend towards lower rates of treatment withdrawal in THS (19% vs. 23%; p = 0.01) in the univariate analysis, however we could not observe this difference after multivariable adjustment (aOR 1.42; 95%CI 0.95–2.12; p = 0.09).

Figure 1
figure 1

Kaplan–Meier curve depicting survival in THS (red) and SHI (blue) patients (with 95% CI). Abbreviations: CI: confidence interval; d: days; SHI: social health insurance system; THS: tax-based health system.

Sensitivity analyses

In the sensitivity analyses comparing 30-day-mortality between SHI and THS for patient-specific characteristics, treatment strategies and health economic data (see Forest Plot in Fig. 2), we found that only in the subgroup of patients treated in countries with a HDI above the median of 0.904 (aOR 1.56 95%CI 1.11–2.17) and GDP per capita above the median of 41,892 US$ (aOR 1.56 95%CI 1.11–2.17), treatment in a THS was associated with higher odds for non-survival at 30 days.

Figure 2
figure 2

Sensitivity analyses stratifying 30-day-mortality in subgroups for patient-specific characteristics using generalised estimation equations (GEE) producing population average odds ratios. The depicted aORs from model-1 include only the ICU as panel. Abbreviations: aOR: adjusted odds ration; GDP: gross domestic product; HDI: human development index; ICU: intensive care unit; LST: life-sustaining treatment; NIV: non-invasive ventilation; SHI: social health insurance system; SOFA: sequential organ failure assessment; THS: tax-based health system.

Discussion

This study explored differences in critically ill older patients suffering from COVID-19 between SHI and THS. We found no clinically relevant differences with regards to demographics and geriatric characteristics, such as frailty and comorbidities, as well as management strategies and outcomes.

A key point for the interpretation of this study is the categorization of health care systems. As discussed in our preliminary study, we are aware that the division of national health care systems into two distinct systems may be an over-simplification. In addition to this rough dichotomization of health care systems in Europe, there are also more sophisticated classification systems with higher granularity that can be used to subdivide the organization of health systems3,19,20. Some authors use a deductive approach and describe up to 27 different theoretically possible subtypes of health systems21,22,23. Ultimately, we consider all subclassifications to be variations of the original used subdivision here. These are, on the one hand, primarily tax-financed, municipally organized systems with a "single payer" (THS), or in a social insurance system (SHI) financed by predefined and earmarked premiums, mostly from payroll taxes, and organized rather pluralistically, where private and public service providers often coexist and compete. In addition to previously published scientific analyses, the World Bank and the Organization for European Economic Cooperation (OEEC) also use the pragmatic view of existing European health systems used in this study2,18. Apart from national health care systems, other organizing factors, such as equipment, management of ICUs may have influenced our results, although we do not have specific information on this. In the following, we will put the results of our study in context, and we believe that one of the strengths of our study is its relative uniqueness.

Baseline risk distribution in SHI versus THS

In our study, some statistically significant differences in baseline characteristics were found between the SHI and THS groups, but we consider these to be clinically irrelevant given the small absolute numeric differences. Triage for ICU admission is a difficult issue with wide variation in perception and acceptance, not only across countries10,24,25,26,27. However, when baseline characteristics are used as a possible surrogate parameter on different admission policies between the two systems, there is no evidence of such a difference based on our data.

These findings contrast with the results of our recent analysis in which we compared critically ill elderly (80 years and older) patients in SHI and THS with respect to admission characteristics, organ replacement procedures, and outcomes before the COVID-19 pandemic. In this study, we detected indirect evidence of a more liberal admission policy in SHI, indicated by higher age and frailty scores7. However, we believe that even in the absence of a pandemic, the differences between the two health systems across different European countries might be subtle in nature. Yet, because the COVID-19 pandemic has strained health systems of all types, we believe this has offset and compensated for systemic differences. Since we also included younger patients (70 years and older) compared to our last study, an exact comparison of the age structures between the two studies is impossible7. However, it is striking that the number of patients over 90 years of age in the COVIP study was very low. We interpret this as a possible indication of stricter admission criteria across all ICUs, countries and health systems.

Organ support and management in SHI versus THS

In this analysis, no differences between SHI and THS regarding invasive ventilation and the use of vasopressors was shown. However, it is interesting to note that, as in our earlier study, RRT was used significantly more frequent in SHI. This is of interest as this extra effort did not result in any survival benefit. Since the rate of patients with preexisting renal disease was similar between SHI and THS, and median SOFA scores were only slightly different, one could speculate here whether the more aggressive use of RRT in SHI was triggered by financial motivation, but also clinical training in health care settings or used more liberally due to less financial restrictions. Of note, the renal SOFA component was similar in SHI and THS (p = 0.52). These findings fit well with the STARRT-AKI study and a recent meta-analysis that showed no benefit for earlier use of RRT28,29.

Mortality analysis

In brief, we did not detect a mortality in critically ill patients over 70 years between SHI and THS. Mortality was similar at the time of ICU discharge, 30 days, and 90 days, even after multivariable adjustment for clinical and health economic data, as well as clinical management. Likewise, extensive sensitivity analyses did not reveal a clinically relevant difference between the two health care systems. Of note, also in patients above 80 years of age, reflecting the age group of the VIP studies, we found no differences. We also found no evidence of disparate mortality in the sensitivity analyses related to incidence in the population at the time of admission. There was a signal towards higher mortality in THS with higher GDP and HDI (GDP and HDI overlapped in this analysis). However, we interpret this lone signal for differential mortality primarily in the context of multiple testing and do not wish to generate a hypothesis based on it.

Limitations

This is a retrospective cohort study and is therefore subject to the limitations associated with this methodology. No sample size calculation was performed for this analysis, therefore, the absence of differences between groups could be due to an underpowered study to detect very subtle differences in outcomes between the study groups. Like our previous study, this study does not directly provide information on the triage process and prescreening prior to actual ICU admission. Also, this study is applicable only to patients aged between 70 and 90, as only very few patients above 90 years were included. Another weakness of this analysis is also that different important countries using a THS were not included in our database. In particular, Sweden, which has chosen a special path in the pandemic anyway, is missing. Local COVID-19 incidences play another factor of uncertainty. Although the average number of new infections and thus the burden of critically ill patients may be manageable in a given country, whereas the occurrence of local clusters and excessive strain on health care systems is typical for the course of COVID-19. Our analysis took into account the 7-day incidence at the country level in a sensitivity analysis—nevertheless, clusters at the local level may have influenced the results. However, given that the overall results of this study were very homogeneous, we do not believe that we have overlooked relevant epidemiological factors of the pandemic in this regard. Finally, it must be emphasized that other factors, including nationally distributed ones, may have influenced our results, such as the local organization of ICUs, cultural factors, or the availability of innovative therapies.

Conclusion

The aims of this study were to compare the admission characteristics, treatment strategies (frequency of organ replacement procedures and therapy target limitations) and mortality at three time points (time of ICU discharge, after 30 and after 90 days) of critically ill patients aged between 70 and 90 admitted to an ICU due to severe COVID-19 disease between Bismarck systems and Beveridge systems in Europe. We found no evidence of clinically relevant differences in admission characteristics between SHI systems and THS. With regard to intensive care management, there was evidence of more frequent use of renal replacement therapies in SHI systems compared to THS. There were no indications of differences between SHI systems and THS with regard to mortality—which may be considered the ultimate measure of performance, especially in intensive care medicine. It remains unknown to what extent the observed parity of intensive care for elderly patients with COVID-19 is a specific feature of critical care medicine or a sign of socioeconomic convergence of the two systems.