Introduction

In community-acquired pneumonia (CAP), the global occurrence ranges from 1.5 to 14 cases per 1000 person-years, making it the leading cause of death among infectious diseases [1, 2]. In the United States, the incidence is estimated to be between 106 and 164 per 10,000 inhabitants, while in Latin America, it reaches 294 cases per 10,000 inhabitants [1,2,3]. This incidence is particularly pronounced among individuals over 65 years old, those with comorbidities, and/or immunosuppressed individuals [1,2,3,4]. Observational studies report a mortality of CAP at 30 days, 6 months and 1 year after hospital discharge of 4.7% (95% CI 3.4–6.8), 8.6% (95% CI 8.3–11.9), and 14.9% (95% CI 12.4–18.2), respectively [5]. Regarding long-term outcomes, mortality rates from CAP requiring hospitalization were 45.7% in older adults within one year of follow-up [6].

Currently, validated scales are available to stratify pneumonia severity, predict prognosis or survival, and anticipate the need for Intensive Care Unit (ICU) admission [1, 4]. These tools provide valuable guidance for initiating early therapeutic strategies, with the goal of positively impacting outcomes, including mortality rates [1, 2, 7]. The implementation of risk scores (RS) is recommended by clinical practice guidelines and consensus from the American Thoracic Society/Infectious Diseases Society of America (ATS/IDSA) to evaluate prognosis and determine the optimal patient management site for CAP [8, 9]. The most recommended RS are the Pneumonia Severity Index (PSI) and confusion, uremia, respiratory rate, BP, and age ≥ 65 years (CURB-65) [10]. However, the clinical applicability of these scores primarily lies in estimating mortality within the initial 30 days, with insufficient evidence for their long-term predictive capacity [10, 11]. Therefore, efforts have been made to determine the long-term predictive capacity of clinical variables in widely studied CAP scores, some of these are estimated glomerular filtration rate, altered mental status, and pleural effusion on chest X-rays, all associated with long-term mortality [12,13,14].

E. Lubart et al. [15]. investigated 180 patients in a 1-year period, divided into two groups: those with CAP and acute kidney injury (34.4%) and those with CAP and no acute kidney injury (65.6%). They found a significant association between glomerular filtration rate and mortality (chi-square = 37.1, df = 3, p < 0.001). However, it was a single- center retrospective study with small sample size and enrolled only elderly patients (SD: 87.9). Also, Garcia-Vidal et al. [16]. analyzed a total of 2457 hospitalized patients with CAP in a prospective, longitudinal and observational study. They found that 41.1% of patients with altered mental status upon admission died within the first 48 h, and only 10.8% survived beyond 30 days. Utilizing RS that encompass intrinsic patient traits, clinical indicators and lab results could identify those at elevated risk of long-term mortality from lower respiratory tract infections [11, 13]. Yet, evidence concerning the utility of clinical variables and RS in CAP patients remains sparse, given the limitations of long-term mortality studies [11,12,13,14]. The most commonly applied long-term scale is the Community-Acquired Pneumonia Severity Index (CAPSI). This index developed a predictive model to estimate one-year mortality, with a weighted score that includes: age over 80 years (4 points), congestive heart failure (2 points), dementia (6 points), respiratory rate ≥ 30 breaths per minute (2 points), and blood urea nitrogen over 30 mg/dL (3 points) as predictors of higher risk, achieving a ROC curve of 0.76 [4]. Understanding the long-term performance of these scales is crucial due to the rise in one-year mortality and the lack of information on risk estimation for short-term to one-year variables. This aids in identify impactful scales for effective patient follow-up and mortality prevention, based on the performance in predicting mortality. Hence, this article aims to compare the effectiveness of 16 scores in forecasting mortality at three, six, and twelve months in adult CAP patients.

Methods

A retrospective cohort study on individuals diagnosed with CAP was conducted across two hospitals in Colombia. Patients were assessed and admitted to emergency room and ICU from January 2010 to January 2020.

Eligibility criteria

Participants of both sexes, aged 18 years or older, diagnosed with CAP who had spent at least 6 h in the emergency room, or whom had been admitted to the ICU due to this same condition, were recruited. Pneumonia was defined as pulmonary infiltrate on chest X-ray not seen previously, plus at least one symptom compatible with pneumonia such as cough, fever, dyspnea, and/or chest pain [4, 8, 9]. Patients with nosocomial infection, incomplete clinical history, absence of 12-month survival information, and survival of less than 30 days were excluded (thus evaluating exclusively the mortality of hospitalized patients with CAP who survived the first 30 days).

Skilled healthcare professionals, including doctors and specialized nurses, conducted the subject selection process upon entry into the study center. Initially, 7454 potentially eligible subjects were identified, of which 3688 were included in the analysis after excluding those with clinical presentations incompatible with pneumonia, nosocomial infection, lack of medical history, absence of data on 12-month survival, and survival of less than 30 days (Fig. 1).

Fig. 1
figure 1

Patient Admission Flowchart

Additionally, medical records needed to include sufficient information for the evaluation of CURB-65, CRB-65, Severe Community Acquired Pneumonia (SCAP), CORB, ADROP, National Early Warning Score (NEWS), Pneumonia Shock, Risk of Early Admission to ICU (REA-ICU), PSI, SMART-COP, SMRT-CO, SOAR, Quick Sequential Organ Failure Assessment (qSOFA), Systemic Inflammatory Response Syndrome (SIRS), CAPSI, and Charlson Comorbidity Index (CCI) scores (Supplementary Table 1.)

Variables

Sociodemographic variables (age and gender), comorbidities through the CCI, vital signs, consciousness status, chest X-ray findings such as multilobar involvement or pleural effusion reported by a radiologist, and laboratory tests including arterial gases, hematocrit, white blood cell count, blood urea nitrogen, serum sodium, albumin, and blood glucose were included. Additionally, the need for ICU, invasive mechanical ventilation (IMV), and/or vasopressor support were considered. Sepsis was defined as a life-threatening organ dysfunction resulting from an imbalanced host response to infection [17]. Patients with septic shock were identified based on their need for vasopressors to maintain a mean arterial pressure of > 65 mmHg in the clinical setting [18]. The dependent variable was mortality evaluated at 3, 6, and 12 months following CAP diagnosis.

To minimize possible errors in outcome classification, the research team collecting data from clinical records had medical expertise in diagnosing the studied pathology. To reduce typing bias, at least two team members reviewed the information, and in case of inconsistency, a third team member reviewed the information and made the final decision.

Sample size

Sample size calculation used data from Lim et al. [19], describing a sensitivity of 75% and specificity of 69% for CURB-65, and Fine et al. [20], reporting a sensitivity of 100% and specificity of 52.2% for PSI. Using the formula for paired diagnostic tests, with an expected mortality of 6.1%, 90% power, and statistical significance of 0.05, a minimum of 625 subjects was required.

Missing data

An imputation analysis addressed missing data, employing weighted mean imputation for quantitative variables and logistic regression for qualitative variables with a loss of less than 10% [21]. Variables with more than 10% data loss were excluded. A comparison between non-imputed and imputed results ensured that imputation did not introduce bias or significantly alter the original data.

Statistical analysis

Data were entered into REDCap (Research Electronic Data Capture) [22] for subsequent analysis using SPSS 25 software (IBM Corp. IBM SPSS Statistics for Windows, Version 25.0 licensed). Qualitative variables were reported in frequencies and percentages, while quantitative variables were summarized using mean and standard deviation for normally distributed ones and median and interquartile range for non-normally distributed ones. Bivariate analysis between questionnaires and the outcome (alive or dead) was performed using the chi-square test for qualitative variables and Student’s t-test or Mann-Whitney U test for quantitative variables [21]. The sociodemographic variables, clinical variables, laboratory test results, and diagnostic imaging findings obtained from patients’ medical records were used to calculate the scores for each of the RS included in the study (Supplementary Table 1). Scores obtained for each questionnaire were used to calculate a receiver operating characteristic (ROC) curve, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), positive likelihood ratio (LR+), and negative likelihood ratio (LR-), using the established cutoff point for each questionnaire (Supplementary Table 2). A comparison was made between the different ROC curve obtained, using the DeLong test [19], considering a value of p < 0.05 as significant. The ROC curve was interpreted as follows: 0.50, absence of discriminatory capacity; 0.51 to 0.60, almost null discriminatory capacity; 0.61 to 0.69, poor discriminatory ability; > 0.7 to 0.8, acceptable discrimination ability; > 0.8 to 0.9, excellent discriminatory capacity; and > 0.9, outstanding discriminatory capacity [21].

Results

Population characteristics

A total of 3688 were included in the final analysis (Fig. 1). Mortality at 3, 6, and 12 months was 5.2%, 8.3%, and 16.3%, respectively. The average age was 63.5 years (SD: 21.39), and 59.3% (2188/3688) of the patients were male. The most common symptoms in the overall population were cough in 82.6% (3045/3688), dyspnea in 67.4% (2486/3688), and fever in 47.6% (1756/3688) (Table 1). The most prevalent comorbidities were arterial hypertension in 46.1% (1699/3688), and COPD in 25.5% (941/3688).

Table 1 General characteristics of the population

Arterial gases and blood tests

The inspired fraction of oxygen in survivors was 28.1% (SD: 11.59) compared to 30.3% (SD: 14.98) in non-survivors. Blood urea nitrogen was 4.7 mg/dl lower in survivors compared to the deceased group (22.5 vs. 27.2). Laboratory test results are described in Supplementary Table 3.

Treatment during hospitalization

12.1% (73/602) of deceased patients had septic shock compared to 6.4% (198/3086) of surviving patients (Supplementary Table 4). The use of vasopressor support and systemic corticosteroids was 12.3% (78/602) and 31.6% (190/602) in deceased patients, respectively. The need for ICU was 7.6% higher in deceased patients compared to the survivor group (17.6 vs. 10).

Performance of RS for Mortality at 3, 6, and 12 months

At 3 months, PSI, CCI, and CRB-65 scores showed ROC curves of 0.74 (95% CI: 0.71–0.77), 0.71 (95% CI: 0.67–0.74), and 0.70 (95% CI: 0.66–0.74), (Table 2; Fig. 2). At 6 months, PSI and CCI scores showed performances of 0.74 (95% CI: 0.72–0.77) and 0.72 (95% CI: 0.69–0.74), respectively (Table 3; Fig. 3). At 12 months, all evaluated scores showed poor discriminatory capacity, including PSI, which decreased its capacity to poor with an ROC curve of 0.64 (95% CI: 0.61–0.66) (Table 4; Fig. 4). The score with the lowest performance in predicting mortality at 3, 6, and 12 months was SIRS with an ROC curve of 0.51 (95% CI: 0.47–0.55), 0.50 (95% CI: 0.47–0.54), and 0.50 (95% CI: 0.47–0.52), respectively.

Table 2 Performance of risk scores in Community Acquired Pneumonia for 3 months Mortality Prediction
Fig. 2
figure 2

Performance of risk scores at 3-month

Table 3 Performance of risk scores in Community Acquired Pneumonia for 6 months Mortality Prediction
Fig. 3
figure 3

Performance of risk scores at 6-month

Table 4 Performance of risk scores in Community Acquired Pneumonia for 12 months Mortality Prediction
Fig. 4
figure 4

Performance of risk scores at 12-month

Discussion

In this cohort study describing the performance of RS in predicting mortality in patients with CAP, it was observed that at 3 months, PSI, CCI, and CRB-65 showed acceptable predictive performance. At 6 months, only PSI and CCI maintained acceptable levels of accuracy. For the 12-month period, all evaluated scores exhibited very limited discriminatory ability, ranging from poor to nearly negligible. The use of RS may serve as a complementary tool for predicting long-term mortality in hospitalized CAP patients.

Several RS have been proposed to predict the prognosis of severe CAP patients and determine the need for ICU treatment. Anurag et al. [24] described that SCAP (ROC curve: 0.873) performed well in predicting CAP severity compared to PSI (ROC curve: 0.713) and CURB-65 (ROC curve: 0.643). Furthermore, SCAP showed excellent performance in predicting 14-day mortality, with a ROC curve of 0.96. These findings are consistent with those described by España et al. [25], who validated the SCAP score as acceptable for predicting 30-day mortality in CAP patients. However, although studies analyzing SCAP show good performance in predicting short-term mortality [], our study generated new results indicating poor performance of SCAP in predicting mortality at 3 and 6 months, and negligible at 12 months.

Alan et al. [26] described the long-term predictive performance of PSI and CURB-65 over a 6-year follow-up period in hospitalized CAP patients. Initial scores had prognostic accuracy with a ROC curve of 0.79 to 0.83 (p < 0.001) and 0.73 to 0.80 (p < 0.001) after two years of follow-up, respectively. According to our data, PSI showed the best performance in predicting mortality at 3, 6, and 12 months, while CURB-65 exhibited poor performance in these periods. Additionally, in our comprehensive evaluation of clinical variables and laboratory results, such as comorbidities, renal involvement, and age, CCI and NEWS scores demonstrated a strong negative predictive value independently for long-term mortality from CAP compared to other scores, such as CURB-65, CORB, SMART-COP, among others.

Kaplan et al. [27] assessed 158,960 hospitalized patients, comparing those with CAP to non-CAP controls. Their analysis was one of the first ones to revealed that the most closely associated comorbidities with a higher 1-year mortality rate were metastatic solid tumours (85.5%), renal disease (64.7%), and hepatic disease (56.2%). Currently, most scores for assessing CAP incorporate these comorbidities. However, it’s noteworthy that widely used scores such as CURB-65, CRB-65, SCAP, CORB, ADROP, NEWS, Pneumonia Shock, PSI, SMART-COP, SMRT-CO, SOAR, qSOFA, and SIRS are primarily designed to predict outcomes within a 30-day timeframe. There is a lack of consensus regarding the optimal tool for predicting long-term outcomes, leading to uncertainty when determining mortality risk estimates.

Uranga et al. [4] developed and validated a prognostic index specific for predicting one-year mortality in hospitalized CAP patients. The variables included in the RS construction were age ≥ 80 years with 4 points, chronic heart failure with 2 points, dementia with 6 points, respiratory rate ≥ 30 breaths/min with 2 points, and blood urea nitrogen ≥ 30 mg/dL with 3 points. It was observed that the risk of one-year mortality increased by 24% (HR: 1.24; 95% CI: 1.19–1.28) for each unit increase in the predictive model. The CAPSI showed predictive accuracy of 0.76 in the derivation cohort and 0.77 in the validation cohort, results superior to those reported in our study.

Long-term survival rates exhibit significant variability in the literature. According to Johnstone et al. [28], in their study involving 3284 patients diagnosed with CAP, 12% deceased within the first 30 days, 28% within the first year, and 53% within a 3.8-year follow-up period. The mean age of the deceased was 76.3 years (SD: 13.4), with 56% being male. Conversely, Koskela et al. [29]. described long-term mortality rates in a population with a history of CAP and diabetes, reporting mortality rates of 54% and 10% at 5 years and 11 months of follow-up, respectively, among patients with and without diabetes. Bruns et al. [30]. utilizing municipal records and death certificates in the Netherlands, calculated cumulative mortality rates at 1, 5, and 7 years of 17%, 43%, and 53%, respectively. In comparison with various previous observational studies [23,24,25,26,27,28,29,30], the observed mortality in our study is relatively low, possibly attributable to a younger population, variability in the severity of the illness, and/or the exclusion of subjects with short-term mortality.

Limitations

Among the limitations of our study is its observational nature, based on information obtained from clinical records, which may have data omissions [21]. It is important to note that the personnel responsible for data collection received adequate training to ensure accuracy in the transcription of the information obtained. Additionally, our study was conducted in two hospital, which may be considered a strength for generalizing the results. By excluding subjects who did not survive the first 30 days, we observed a 15% reduction in the population with CAP, which may influence the overall mortality rate. However, our primary objective is to evaluate the performance of these scores beyond this initial period. We also believe that the sample size included was sufficient to meet the objectives [21].

It is relevant to note that information on causes of death at 12 months could not be obtained, which would have provided important additional data for analysis. Therefore, we suggest that future studies address this issue for a more comprehensive understanding of the results. We consider it pertinent to conduct additional research that can corroborate our findings, as well as externally validate established RS to predict mortality at 12 months, such as those described by Uranga et al. [4], which would help strengthen the evidence and improve accuracy in predicting long-term outcomes in CAP patients.

Conclusion

In predicting mortality in patients with CAP, it was observed that at 3 months, PSI, CCI, and CRB-65 showed acceptable predictive performances. At 6 months, only PSI and CCI maintained acceptable levels of accuracy. For the 12-month period, all evaluated scores exhibited very limited discriminatory ability, ranging from poor to almost negligible. The use of RS may serve as a complementary tool for predicting long-term mortality in hospitalized CAP patients.