Background

Community-acquired pneumonia (CAP) is a major public health challenge worldwide, with 2.6 million deaths in 2019 [1,2,3,4]. Multiple respiratory pathogens are known to cause CAP. Influenza virus (IFV), human parainfluenza viruses (HPIVs), respiratory syncytial virus (RSV), human enterovirus (EV)/rhinovirus (HRV), human adenonvirus (AdV), human coronavirus (HCoV) and human metapeumovirus (HMPV) are the major viruses in CAP; while Streptococcus pneumoniae (S. pneumoniae), Haemophilus influenzae (H. influenzae), Mycoplasma pneumoniae (M. pneumoniae), and Legionella pneumophila (L. pneumophila) are common respiratory bacteria in CAP [5,6,7,8]. Moreover, the aetiology of CAP varies geographically due to the impact of social, economic, environmental and demographic factors [7,8,9].

Due to the complexity of CAP aetiology, the implementation of multiplex molecular detection to figure out the incidence of major pathogens has significantly improved our understanding of CAP aetiology [3, 5, 6]. However, routine screening of respiratory viruses other than IFVs is only recommended in patients with severe CAP and immunodeficiency [10]. More sufficient laboratory-based aetiological evidence is essential to improve the understanding of the pathogenesis of CAP and to develop effective guidelines for aetiological diagnosis and anti-infective therapies to CAP.

The aetiology of CAP in adults and its association with adverse outcomes in patients in China, one of the most populous countries in the world that is undergoing rapid industrialization, urbanization and ageing, have not been well defined based on a large-scale comprehensive aetiological study of both viral and bacterial pathogens by multiplex detection. Here, we conducted a multicentre prospective study to determine the aetiology of CAP in adults in China. We also evaluated the association between respiratory pathogens and disease severity and identified priority pathogens for screening.

Methods

Study design and participants

From 1 to 2014 to 31 December 2019, hospitalized CAP patients aged ≥ 14 years were recruited from 10 hospitals in nine cities located in different geographical regions in mainland China. CAP was diagnosed according to the 2007 guidelines of the American Thoracic Society [11]. Detailed inclusion and exclusion criteria are provided in the supplementary materials (Additional file 1: Supplementary Methods). Demographic and clinical information were obtained from clinical records.

Specimen collection and respiratory pathogen detection

For each patient, sputum or bronchoalveolar lavage fluid (BALF) and whole blood samples were collected at enrolment. For a few patients, pleural effusion and endotracheal aspirate samples were collected. A total of 200 µl samples were used for nucleic acid extraction as previously reported by using easyMAG (bioMerieux, Marcy l’Etoile, France) [9, 12, 13]. Thirty-three respiratory pathogens were detected by using the multiplex real-time PCR method (Fast Track Diagnostic, Junglinster, Luxembourg), including IFVs (A, B and C) and the 2009 subtype of IFVA/H1N1, HPIV 1–4, HCoVs (NL63, 229E, OC43 and HKU1), HMPV A and B, EVs (including HRVs), RSV A and B, Adv, human parechovirus (HPeV), human bocavirus (HBoV), Pneumocystis jirovecii (P. jirovecii), M. pneumoniae, Chlamydophila pneumoniae (C. pneumoniae), S. pneumoniae, H. influenzae, H. influenzae type b (Hib), Staphylococcus aureus (S. aureus), Moraxella catarrhalis (M. catarrhalis), Bordetella spp. (except for Bordetella parapertussis), Klebsiella pneumoniae (K. pneumoniae), L. pneumophila and Salmonella spp. In addition, bacterial culture testing was performed on some specimens. Positive results were considered in further analysis.

Statistical analysis

The Wilcoxon rank sum test or t test was used to analyse continuous variables. The chi-square test or Fisher’s exact test was used to analyse categorical variables. Multivariate logistic regression was used to evaluate the association between aetiological factors and severe CAP adjusted by age, sex, season, duration of illness, previous antibiotic exposure (defined as antibiotic use within 5 days prior to admission) and underlying diseases. A two-sided p < 0.05 was considered statistically significant. All statistical analyses were conducted by using IBM SPSS (v.19.0; IBM Corp., Armonk, NY, USA) and the R package (version 3.6.1) [14].

The association of the detected pathogens with the risk of severe pneumonia was calculated using the adjusted odds ratio (OR). The aetiological estimation fraction was determined based on the detection rate of pathogens and their risk associations with severe pneumonia. The ten major pathogens were identified by the aetiological estimated fraction. The calculation form of the aetiological estimation fraction is as follows:

$$\begin{array}{c}{\rm{Aetiological}}\,{\rm{estimated}}\,{\rm{fraction}}\,(\% )\,{\rm{ = }}\,\\{\rm{Detection}}\_{\rm{rate}}\,(\%)\,\times\,{\rm{Severe}}\_{\rm{OR}}\end{array}$$

Results

Patient characteristics

A total of 3,403 hospitalized adults with CAP were enrolled in this study. Of which 462 (13.58%) had severe pneumonia, 317 (9.32%) required intensive care, and 66 (1.94%) died in the hospital (Table 1). The median age of the patients was 58 (interquartile range, 40–70) years. There were 954 (6.61%) patients with underlying diseases, and the most common clinical symptoms were cough (3,005, 88.30%), sputum (2,552, 74.99%), dyspnoea (442, 12.99%) and chest pain (437, 12.84%) (Table 1). The number of cases from each site ranged from 104 to 599 (Additional file 2: Table S1).

Table 1 Demographic and clinical characteristics of enrolled patients with community-acquired pneumonia requiring hospitalization

Among the 3,123 patients with available antibiotic exposure data before admission, 720 (22.99%) patients had antibiotic exposure before admission. The most common antibiotics were β-lactams (351, 11.21%), followed by quinolones (232, 7.41%), macrolides (39, 1.25%) and other drugs (207, 6.61%). A higher proportion of antibiotic exposure before admission was found in patients with severe CAP than in those with non-severe CAP (37.12% vs. 20.94%, p < 0.001) (Additional file 3: Table S2).

Detection of respiratory pathogens

A total of 3,213 eligible sputum, 190 BALF, 656 blood, 13 pleural effusion and 12 endotracheal aspirate samples were collected in the study. Molecular test results were available for all 3,403 respiratory specimens. In parallel, bacterial culture testing was performed on 1,001 (54, 5.39%) sputum, 63 (2, 3.17%) BALF, 260 (9, 3.46%) blood, 11 (0, 0%) pleural effusion and 9 (2, 22.22%) endotracheal aspirate samples. All positive results were included in further aetiological evaluation. At least one pathogen was detected in 2,054 (60.36%) patients; specifically, a single pathogen was detected in 1,329 (39.05%) patients, and multiple pathogens were detected in 725 (21.30%) patients.

M. pneumoniae (376, 11.05%), H. influenzae (363, 10.67%), K. pneumoniae (355, 10.43%), IFVA (323, 9.49%), HRVs (307, 9.02%), S. pneumoniae (253, 7.43%), S. aureus (153, 4.50%) and Adv (100, 2.94%) were the most frequently detected pathogens, accounting for more than 80% of all positive results (Fig. 1, Additional file 4: Table S3). The detection rate of remnant respiratory pathogens was less than 3.00%. Regardless of the viral subtype, IFVs (A, B and C) (398, 11.70%) were the most frequently detected pathogens (Additional file 5: Figure S1). The overall pathogen detection rate varied from 37.23 to 79.09% among the nine sites during the study period (Additional file 6: Table S4).

Fig. 1
figure 1

Pathogen detection among Chinese adults with community-acquired pneumonia requiring hospitalization, 2014–2019. The dark shade represents single detection, while the light shade represents co-detection. IFVs, influenza viruses (A, B and C); EVs, enteroviruses; HRVs, human rhinoviruses; HCoVs, human coronaviruses; HPIVs, parainfluenza viruses; Adv, adenovirus; RSV, respiratory syncytial virus; HMPV, human metapneumovirus; HBoV, human bocavirus; HPeV, human parechovirus; M. pneumoniae, Mycoplasma pneumoniae; K. pneumoniae, Klebsiella pneumoniae; H. influenzae, Haemophilus influenzae; S. pneumoniae, Streptococcus pneumoniae; S. aureus, Staphylococcus aureus; M. catarrhalis, Moraxella catarrhalis; P. jirovecii, Pneumocystis jirovecii; C. pneumonia, Chlamydia pneumoniae; L. pneumophila, Legionella pneumophila/Legionella longbeachae; Bordetella spp., Bordetella species (except for Bordetella parapertussis); Salmonella spp., Salmonella species

Compared with bacterium-positive patients (n = 948) and pathogen-negative patients (n = 1,349), virus-positive patients (n = 654) had a higher rate of chills (p = 0.006, p = 0.046), severe CAP (p = 0.002, p < 0.001), ICU admission (p = 0.024, p < 0.001), and noninvasive ventilation (p = 0.003, p = 0.007). When underlying disease was taken into account, virus-positive patients had a higher rate of congestive heart failure than patients positive for bacteria and those with negative detection results (p = 0.001, p = 0.012, respectively) (Table 1).

Age distribution of patients with respiratory infections

To identify the age distribution of patients with infections, we categorized four age groups as 14–24, 25–44, 45–64 and ≥ 65 years to compare the frequency of positive detection. The highest positive rates of detection of total pathogens (69.26%) and bacteria (56.28%) were found in the 14–24 age group (chi-square test, p < 0.001). While the virus-positive rate was highest in the elderly aged ≥ 65 years old (34.87%, chi-square test, p = 0.017) (Fig. 2A). For each pathogen, higher frequencies of M. pneumoniae and C. pneumoniae were found in the 14–24 year-old group (p < 0.001), IFVA (p = 0.005) and RSV (p = 0.034) in the 45–64 year-old group, and HPIV3 (p = 0.044) and K. pneumoniae (p < 0.001) in the ≥ 65 year-old group (chi-square test) (Fig. 2B and C, Additional file 7: Table S5). The detection rate of the remaining screened pathogens showed no significant difference among age groups.

Fig. 2
figure 2

Age distribution of community-acquired pneumonia patients infected with respiratory pathogens. (A) The detection rate of viruses and bacteria in different age groups. (B) Positive detection of bacteria in different age groups. (C) Positive detection of viruses in different age groups. The green bar indicates single bacterium detection, the orange bar indicates single virus detection, and the dark green bar indicates co-detection of bacteria and viruses

Temporal distribution of respiratory pathogens

The pathogen detection rates varied during the study period, with the highest rate in 2014 (72.48%) and the lowest in 2016 (45.53%) (Additional file 8: Table S6). Although the detection rate of each pathogen fluctuated slightly, the most commonly detected pathogens were relatively consistent, with eight pathogens ranking in the top 10 every year. Seasonality was analysed based on the peak detection rate of each pathogen, and 10 viruses and 4 bacteria were found to have significant seasonality, for example, K. pneumoniae peaks in summer (June to August), while IFVA and RSV in winter (December to February) (Fig. 3, Additional file 9: Table S7).

Fig. 3
figure 3

Temporal distribution of community-acquired pneumonia in patients with respiratory pathogen infections. The area under the curves represents monthly detection rates. The horizontal axis shows the variation in pathogen detection between seasons. The seasons were defined as spring (March to May), summer (June to August), autumn (September to November), and winter (December to February). Data between years (2014–2019) are shown in different colours

Co-detection of respiratory pathogens

Two or more (termed “multiple” hereafter) pathogens were co-detected in 725 (21.30%) patients, namely, multiple viruses in 78 (2.29%) patients, multiple bacteria in 195 (5.73%), and viruses with bacteria in 452 (13.28%) patients (Additional file 4: Table S3). Two pathogens were found in 542 (74.76%) patients, three in 145 (19.28%), and four or more in 38 (5.24%). The most common co-detected pathogens were IFVA with H. influenzae (n = 53), S. pneumoniae with H. influenzae (n = 51) and K. pneumoniae with H. influenzae (n = 42) (Additional file 10: Figure S2).

Associations of pathogens with severe CAP

The overall positive detection rate was significantly higher in patients with severe (67.32%) versus non-severe CAP (59.27%) (chi-square test, p = 0.001). K. pneumoniae (OR 1.599, 95% confidence interval [CI] 1.099–2.327, p = 0.014), S. aureus (OR 1.883, 95% CI 1.032–3.434, p = 0.039), L. pneumophila (OR 4.086, 95% CI 1.946–8.582, p < 0.001), IFVA (OR 2.771, 95% CI 1.954–3.930, p < 0.001) and RSV (OR 2.315, 95% CI 1.343–3.992, p = 0.003) were related to severe CAP after adjusting for the confounding factors of age, sex, season, days post illness onset, prior antibiotic exposure and underlying diseases (Fig. 4A). We also found that the co-detection of IFVA with S. aureus was more frequent in patients with severe CAP than single S. aureus detection (chi-square test, p = 0.015) and that co-detection of K. pneumoniae with S. aureus was more frequent in patients with severe CAP than single K. pneumoniae detection (chi-square test, p = 0.048) (Fig. 4B). Such findings further indicate the important role of pathogens, for example, IFVA and S. aureus, in severe CAP.

Fig. 4
figure 4

Associations of respiratory pathogens with severe community-acquired pneumonia (CAP). (A) The detection rate of respiratory pathogens and adjusted odds ratios (ORs) in patients with severe CAP and non-severe CAP. ORs adjusted for age (years), sex, season, duration of illness (days), previous antibiotic exposure and underlying diseases. (B) Comparison of the effects of multiple detections and single detection on severe disease

Considering the overall positive detection rate and the associations with severe infections, we proposed that the ten major pathogens according to the aetiological estimation fraction, including the top eight most frequently detected pathogens, RSV and L. pneumophila, accounting for 76.06–92.52% of all positive detection results across sites, should be given priority in screening (Fig. 5, Additional file 11: Table S8).

Fig. 5
figure 5

Cumulative proportion of the ten major pathogens at each sentinel site in community-acquired pneumonia patients. The line chart shows the cumulative positive rates of the ten major site-specific pathogens calculated by taking into account the detection rate and the risk of severe illness according to the site-combined data. The number in the box indicates the ranking of the pathogen detected at that site

Discussion

In this study, we identified the respiratory pathogens spectra in adults with CAP in China via a multicentre prospective study. Our data showed that the pathogen spectra were consistent across different geographical regions over years. We further demonstrated that 10 major pathogens account for 76.06–92.52% of all positive detections across sites. K. pneumoniae, S. aureus, IFVA and RSV were risk factors related to severe CAP. These findings update our understanding of the aetiology of CAP in China, which may largely inform the development clinical pathogen diagnosis, anti-infective intervention, and even vaccination.

Molecular testing method has been shown to improve pathogen diagnosis efficiency and feasibility according to several large-scale studies on the aetiology of CAP [5, 15,16,17,18]. By using the RT‒PCR method, the overall positive detection rate of common respiratory pathogens in our study was 60.36%, which is comparable with previous reports, for example, in Africa (59.6%), Europe and the United States (60.6%) [16, 17], demonstrating the reliability of molecular test methods. IFVs, S. pneumoniae, H. influenzae, M. pneumoniae and HRVs were considered the major detected CAP pathogens [8, 9, 17,18,19,20]. According to our results, the viral spectrum was similar to that in previous studies, but the dominant nonviral agents varied compared with the data from other countries. For example, S. pneumoniae and M. pneumoniae were found at high frequencies in adults with CAP in USA, Finland and Australia, and S. pneumoniae and H. influenzae were the most frequently detected bacteria in adults with CAP in Sweden, Japan and Chile [5, 19,20,21,22,23], while M. pneumoniae and K. pneumoniae were found at high frequencies in our study. In addition, K. pneumoniae was also identified as the fourth most commonly detected pathogen in patients with acute respiratory infection in China [24], reflecting the regional distribution of the pathogens. The frequency of detected pathogens may be affected by multiple factors, including medical, social, economic, environmental, geographical, and demographic factors. The difference in the implementation of national immunization programs and the use of antibiotics may also influence the aetiology of respiratory infections.

Elucidating the roles of different pathogens in severe CAP is critical for identifying the risk factors for severe infections to improve disease management. However, confounding factors might influence this determination [25]. In this study, after adjusting for confounding factors, including age, sex, season, days post symptom onset, underlying diseases, and antibiotic exposure before admission, IFVA, K. pneumoniae, S. aureus, and RSV were found to be strongly associated with severe infections, suggesting the important aetiological roles of these pathogens in severe CAP. K. pneumonia was found to be correlated with adverse outcomes and had a high detection rate in our study. These results indicated that K. pneumoniae should be seriously considered as a priority screening pathogen in adults with CAP. Further investigations should be performed to determine the pathogenicity and virulence characteristics of K. pneumoniae.

Aetiological studies on respiratory infections have defined a broad range of pathogens. However, the prevalence of these dozens of pathogens including viruses, bacteria, fungi and parasites varies largely, indicating the differential role of the pathogens play in respiratory infections. It is hard and unnecessary to cover all the respiratory pathogens in clinic panel for pathogen diagnosis for benefit/cost and technical reasons. Therefore, precise definition is required to design a panel which can cover the majority of respiratory pathogens with appropriate cost based on studies on respiratory pathogen prevalence. Our data showed that the ten major pathogens in CAP, including M. pneumoniae (376, 11.05%), H. influenzae (363, 10.67%), K. pneumoniae (355, 10.43%), IFVA (323, 9.49%), HRVs (307, 9.02%), S. pneumoniae (253, 7.43%), S. aureus (153, 4.50%), Adv (100, 2.94%), RSV (80, 2.35%), and L. pneumophila (35, 1.03%), account for 76.06–92.52% of all positive detection results across sampling sites. These findings can inform the design of priority pathogen screening in China, which may increase the efficacy of common pathogen screening and decrease the unnecessary expenditure for aetiological diagnosis. Our data showed that S. pneumoniae and H. influenzae are among the top 10 aetiological agents detected in adults with CAP in our study. As the two vaccines have not been included in the national immunization program in China, our data strongly indicate the necessity to prioritize the inclusion of pneumococcal conjugate vaccines (PCVs) and Hib vaccines to mitigate the burden of the corresponding infections [26, 27].

Normally, the detection of P. jirovecii is mainly reported in immunocompromised hosts. However, serological tests support the possibility of subclinical infection, exposure, or fixation [28]. We detected P. jirovecii by using a molecular method, which might increase the positive detection rate. Nearly 60% (32 of 54) of patients were positive for multiple pathogens. We considered the positive detection of P. jirovecii to be reliable, while the clinical significance of the positive detection of P. jirovecii needs intensive investigation.

Our study had several limitations. Firstly, although the presence of pathogens determined by molecular testing method has been accepted, more comparative studies involving bacterial culture and other traditional methods used in the clinic are still needed to improve the strategies used for pathogen detection [29]. Especially, some respiratory samples were found to be positive on L. pneumophila in our study, but the clinical significance of the results still needs to be investigated intensively. However, very limited cases were tested by using urinary antigen detection for L. pneumophila in clinic. Secondly, the process for sampling sputum or BALF might introduce commensal bacterial contamination from the upper respiratory tract, which might influence the detection results. In addition, self-administration of antibiotics before admission was an unavoidable confounding factor in aetiology studies. Finally, our study was designed and performed before the COVID-19 pandemic, and our findings should be further compared with the aetiology of CAP post COVID-19.

Conclusions

In conclusion, we clarified the pathogen spectrum in adults with CAP in China and characterized the pathogens associated with severe infection. On this basis, we propose to include 10 major pathogens as priorities for clinical pathogen screening in adults with CAP.