Background

Pneumonia is the main cause of hospitalization and the leading cause of death in children aged < 5 years worldwide [1]. Early diagnosis and timely treatment are important for reducing morbidity and mortality [2]. The symptoms of pneumonia are non-specific in children, and there is no single test with a high sensitivity and specificity for diagnosing pneumonia. Clinicians diagnose pneumonia in children in resource-limited settings using the World Health Organization criteria; however, the sensitivity and specificity are low, which results in misdiagnosis and overtreatment [3, 4]. Chest computed tomography is regarded as the gold standard for detecting pneumonia; however, its routine use is restricted by cost, accessibility, and radiation exposure [5].

In clinical practice, chest radiography (CXR) is a widely used imaging modality for diagnosing pneumonia [6]. However, the routine use of CXR is restricted by some diagnostic and technical limitations, including the absence of definitive diagnostic criteria and intra- and inter-observer variations [7,8,9]. Moreover, exposure to ionizing radiation in children could increase the risk of cancer later in life [6, 10, 11]. Lung ultrasound (LUS) is radiation-free, portable, and inexpensive, which can be conducted at the point of care. Furthermore, the portable ultrasonography machines was easier obtained, which raises the potential of LUS for diagnostic methods in remote settings. It could identify complications of pneumonia and is widely used for the diagnosis and management of pneumonia in children [12, 13]. However, whether the diagnostic performance of LUS and CXR for pneumonia in children is comparable remains unclear. Therefore, the current systematic review and meta-analysis was performed to compare the diagnostic performance of LUS with that of CXR in detecting pneumonia in children.

Methods

Data collection

This study was performed according to the Preferred Reporting Items for Systematic Reviews and Meta-Analysis Statement [14]. The study protocol was registered at the INPLASY register (INPLASY202340071). We searched for studies that presented the diagnostic value of LUS with CXR for diagnosing pneumonia in children, and no restrictions were placed on publication language and status. We systematically searched PubMed, EmBase, and the Cochrane Library to screen eligible studies throughout April 2023, and used ((“pneumonia” [MeSH Terms] OR “pneumonia” [All Fields]) AND (“ultrasound” [MeSH Terms] OR (“ultrasound” [All Fields]) as search terms. The search terms were restricted to “Child: birth-18 years.” We also manually reviewed relevant reference lists, citation searches, and systematic reviews to identify any new eligible studies.

The processes of literature search and study selection were independently performed by two reviewers, and any disagreement between reviewers was resolved by discussion with an additional reviewer. Study was included if they met: (1) participants: all of individuals aged < 18.0 years, and suspected for pneumonia; (2) diagnostic tools: the study had to applied both LUS and CXR as diagnostic tools; (3) gold standard: the gold standard for diagnosing pneumonia should clear report; (4) outcomes: studies reported true positive, false positive, false negative, true negative, or data could be transformed into such; and (5) study design: no restrictions placed on study design, including prospective and retrospective design.

Data collection and quality assessment

The following variables were independently collected by two reviewers: first author’s name, publication year, country, study design, sample size, number of boys/girls, mean age, setting, pneumonia diagnosis, diagnostic tool, true positive, false positive, false negative, and true negative data. Then, the methodological quality was assessed by the quality assessment of diagnostic accuracy studies-2 (QUADAS-2), which was based on patient selection, index tests, reference standard, and flow and timing; the categories low risk, high risk, and unclear were assigned to each study [15]. Inconsistent results regarding data collection and quality assessment between reviewers were resolved by a third reviewer.

Statistical analysis

The diagnostic parameters of LUS and CXR were analyzed using true positive, false positive, false negative, and true negative data with a bivariate generalized linear mixed model and a the random-effects model. The calculated outcomes included sensitivity, specificity, positive likelihood ratio (PLR), negative likelihood ratio (NLR), diagnostic odds ratio (DOR), and area under the receiver operating characteristic curves (AUC) [16, 17]. The heterogeneity among studies was evaluated using the I2 and Q statistics, and I2 ≥ 50.0% or P < 0.10 was defined as significant heterogeneity [18, 19]. Then, the ratio of sensitivity, specificity, PLR, NLR, DOR, and AUC between LUS and CXR were compared using the random-effects model [16, 17, 20]. Subsequently, subgroup analyses were performed based on country, study design, mean age, and gold standards. A funnel plot with Deeks’ asymmetry test was applied to assess potential publication bias [21]. All reported P were 2-sided, and the inspection level for pooled conclusions was 0.05. STATA software (version 12.0 StataCorp, Texas, USA) was used to perform all statistical analyses.

Results

Literature search

An initial electronic search yielded 1,315 records, and 943 studies were retained after removing duplicate studies. After the title and abstract were reviewed for relevance, 871 studies were removed. The remaining 72 studies were retrieved for detailed evaluations, and 46 studies were excluded because of other diseases (n = 31), no CXR data (n = 12), and no desirable data (n = 3). A total of seven articles were identified by manually reviewing the reference lists of relevant articles, and all of these studies were removed owing to duplicate articles. Subsequently, 26 studies were selected for quantitative meta-analysis [22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47]. The literature search and study selection process are shown in Fig. 1.

Fig. 1
figure 1

The processes of literature search and study selection

Study characteristics

Table 1 summarizes the baseline characteristics of the included studies and patients. Of the included studies, 22 studies were prospective, and four studies were retrospective. These studies involved 3,401 children, and the sample size ranged from 28 to 641. The mean age of the included children ranged from newborn to 6.5 years. Twenty-one studies were performed in Western countries, and five studies were conducted in Eastern countries. Sixteen studies used clinical criteria to diagnose pneumonia, and the remaining 10 studies used CXR to diagnose pneumonia. The methodological quality of the included studies is shown in Table S1, and the overall quality of the included studies was moderate to high.

Table 1 The baseline characteristics of included studies

Sensitivity and specificity

The summary sensitivity and specificity of LUS for detecting pneumonia in children were 0.95 (95% CI: 0.93–0.97), and 0.92 (95% CI: 0.81–0.97), while the sensitivity and specificity of CXR were 0.92 (95% CI: 0.90–0.93), and 0.93 (95% CI: 0.91–0.95), respectively (Fig. 2). We noted that the sensitivity of LUS was higher than that of CXR for detecting pneumonia in children (ratio: 1.03; 95% CI: 1.01–1.06; P = 0.018), whereas there was no significant difference between LUS and CXR for specificity (ratio: 0.99; 95% CI: 0.90–1.09; P = 0.819). Subgroup analyses found that LUS was associated with a higher sensitivity than CXR in most subgroups, whereas no significant difference was observed between LUS and CXR for sensitivity if pooled studies were conducted in Eastern countries, had a mean age < 5.0 years, and used CXR diagnosed pneumonia (Table 2). Moreover, there were no significant differences in specificity between LUS and CXR in all subgroups (Table 2).

Fig. 2
figure 2

The summary sensitivity and specificity of LUS for detecting pneumonia

Table 2 Subgroup analyses for diagnostic performance of US and chest radiography

PLR and NLR

The summary PLR and NLR of LUS for detecting pneumonia were 12.31 (95% CI: 4.70-32.23), and 0.05 (95% CI: 0.03–0.08), while the PLR and NLR of CXR for diagnosing pneumonia were 24.63 (95% CI: 8.63–70.26), and 0.08 (95% CI: 0.05–0.12), respectively (Figure S1). There were no significant differences between LUS and CXR for PLR (ratio: 0.50; 95% CI: 0.12–2.07; P = 0.340) and NLR (ratio: 0.63; 95% CI: 0.32–1.21; P = 0.161). Subgroup analyses found that LUS was associated with a lower PLR than CXR if pooled studies used CXR as the gold standard. Moreover, LUS was associated with a lower NLR than CXR if the mean age of the children was ≥ 5.0 years (Table 2).

DOR

We noted that the summary DOR of LUS for detecting pneumonia was 108.53 (95% CI: 51.30-229.61), while the DOR of CXR for diagnosing pneumonia was 488.54 (95% CI: 160.82-1484.16) (Figure S2). The comparison results indicated that the DOR of LUS for detecting pneumonia was lower than that of CXR (ratio: 0.22; 95% CI: 0.06–0.85; P = 0.028). Subgroup analyses indicated that LUS was associated with a lower DOR as compared with CXR when pooled prospective studies, the mean age of children was < 5.0 years, and CXR was used as the gold standard to diagnose pneumonia (Table 2).

AUC

The AUC of LUS for detecting pneumonia in children was 0.98 (95% CI: 0.96–0.99), while the AUC of CXR for diagnosing pneumonia in children was 0.99 (95% CI: 0.98-1.00) (Fig. 3). There was no significant difference between LUS and CXR for AUC (ratio: 0.99; 95% CI: 0.97–1.01; P = 0.280). Subgroup analyses found that LUS was associated with a lower AUC than CXR when the mean age of children was < 5.0 years, and CXR was applied as the gold standard to diagnose pneumonia (Table 2).

Fig. 3
figure 3

The area under the receiver operating characteristic curves of LUS for detecting pneumonia

Publication bias

The publication bias of LUS for detecting pneumonia in children is shown in Figure S3, and the Deeks’ asymmetry test suggested no significant publication bias (P = 0.78).

Discussion

Our study found that the diagnostic values of LUS and CXR were relatively good for detecting pneumonia in children. Moreover, we noted that LUS was associated with a higher sensitivity and lower DOR for detecting pneumonia than CXR. However, we did not find any differences between LUS and CXR for specificity, PLR, NLR, and AUC. Finally, the diagnostic performance between LUS and CXR could be affected by study design, mean age of children, and gold standard for diagnosing pneumonia.

The diagnostic performance of LUS has been investigated in several systematic reviews and meta-analyses [13, 48,49,50,51]. Orso et al. identified 17 studies and found that the diagnostic performance of LUS was relatively higher, although these results were restricted by reliable reference standard [48]. Tsou et al. identified 25 studies and found that LUS could accurately detect pneumonia in children, and the performance of LUS could be affected by experienced sonographers [49]. Pereda et al. identified five studies and found that LUS could be considered an imaging alternative for detecting pneumonia in children; however, this conclusion was restricted by unstable results [13]. Xin et al. identified eight studies and supports using LUS for detecting pneumonia in children, and the most common clinical signs of LUS were pulmonary consolidation, positive air bronchogram, abnormal pleural line, and pleural effusion [50]. However, these studies only provided a summary of the diagnostic performance of LUS for detecting pneumonia in children, and the diagnostic value between LUS and CXR was not directly compared [13, 48,49,50]. Most recently, a meta-analysis conducted by Yan et al. identified 22 studies and suggested that LUS could be regarded as a reliable, valuable, and alternative diagnostic tool to CXR for detecting pneumonia in children [51]. However, this study had several shortcomings, including mistakes on data abstraction, an absence of direct comparison results, and no investigation on the diagnostic performance of LUS versus CXR in study or children with specific characteristics.

Our study found that the diagnostic performance of LUS was relatively high for detecting pneumonia in children, which was consistent with prior meta-analyses [13, 48,49,50,51]. We also noted that the diagnostic performance of LUS and CXR for detecting pneumonia in children was comparable. Furthermore, the sensitivity of LUS was higher than that of CXR, which suggests that LUS could differentiate more pneumonia cases, and the prognosis of pneumonia in children could improve. Although CXR is inexpensive and quick, it has a poor ability to distinguish alveolar and interstitial pneumonia. Additional shortcomings of CXR include ionizing radiation and inter-observer agreement [52,53,54]. The use of LUS can monitor disease progression without exposure to ionizing radiation. Studies have already demonstrated that the use of LUS could shorten emergency department stays, lower financial costs, and reduce complications related to invasive procedures [55,56,57].

Subgroup analyses found that the diagnostic performance of LUS and CXR for detecting pneumonia in children could be affected by study design, mean age of children, and the gold standard used for diagnosing pneumonia. Several reasons could explain these results: (1) the study design is significantly related to intrinsic biases, and inevitable limitations for retrospective studies include selection and recall biases. Moreover, most included studies were designed as prospective; thus, the pooled conclusions based on retrospective studies were not stable; (2) the diagnostic performance of LUS in children was higher than that in adults for detecting pneumonia [50, 58]. Our study found that LUS was superior to CXR for children aged 5.0 years or older, while the diagnostic performance of LUS was lower than CXR for children aged less than 5.0 years; and (3) numerous included studies applied CXR as the gold standard for detecting pneumonia, and the diagnostic value of CXR may have been overestimated.

This study had some limitations. First, the analysis was based on prospective and retrospective studies, and the pooled conclusions could be affected by uncontrolled selection, recall, and confounding biases. Second, the sonographer’s experience could have affected the diagnostic performance of LUS. Third, the gold standard for diagnosing pneumonia varies across the included studies, which could affect the diagnostic value of LUS and CXR. Fourth, the severity of pneumonia differed across the included studies, which could have affected the complexity of detecting pneumonia in children. Finally, the inherent limitations of meta-analyses based on published data include inevitable publication bias and restricted detailed analyses.

Conclusions

Both LUS and CXR showed high diagnostic performance in detecting pneumonia in children, and the diagnostic parameters were comparable in terms of specificity, PLR, NLR, and AUC. Moreover, we noted that LUS was associated with higher sensitivity and lower DOR for detecting pneumonia in children than CXR. Exploratory analyses found the diagnostic value of LUS were lower than CXR for detecting pneumonia in children less than 5.0 years. Thus, the LUS should be recommended for detecting pneumonia in older children. Further large-scale prospective studies should be performed to compare the diagnostic value of LUS with CXR for detecting pneumonia in children with specific characteristics.