Background

Osteosarcoma is the most frequent type of primary bone malignancy in childhood and adolescence, which originates from primitive mesenchymal stem cells that improperly form osteoblasts that then deposit malignant osteoid [1]. The combination of high-dose chemotherapy and limb-salvage surgery has been shown to prolong the overall survival of localized osteosarcoma to 65~70% [2]. However, the prognosis of patients with radiographically discernable distant metastases is still unfavorable due to a large majority of occult metastases present in lung, and the minority in bone, lymph node and other parts of the body [3]. Moreover, after limb-salvage operations, approximately 13.5% of patients have a local recurrence of the sarcoma [4]. The outcome of local recurrent osteosarcoma is even worse than for patients with metastases alone [5]. Therefore, accuracy and early detection of local recurrence and distant metastases formation have a crucial role in the risk stratification and the treatment of osteosarcoma.

Several traditional imaging modalities have been used for the diagnosis, staging and treatment monitoring of osteosarcoma, such as plain radiographs, computed tomography (CT) and magnetic resonance imaging (MRI). 18F-fluorodeoxyglucosepositron emission tomography (18F-FDG PET), through detecting the high uptake of 18F-FDG, a radioactive analogue of glucose, could identify sites with increased metabolic activity of various malignant tumors. More recently, PET/CT, which combines metabolic data from PET and imaging data from conventional CT, seems to be far more reliable in diagnosing malignancies.

A previous clinical study [6] had demonstrated that osteosarcoma was 18F-FDG-avid. Uptake of 18F-FDG was applied for the diagnosis, chemotherapy response assessment, prognosis prediction and guidance of biopsies of osteosarcoma [7]. Compared with bone scintigraphy, 18F-FDG PET and PET/CT could identify bone, lung and other metastatic lesions. Subsequently, 18F-FDG PET and PET/CT have an advantage in assessing local recurrence as they are not affected by imaging artifacts.

Multiple studies have attempted to assess the diagnostic accuracy of 18F-FDG PET and PET/CT in osteosarcoma. However, there seems to be considerable methodological variability, including the methods for evaluating the FDG uptake and the standardized uptake value (SUV) at determining whether the lesions are positive. Results remain inconclusive. Recently, two systematic review coupled meta-analyses [8,9,10] tried to further clarify this issue, but none of the included studies specifically aimed at osteosarcoma and did not statistically analyze the retrieved data. 18F-FDG PET or PET/CT have not been considered as the standardized components of the diagnostic algorithm for osteosarcoma. To reach a more precise result on this topic, the present study sought to systematically collected previously published data from literatures and performed a statistically evaluation using meta-analysis to see if 18F-FDG PET/CT is far more efficient at diagnosing osteosarcoma then present detection paradigms.

Methods

Search strategy, inclusion and exclusion criteria

Two investigators, blinded, independently and repeatedly, performed a systematic computerized article search using three databases (Medline/PubMed, Embase and the Cochrane Library) with combinations of following key words: “positron emission tomography” [all field] OR “PET” [all field] AND “osteosarcoma” [all field] OR “bone sarcoma” [all field]. No language limitations were imposed. This search process was completed on March 1, 2018 with no language and search limitations. Additionally, bibliographies of included studies were also searched by hand to explore any potentially eligible trials.

The targeted studies in the meta-analysis had to fulfill all following criteria: a) clinical trials assessing the usefulness of 18F-FDG PET or PET/CT in the diagnosis, staging, restaging and recurrence monitoring of osteosarcoma; b) patients with osteosarcoma clinically diagnosed by histopathology, follow-up or other reference methods; c) sufficient data were provided to calculate the number of true-positive (TP), false-positive (FP), false-negative (FN) and true negative (TN) cases; d) if more than one article contained overlapping data, the most comprehensive or recent article was included; and e) 18F-FDG was intravenously administered as an inducer.

Exclusion criteria for this meta-analysis were: a) in vitro or animal studies; b) trials with fewer than five participants with osteosarcoma; c) posters presented at conferences/congresses (due to the lack of data and methodology description); d) not original research (reviews, editorials, meta-analyses, letters and comments).

Data extraction and quality assessment

The following main information were extracted from original articles: first author’s surname, year of publication, source of studies, basic characteristics of the participants (numbers, age and gender), study design, inclusion interval, technical details (image devices, injection dose and methods of image analysis) and the time between injection and image acquisition). Additionally, the cases of TP, FP, TN and FN were extracted directly or recalculated through data presented in original articles based on different lesions such as primary, recurrence and metastases of lung and bone.

The methodological quality of included studies was appraised utilizing the QUADAS tool [11], which is composed of 14 items. Study following at least nine items of these scores was deemed as high quality and was included in this investigation.

Statistical analysis

This systematic review and meta-analysis confirmed the standardized items described by “the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA)” statement [12]. To assess the accuracy of 18F-FDG PET and PET/CT on the diagnosis, staging and recurrence monitoring of osteosarcoma, the pooled estimates included the sensitivity, specificity, positive likelihood ratio (PLR), negative likelihood ratio (NLR), diagnostic odd ratio (DOR) and the summary receiver operating characteristic curve (sROC) and AUC. Following recommendations of Cochrance Handbook (www.cochrance.org/trainig/cochrance-handbook), study-heterogeneity was evaluated using Chi-squared and I-square statistic algorithms. Low heterogeneity was defined as I-square < 50% and with P> 0.1. To obtain a reliable result, all analyses were performed using the random-effect model. The DOR is an indicator of the test accuracy, ranging from zero to infinity. A higher DOR indicates that the test is more accurate. The areas under the curve (AUC) and Q*-index are two important statistics that reflect the diagnostic value. The AUC is defined as the area under sROC, and Q* is the point where the sensitivity is equal to the specificity. The statistical analyses for the detection of primary lesions, recurrence, lung metastases, bone metastases and all distant metastases were performed separately.

Results

Study selection

At primary retrieval, a total of 1952 articles were identified, based on the search criteria, of which 1892 were subsequently deemed non-viable after inclusion and exclusion criteria were applied. Out of the 84 articles resulting from preliminary screening, 37 articles were referred to irrelevant studies, 5 articles were duplicate publications and 21 articles lacked sufficient data to calculate evaluation indicators and were excluded from the meta-analysis. The remaining 26 studies [13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38] deemed viable, published from 1998 to 2017, were included in the meta-analysis. Although five of the trials [14,15,16,17,18] showed signs of having utilized the same patients, these were included in a different subgroup analyses. The selection process and reasons other articles were excluded are described in Fig. 1.

Fig. 1
figure 1

Selection process for studies included in the meta-analysis

Characteristics of included studies

The basic characteristics of the 26 included studies are summarized in Table 1. The sizes of patients in the studies ranged from 5 to 206, and a total of 798 osteosarcoma patients were included. A total of 17 studies were retrospective design types whereas 8 studies were prospective. Based on the QUADAS score, only one study [22] had a score of 9, nine studies [14, 16,17,18, 25, 30, 31, 34, 35] had a score of 11, five studies [15, 21, 23, 24, 32] a score of 12 with all remaining studies [13, 19, 20, 26,27,28,29, 33, 36,37,38] possessing a score of 13. The major information of PET or PET/CT imaging of each study was summarized in Table 2. Among them, sixteen studies presented that image acquisition were performed approximately 60 min after FDG injection. The results of whole-body 18F-FDG PET or PET-CT were analyzed visually by experienced radiologists in 7 studies [13, 20, 22, 30, 31, 34, 37] and by standardized uptake values in 19 studies [14,15,16,17,18,19, 21, 23,24,25,26,27,28,29, 32, 33, 35, 36, 38]. The diagnostic data (TPs, FPs, TNs and FNs) were extracted directly or calculated through data provided in the tables or bodies of each original article (Table 3).

Table 1 Main characteristics of the included studies
Table 2 Technical aspects of the included studies
Table 3 Diagnosis accuracy data on each examination– or lesion–based analysis

Primary lesion

The diagnosis of the primary lesion was analyzed on a patient-based level. Fourteen studies [13, 15, 19, 21,22,23,24,25,26,27, 30, 32, 33, 36] with 243 patients, with known osteosarcoma, were available. There were no threshold effects in this data (p-value = 1.000). All primary lesions were FDG-avid. The combined sensitivity of 18F-FDG PET and PET/CT in the detection of osteosarcoma was 100%, while no specificity could be obtained. There was no statistically significant heterogeneity in these estimates of sensitivity (I-square = 0.0%).

Recurrence

Results assessing the diagnostic performance of 18F-FDG PET and PET/CT for detecting recurrence of osteosarcoma as generated from the 7 studies [18, 24, 28, 29, 35, 37, 38], showed that the pooled sensitivity of 91% (95% CI of 81–96%), a specificity of 93% (95% CI of 87–97%), a PLR of 7.36 (95% CI of 3.54–15.30), a NLR of 0.14 (95% CI of 0.07–0.29), a DOR of 63.98 (95% CI of 19.29–212.18), the Q*-index of 0.8842 and AUC of 0.9452 (Fig. 2). No significance of heterogeneity between study for the pooled sensitivity (I-square = 0.0%) and the threshold effect (p-value = 0.269) were presented in this analysis.

Fig. 2
figure 2

Diagnostic performance of 18F-FDG PET and PET/CT for the recurrence of osteosarcoma: (1) Pooled sensitivity (2) Pooled specificity (3) Pooled diagnostic odds ratio (4) Summary receiver operating characteristic curve (sROC) with the Q*-index

Lung metastases

In total, 8 studies [14, 17, 28,29,30, 33, 34, 38] addressed the diagnosis of lung metastases. The threshold effects was no found in the data (p-value = 0.233). The combined sensitivity of 18F-FDG PET and PET/CT in detecting lung metastases of osteosarcoma was 81% (95% CI of 72–88%), specificity 94% (95% CI of 89–97%), PLR 8.13 (95% CI of 4.19–15.77) and NLR 0.26 (95% of CI 0.17–0.40). The pooled DOR was 48.85 (95% CI of 18.92–126.14), whereas the Q*-index was 0.8614 with an AUC of 0.9268 (Fig. 3).

Fig. 3
figure 3

Diagnostic performance of 18F-FDG PET and PET/CT for osteosarcoma lung metastasis: (1) Pooled sensitivity (2) Pooled specificity (3) Pooled diagnostic odds ratio (4) Summary receiver operating characteristic curve (sROC) with the Q*-index

Bone metastases

Results assessing the diagnostic performance of 18F-FDG PET and PET/CT for detecting bone metastasis of osteosarcoma as generated from the 6 studies [22, 29, 31, 33, 34, 36], demonstrated that the pooled sensitivity of 93% (95% CI of 87–97%), a specificity of 97% (95% CI of 96–98%), a PLR of 9.81 (95% CI of 2.73–35.29), a NLR of 0.08 (95% CI of 0.04–0.18), a DOR of 174.19 (95% CI of 38.37–790.76), the Q*-index of 0.9397 and the AUC of 0.9813 (Fig. 4). No significance of heterogeneity between study for the pooled sensitivity (I-square = 33.8%) and the threshold effect (p-value = 0.8279) were presented in this analysis.

Fig. 4
figure 4

Diagnostic performance of 18F-FDG PET and PET/CT for osteosarcoma bone metastasis: (1) Pooled sensitivity (2) Pooled specificity (3) Pooled diagnostic odds ratio (4) Summary receiver operating characteristic curve (sROC) with the Q*-index

All distant metastases

In total, 11 studies [14, 17, 22, 28,29,30,31, 33, 34, 36, 38] addressed the diagnosis of all distant metastases. There were no threshold effects in this data (p-value = 0.647). The combined sensitivity of 18F-FDG PET and PET/CT in detecting all distant metastases of osteosarcoma was 90% (95% CI of 86–93%), specificity 96% (95% CI of 95–97%), PLR 13.81 (95% CI of 5.77–33.06) and NLR 0.13 (95% CI of 0.07–0.23). Statistically heterogeneity was no found in the estimates of the DOR (I-square = 49.8%). The pooled DOR was 125.67 (95% CI of 48.54–325.35). The Q*-index was 0.9103 with an AUC of 0.9639 (Fig. 5). To explore whether the injected dose of 18F-FDG could affect the diagnostic accuracy of all distant metastases, we performed a subgroup analysis which demostrated that 18F-FDG PET/CT, using a injected dose above 5 MBq/kg, is similar to that using a lower dose (sensitivity of 88% [95% CI 76–95%] vs. 88% [95% CI 78–95%]; specificity of 93% [95% CI 97–99%] vs. 98% [95% CI 97–99%]).

Fig. 5
figure 5

Diagnostic performance of 18F-FDG PET and PET/CT for all distant metastases of osteosarcoma: (1) Pooled sensitivity (2) Pooled specificity (3) Pooled diagnostic odds ratio (4) Summary receiver operating characteristic curve (sROC) with the Q*-index

Discussion

Osteogenic sarcoma has an elevated rate of glycolysis and, consequently, a high uptake of 18F-FDG in malignant cells [39] . A previous meta-analysis [40] revealed that the standardized uptake values before (SUV1) and after (SUV2) chemotherapy in osteosarcoma were associated with the histological response. SUV2:1< 0.5 or SUV2< 2.5 had predictive significance for tumor necrosis. Multiple studies investigated the diagnostic value of 18F-FDG PET and PET/CT for osteosarcoma, but no definitive results were obtained. To settle these questions scientifically, in the present study, a rigorous inclusion criterion was predesigned to collect published articles as comprehensive as possible and subgroup analyses were conducted to pool outcome estimates from individual studies only utilizing the random-effect modeling. Importantly, this is the first meta-analysis evaluating the diagnostic utility of 18F-FDG PET/CT in osteosarcoma. By systemically analyzing the retrieved data, investigations demonstrated that 18F-FDG PET/CT had an excellent accuracy in the diagnosis, staging and restaging of osteosarcoma.

The value of PET in the differential diagnosis of bone tumor and tumor-like lesions was first described by Schulte et al.(2000) [15]. Using a tumor-to-background ratio of 3.0 as the cutoff for determining malignant bone lesions, the sensitivity, specificity and accuracy were 0.93, 0.667 and 0.817, respectively. Although there were several false negative cases, there were none for patients with osteosarcoma (n = 44). A subsequent meta-analysis [41] also reported the outstanding ability of 18F-FDG PET to distinguish between benign and malignant bone and soft tissue tumors. In studies included in the present meta-analysis, all primary osteosarcoma lesions (n = 243) were 18F-FDG-avid with a good pooled sensitivity of 100%. However, it must be noted, that as a nonspecific manifestation on 18F-FDG PET, osteosarcoma is not distinctly discernible from other highly malignant bone sarcomas, such as Ewing sarcoma [42].

Detecting recurrent or residual osteosarcoma has been a challenge for clinicians due to the post-therapeutic changes and imaging artifacts caused by metallic endoprothesis [43]. In 1996, Garcia and co-workers reported the diagnostic value of 18F-FDG PET in 48 suspected recurrent musculoskeletal sarcomas (including 18 osteosarcoma patients), indicating a good sensitivity and specificity (98 and 90%, respectively) [44]. 18F-FDG PET has an advantage in assessing local recurrence because it is not affected by the imaging artifact. Nevertheless, elevated 18F-FDG uptake could be affected by post-treatment changes, irrespective of local recurrence [45, 46]. In this investigation, a good accuracy was observed of 18F-FDG PET in detecting recurrent osteosarcoma (local relapses or distant metastases), which are similar to the conclusions for other recurrent malignant tumors [47, 48].

Although osteosarcoma has a tendency to metastasize early, which would modify the outcome of osteosarcoma with an unfavorable survival, the prognosis of patients with resectable metastatic lesions is relatively good [49,50,51]. 18F-FDG PET and PET/CT are useful tools for detecting possible malignant lesions. Therefore, a subgroup analysis was performed on the metastasis-based data in the presented study.

Lung metastasis ranks as the leading cause of osteosarcoma-related death [2]. Schulte et al. (1999) [14] first reported the diagnostic performance of PET for detecting osteosarcoma lung metastasis. In the presented investigation a total of 27 scans included 4 true positive and 23 true negative cases. However, Iagaru et al. (2006) [20] of 106 bone and soft tissue sarcomas suggested that PET-false positive cases were significantly increased in sub-centimeter lung metastases. Cistaro et al. (2012) [28] published data on 18 participants (11 osteosarcomas), who underwent 31 PET/CT scans where it was demonstrated that there was no significant advantage of the SUVmax or SUVratio in the evaluation of lung nodules of <6 mm. This may explain the lower sensitivity of 18F-FDG PET and PET/CT in the diagnosis of lung metastases compared to bone metastases in the present meta-analysis. Compared to those who only have lung metastasis, the survival for patients with osteosarcoma who have bone metastasis is even poorer [5]. Because there are many possible metastatic bones in osteosarcoma, whole-body screening is indispensable. In the diagnosis of bone metastases, our investigations revealed a remarkable sensitivity, specificity and Q*-index (0.93, 0.97 and 0.9397, respectively) for 18F-FDG PET and PET/CT. Bone scintigraphy is another commonly used functional modality, while previous studies have shown that 18F-FDG PET and PET/CT are superior to bone scintigraphy in detecting bone metastases of osseous sarcoma, including osteosarcoma [34]. Other than metastasis in the lungs and bones, osteosarcoma could metastasize to other locations such as the lymph nodes and soft tissue. Because these cases are rare, only distant metastases as a whole were analyzed. The present analysis further demonstrated a diagnostic accuracy of 18F-FDG uptake in metastatic lesions with an AUC of 0.9639 and *Q of 0.9103.

Misdiagnoses of 18F-FDG PET and PET/CT are attributable to multiple reasons. Three included studies [34,35,36] showed higher FN and FP than the others because one study [35] detected local recurrence in patients with extremity osteosarcoma treated with surgical resection and endoprosthetic replacement and another two studies [34, 36] involved newly diagnosed with high grade osteosarcoma or initial staging osteosarcoma patients. Increased 18F-FDG uptake is not pathognomonic for malignancy. Some benign lesions, such as giant cell tumors of the bones, inflammatory disease and fractures are presented with a high level of tracer accumulation [6]. Among these, inflammatory disease is the most commonly encountered causes of a false positive. Therefore, the findings of 18F-FDG PET/CT for malignant lesions should be confirmed with a histopathology examination or follow-up. Meanwhile, some false negative cases are unavoidable. One cause is the nonspecific 18F-FDG uptake in malignant diseases and asymmetric 18F-FDG distribution. Second, because of the limited spatial resolution of 18F-FDG PET, some occult or sub-centimeter lesions could not be identified. Currently, the high cost of 18F-FDG PET and PET/CT discourages their use in developing countries [52,53,54]. However, considering their satisfactory performance, as demonstrated by the present investigation, and the poor prognosis of osteosarcoma, especially for those with recurrence or distant metastases, patients may benefit from evaluation with 18F-FDG PET or PET/CT.

Some limitations of this meta-analysis merit considerations. First, the present analysis was a meta-analysis and systemic review; therefore, we were only able to analyze questions on the study level instead of on the patient level. Second, as a result of a lack of publications assessing the effectiveness of 18F-FDG PET and PET/CT for detecting osteosarcoma, several sub-group analyses in our investigation were performed on a small number of studies. Third, there was methodological variability in the included studies, such as in the SUV, reference standards tests and duration of follow-up. Furthermore, some studies were assessed using 18F-FDG PET, while others were assessed with PET/CT. Finally, some information was not available in the included studies.

Conclusions

In summary, our comprehensive meta-analysis of publications demonstrated that 18F-FDG PET/CT have an excellent accuracy in the diagnosis, staging and recurrence monitoring of osteosarcoma. 18F-FDG-avid lesions should be further examined in osteosarcoma, especially for suspicious lung lesions. To support the current conclusions, larger-scale trials should now be conducted.