Abstract
Nuclear imaging is used in the evaluation of painful arthroplasties for diagnosing loosening or periprosthetic joint infection (PJI). The purpose of this systematic review is to depict the reproducibility of the various nuclear imaging modalities used in the assessment of painful hip and knee arthroplasties. A systematic review of the literature was conducted with a comprehensive search of MEDLINE to identify clinical studies investigating the intra-and inter-observer agreement of nuclear imaging modalities in diagnosing PJI and mechanical loosening. A total of 3.000 studies, published between 2004 and 2020, were reviewed and 16 studies met the inclusion criteria. Quality assessment identified concerns with regard to the internal validity in approximately 40% of the included studies. A meta-analysis could not be performed because of insufficient available clinical data. The intra-observer agreement was poorly investigated. The included studies reported an inter-observer agreement of “slight” to “excellent” for bone scintigraphy, “moderate” to “substantial” for leukocyte scintigraphy, “substantial” to “almost perfect” for combined leukocyte and bone marrow scintigraphy, and “fair” to “substantial” for anti-granulocyte scintigraphy. Hybrid imaging with SPECT/CT and FDG-PET/CT demonstrated “substantial” and “almost perfect” inter-observer agreement for symptomatic hip prostheses, respectively. The reproducibility of nuclear imaging is underreported in clinical studies investigating painful knee and hip arthroplasties. Moreover, the included studies demonstrated methodological concerns with a high risk of bias. The available evidence demonstrated a wide range of inter-observer agreement using scintigraphy. Hybrid imaging with SPECT/CT and FDG-PET/CT may improve the accuracy of interpretation and reproducibility. However, literature provides limited data to support this assumption.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
The incidence of complications after total hip and knee replacements is low, but due to the increased frequency of implantations, complications are commonly encountered and require an appropriate diagnostic algorithm. Among the variety of complications (including heterotopic ossification, pseudotumor, inflammatory disorders, hardware failure, and foreign body granulomatosis causing osteolysis), the two major complications are mechanical (aseptic) loosening and periprosthetic joint infection (PJI). PJI is considered one of the most devastating prostheses-related complications. However, reliable differentiation between PJI and aseptic loosening is often difficult since both may be accompanied by similar symptoms.
Currently, there is no diagnostic algorithm and despite multiple diagnostic tests, diagnosing PJI and differentiation from mechanical loosening remains challenging. Although current guidelines do not recommend the routine use of nuclear imaging [1, 2], a variety of radionuclide imaging modalities can contribute to the evaluation of suspected PJI or loosening. These modalities include bone scintigraphy (BS), leukocyte scintigraphy (LS), anti-granulocyte scintigraphy (AGS), fluorodeoxyglucose positron emission tomography (FDG-PET), single-photon emission computed tomography (SPECT), and combined techniques [3,4,5]. In the assessment of PJI of the hip, a meta-analysis reported pooled sensitivity and specificity of 80–69%, respectively, for BS, 88–92% for LS, 69–96% for combined LS-bone marrow scintigraphy (LS-BMS), 84–75% for AGS, and 86–93% for FDG-PET/CT [4]. For PJI of the knee, the reported pooled sensitivity and specificity were 93–56%, respectively, for BS, 88–77% for LS, 80–93% for LS-BMS, 90–95% for AGS, and 70–84% for FDG-PET/CT [5]. For aseptic loosening of hip arthroplasties, the reported pooled sensitivity and specificity were 85–72%, respectively, for BS [6]. In one clinical study which investigated symptomatic hip arthroplasties with SPECT/CT, a sensitivity of 94% and a specificity of 100% were reported [7]. In the evaluation of aseptic loosening of knee arthroplasties, the reported range of sensitivity and specificity was 76–100% and 33–100%, respectively, for BS, 56–100% and 56–100% for FDG-PET/CT, and 86–100% and 63–93% for SPECT/CT [8].
The reported diagnostic accuracies of the nuclear imaging modalities in the assessment of symptomatic arthroplasties reflect the agreement of the interpretation of the scan (index test) with the reference standard However, understanding the variables which are of influence to the interpretation of a scan is important for the outcome of (false) positive or (false) negative results. These variables are related to several domains such as the index test, the reference standard, and observer-related factors. Important variables related to the subject (the painful arthroplasty) are the time interval between imaging and surgery, location and type of implant, and severity of the infection. For example, the interpretation of a low grade PJI could be more challenging for the observer and lead to more false negative results compared with diagnosing an acute PJI. The type of imaging modality (index test), the performed imaging protocol, and the used diagnostic criteria could also affect the interpretation and outcome. For example, a variety of diagnostic criteria was reported for various nuclear imaging modalities which could lead to different diagnostic accuracy outcomes regarding the investigated imaging modality [4, 5]. Finally, in the interpretation of the scans, the background of the observer such as the level of expertise and experience may also affect the diagnostic outcome. These factors not only could affect the reported diagnostic accuracy but also may introduce variations in intra- and inter-observer agreement. These variations are important and potential compromising factors for the reproducibility and therefore the clinical application of the imaging modalities in the assessment of symptomatic arthroplasties.
The (statistical) reliability of an imaging modality is defined by the degree of agreement when different observers use the same imaging technique, classification, and procedure to assess the same subject. Each of the nuclear imaging modalities is characterized by specific benefits and drawbacks in diagnosing PJI or aseptic loosening. For example, scintigraphy is potentially hampered by non-specific tracer uptake which could lead to unsatisfactory results of inter-observer reliability. In literature, it has been suggested that the interpretation of nuclear imaging could be improved by more recently introduced hybrid techniques as SPECT/CT and PET/CT [9, 10]. For other pathologies, including infectious bone diseases, the improved diagnostic accuracy and reproducibility of hybrid imaging have been well established [11]. However, in the assessment of symptomatic arthroplasties, this improvement in terms of reproducibility has not been well defined in literature.
The level of reproducibility and variables that could influence the individual interpretation of images is not always clear and therefore a suitable subject for systematically literature investigation. To the best of our knowledge, no systematic review that investigated this topic has been published before. The objective of this study is to systematically review the reproducibility of currently used nuclear imaging modalities in the evaluation of symptomatic hip and knee arthroplasties.
Materials and Methods
Search Criteria
The imaging modalities that were reviewed for the assessment of painful arthroplasties were planar scintigraphy, PET, SPECT, and combined techniques.
Search Strategy
A computer-aided search of the PubMed databases was conducted in June 2019 and updated in April 2020. The search was restricted to primary studies that were written in English. For each database, a specific search strategy was developed (Fig. 1) with an informatics specialist. Reference lists of the identified studies and relevant reviews were hand-searched for supplementary eligible studies. The search was performed according to the PRISMA guideline (Preferred Reporting Items for Systematic reviews and Meta-Analyses) Statement [12].
Study Selection
The following inclusion criteria were used to identify eligible, clinical studies: (1) planar scintigraphy, SPECT, or PET had been used to identify suspected PJI or loosening of hip and/or knee arthroplasties. (2) The study used a valid reference standard for PJI (criteria by the Musculoskeletal Infection Society and/or the criteria of Infectious Diseases Society of America) [1, 13] or aseptic loosening with preoperative findings (3), a clinical follow-up interval of at least 6 months when surgery was not performed. (3) The study reported the calculated intra- and inter-observer agreement with multiple observers. Exclusion criteria were (1) nonhuman studies, (2) non-English language studies, or (3) studies in which differentiation between osteomyelitis without arthroplasties and PJI was not possible. There was no restriction regarding the date of publication.
The titles were screened for eligibility by 1 reviewer (S.V.) and were then processed for abstract assessment. The titles and abstracts were independently screened and assessed for eligibility in an unblinded standardized manner by 2 reviewers (S.V. and J.K.) Studies considered to be of dubious eligibility were rejected. The final decision on inclusion was made on the basis of the full text of the article.
Methodological Quality Assessment
The criteria list of QUADAS2 for evaluating internal and external validity of diagnostic studies recommended by the Cochrane Methods Group on Systematic Review of Screening and Diagnostic Tests was used for grading the methodological quality of the selected studies [14]. Evaluation was performed by the two reviewers (S.V. and J.K.) independently. The internal and external validity were evaluated and the criteria of QUADAS2 were used for the determination of the methodological limitations for descriptive purposes. All studies that met the inclusion criteria were included and, a priori, no studies were excluded from the systematic review on the basis of quality. A quantitative analysis could not be performed due to insufficient available data.
Results
Included Studies
The search strategy identified 2.994 studies from MEDLINE. There was no overlap found between the retrieved studies. No additional studies were identified through other sources. Of the initial 2,994 studies, 2,761 were excluded after analyzing the information provided in the title and abstract. The full articles of the remaining 233 studies were reviewed for eligibility. Six studies were extracted from the reference lists of these studies (n = 239). There was no disagreement between the reviewers regarding the definitive inclusion of the studies. The main reason for exclusion was the absence of intra- and inter-observer agreement calculation in the use of imaging for diagnosing PJI or aseptic loosening (n = 179). Other reasons for exclusion were the absence of a valid reference standard (n = 5), no differentiation between osteomyelitis and periprosthetic complications (n = 3), or absence of symptomatic arthroplasties inclusion (n = 36). Meta-analysis was not performed due to insufficient available clinical data. In general, no differentiation a priori was made for the type of implant, the interpretation criteria used for the index test, or the time period between surgery and imaging (Table 1).
Description of Study Characteristics
A total of 16 studies, published between 2004 and 2020, were included for systematic review. Two studies reported the inter-observer agreement for nuclear imaging in the assessment of aseptic loosening of hip and knee arthroplasties [17, 18]. In 4 studies, the results of reproducibility were reported without differentiation between PJI and aseptic loosening using bone scintigraphy [7, 19,20,21]. Of those studies, one reported the reproducibility of bone scintigraphy combined with SPECT/CT [7]. The intra- or inter-observer agreement of nuclear imaging in the assessment of PJI was investigated by 10 studies. Of those studies, 2 studies used leukocyte scintigraphy [22, 23], 1 study used combined leukocyte and bone scintigraphy [24], 3 studies used combined leukocyte and bone marrow scintigraphy [24,25,26], 3 studies used anti-granulocyte scintigraphy [15, 16, 27], 1 study used combined anti-granulocyte and bone marrow scintigraphy [27], and 3 studies used FDG-PET/CT [19, 28, 29]. The characteristics of the studies are detailed in Table 2.
Qualitative Analysis
The external validity showed high concerns with regard to applicability to clinical practice in approximately 40% of the included studies for the domain “patient selection.” The internal validity of the included studies showed more concerns with regard to the risk of bias. Approximately 40% of the included studies did not provide sufficient information regarding the domains “index test” and “patient selection” (Fig. 2) More than 70% of the studies did not provide detailed data regarding the domain “flow and timing.” The details of the internal and external validity are shown in Fig. 3.
Bone Scintigraphy and Combined Techniques
Of the 6 studies that investigated the use of bone scintigraphy, one study reported “excellent” (ICC = 0.89) intra-observer agreement of two observers in the assessment of the symptomatic hip [21]. In the assessment of aseptic loosening of THA, a “fair” (ICC = 0.43) and “good” (ICC = 0.67) inter-observer agreement was reported for the acetabular and femoral component, respectively [17, 18]. No significant differences between the cemented and uncemented prostheses was found. In 4 studies, the reproducibility was reported without differentiation between PJI and aseptic loosening. For the symptomatic hip, two studies reported a kappa value for inter-observer agreement of 0.11 (slight) [19] and an ICC of 0.81 (excellent) [21]. One study did not differentiate between hip and knee replacements and reported a kappa of 0.92, 0.81, and 0.80 for the blood flow, blood pool, and bone phase, respectively (overall kappa was not reported) [20]. One study combined bone scintigraphy with SPECT and reported a “fair” (κ = 0.27) and “moderate” (κ = 0.60) inter-observer agreement in the assessment of THA and TKA respectively [7]. The same study found a “moderate” (κ = 0.58) and “substantial” (κ = 0.80) inter-observer agreement when bone scintigraphy was combined with SPECT/CT in the assessment of THA and TKA respectively.
Leukocyte Scintigraphy and Combined Techniques
One study [23] reported the inter-observer agreement of leukocyte scintigraphy in diagnosing PJI and reported “moderate” to “substantial” as a result of interpretation by three observers (κ = 0.55, 0.60, and 0.74). There was no significant difference of concordance in 24% (n = 55) and 33% (n = 40) of the hip and knee group, respectively. Combined leukocyte and bone marrow scintigraphy demonstrated in one study “almost perfect” inter-observer agreement of three pairs of observers in the assessment of TKA and THA together [24]. For the assessment of hip arthroplasty, a kappa value of 0.74 (substantial) was reported by another study [26]. None of the 5 studies that investigated the use of leukocyte scintigraphy reported the intra-observer agreement.
Anti-granulocyte Scintigraphy and Combined Techniques
The 3 studies [15, 16, 27] that investigated the use of anti-granulocyte scintigraphy in the evaluation of PJI reported “excellent” inter-observer agreement for TKA/UKA (κ = > 0.75), “fair to moderate” for THA/TKA together (κ = 0.31–0.52), and “fair to excellent” agreement depending on the grade of infection (mild to severe). One study [27] reported the anti-granulocyte scintigraphy combined with bone-marrow scintigraphy and reported kappa values of 0.55–0.72 without differentiation between THA and TKA. None of the 3 studies reported the intra-observer agreement of anti-granulocyte scintigraphy in the assessment of PJI.
FDG-Pet/Ct
Of the 3 studies that investigated the use of PET/CT in the evaluation of PJI, two studies reported the inter-observer agreement of the tracer 18F-FDG. Stumpe et al. reported “moderate” inter-observer agreement with (κ = 0.47) [19]. Verberne et al. reported for the inter-observer agreement for 4 different criteria for PJI with kappa values ranging from 0.77–0.85 [29]. One study reported “substantial” (κ = 0.67) inter-observer agreement when 18F-PET/CT was combined with 18Fluoride. None of the studies reported the intra-observer agreement for the use of PET/CT in the assessment of PJI.
Discussion
Various nuclear imaging modalities can contribute in the challenging diagnosis and differentiation between PJI and mechanical loosening of hip and knee replacements. The reproducibility of these imaging modalities is an important factor for the clinical usability and could concern a major limitation in the evaluation of symptomatic joint replacements and will be discussed in the section below.
Bone Scintigraphy and SPECT/CT
Bone scintigraphy is known as a highly sensitive imaging modality for diagnosing aseptic loosening. This modality is widely available and can exclude any form of loosening. 99mTc-labeled bisphosphonates are the most commonly used radiopharmaceuticals to depict osteoblastic activity. In patients with mechanical loosening of the hip or knee replacement, negative perfusion and blood pool but positive osseous phases are often interpreted as loosening. Throughout literature, one investigation group [17, 18] reported the diagnostic accuracy of bone scintigraphy for the acetabulum and femoral component of THA and reported “fair” and “good” inter-observer agreement, respectively. No substantial differences between cemented and uncemented prostheses were found. However, the results of these studies have not yet been confirmed by other studies. None of the included studies reported the reproducibility of bone scintigraphy for diagnosing mechanical loosening of TKA.
In the assessment of PJI, the increased osteoblastic activity on bone scintigraphy results in a highly sensitive modality. However, in the first years after implantation, specificity is dramatically lower and differentiation between infection and mechanical loosening (1–2 years for hip prostheses, 4–5 years after knee prostheses) is not accurate [4, 5]. A bone scan is defined as positive for infection when there is uptake in all three phases (positive perfusion, blood pool, and osseous phase), and this criterion was used by all four studies that investigated bone scintigraphy in the assessment of painful prostheses. However, a wide variety in the level of reproducibility was reported (“slight” to “almost perfect” inter-observer agreement) [19,20,21] This wide range of observer agreement may be explained by the low-resolution images of scintigraphy which could lead to unsatisfactorily results of inter-observer reliability.
The introduction of hybrid imaging with SPECT/CT could improve diagnostic accuracy and reproducibility of the low-resolution scintigraphy. Adding only SPECT may not improve the reproducibility of bone scintigraphy alone [7]. This may be the result of the missing CT portion, which results in less anatomical relation and therefore less accurate interpretation of the images. For other pathologies, it has been reported that adding SPECT/CT to planar images results in higher diagnostic accuracy because of better resolution, direct (anatomic) correlation of functional and morphological abnormalities, and better distinguishing of bone infection from soft tissue infections through improved assessment of the extent of the infection [11]. A previous study showed that when SPECT/CT was added, inter-observer agreement regarding the location of infectious foci increased from κ = 0.68 to κ = 1.0 in in the assessment of osteomyelitis [11]. In agreement with the first clinical studies, 4 studies demonstrated “excellent” intra- and inter-observer agreement when SPECT/CT was added in the interpretation of symptomatic hip and knee arthroplasties [30,31,32,33]. However, these studies did not correlate the nuclear imaging results with surgical findings or clinical follow-up whether the prosthesis was loose or not. Because of this major methodological concern, no conclusions can be drawn regarding the clinical applicability and were therefore excluded from this review. In contrary to these studies, the one study that did correlate the SPECT/CT findings with a valid reference standard reported “moderate” to “substantial” agreement for the hip (κ = 0.58) and knee (κ = 0.80) [7]. Unfortunately, these findings are not yet confirmed by other studies. Although it is plausible that adding SPECT/CT improves the reliability of the interpretation of scintigraphy in the assessment of symptomatic joint replacement, there is limited clinical data in literature to support this assumption. The clinical impact of SPECT/CT and its improvement in terms of reproducibility would be a suitable subject for further investigation in the field of symptomatic arthroplasties.
Leukocyte Scintigraphy and Combined Techniques
Leukocyte scintigraphy has a long history in the evaluation of PJI as it can be used as a more specific imaging modality in order to differentiate between aseptic loosening and PJI. When the appropriate criteria are used, leukocyte scintigraphy may be a specific and accurate imaging modality for diagnosing PJI [4, 5]. Only one study reported “moderate” to “substantial” inter-observer agreement of three different pairs of readers (κ = 0.55–0.74). In 26 (27.4%) of the 95 investigated knee and hip arthroplasties, there was no concordance between the three observers. There were differences in concordance between the hip (76%) and knee (67%) group, however, not significant. The diagnostic accuracy of leukocyte scintigraphy is potentially hampered by the fact that labeled leukocytes not only accumulate in infections but also accumulate physiologically in the bone marrow. In order to reduce the number of false-positive results, leukocyte scintigraphy is often combined with bone marrow scintigraphy, which has been proposed as the imaging modality of choice for diagnosing PJI [34]. Besides the improved sensitivity and specificity compared with leukocyte scintigraphy alone, this combined technique demonstrated “substantial” to “almost perfect” inter-observer agreement in two studies [24, 26]. An explanation for the good reproducibility of LS-BMS scans may lie in the fact that comparison of images improve interpretation, which was also found by Sousa et al. [27] It is important to note that different diagnostic accuracies were found between knee and hip arthroplasties in previous studies [4, 5]. The one study that investigated the reproducibility of THA alone reported “moderate” inter-observer agreement [26], while “almost perfect” concordance was found in another study with a combined hip and knee group [24]. For further studies, it is important to separately analyze the reproducibility of both arthroplasties.
Anti-granulocyte Scintigraphy and Combined Techniques
Anti-granulocyte scintigraphy was introduced as an alternative for leukocyte scintigraphy with the advantage of in vivo labeling of leukocytes and was proposed as a promising diagnostic tool for the detection of PJI. Image acquisition and interpretation are the same as for leukocyte scintigraphy. In recent meta-analyses, less satisfying diagnostic accuracies of anti-granulocyte scintigraphy in the assessment of THA compared with TKA were found [4, 5]. The included studies in this review did not report inter-observer agreement in the assessment of THA alone. For TKA, a wide variety of kappa values between “fair” to “substantial” agreement (κ = 0.31 to > 0.75) was reported. Interestingly, for the combined knee and hip group [27], only “fair” to “moderate” inter-observer agreement was found, while another study reported “excellent” reproducibility (κ = > 0.75) for TKA alone [16]. Hence, one may suggest that variation in diagnostic accuracy outcomes between the knee and hip arthroplasties also applies to the level of reproducibility since interpretation is less accurate for THA. However, there is no available evidence to support this assumption because symptomatic THA has not been separately analyzed. Another important result was found in a study that reported important differences between the inter-observer agreement of mild PJI (κ = 0.25) versus severe PJI (κ = > 0.75) [15]. This could implicate that the degree of infection directly influences the interpretation and reproducibility of the anti-granulocyte scintigraphy scan in the assessment of PJI of the knee. Unfortunately, none of the other included studies differentiated in the degree of infection. In accordance with combined leukocyte and bone marrow scintigraphy, one study found higher levels of inter-observer agreement for combined anti-granulocyte scintigraphy and bone marrow scintigraphy compared with anti-granulocyte scintigraphy alone [27]. This supports the conclusion that the combination of these imaging modalities may improve interpretation and reproducibility.
FDG-PET/CT
FDG-PET has assumed considerable potential in the assessment of periprosthetic infection. Important advantages of FDG-PET/CT, compared with planar scintigraphy, are its increased resolution, anatomic correlation with coregistered CT, and availability of simple visual interpretation criteria of images. In the assessment of PJI of the hip, in contrary to PJI of the knee [5], FDG-PET/CT is considered a promising imaging modality with assumed simple visual interpretation of uptake patterns. However, the visual interpretation of uptake patterns could potentially be hampered by inter-observer variation in the interpretation of imaging. One of the first studies on clinical application of FDG-PET/CT stated that the simplicity of the diagnostic criteria should reduce potential inter-observer variation and increase the accuracy of the scan [10]. Throughout literature, however, this has not been investigated in literature. Moreover, a variety of diagnostic criteria has been used. Based on the best available evidence, it is assumed that extended uptake at the middle portion of the femoral bone-prosthesis interface is the most accurate criterion for PJI [29]. This criterion demonstrated “almost perfect” inter-observer agreement (κ = 0.85) in one study [29]. An alternative criterion, increased uptake along the bone-prothesis interface compared with the physiological uptake in the bladder, demonstrated only “moderate” inter-observer agreement [19]. Unfortunately, the literature search revealed no additional clinical data to support these findings and more clinical investigation is needed in order to validate these results.
Limitations and Concerns
This systematic review depicts an overview of the currently available literature on the reproducibility of the various nuclear imaging modalities in the assessment of painful arthroplasties. This review demonstrated that the level of evidence is hampered by serious limitations.
There are several concerns regarding the methodological quality of the included studies. Most studies investigated the diagnostic accuracy and intra- and inter-observer agreement was not the main research question. Ideally, the degree of agreement is investigated between multiple and different observers using the same imaging technique, classification, and procedure to assess the same subject. An important factor of influence to the reproducibility of the images is the level of experience of the observers. Interpretation of images by nuclear medicine physicians with or without experience in musculoskeletal imaging and the level of experience may directly influence the diagnostic accuracy. However, the level of experience and expertise was poorly or not described by 12 of the 16 included studies. Moreover, the number of observers is important to calculate the inter-observer agreement properly and avoid the possibility of agreement occurring by chance. Only 5 of the 16 included studies reported the results of more than 2 readers (Tables 3, 4, 5, 6, 7, 8, 9, and 10).
Recent meta-analyses demonstrated different diagnostic accuracies of nuclear imaging between hip and knee arthroplasties. It is assumed that the inter-observer agreement could vary between the location of the prostheses, due to different uptake patterns and physiology [4, 5, 21]. In the interpretation of nuclear images, the type of implant (knee or hip) could be of influence to reproducibility, as demonstrated by several studies [7, 16, 27]. Therefore, the reported reproducibility concerning both arthroplasties without differentiation of type and location should be interpreted with caution. Unfortunately, 5 of the included studies did not differentiate between hip and knee arthroplasties. The degree of infection (chronic versus acute PJI) and the time between the implantation (last surgery) and performed imaging are also important factors in the interpretation of scans and were not described in most studies.
In the field of nuclear orthopedic imaging for painful prosthesis, concerns regarding the methodological quality and level of evidence have been previously been described [35]. This systematic review depicts that the interpretation and reproducibility of nuclear imaging in the assessment of symptomatic joint replacements, an important aspect of clinical usability, have thus far been underreported. Moreover, it is important to note in general that most studies reported the consensus results between two readers, and therefore, analyzing the intra- and inter-observer agreement was not possible. Further investigation should report implant characteristics, time interval between imaging and surgery, and degree of infection, and separately analyze hip and knee arthroplasties.
Key Points of the Best Available Evidence According to This Systematic Review
Literature provides insufficient data regarding for intra-observer agreement when using nuclear imaging in the assessment of symptomatic joint replacements. Bone scintigraphy demonstrated a wide range of inter-observer agreement. The comparison of images, as for combined leukocyte and bone marrow scintigraphy, could improve the level of inter-observer agreement. The infection grade (mild versus severe) could directly influence the accuracy of interpretation. The interpretation and level of reproducibility for nuclear imaging could be different between hip and knee arthroplasties. Although it is plausible that hybrid imaging improves interpretation for symptomatic joint replacements and could improve the level of inter-observer agreement, there is limited data to support this assumption. The results of this systematic review should be interpreted with caution because of methodological concerns of the included studies with high risk of bias (Tables 11 and 12). Important variables that may influence the level of reproducibility as type of implant, time interval between imaging and surgery, level of expertise of the observers, and degree of infection were underreported. Further studies investigating the diagnostic performances of nuclear imaging in symptomatic orthopedic implants should report intra- and inter-observer agreement and separately analyze hip and knee arthroplasties.
References
Osmon DR, Berbari EF, Berendt AR, et al. Diagnosis and management of prosthetic joint infection: clinical practice guidelines by the Infectious Diseases Society of America. Clin Infect Dis. 2013;56(1):e1–e25. https://doi.org/10.1093/cid/cis803.
Parvizi J, Tan TL, Goswami K, et al. The 2018 definition of periprosthetic hip and knee infection: an evidence-based and validated criteria. J Arthroplast. 2018;33(5):1309–1314.e2. (In eng). https://doi.org/10.1016/j.arth.2018.02.078.
Verberne S, Temmerman O. Imaging of prosthetic joint infections. Management of Periprosthetic Joint Infections (PJIs) 2016:259.
Verberne SJ, Raijmakers PG, Temmerman OP. The accuracy of imaging techniques in the assessment of periprosthetic hip infection: a systematic review and meta-analysis. J Bone Joint Surg Am. 2016;98(19):1638–45. (In eng). https://doi.org/10.2106/jbjs.15.00898.
Verberne SJ, Sonnega RJ, Temmerman OP, Raijmakers PG. What is the accuracy of nuclear imaging in the assessment of periprosthetic knee infection? A Meta-analysis Clin Orthop Relat Res. 2017;475(5):1395–410. (In eng). https://doi.org/10.1007/s11999-016-5218-0.
Temmerman OP, Raijmakers PG, Berkhof J, Hoekstra OS, Teule GJ, Heyligers IC. Accuracy of diagnostic imaging techniques in the diagnosis of aseptic loosening of the femoral component of a hip prosthesis: a meta-analysis. J Bone Joint Surg (Br). 2005;87(6):781–5. https://doi.org/10.1302/0301-620X.87B6.15625.
Arican P, Okudan Tekin B, Sefizade R, Naldoken S, Bastug A, Ozkurt B. The role of bone SPECT/CT in the evaluation of painful joint prostheses. Nucl Med Commun. 2015;36(9):931–40. (In eng). https://doi.org/10.1097/mnm.0000000000000348.
Barnsley L, Barnsley L. Detection of aseptic loosening in total knee replacements: a systematic review and meta-analysis. Skelet Radiol. 2019;48(10):1565–72. https://doi.org/10.1007/s00256-019-03215-y.
Basu S, Kwee TC, Saboury B, Garino JP, Nelson CL, Zhuang H, et al. FDG PET for diagnosing infection in hip and knee prostheses: prospective study in 221 prostheses and subgroup comparison with combined (111)In-labeled leukocyte/(99m)Tc-sulfur colloid bone marrow imaging in 88 prostheses. Clin Nucl Med. 2014;39(7):609–15. https://doi.org/10.1097/RLU.0000000000000464.
Zhuang H, Duarte PS, Pourdehnad M, Maes A, van Acker F, Shnier D, et al. The promising role of 18F-FDG PET in detecting infected lower limb prosthesis implants. J Nucl Med. 2001;42(1):44–8 (In eng).
Horger M, Eschmann SM, Pfannenberg C, Storek D, Dammann F, Vonthein R, et al. The value of SPET/CT in chronic osteomyelitis. Eur J Nucl Med Mol Imaging. 2003;30(12):1665–73. (In eng). https://doi.org/10.1007/s00259-003-1321-z.
Moher D, Liberati A, Tetzlaff J, Altman DG. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Med. 2009;6(7):e1000097. https://doi.org/10.1371/journal.pmed.1000097.
Parvizi J, Zmistowski B, Berbari EF, Bauer TW, Springer BD, Della Valle CJ, et al. New definition for periprosthetic joint infection: from the Workgroup of the Musculoskeletal Infection Society. Clin Orthop Relat Res. 2011;469(11):2992–4. (In Eng). https://doi.org/10.1007/s11999-011-2102-9.
Whiting PF, Rutjes AW, Westwood ME, et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. 2011;155(8):529–36. https://doi.org/10.7326/0003-4819-155-8-201110180-00009.
Gratz S, Behr TM, Reize P, Pfestroff A, Kampen WU, Hoffken H. (99m)Tc-Fab' fragments (sulesomab) for imaging septically loosened total knee arthroplasty. The Journal of international medical research. 2009;37(1):54–67. (In eng). https://doi.org/10.1177/147323000903700107.
Gratz S, Reize P, Pfestroff A, Hoffken H. Intact versus fragmented 99mTc-monoclonal antibody imaging of infection in patients with septically loosened total knee arthroplasty. The Journal of international medical research. 2012;40(4):1335–42. (In eng). https://doi.org/10.1177/147323001204000412.
Temmerman OP, Raijmakers PG, David EF, et al. A comparison of radiographic and scintigraphic techniques to assess aseptic loosening of the acetabular component in a total hip replacement. J Bone Joint Surg Am. 2004;86-a(11):2456–63 (In eng).
Temmerman OP, Raijmakers PG, Berkhof J, et al. Diagnostic accuracy and interobserver variability of plain radiography, subtraction arthrography, nuclear arthrography, and bone scintigraphy in the assessment of aseptic femoral component loosening. Arch Orthop Trauma Surg. 2006;126(5):316–23. (In eng). https://doi.org/10.1007/s00402-006-0120-y.
Stumpe KD, Notzli HP, Zanetti M, et al. FDG PET for differentiation of infection and aseptic loosening in total hip replacements: comparison with conventional radiography and three-phase bone scintigraphy. Radiology. 2004;231(2):333–41. https://doi.org/10.1148/radiol.2312021596.
Granados U, Fuster D, Soriano A, et al. Screening with angiographic images prior to (99m)Tc-HMPAO labelled leukocyte scintigraphy in the diagnosis of periprosthetic infection. Rev Esp Med Nucl Imagen Mol. 2015;34(4):219–24. https://doi.org/10.1016/j.remn.2014.10.003.
Yoldas B, Cankaya D, Andic K, et al. Higher reliability of triple-phase bone scintigraphy in cementless total hip arthroplasty compared to cementless bipolar hemiarthroplasty. Ann Med Surg (2012). 2016;10:27–31. (In eng). https://doi.org/10.1016/j.amsu.2016.07.022.
Erba PA, Glaudemans AW, Veltman NC, et al. Image acquisition and interpretation criteria for 99mTc-HMPAO-labelled white blood cell scintigraphy: results of a multicentre study. Eur J Nucl Med Mol Imaging. 2014;41(4):615–23. (In eng). https://doi.org/10.1007/s00259-013-2631-4.
Pelosi E, Baiocco C, Pennone M, Migliaretti G, Varetto T, Maiello A, et al. 99mTc-HMPAO-leukocyte scintigraphy in patients with symptomatic total hip or knee arthroplasty: improved diagnostic accuracy by means of semiquantitative evaluation. J Nucl Med. 2004;45(3):438–44 (In eng).
El Espera I, Blondet C, Moullart V, et al. The usefulness of 99mTc sulfur colloid bone marrow scintigraphy combined with 111In leucocyte scintigraphy in prosthetic joint infection. Nucl Med Commun. 2004;25(2):171–5 (In eng).
Fuster D, Soriano A, Garcia S, Piera C, Suades J, Rodríguez D, et al. Usefulness of 99mTc-ciprofloxacin scintigraphy in the diagnosis of prosthetic joint infections. Nucl Med Commun. 2011;32(1):44–51. (In eng). https://doi.org/10.1097/MNM.0b013e328340e6fb.
Fuster D, Duch J, Soriano A, García S, Setoain X, Bori G, et al. Potential use of bone marrow scintigraphy in suspected prosthetic hip infection evaluated with 99mTc-HMPAO-leukocytes. Revista espanola de medicina nuclear. 2008;27(6):430–5 (In spa).
Sousa R, Massada M, Pereira A, Fontes F, Amorim I, Oliveira A. Diagnostic accuracy of combined 99mTc-sulesomab and 99mTc-nanocolloid bone marrow imaging in detecting prosthetic joint infection. Nucl Med Commun. 2011;32(9):834–9. (In eng). https://doi.org/10.1097/MNM.0b013e3283496695.
Kumar R, Kumar R, Kumar V, Malhotra R. Potential clinical implication of (18) F-FDG PET/CT in diagnosis of periprosthetic infection and its comparison with (18) F-fluoride PET/CT. Journal of medical imaging and radiation oncology. 2016;60(3):315–22. (In eng). https://doi.org/10.1111/1754-9485.12444.
Verberne SJ, Temmerman OPP, Vuong BH, Raijmakers PG. Fluorodeoxyglucose positron emission tomography imaging for diagnosing periprosthetic hip infection: the importance of diagnostic criteria. Int Orthop. 2018;42(9):2025–34. (In eng). https://doi.org/10.1007/s00264-018-3931-x.
Barthassat E, Afifi F, Konala P, Rasch H, Hirschmann MT. Evaluation of patients with painful total hip arthroplasty using combined single photon emission tomography and conventional computerized tomography (SPECT/CT) - a comparison of semi-quantitative versus 3D volumetric quantitative measurements. BMC Med Imaging. 2017;17(1):31. (In eng). https://doi.org/10.1186/s12880-017-0204-x.
Hirschmann MT, Iranpour F, Konala P, Kerner A, Rasch H, Cobb JP, et al. A novel standardized algorithm for evaluating patients with painful total knee arthroplasty using combined single photon emission tomography and conventional computerized tomography. Knee surgery, sports traumatology, arthroscopy : official journal of the ESSKA. 2010;18(7):939–44. (In eng). https://doi.org/10.1007/s00167-010-1070-z.
Rasch H, Falkowski AL, Forrer F, Henckel J, Hirschmann MT. 4D-SPECT/CT in orthopaedics: a new method of combined quantitative volumetric 3D analysis of SPECT/CT tracer uptake and component position measurements in patients after total knee arthroplasty. Skelet Radiol. 2013;42(9):1215–23. (In eng). https://doi.org/10.1007/s00256-013-1643-2.
Suter B, Testa E, Stampfli P, et al. A novel standardized algorithm using SPECT/CT evaluating unhappy patients after unicondylar knee arthroplasty--a combined analysis of tracer uptake distribution and component position. BMC Med Imaging. 2015;15:11. (In eng). https://doi.org/10.1186/s12880-015-0053-4.
Palestro CJ, Love C, Tronco GG, Tomas MB, Rini JN. Combined labeled leukocyte and technetium 99m sulfur colloid bone marrow imaging for diagnosing musculoskeletal infection. Radiographics. 2006;26(3):859–70. https://doi.org/10.1148/rg.263055139.
Glaudemans AW, Jutte PC, Petrosillo N, Erba PA, Lazzeri E, Signore A. Comment on: “diagnosis of periprosthetic joint infection: the role of nuclear medicine may be overestimated” by Claudio Diaz-Ledezma, Courtney Lamberton, Paul Lichtstein and Javad Parvizi. J Arthroplast. 2016;31(2):551–2. (In eng). https://doi.org/10.1016/j.arth.2015.07.002.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
The authors declare that they have no conflict of interest.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Verberne, S.J., Kuiper, J.W.P., Broos, W.A.M. et al. Systematic Review on Reproducibility of Nuclear Imaging in the Assessment of Painful Hip and Knee Replacements. SN Compr. Clin. Med. 2, 1491–1504 (2020). https://doi.org/10.1007/s42399-020-00450-6
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s42399-020-00450-6