Introduction

Malignant bone tumors account for approximately 6% of all cancer cases under the age of 20 years [1, 2]. The most common types of malignant bone neoplasms affecting young patients are osteosarcomas and Ewing sarcomas, with relative frequencies of 50 % and 40 %, respectively. Chondrosarcomas can occur at any age, but 70 % of the affected patients are older than 40 years. While radical resection and amputation were the only available treatment options for these tumors until the 1970s, multimodal therapy, including (neo-)adjuvant chemotherapy, radiation therapy, and surgery, has become the reference standard today [3]. However, neoadjuvant chemotherapy has been found not to impact imaging results or resectability in a favorable manner and resection of the adjacent joint remains inevitable in most cases [1, 4]. The surgical approach is determined by the joint invasion (JI) status: whereas unaffected joints may be resected intraarticularly, the presence of JI necessitates extra-articular resection to prevent local recurrence, which is associated with a poor prognosis [5, 6]. However, extra-articular resection is technically demanding and may lead to unfavorable functional results [7, 8].

Magnetic resonance (MR) imaging is the standard imaging method for the local staging of malignant bone tumors [9, 10]. Providing excellent soft tissue contrast and three-dimensional anatomic information, extraosseous tumor growth and the proximity to critical structures, like vessels or nerves, can be assessed precisely [11]. Previous studies have also investigated the feasibility of JI assessment on MR images [10, 12,13,14]. Schima et al investigated epiphyseal involvement, presence of intrasynovial tumor tissue, invasion of the joint capsule, intraarticular destruction of cortical bone or cartilage, invasion of the cruciate ligaments, and joint effusion as MR criteria for tumor invasion of the knee joint [12]. Ozaki et al defined cartilage disruption, acetabular/epiphyseal signal alterations, and marked joint effusion as signs of JI at the hip [13]. However, information on the performance of these imaging features is scarce and reported specificities for MR-based JI assessment differ dramatically [10, 12, 13]. Moreover, previous studies did not analyze the reproducibility of the individual features assessed on MR imaging.

The aim of this study was thus to evaluate the performance and reproducibility of previously described and new MR imaging features in the diagnosis of JI in patients with malignant bone tumors.

Materials and methods

Cohort

Approval of the Institutional Review Board has been obtained prior to this study and written informed consent was waived. Pre-treatment MR examinations of patients with malignant bone neoplasms, who had undergone joint resection at our musculoskeletal tumor center between 01/2009 and 12/2019, were obtained from the imaging database. Examinations obtained after chemotherapy were not included for two reasons: First, apparent tumor size reduction under neoadjuvant treatment should not be mistaken as reversal of JI, as intraarticular spread may already have occurred. The planning of tumor resection is therefore almost always based on preoperative images with regard to the initial tumor extent. Second, neoadjuvant treatment may induce signal alterations that may be mistaken for tumor tissue and thus be subject to false-positive assessments [4]. To limit the variability of MR protocols, included exams were required to be performed for primary bone tumor screening purposes. Tumor joint adjacency was defined as tumor invasion of either the metaphysis of the joint-forming long bone, the acetabulum, or the glenoid. Cases not presenting one of these features were excluded. Availability of pre-therapeutic scans was obligatory, as imaging features have been shown to be influenced by neoadjuvant therapy [15,16,17]. Definitive statements on JI status were obtained from pathologic reports of the local Institute of Pathology. Specimens with unclear reports were reviewed by two pathologists (K.S. and C.B.) and JI status was determined.

Image acquisition and evaluation

MR images were acquired in at least two planes: short tau inversion recovery/fat-saturated intermediate-weighted and pre- and post-contrast T1-weighted sequences were oriented along the longitudinal axis of the articulating long bone(s), whereas T2-weighed and fat-saturated post-contrast T1-weighed sequences were oriented along the short axis. Detailed information on sequence parameters is available as supplementary material. Images were independently reviewed by three radiologists (K.W., A.A.F., and M.R., with 26, 10, and 8 years of experience in musculoskeletal radiology, respectively) using picture archiving and communication system (PACS) workstations. Readers were blinded regarding clinical information, surgical outcomes, and histopathological findings. Images were evaluated using a standardized scoring sheet, assessing the following indirect (1.–4.) and direct (5.–7.) signs of JI:

  1. 1.

    Epiphyseal/Acetabular signal abnormalities:

  • 1.1 Bone marrow replacement (iso-/hypointense to muscle tissue on T1-weighted images)

  • 1.2 Edema-like signal intensity (hyperintense to muscle on T1-weighted images and markedly hyperintense on STIR-/intermediate-weighted/T2-weighted images).

    1. 2.

      Signal abnormalities affecting the transarticular bone (transarticular to primary tumor site):

  • 2.1 Bone marrow replacement (see 1.1.)

  • 2.2 Edema-like signal intensity (see 1.2.)

    1. 3.

      Synovial contrast enhancement

    2. 4.

      Joint effusion

    3. 5.

      Direct visualization of ITT (tumor tissue definitively crossing the synovial membrane toward the joint space, having direct contact to synovial fluid)

    4. 6.

      Intraarticular destruction of:

  • 6.1 Bone (discontinuity of intraarticular cortical bone)

  • 6.2 Cartilage (discontinuity of articular cartilage)

    1. 7.

      Invasion of:

  • 7.1 Cruciate ligament(s) (knee joint)

  • 7.2 Ligamentum teres (hip joint)

  • 7.3 Capsular insertions (all joints)

Readers further measured the largest tumor diameter in centimeter and tumors were divided into two groups (larger than average, smaller than average), for each reader. The overall imaging diagnoses (JI: yes/no), and the subjective certainty of each reader’s diagnosis (5-point Likert-scale, 1: definitely present; 2: likely present; 3: uncertain; 4: likely not present; 5: definitely not present), were documented. Image quality was assessed by one reader using a 4-point ordinal scale (poor, moderate, good, excellent). Reader 1 also documented the MR sequence and image orientation most relevant for the diagnosis. Readings were repeated for intra-reader reproducibility measurements (K.W., 90 days between readings).

Combinations of imaging features

Following the primary analysis of individual imaging features, three dummy variables were created for Reader 1, based on the following criteria: MaxSens was defined as positive if at least one of all direct imaging features with a good inter-reader reproducibility (Fleiss’ κ ≥ 0.6) and a good sensitivity (≥ 80 %, in each reader) was achieved. MaxSens was negative if all of the variable-defining imaging features were negative. MaxSpec was positive if at least one direct imaging feature with a good inter-reader reproducibility (Fleiss’ κ ≥ 0.6) and a good specificity (≥ 80 %, in each reader) was positive, and negative if all of the defined imaging features were negative. AllDirect was positive if at least one direct sign of JI was positive, and negative if all direct signs of JI were negative.

Statistical analysis

Data analysis was performed by B.J.S. and J.B. using STATA version 15 software (StataCorp LP). All statistical tests were performed with a two-sided level of significance (α) of 0.05. Inter-group differences in demographics were compared using Pearson’s chi-square test for categorical data (gender, side, site, tumor type) and an independent samples t test for continuous data (age at diagnosis). In a primary analysis, exact logistic regression models were used to determine associations between the observation of individual imaging features and JI status. Moreover, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated using crosstabs for each imaging feature and each reader, respectively. Infinity as upper limit of the 95% CI was noted as “+inf.”

In the secondary analysis, sensitivity, specificity, PPV, and NPV of the aforementioned dummy variables (MaxSens, MaxSpec, and AllDirect) were calculated.

Fleiss’ κ was used to determine inter-reader reliability for repeated image analysis in each imaging feature between readers. Intra-reader reproducibility for repeated measures was calculated using Cohen’s κ.

Results

Cohort characteristics

Of 434 patients with malignant bone tumors treated at our institution, 313 were excluded as metaphyseal invasion was absent and/or patients did not undergo resection (Figure 1). Further 68 patients were excluded, because pre-therapeutic MR imaging was unavailable, and five patients were excluded, because the pathologic report lacked a definitive statement on JI and retrospective assessment of the specimens was impossible. We finally included 48 patients (age in years ± SD: 28.7 ± 21.4 years; sex: 26 men, 22 women) with joint adjacent malignant bone tumors in this study (Table 1). JI was more common in older patients and men were affected more frequently (age, no JI: 16.5 ± 12.0 vs. JI 40.8 ± 22.5, p < 0.001; sex, no JI: 14 women, 10 men; JI: 8 women, 16 men, p = 0.08). Most tumors were located at the knee (n = 28), exceeding both tumors at hips (n = 13) and shoulders (n = 7) combined. However, histopathologic findings revealed a higher prevalence of JI in hips (Figure 2) (11/13, 85 %) compared to knees (10/28, 36 %) and shoulders (3/7, 43 %) (p = 0.006). Tumor entities comprised osteosarcomas (n = 30), chondrosarcomas (n = 11), Ewing sarcomas (n = 4), two secondary malignancies, and one pleomorphic sarcoma. JI was most frequently diagnosed in chondrosarcomas, secondary malignancies, and pleomorphic sarcoma (osteosarcoma 43%; chondrosarcoma 73 %; Ewing sarcoma 0 %; others 100 %, p = 0.006).

Fig. 1
figure 1

Flow diagram of study population and patient recruitment

Table 1 Cohort demographics
Fig. 2
figure 2

Chondrosarcoma of the pelvis invading the right hip joint. a Coronal T1-weighted TSE image demonstrates bone marrow replacement at the acetabulum and destruction of intraarticular bone (arrow) caused by bone tumor extending into the pelvis (star). b Axial T2-weighted TSE image shows tumor invasion of the ligamentum teres (arrow) as well as direct visualization of intraarticular tumor tissue in the acetabular fossa (arrowhead). Note the absence of relevant joint effusion and typical lobulated appearance and high signal intensity of cartilaginous mass

Image analysis

Reader 1 rated image quality as good or excellent in 42 of 48 cases and as moderate in the remaining six cases. T2-TSE was regarded as the most valuable sequence for determining JI status in most cases (35/48, 72.9%) and the conclusion certainty was “uncertain” in none of the cases. Concerning the overall imaging diagnoses, sensitivity and specificity were excellent in all readers (sensitivity: 92–100%; specificity: 88–100%) (Table 2).

Table 2 Diagnosis of Joint Invasion: Overall imaging diagnoses

Indirect signs of joint invasion

JI was strongly associated with epi-/apophyseal signal alterations reflecting cellular infiltration and findings were highly reproducible (odds ratio (OR) = 15.6–18.3, p = 0.002–0.004; Fleiss κ [95% CI] = 0.96 [0.80 – 1.00]; Table 3). This imaging feature was moreover found to be highly sensitive for JI and had a high NPV in each reader (sensitivity: 96%, NPV: 91–92%; Table 4). In comparison, edema-like signal alterations at the same location were of lesser diagnostic value (OR = 0.6–4.8, p = 0.03–0.56; sensitivity: 33–83%; NPV: 45–75%). Transarticular bone marrow signal alterations for both cellular infiltration and edema-like signal alterations had an excellent PPV (100%). However, both of these indirect signs were observed rarely, considerably affecting odds ratios and sensitivities (OR = 1.0–2.5, p = 0.49–1.00; sensitivity = 1 to 2/24, 4–8%). Joint effusion was associated with increased odds for JI and also showed a substantial inter-reader agreement, but was neither markedly sensitive nor specific and the NPV was limited (OR = 4.8–25.3, p = < 0.001–0.02; Fleiss κ = 0.70; sensitivity: 54–63%; specificity: 75–96%; NPV: 67–69%). Although odds for JI were comparably higher in cases with synovial contrast enhancement, reproducibility of this imaging feature was only moderate and its specificity limited (OR = 7.5 – 8.7, p = 0.007–0.06; Fleiss κ = 0.51; sensitivity: 87–96%; specificity: 29–54%). In our cohort, a maximum tumor diameter greater than 10.5 cm was not associated with increased odds for JI and this criterion was neither sensitive nor specific (p ≥ 0.56; sensitivity: 29–50%; specificity: 63–79%).

Table 3 Associations of imaging features and JI
Table 4 Performance of the analyzed imaging features in diagnosing JI

Direct signs of joint invasion

All direct signs besides tumor invasion of the ligamentum teres were associated with increased odds for JI (Table 3). Direct visualization of ITT was strongly associated with the histopathological diagnosis of JI (OR [95 % CI], Reader 1 229 [29.8, + inf], p < 0.001; Reader 2 229 [29.8, + inf], p < 0.001; Reader 3 186 [17.8, 10848], p < 0.001) and associated values for sensitivity, specificity, PPV, and NPV were ≥ 92 % for all readers (Table 4; Figures 3 and 4). Moreover, inter- and intra-reader reproducibility were excellent (Fleiss’ κ [95% CI] = 0.83 [0.67, 1]; Cohen’s κ [95% CI] = 0.95 [0.86, 1.00]). Odds for JI were high in patients with intraarticular bone destruction (p < 0.001, respectively), and sensitivity of this sign as well as its NPV both ranged from 96 to 100%, while specificity (71–96%) and PPV (77–96%) were reader-dependent (Fleiss’ κ [95 % CI]; 0.72 [0.55, 0.88]). Destruction of cartilage showed elevated odds for JI in all readers (Reader 1 12.5 [1.7, +inf], 0.009; Readers 2+3 37.0 [6.2, 430], p < 0.001) and a specificity of 92 to 100%. With regard to intraarticular ligaments, analyses demonstrated increased odds for JI in cases affecting the cruciate ligaments, but not the ligamentum teres (p ≤ 0.02 and p ≥ 0.26, respectively). Capsular invasion predicted the histopathologic diagnosis of JI in all readers (p ≤ 0.007, respectively) and specificity was ≥ 83%, for each reader (Figure 5).

Fig. 3
figure 3

Osteosarcoma of the left distal femur invading the knee joint. a Sagittal T1-weighted TSE image shows metaphyseal mass (star) with epiphyseal extension and posterior soft tissue component. b Corresponding post-contrast image reveals inhomogeneous enhancement of the bone lesion as well as synovial enhancement at the posterior joint recess (arrows). c Sagittal intermediate-weighted TSE image with fat suppression and d axial T2-weighted TSE image show JI via the posterior capsular insertion and directly visualize intraarticular tumor tissue in contact with joint fluid and cartilage surface of the lateral femoral condyle (arrows)

Fig. 4
figure 4

Chondrosarcoma of the right distal femur invading the knee joint at two different sites. a Sagittal T1-weighted and b corresponding intermediate-weighted TSE images with fat suppression demonstrate bone marrow replacement in the metaphysis and epiphysis of the distal femur (stars), destruction of intraarticular bone at the roof of the intercondylar notch (arrows), and tumor invasion of the anterior cruciate ligament (arrowheads). c, d Consecutive axial T2-weighted images show destruction of bone and articular cartilage at the femoral trochlea (arrows) as well as tumor nodules in contact with joint fluid and cartilage surface of the patella (arrowheads) indicating invasion of the femoro-patellar joint compartment

Fig. 5
figure 5

Extra-articular osteosarcoma of the right distal femur. a Sagittal T2-weighted TSE and b corresponding intermediate-weighted TSE images with fat suppression show inhomogeneous mass with large soft tissue component at the distal femoral diaphysis and metaphysis (stars). The tumor reaches the origin of the anterior cruciate ligament posteriorly (short arrows) but does not invade the ligament or cross the synovial membrane at the posterior capsule (large arrowhead in b). Anteriorly, the mass invades the prefemoral fat (long arrows) but does not reach the suprapatellar joint recess (small arrowheads in b)

Combinations of imaging features

Direct visualization of ITT and intraarticular destruction of bone met the criteria to be included in the MaxSens variable and MaxSpec comprised direct visualization of ITT, invasion of the capsular insertion, and cruciate ligaments or ligamentum teres. AllDirect had a sensitivity, specificity, PPV, and NPV of 96 % with one false-positive and one false-negative case (95 % CI; sensitivity, specificity: 79, 100; PPV, NPV: 77, 99). While results for MaxSens were the same, MaxSpec had a specificity and PPV of 100 % [86, 100], a sensitivity of 96 % [79, 100], and a NPV of 96 % [78, 99], with only one false-negative case.

Discussion

This study evaluated the diagnostic performance and reproducibility of MR imaging features in the detection of JI by malignant bone tumors. Similar to previous population-based studies, osteosarcomas were the most frequent entity and the knee was the most common location for malignant bone neoplasms in our cohort [1]. Therefore, despite the relatively low number of included cases, we regard the external validity of our cohort as substantial.

Correct assessment of JI status in preoperative MR examinations is of great importance to orthopedic surgeons, as false-positive diagnoses lead to unnecessary extensive resections with high complication rates and worse functional outcomes, while false-negative diagnoses may increase the risk of local recurrence due to intra-operative spread of malignant cells across the surgical plane [5, 7, 8, 11, 18,19,20]. However, although previous studies found that the tumor diameter may precisely be measured using T1-weighted sequences, they also reported up to 50 % false-positive assessments, when JI was diagnosed with MR imaging [10, 12,13,14, 21]. In contrast, our readers were able to confidently exclude JI in 88–100 % of the true-negative cases. These differences may be explained by the improved quality of MR images compared to former years and the structured analysis of imaging signs in this study. This may also apply to the two most valuable and reproducible signs in this study—direct visualization of ITT and intraarticular destruction of bone—which were previously reported to be far less specific [12, 13, 22].

Previous studies have shown mixed results for the value of joint effusion and synovial contrast enhancement in diagnosing JI [12, 14]. Our results indicate that these indirect signs are either nonspecific or insensitive. Such reactive changes may occur due to hyperemia, capsular irritation by the inflammatory tumor microenvironment, or mechanical stress, if the tumor is abutting the joint, but not invading it [23]. Furthermore, the absence of joint effusion showed a much lower negative predictive value for JI than in the study by Schima et al (11).

Our readers were able to detect tumor invasion of capsular insertions and to differentiate this finding from mere capsular contact or displacement, in particular with use of T2-weighted images, whereas this task was described as challenging by Schima et al, and the usefulness of T2-weighted MR sequences for diagnosing JI has been discussed controversially [12, 24]. Tumor invasion of intraarticular ligaments was a highly specific sign and showed a high PPV. For the cruciate ligaments, these results match those of previous studies, while assessment of tumor invasion of the ligamentum teres has been reported to be more difficult, most likely due to technical limitations in visualizing the ligament [13]. Notably, invasion of the hip joint via the ligamentum teres occurred most often in patients with chondrosarcomas: Out of six cases, in which invasion of the hip joint was present, chondrosarcomas accounted for four.

Similar to previous reports, epiphyseal bone marrow replacement seen on T1-weighted MR images had an excellent sensitivity and NPV, but a low specificity for the diagnosis of JI in our study [12]. Edema-like epiphyseal signal alterations were however relatively insensitive and nonspecific and may thus only be of very limited value when diagnosing JI.

The combination of highly specific direct imaging features (direct visualization of ITT, invasion of the capsular insertion, or intraarticular ligaments) led to a further increase of sensitivity (96 %), while maintaining an excellent specificity (100 %). Consequently, we propose that a combination of direct signs should be considered when diagnosing JI.

This study has limitations: most importantly, a reader bias is pertinent to this study, as due to the design of the study, readers most likely expected a high number of cases with JI, possibly influencing the diagnosis. To minimize this bias, readers were blinded to clinical and histopathological diagnoses and surgical outcome [25]. Second, the analyzed examinations were obtained in clinical routine on two MR scanners with different field strengths using pulse sequences with different in-plane resolutions. On the other hand, this also demonstrates the feasibility of assessing the discussed imaging features in routine clinical scans. Third, the statistical power of our analyses is limited by the small cohort size. It may be difficult to overcome this limitation as the incidence of joint adjacent malignant bone tumors, particularly with JI, is relatively low.

In summary, our results demonstrate the feasibility to accurately determine JI status of malignant bone tumors on pre-therapeutic MR images. Direct signs, in particular visualization of intrasynovial tumor tissue and destruction of intraarticular bone, were found to be most valuable when assessing JI, owing to their excellent specificity, and were highly reproducible. In contrast to previous studies, indirect signs were found to be much less reliable. Providing this information is essential for orthopedic surgeons, as correctly diagnosing JI prior to surgery may prevent inadequate resections, postoperative complications, and local recurrence.