Clinical significance of metabolic tumor volume by PET/CT in stages II and III of diffuse large B cell lymphoma without extranodal site involvement
- First Online:
- Cite this article as:
- Song, MK., Chung, JS., Shin, HJ. et al. Ann Hematol (2012) 91: 697. doi:10.1007/s00277-011-1357-2
- 821 Downloads
The objective of this study was to investigate whether metabolic tumor volume (MTV) by positron emission tomography (PET) can be a potential prognostic tool when compared with Ann Arbor stage, in stages II and III nodal diffuse large B cell lymphoma (DLBCL). We evaluated 169 patients with nodal stages II and III DLBCL who underwent measurements with PET prior to rituximab combined with cyclophosphamide, adriamycin, vincristine, and prednisone (R-CHOP). Cutoff point of MTV was measured using the receiver operating characteristic (ROC) curve. During a median period of 36 months, stage II was 59.2% and III was 40.8%. Using the ROC curve, the MTV of 220 cm3 was the cutoff value. The low MTV group (<220 cm3) had longer progression-free survival (PFS) and overall survival (OS), compared with the high MTV group (≥220 cm3) (p < 0.001, p < 0.001). Stage II patients had longer survival than those in stage III (PFS, p = 0.011; OS, p = 0.001). The high MTV group had lower PFS and OS patterns, regardless of stage, compared with the low MTV group (p < 0.001, p < 0.001). Multivariate analysis revealed an association of the high MTV group with lower PFS and OS (PFS, hazard ratio (HR) = 5.300, p < 0.001; OS, HR = 7.009, p < 0.001), but not stage III (PFS, p = 0.187; OS, p = 0.054). Assessment of MTV by PET had more potential predictive power than Ann Arbor stage in the patients that received R-CHOP.
KeywordsDiffuse large B cell lymphoma Positron emission tomography Rituximab
Introduction of rituximab combined with cyclophosphamide, adriamycin, vincristine, and prednisone (R-CHOP) has resulted in improved survival outcomes in patients with diffuse large B cell lymphoma (DLBCL) [1, 2, 3, 4, 5, 6, 7, 8]. DLBCL is a heterogenous group of B cell non-Hodgkin lymphoma (NHL), rather than a single clinicopathologic entity . Multiple histologic subtypes were recognized and several molecular and genetic abnormalities were variably present. In recent years, most studies have focused on identifying molecular markers in order to define new prognostic factors. However, no relevant prognostic molecular markers have been validated, and an agreement on prognostic models has not yet been reached .
Aggressive NHL, including DLBCL, has been staged according to the Ann Arbor staging system, which was originally designed for Hodgkin lymphoma (HL). The International Prognostic Index (IPI) is the primary clinical tool used to predict the outcome for patients with aggressive NHL based on the number of negative prognostic factors at the time of diagnosis, including Ann Arbor stage III/IV and other factors (age ≥ 60 years, elevated lactate dehydrogenase level, Eastern Cooperative Oncology Group performance status ≥2, more than one extranodal site) . However, due to the higher heterogeneity and hematogenous spread pattern of dissemination in NHL relative to contiguous lymphatic spread with HL, Ann Arbor staging system has limited value in the context of assessing accurate tumor burden in NHL. For instance, despite the presence of a high tumor burden in stage II disease, the IPI score can be “zero point,” whereas the score can be “one point” in stage III disease even if the tumor burden is low.
New imaging techniques such as 18F-fluorodeoxyglucose (18F-FDG) positron emission tomography (PET) have been used as prognostic tools in NHL [12, 13]. As indicated by several positive sets of data, imaging techniques have become an important tool in clinical decisions on therapeutic strategies for treating aggressive NHL, including DLBCL [14, 15, 16]. The objective of the present study was to investigate whether metabolic tumor volume (MTV) by PET can be used as a potential prognostic tool, compared with the Ann Arbor stage, in patients with stages II and III nodal DLBCL.
Materials and methods
The baseline characteristics and comparison between stages II and III nodal DLBCL patients
Total (n = 169) (%)
Stage II (n = 100) (%)
Stage III (n = 69) (%)
Age, median, years (range)
Age ≥ 60 years
ECOG at least 2
Bulky LN (≥5 cm)
MTV, median, cm3 (range)
IPI score, n (%)
Response by revised IWC
Inclusion and exclusion criteria
Patients were included if they had primary nodal localization as a de novo DLBCL histotype, and the stage was II or III according to Ann Arbor staging and had been available for clinical follow-up. Patients were excluded if they presented any extranodal involved site, DLBCL secondary to low-grade NHL, or other treatment, including radiotherapy after R-CHOP therapy or autologous stem cell transplantation and if there was a discrepancy in the LNs between PET and conventional computed tomography (CT). In addition, patients were excluded if they had uncontrolled diabetes mellitus, evidence of infection at the time of diagnosis, especially active tuberculosis, or antibodies against human immunodeficiency virus.
Measurement of MTV by PET/CT
Dual-modality PET/CT tomography was performed on a biograph (Siemens Medical Solution, Hoffman Estates, IL, USA), based on a dual-slice helical CT and a full-ring PET tomography. FDG-PET images were evaluated for regions of focally increased tracer uptake. In the target lesions of FDG tracer uptake, SUV of ≥2.5 as contouring border was considered to represent lymphoma, as suggested by Freudenberg et al. . The CT images were used for PET attenuation correction. Imaging reconstruction of corrected emission data was performed after Fourier transform with AWOSEM software (two iterations, eight subsets, 5 mm Gaussian filter). The CT criterion for pathologic LN was that the LN exceeded 1.0 cm in all regions, except the groin. PET image also evaluated the area of focal tracer uptake; thus, the SUV of ≥2.5 was considered as pathologic LN, and the MTV was measured after CT attenuation correction. CT images were acquired with 130 mAs, 130 kV, and slice width (or 5 min and table feed) of 8 mm per rotation. Intravenous or oral contrast agents were used in all patients, and a standardized breathing protocol was applied. PET images were interpreted by nuclear physicians at each institution. Data were then reviewed by two nuclear medicine experts at Pusan National University Hospital.
Pretreatment and response evaluation
Pretreatment staging and response evaluation after six or eight cycles of R-CHOP therapy were based on clinical examination, CT scan of the neck, chest, abdomen, and pelvis, bone marrow biopsy, and PET/CT. Response was assessed according to revised International Workshop Criteria . Criteria were as follows: complete response (CR) is defined as (a) no signs or symptoms of disease, (b) negative PET and regression to normal size on CT, and (c) normal bone marrow. Partial response (PR) is defined as 50% decrease in tumor size but with a positive PET at the prior disease site. Stable disease (SD) is defined as (a) positive PET at the prior sites of disease and no new sites on CT or PET and (b) PET negative and no change in size of previous lesions on CT. Progressive disease is defined as (a) appearance of a new lesion >1.5 cm in any axis, ≥50% increase in the sum of the product of the diameters of more than one node, or ≥50% increase in the longest diameter of a previously identified node>1 cm in short axis and (b) lesions PET positive if FDG-avid lymphoma or PET positive prior to therapy.
The Mann–Whitney U test was used for assessment of differences in the frequency-independent prognostic factors of stages II and III groups. Progression-free survival (PFS) was calculated from the date of diagnosis to documented disease progression; observations were censored on the date the patient was last known to be alive or, for patients dying as a result of causes unrelated to lymphoma or treatment, the date of death. Overall survival (OS) was calculated from the date of diagnosis until death as a result of any cause or the date last known to be alive. PFS and OS were estimated by the Kaplan–Meier method, and the difference was compared using a log-rank test. Receiver operating characteristic (ROC) curve was performed for estimation of the accuracy in prediction of ideal cutoff value of MTV. Estimation of sensitivity and specificity was based on the cutoff value of MTV. SPSS software for Macintosh (SPSS 15.0; Chicago, IL, USA) was used for statistical data processing. A probability value <0.05 was considered statistically significant.
One hundred sixty-nine patients with stage II/III nodal DLBCL were treated with R-CHOP from 2004 to 2008, and the baseline characteristics were summarized in Table 1. Differences of several independent prognostic factors between stages II and III groups were not observed. However, MTVs in the stage III group were larger when compared with the stage II group (p < 0.001). CR and PR of the treatment response were comparable between the two groups (p = 0.786, p = 0.236, respectively), whereas SD was higher in the stage III group when compared with the stage II group (p = 0.036, Table 1).
Measurement of cutoff value of MTV in patients at diagnosis
Clinical outcome according to stage or MTV
Clinical outcome according to stage combined with MTV
Further analysis was performed to determine whether tumor burden was of clinical importance between stages II and III nodal DLBCL patients. Outcomes were compared among the four subgroups based on tumor burden and stage II or III (stage II group with low MTV, stage II group with high MTV, stage III group with low MTV, and stage III group with high MTV). The high MTV group, regardless of stage, had lower PFS and OS patterns, compared with the low MTV group (PFS and OS in stage II with low MTV, 90.5% and 95.2%; in stage III with low MTV, 88.0% and 88.0% versus in stage II with high MTV, 60.5% and 65.8%; in stage III with high MTV, 51.2% and 51.2%; p < 0.001, p < 0.001), whereas the prognostic impact of stage in the same MTV group was absent (in the low MTV group, difference of PFS and OS according to stage, p = 0.703, p = 0.268; in the high MTV, p = 0.347, p = 0.175, Fig. 2c, d).
Univariate and multivariate analysis
Univariate analysis for prognostic factors in patients
HR (95% CI)
HR (95% CI)
Multivariate analysis for prognostic factors in patients
HR (95% CI)
HR (95% CI)
Since its publication in 1971, Ann Arbor staging has been used as the staging system for both HL and NHL lymphomas . However, the number of involved nodal sites was not considered in this optional staging system. For these reasons, the Ann Arbor scheme was revised, and modifications in the staging procedures were recommended within the framework of the Ann Arbor Classification by a committee meeting in the Cotswolds under the auspices of the Cancer Research Campaign and Imperial Cancer Research Fund . However, accurate tumor burden was also not considered in the staging system. Interestingly, one previous study demonstrated that tumor burdens could be discriminated (i.e., low tumor burden vs. high burden) based on the number of extensive nodal areas and extranodal sites . The author of this study suggested that tumor burden measured by their method was an excellent prognostic factor in CHOP era. However, this method only involved a simple arithmetic system, and no imaging techniques were utilized. Because accurate tumor surveillance is a fundamental precondition for assessment of prognosis and therapeutic options in patients with NHL, a more accurate staging system model should be developed, especially in the era of rituximab.
A recent meta-analysis study, based on data from three large clinical trials, suggested that treatment with rituximab resulted in significant improvement of the treatment outcome within each of the four IPI factors, including Ann Arbor stage . Of particular interest, the study revealed data from the MabThera International Trial, where the advanced stage (III/IV) was no longer an independent factor of OS in multivariate analysis. In this study, data from the MegaCHOEP Trial also demonstrated that the advanced stage showed borderline correlations with PFS and was not associated with OS. In addition, data from the RICOVER-60 trial shown in this study demonstrated that the advanced stage was not an independent factor of PFS and OS. These findings indicate that treatment with rituximab resulted in improved outcome of advanced stage patients and diminished gap of survival between the limited stage and advanced stage according to the Ann Arbor staging system. For these reasons, we do not believe that the advanced stage itself would be a true poor prognostic factor in the era of rituximab. According to the results of meta-analysis in the above clinical trial, the authors demonstrated that four IPI factors were independent factors. However, it did not address several discrepancies, including patient characteristics, using regimens other than CHOP and a different treatment schedule in each clinical trial.
Development of imaging techniques such as 18F-FDG-PET has resulted in increased diagnostic accuracy and allowed clinicians to distinguish primary malignant lesions from benign areas. Thus, 18F-FDG-PET has been reported to provide superior information on staging of NHL when compared with conventional CT scans. Interestingly, two recent studies showed that tumor burden measured by PET could be used to measure the actual tumor burn of lymphoma [20, 21]. One of these studies showed that active tumor burden based on PET might be a prognostic indicator of volumetric response . Volume assessment in these studies was based on percent reduction in SUVmax. However, in the present study, we used a volume measurement process that included a cutoff value of absolute SUV volume measurement, as described by Freudenberg et al. .
PET using the tracer 18F-FDG incorporates metabolic tumor function with anatomic localization. Tumor volumes by PET in solid tumors have been associated with clinical outcome in several studies [22, 23]. However, to the best of our knowledge, clinical application of tumor burden by PET as a new staging tool has not yet been reported in DLBCL patients treated with R-CHOP therapy.
The findings reported in the present study suggest that total tumor burden of lymphoma is a more important prognostic parameter than Ann Arbor stage for assessing DLBCL. In the multivariate analysis, a high MTV had a greater clinical significance than stage III in survival. This result demonstrates that the Ann Arbor staging system has limited use in assessing DLBCL due to the heterogenous spread pattern of NHL in contrast to HL. Therefore, overall assessment of tumor burden of lymphoma may be needed before treatment strategies can be developed. In addition, the clinical outcome was not different for the patients in the same MTV state; however, the stage was different. These results suggest that a simple classification for prognosis according to diaphragm would not be wise, at least for DLBCL, in the era of rituximab. The present study was conducted to analyze the clinical importance of MTV between only nodal stages II and III DLBCL patients. Therefore, a further well-designed study including all nodal stages and extranodal sites is needed.
In conclusion, quantitative assessment of metabolic tumor volume using PET may potentially be more useful in the prediction of clinical outcome than the Ann Arbor staging system in stages II and III of exclusively nodal involved DLBCL patients treated with R-CHOP.
This study was supported by a grant from the national R&D Program for Cancer Control, Ministry for Health, Welfare and Family affairs, Republic of Korea (0920050) and supported by a grant of the Korean Health Technology R&D Project, Ministry of Health and Welfare, Republic of Korea (A070001).
This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.