Multi-task deep learning-based radiomic nomogram for prognostic prediction in locoregionally advanced nasopharyngeal carcinoma

Gu, Bingxin; Meng, Mingyuan; Xu, Mingzhen; Feng, David Dagan; Bi, Lei; Kim, Jinman; Song, Shaoli

doi:10.1007/s00259-023-06399-7

Multi-task deep learning-based radiomic nomogram for prognostic prediction in locoregionally advanced nasopharyngeal carcinoma

Original Article
Open access
Published: 19 August 2023

Volume 50, pages 3996–4009, (2023)
Cite this article

Download PDF

You have full access to this open access article

European Journal of Nuclear Medicine and Molecular Imaging Aims and scope Submit manuscript

Multi-task deep learning-based radiomic nomogram for prognostic prediction in locoregionally advanced nasopharyngeal carcinoma

Download PDF

Bingxin Gu^1,2,3,4,5^na1,
Mingyuan Meng⁶^na1,
Mingzhen Xu^1,2,3,4,5^na1,
David Dagan Feng⁶,
Lei Bi⁷,
Jinman Kim⁶ &
…
Shaoli Song ORCID: orcid.org/0000-0003-2544-7522^1,2,3,4,5

3401 Accesses
7 Altmetric
1 Mention
Explore all metrics

Abstract

Purpose

Prognostic prediction is crucial to guide individual treatment for locoregionally advanced nasopharyngeal carcinoma (LA-NPC) patients. Recently, multi-task deep learning was explored for joint prognostic prediction and tumor segmentation in various cancers, resulting in promising performance. This study aims to evaluate the clinical value of multi-task deep learning for prognostic prediction in LA-NPC patients.

Methods

A total of 886 LA-NPC patients acquired from two medical centers were enrolled including clinical data, [¹⁸F]FDG PET/CT images, and follow-up of progression-free survival (PFS). We adopted a deep multi-task survival model (DeepMTS) to jointly perform prognostic prediction (DeepMTS-Score) and tumor segmentation from FDG-PET/CT images. The DeepMTS-derived segmentation masks were leveraged to extract handcrafted radiomics features, which were also used for prognostic prediction (AutoRadio-Score). Finally, we developed a multi-task deep learning-based radiomic (MTDLR) nomogram by integrating DeepMTS-Score, AutoRadio-Score, and clinical data. Harrell's concordance indices (C-index) and time-independent receiver operating characteristic (ROC) analysis were used to evaluate the discriminative ability of the proposed MTDLR nomogram. For patient stratification, the PFS rates of high- and low-risk patients were calculated using Kaplan–Meier method and compared with the observed PFS probability.

Results

Our MTDLR nomogram achieved C-index of 0.818 (95% confidence interval (CI): 0.785–0.851), 0.752 (95% CI: 0.638–0.865), and 0.717 (95% CI: 0.641–0.793) and area under curve (AUC) of 0.859 (95% CI: 0.822–0.895), 0.769 (95% CI: 0.642–0.896), and 0.730 (95% CI: 0.634–0.826) in the training, internal validation, and external validation cohorts, which showed a statistically significant improvement over conventional radiomic nomograms. Our nomogram also divided patients into significantly different high- and low-risk groups.

Conclusion

Our study demonstrated that MTDLR nomogram can perform reliable and accurate prognostic prediction in LA-NPC patients, and also enabled better patient stratification, which could facilitate personalized treatment planning.

Computed tomography-based deep-learning prediction of induction chemotherapy treatment response in locally advanced nasopharyngeal carcinoma

Article 24 November 2021

Deep Learning and Radiomics Based PET/CT Image Feature Extraction from Auto Segmented Tumor Volumes for Recurrence-Free Survival Prediction in Oropharyngeal Cancer Patients

Deep learning signatures reveal multiscale intratumor heterogeneity associated with biological functions and survival in recurrent nasopharyngeal carcinoma

Article 26 April 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Nasopharyngeal carcinoma (NPC) is an epithelial malignancy arising from the nasopharyngeal mucosal lining [1], with high prevalence rates in east and southeast Asia [2]. About 70%-80% of NPC patients are categorized as locoregionally advanced NPC (LA-NPC) (Tumor-Node-Metastasis (TNM) stage III or IVa) according to the 8th edition of American Joint Committee on Cancer (AJCC)/Union for International Cancer Control (UICC) staging system [3]. The primary therapeutic regimen for NPC is radiation therapy (RT) with or without chemotherapy due to its radiosensitivity [4]. However, despite the improvement in treatment, due to locoregional recurrences and distant metastasis, the 5-year survival rates of LA-NPC patients is still a persistent problem, usually ranging from 10 to 40% [5]. Under this circumstance, pretreatment prognosis is a major concern for LA-NPC patients, which is conducive to guide the individualized therapeutic regimen. Specifically, based on the pretreatment prognosis, patients could be stratified into different risk groups with different therapeutic regimens applied, and this has been reported to potentially improve the patients’ overall survival outcomes [6].

TNM staging system is widely used for prognostic prediction and patient stratification [7,8,9]. However, despite the fact that patients with the same TNM stage receive the same treatment, large variations in prognosis exists due to the heterogeneous nature of tumor microenvironment [10]. Image-derived biomarkers, such as the standardized uptake value (SUV) and metabolic tumor volume (MTV) derived from [¹⁸F]-fluorodeoxyglucose ([¹⁸F]FDG) positron emission tomography/computed tomography (PET/CT), can provide promising prognostic information for NPC [11, 12]. Nevertheless, these factors are limited in clinical practice as they are arduous to represent intra-tumor information such as tumor texture, intensity, heterogeneity, and morphology. Therefore, a reliable and accurate prognostic prediction model is needed to predict their progression-free survival (PFS), and to distinguish high-risk from low-risk patients. Such prediction will ultimately facilitate the formulation of therapeutic regimens and improve patients’ overall survival outcomes.

Radiomics is a widely recognized computational method for prognostic prediction, which extracts high-dimensional handcrafted features from medical images to characterize intra-tumor information and then models the relevance between the features and prognostic outcomes through statistical methods [13, 14]. Radiomics has been widely used for prognostic prediction in various cancers including NPC [15,16,17]. However, the extraction of radiomics features requires tumor segmentation masks as the guidance, which inevitably brings an additional segmentation step into the radiomics pipeline. In addition, radiomics features are extracted from the segmented regions, which are usually limited to primary and metastatic lesions [5, 18]. This suggests that the extracted radiomics features may have difficulties in representing the prognostic information outside of malignant lesions (e.g., adjacent tissue invasion). There have been attempts at leveraging lymph node segmentation for radiomics analysis [19,20,21]. However, lymph node segmentation is intractable and the adjacent tissue invasion has not been considered yet. This limitation is more critical for LA-NPC patients, as many vital tissues and organs adjacent to the nasopharynx (e.g., brain, ethmoidal sinus, and orbit) might have already been invaded by LA-NPC [22].

Deep learning is an alternative approach to prognostic prediction and is becoming popular in the literature [15, 23, 24]. Deep survival models based on deep learning usually adopt convolutional neural networks (CNNs) to extract image features and then perform end-to-end prediction from medical images, where tumor segmentation masks are often not required [25]. Without tumor masks as constraints, deep survival models may potentially leverage the prognostic information existing within the entire images. Deep survival models have demonstrated the potential to outperform conventional radiomics-based prognostic prediction models [26,27,28]. However, performing end-to-end prediction without using tumor masks introduces interference from non-relevant background information and incurs difficulties in extracting tumor-specific information. Recently, multi-task deep survival models were explored to perform prognostic prediction jointly with tumor segmentation [29,30,31], which implicitly guided the model to extract tumor-related information while not discarding out-of-tumor information. However, the value of multi-task deep learning for prognostic prediction in LA-NPC has not been validated with large patient cohorts. In addition, deep survival models are limited by the ‘block box’ nature [32], which undermines their generalizability in clinical practice.

Nomograms serve as a common tool for guiding individualized treatments as they can simplify complicated prognostic models to numerical estimate of survival probability and provide a clear visual illustration of the factors leading to the prediction [33, 34]. Zhang et al. [5] developed a multiparametric magnetic resonance imaging (MRI)-based radiomic nomogram, which provides an illustrative example of precision medicine and prognostic prediction. Peng et al. [15] developed a deep learning FDG-PET/CT-based nomogram that may act as an individual chemotherapy (IC) indicator in advanced NPC. Pan et al. [3] developed a radiomic nomogram with better prognostic performance than the 8th edition of AJCC/UICC staging system. Nevertheless, it has been reported with an external validation cohort that Pan et al.’s nomogram underestimated the 5-year overall survival (OS) of LA-NPC patients [35]. Therefore, a more reliable and accurate prognostic nomogram is still needed for LA-NPC patients.

In this study, we aim to evaluate the value of multi-task deep learning for prognostic prediction in LA-NPC patients with a large database acquired from two medical centers. We adopted the state-of-the-art deep multi-task survival model (DeepMTS) [29] for joint prognostic prediction and tumor segmentation from pretreatment FDG-PET/CT images, which predicted a survival risk score (DeepMTS-Score) and a tumor segmentation mask for individual LA-NPC patient. The DeepMTS-Score can be directly used for prognostic prediction, while the predicted tumor masks were leveraged for prognostic prediction through radiomics analysis (AutoRadio-Score). We further developed a multi-task deep learning-based radiomic (MTDLR) nomogram by integrating DeepMTS-Score, AutoRadio-Score, and clinical data, so as to improve the accuracy and interpretability of prognostic prediction. Compared with conventional radiomic nomograms, our MTDLR nomogram achieved better prognostic performance and enabled better patient stratification, which demonstrated the potential to facilitate personalized treatment planning.

Materials and methods

Patients

Between May 2009 and May 2019, the medical records of 903 NPC patients were collected from Fudan University Shanghai Cancer Center (FUSCC) and Shanghai Proton and Heavy Ion Center (SPHIC). The inclusion criteria are as follows: (1) histologically confirmed LA-NPC (TNM stage III or IVa); (2) received concomitant systemic treatment with intensity modulated radiotherapy (IMRT); (3) underwent pretreatment FDG-PET/CT scans; and (4) available clinical data and FDG-PET/CT images. Patients with previous chemotherapy/radiotherapy or other malignant tumors were excluded. Finally, 652 patients from FUSCC and 234 patients from SPHIC were enrolled in this study. Patients from FUSCC were randomly divided into a training cohort (n = 522) and an internal validation cohort (n = 130) with a 4:1 ratio, while patients from SPHIC (n = 234) were used as an external validation cohort and used merely for evaluation purpose.

After completion of initial treatment, each patient was followed up for every 3 months in the first 2 years, then every 6 months in the third to fifth year, and annually thereafter. The follow-up endpoint of this study is PFS, defined as the time from randomization to the date of disease progression or death from any cause. The median follow-up time is 50 months (ranging from 44 to 120 months) for FUSCC and 49 months (ranging from 44 to 97 months) for SPHIC. FUSCC and SPHIC Ethical Committee approved this retrospective study with informed consent obtained from all enrolled patients.

PET/CT imaging

FDG-PET/CT images were obtained on a Siemens biograph 16HR PET/CT scanner (Knoxville, Tennessee, USA). FDG-PET/CT data acquisition procedure was detailed in Online Resource.

For quantitative analysis, maximum or mean of standardized uptake value (SUV) normalized to body weight and metabolic tumor volume (MTV) were manually computed for tumor lesions by drawing a 3-dimensional volume of interest (VOI). Meanwhile, total lesion glucose (TLG) was calculated according to the formula: TLG = SUV_mean × MTV, where the SUV_mean and MTV were recorded at the SUV threshold of 2.5.

Multi-task deep learning-based radiomics analysis

The workflow of multi-task deep learning-based radiomics analysis is illustrated in Fig. 1, which presents a three-step pipeline including multi-task deep learning model construction, automatic radiomics analysis, and nomogram construction.

We adopted a deep multi-task survival model (DeepMTS) [29] for joint prognostic prediction and tumor segmentation from FDG-PET/CT images. We preprocessed FDG-PET/CT images with resampling, SUV conversion (for PET only), affine registration, Regions-of-Interest (ROIs) cropping, and intensity normalization (detailed in Online Resource). The preprocessed PET and CT images were concatenated and fed into the DeepMTS as input, while the manual segmentation masks of primary tumors were used as ground truth labels for training only. The DeepMTS is a CNN consisting of a Unet-based segmentation backbone [36] and a DenseNet-based cascaded survival network (CSN) [37]. The Unet is a U-shape encoder-decoder CNN with skip connections between its contracting encoder and expanding decoder [36]. The DenseNet is a CNN consisting of multiple dense blocks with dense connections between layers, which enables feature reuse to enhance the capacity to generalize to unseen data [37]. The segmentation backbone is hard-shared by prognostic prediction and tumor segmentation tasks, which implicitly guides the model to extract features related to tumor regions. The outputs of the segmentation backbone are fed into the CSN as a supplementary input (together with FDG-PET/CT images), which further leverages the global tumor information (e.g., tumor size, shape, and locations) for prognostic prediction. Deep features derived from both segmentation backbone and CSN are used for prognostic prediction via two fully-connected layers. After training, DeepMTS can predict the survival risk scores of patients (DeepMTS-Score) and the segmentation masks of tumor regions. The DeepMTS-Score is relevant to PFS and can be directly used for prognostic prediction, while the predicted tumor masks were further leveraged in the following automatic radiomics analysis. The architecture of DeepMTS is detailed in [29] and its implementation code is publicly available at https://github.com/MungoMeng/Survival-DeepMTS. We also provide more training details in Online Resource. For comparison, we also built a single-task deep survival model for prognostic prediction, following Qiang et al.’s study [38], and its output scores are denoted by SingleTask-Score.

With the tumor masks predicted by DeepMTS, we extracted 1456 handcrafted radiomics features from FDG-PET/CT images via Pyradiomics [39], including 720 PET features, 720 CT features, and 16 shape features based on 3D shape of tumors (detailed in Online Resource). The extracted features were analyzed by a Lasso-Cox model [40], whose output scores are denoted by AutoRadio-Score. We refer to this radiomics process as automatic radiomics, which differentiates it from conventional radiomics based on manual segmentation. For comparison, we also performed the same radiomics analysis based on manual segmentation masks and refer to the output scores as ManualRadio-Score.

After the DeepMTS-Score and AutoRadio-Score are derived, we developed a multi-task deep learning-based radiomic (MTDLR) nomogram by combining the DeepMTS-Score, AutoRadio-Score, and clinical data. Univariate and multivariate analyses were performed for all clinical data and prediction scores via Cox proportional hazards regression, so as to screen out the prognostic indicators with significant relevance to PFS and build the nomogram. For comparison, we also built a conventional radiomic nomogram and a single-task deep learning-based radiomic nomogram by combining the ManualRadio-Score and SingleTask-Score with clinical data.

Statistical analysis

Continuous parameters were described using median or mean with range, while categorical variables were described using frequency with percentage. Differences among the training, internal validation, and external validation cohorts were analyzed using the Mann–Whitney test, χ² test, or Fisher’s exact test.

Univariate and multivariate Cox analyses were performed using SPSS (version 26.0; IBM Inc., New York, NY, USA). All radiomic nomograms were developed based on the multivariate analyses. Calibration curves with the Hosmer–Lemeshow goodness-of-fit test were applied to evaluate the consistence between the observed PFS proportion and the predicted survival probability.

The prognostic performance of nomograms was evaluated using Harrell's concordance indices (C-index), time-independent receiver operating characteristic (ROC) curve, and area under curve (AUC). The statistical significance between AUCs was tested via DeLong’s method using R packages (version 3.6.3, http://www.R-project.org). Survival analyses based on Kaplan–Meier method were performed for risk group stratification. Patients with score higher/lower than the cutoff value calculated by ROC were stratified into high/low-risk groups, and then a two-sided log-rank test was applied for comparisons. All tests were two-sided for statistical significance, and P value < 0.05 was considered to indicate statistically significant differences.

Results

Patient characteristics

The demographic and clinical characteristics of patients are presented in Table 1. The median age was 45 years (range 15–83 years), 48 years (range 14–79 years) and 48 years (range 14–74 years) for the training cohort, internal validation cohort, and external validation cohort, respectively. Among these three cohorts, no statistically significant difference was observed in age, gender, EBV DNA, T stage, N stage, and TNM stage, whereas BMI, LDH, histology, and PET parameters were statistically significantly different. At the end of the follow-up, the PFS ratio was 75.67% (395/522), 81.54% (106/130), and 80.77% (189/234) in the training, internal validation, and external validation cohorts, and there was no significant difference of PFS distribution among these cohorts (P = 0.163).

Table 1 Demographic and clinical characteristics of patients

Full size table

Establishment of MTDLR nomogram

Among the clinical and conventional PET parameters, only TNM stage was significantly associated with PFS in univariate analysis for the training cohort (P = 0.031, Table 2). However, none of these parameters showed a significant correlation with PFS in the internal and external validation cohorts. Notably, all the DeepMTS-Score, SingleTask-Score, AutoRadio-Score, and ManualRadio-Score were significantly associated with PFS in univariate analysis for the training, internal and external validation cohorts. For multivariate analysis, the DeepMTS-Score and AutoRadio-Score could serve as independent factors for predicting disease progression in all three cohorts (Table 3).

Table 2 Univariate Cox proportional hazard regression analysis for PFS on the training, internal validation, and external validation cohorts

Full size table

Table 3 Multivariate Cox proportional hazard regression analysis for PFS on the training, internal validation, and external validation cohorts

Full size table

Based on the multivariate analysis, we built the MTDLR nomogram with TNM stage, AutoRadio-Score, and DeepMTS-Score (Fig. 2a). The C-index of the nomogram was 0.818 (95% confidence interval (CI): 0.785–0.851, P < 0.001), 0.752 (95% CI: 0.638–0.865, P < 0.001), and 0.717 (95% CI: 0.641–0.793, P < 0.001) in the training, internal validation, and external validation cohort. Furthermore, the calibration curves showed that the predicted 3-year and 5-year PFS probability of the nomogram was highly consistent with the observed PFS probability (Hosmer–Lemeshow test: P > 0.05, Fig. 2b and c).

Performance of radiomic nomograms

To evaluate the prognostic performance of our MTDLR nomogram, the conventional radiomic nomogram (ManualRadio-Score + TNM) and the single-task deep learning-based radiomic nomogram (SingleTask-Score + TNM) were compared (Online Resource Fig. 1 and Table 1). Table 4 shows that our DeepMTS-Score exhibits better prognostic performance than the SingleTask-Score in the training (C-index and AUC: 0.780 and 0.819; Fig. 3a), internal validation (0.731 and 0.750; Fig. 3b), and external validation cohorts (0.695 and 0.702; Fig. 3c). Furthermore, the AutoRadio-Score also shows better prognostic performance than the ManualRadio-Score in these three cohorts (C-index: 0.728, 0.702, and 0.669; AUC: 0.751, 0.706, and 0.704). Moreover, the MTDLR nomogram combining TNM stage, DeepMTS-Score, and AutoRadio-Score achieved the best prognostic performance among all prognostic scores and nomograms in all three cohorts (C-index: 0.818, 0.752, and 0.717; AUC: 0.859, 0.769, and 0.730).

Table 4 C-index and AUC of different clinical, conventional, and deep learning-based radiomic scores/nomograms evaluated on the training, internal validation, and external validation cohorts

Full size table

Survival analysis for risk group stratification

The conventional radiomic nomogram (ManualRadio-Score + TNM), single-task deep learning-based radiomic nomogram (SingleTask-Score + TNM), and our MTDLR nomogram were used to stratify patients into high- and low-risk groups by cutoff values calculated with ROC curves. The Kaplan–Meier curves of the high- and low-risk patient groups were showed in Fig. 4. For comparison, the commonly-used TNM stage was also adopted to stratify patients according to stage III or IVa, where the patients with stage IVa had significantly poorer prognosis than the patients with stage III in the training cohort (Hazard rate (HR): 1.541, 95% CI: 0.991–2.397, P = 0.029). However, the TNM stage failed to stratify patients into significantly different groups in the internal and external validation cohorts (HR: 1.457, 95% CI: 0.582–3.647, P = 0.381 and HR: 1.839, 95% CI: 0.861–3.928, P = 0.059, respectively). Figure 4 also show that all three nomograms stratify patients into significantly different groups in all three cohorts (P < 0.001). Nevertheless, our MTDLR nomogram differentiated the high- and low-risk groups with the highest HR value among these three nomograms (HR: 10.250, 95% CI: 6.853–15.340, in the training cohort; HR: 7.519, 95% CI: 2.339–24.170, in the internal validation cohort; and HR: 4.812, 95% CI: 2.291–10.100, in the external validation cohort). In addition, the Kaplan–Meier curves of the patient groups stratified by ManualRadio-Score, SingleTask-Score, DeepMTS-Score, and AutoRadio-Score were presented in Online Resource Fig. 2.

Discussion

In this study, we constructed a multi-task deep learning-based radiomic (MTDLR) nomogram to predict the PFS of LA-NPC patients. The prognostic prediction and risk stratification performance of the MTDLR nomogram was superior to the conventional radiomic nomogram and single-task deep learning-based radiomic nomogram. LA-NPC patients can be stratified into low- and high-risk groups, where the high-risk group was characterized by worse PFS rates than the low-risk group.

The TNM staging system, focusing on anatomical and locational information, has been widely used in clinical studies [7,8,9] but, unfortunately, was not an independent prognostic factor in our study (Table 3). Nevertheless, we identified that combining TNM stage with other prognostic scores still improved the prognostic performance, which is consistent with the findings reported in previous studies [8, 28, 35]. FDG-PET/CT images, given the capabilities in providing tumors’ metabolic and anatomical information, have also been widely used for prognostic prediction [41,42,43]. However, the conventional FDG-PET/CT-derived parameters (SUV, MTV, and TLG) cannot serve as effective prognostic indicators in our univariate analysis (Table 2). To further leverage the prognostic information in FDG-PET/CT images, radiomics or deep learning were adopted and showed superiority over conventional parameters [28, 44]. Nevertheless, the prognostic performance varied with different radiomics or deep learning models, which suggests that the prognostic information in FDG-PET/CT image cannot be easily accessed and should be carefully leveraged with well-developed models.

Currently, there is a dilemma for extracting prognostic information from medical images. As discussed, conventional radiomics can well characterize the intra-tumor information while it is limited to the segmented tumor regions. Deep learning can access the prognostic information in the entire images. However, it has difficulties in extracting tumor-specific information. In this study, we adopted a deep multi-task survival model (DeepMTS) [29] to address this dilemma. It has been demonstrated that, through jointly learning tumor segmentation task with a hybrid multi-task architecture, DeepMTS can effectively extract prognostic information from tumor regions while also capturing the out-of-tumor prognostic information, which enables DeepMTS to outperform existing radiomics- or deep learning-based prognostic prediction models [29]. Nevertheless, we noticed that the segmentation output of DeepMTS was not fully leveraged for prognostic prediction and the prognostic information within tumor regions could be further explored. Therefore, we used the DeepMTS-segmented tumor masks for automatic radiomics analysis, which further explored the intra-tumor prognostic information and removed the reliance of conventional radiomics on manual segmentation. For tumor segmentation, the DeepMTS achieved a Dice Similarity Coefficient (DSC) of 0.826, 0.775, and 0.765 on the training, internal validation, and external validation cohorts, which demonstrates great consistency with the manually delineated segmentation masks. It has been reported that automatic segmentations improved the objectiveness [45] and resulted in significantly better prognostic prediction performance than manual segmentation [46], which potentially enables better radiomics analysis and facilitates the final prognostic prediction [47, 48].

The prognostic scores from DeepMTS and automatic radiomics were combined with clinical data to build the MTDLR nomogram, which leveraged both FDG-PET/CT and clinical information and also improved the interpretability for prediction. Our MTDLR nomogram achieved the best prognostic performance among all comparison prognostic scores and nomograms (Table 4), which could be attributed to three facts. First, the DeepMTS produced more discriminative prognostic scores (DeepMTS-Score) than the commonly used single-task deep survival model (SingleTask-Score). Second, the automatic radiomics also produced more discriminative prognostic scores (AutoRadio-Score) than conventional radiomics (ManualRadio-Score). Finally, the DeepMTS-Score and AutoRadio-Score were combined together to achieve better prognostic prediction. The strategy of combining multi-task deep learning and radiomics has been adopted for prognostic prediction in head and neck cancer [48] and achieved one of the top prognostic performance in HEad and neCK TumOR segmentation and outcome prediction (HECKTOR 2022) challenge [49]. Our study further validated this strategy with a large database of NPC patients.

We divided patients based on our MTDLR nomogram and found that the MTDLR nomogram effectively stratified LA-NPC patients into significantly different risk groups, which is potentially beneficial for individualized treatment regimens. Induction chemotherapy (IC) plus concurrent chemoradiotherapy (CCRT) is recommended as 2A-level evidence according to the National Comprehensive Cancer Network (NCCN) guidelines [4]. However, it’s still a controversy as a portion of LA-NPC patients do not benefit from IC. Qiang et al. [38] developed a prognostic system to explore whether high-risk or low-risk patients can benefit from IC + CCRT than CCRT only. Zhong et al. [16] developed a deep learning-based radiomic nomogram to predict the prognosis of NPC patients with different regimens and accordingly recommend an optimal treatment regimen. These studies demonstrated the necessity of stratifying LA-NPC patients into different risk groups so as to optimize treatment regimens.

There exist several inevitable limitations with our study. First, the completeness and homogeneity of our data had deficiencies due to its retrospective nature. EBV status was missing for about 15% of patients, which might limit the accuracy of statistical analysis. Second, our study was conducted in endemic areas and thus only included patients with TNM stage III and IVa. Therefore, the MTDLR nomogram could be further validated with more extensive databases in future studies. However, it should be noted that we have validated our MTDLR nomogram in a large database (886 patients) with two validation cohorts, which can support the effectiveness of MTDLR nomogram in LA-NPC.

Conclusion

In this study, we evaluated the value of multi-task learning for prognostic prediction in LA-NPC patients. To achieve this, we adopted a deep multi-task survival model (DeepMTS) and developed a multi-task deep learning-based radiomic (MTDLR) nomogram that combines TNM stage, DeepMTS-Score, and AutoRadio-Score. Compared to the conventional and single-task deep learning-based radiomic nomograms, the MTDLR nomogram extracted more heterogeneous and prognostic information to better predict the prognosis of LA-NPC patients. We validated our MTDLR nomogram with a large LA-NPC databased with two (internal/external) validation cohorts, which support the effectiveness of MTDLR nomogram and its potential contributions to clinical decision making.

Data availability

Data generated or analyzed during the study are available from the corresponding author by request.

References

Chen Y-P, Chan ATC, Le Q-T, Blanchard P, Sun Y, Ma J. Nasopharyngeal carcinoma. Lancet. 2019;394(10192):64–80. https://doi.org/10.1016/s0140-6736(19)30956-0.
Article PubMed Google Scholar
de Martel C, Georges D, Bray F, Ferlay J, Clifford GM. Global burden of cancer attributable to infections in 2018: A worldwide incidence analysis. Lancet Glob Health. 2020;8(2):e180–90. https://doi.org/10.1016/s2214-109x(19)30488-7.
Article PubMed Google Scholar
Pan JJ, Ng WT, Zong JF, Lee SW, Choi HC, Chan LL, et al. Prognostic nomogram for refining the prognostication of the proposed 8th edition of the AJCC UICC staging system for nasopharyngeal cancer in the era of intensity-modulated radiotherapy. Cancer. 2016;122(21):3307–15. https://doi.org/10.1002/cncr.30198.
Article CAS PubMed Google Scholar
Pfister DG, Spencer S, Adelstein D, Adkins D, Anzai Y, Brizel DM, et al. Head and neck cancers, version 2.2020, NCCN clinical practice guidelines in oncology. J Natl Compr Canc Netw. 2020;18(7):873–98. https://doi.org/10.6004/jnccn.2020.0031.
Article PubMed Google Scholar
Zhang B, Tian J, Dong D, Gu D, Dong Y, Zhang L, et al. Radiomics features of multiparametric MRI as novel prognostic factors in advanced nasopharyngeal carcinoma. Clin Cancer Res. 2017;23(15):4259–69. https://doi.org/10.1158/1078-0432.Ccr-16-2910.
Article PubMed Google Scholar
He YQ, Wang TM, Ji M, Mai ZM, Tang M, Wang R, et al. A polygenic risk score for nasopharyngeal carcinoma shows potential for risk stratification and personalized screening. Nat Commun. 2022;13(1):1966. https://doi.org/10.1038/s41467-022-29570-4.
Article CAS PubMed PubMed Central Google Scholar
Huang SH, O’Sullivan B. Overview of the 8th edition TNM classification for head and neck cancer. Curr Treat Options Oncol. 2017;18(7):40. https://doi.org/10.1007/s11864-017-0484-y.
Article PubMed Google Scholar
Hui EP, Li WF, Ma BB, Lam WKJ, Chan KCA, Mo F, et al. Integrating postradiotherapy plasma epstein-barr virus DNA and TNM stage for risk stratification of nasopharyngeal carcinoma to adjuvant therapy. Ann Oncol. 2020;31(6):769–79. https://doi.org/10.1016/j.annonc.2020.03.289.
Article CAS PubMed Google Scholar
Demirjian NL, Varghese BA, Cen SY, Hwang DH, Aron M, Siddiqui I, et al. CT-based radiomics stratification of tumor grade and TNM stage of clear cell renal cell carcinoma. Eur Radiol. 2022;32(4):2552–63. https://doi.org/10.1007/s00330-021-08344-4.
Article PubMed Google Scholar
Liu Y, He S, Wang XL, Peng W, Chen QY, Chi DM, et al. Tumour heterogeneity and intercellular networks of nasopharyngeal carcinoma at single cell resolution. Nat Commun. 2021;12(1):741. https://doi.org/10.1038/s41467-021-21043-4.
Article CAS PubMed PubMed Central Google Scholar
Gihbid A, Cherkaoui Salhi G, El Alami I, Belgadir H, Tawfiq N, Bendahou K, et al. Pretreatment [(18)F]FDG PET/CT and MRI in the prognosis of nasopharyngeal carcinoma. Ann Nucl Med. 2022;36(10):876–86. https://doi.org/10.1007/s12149-022-01770-4.
Article CAS PubMed Google Scholar
Wong WL. PET-CT for staging and detection of recurrence of head and neck cancer. Semin Nucl Med. 2021;51(1):13–25. https://doi.org/10.1053/j.semnuclmed.2020.09.004.
Article PubMed Google Scholar
Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, van Stiphout RG, Granton P, et al. Radiomics: Extracting more information from medical images using advanced feature analysis. Eur J Cancer. 2012;48(4):441–6. https://doi.org/10.1016/j.ejca.2011.11.036.
Article PubMed PubMed Central Google Scholar
Aerts HJ, Velazquez ER, Leijenaar RT, Parmar C, Grossmann P, Carvalho S, et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat Commun. 2014;5:4006. https://doi.org/10.1038/ncomms5006.
Article CAS PubMed Google Scholar
Peng H, Dong D, Fang MJ, Li L, Tang LL, Chen L, et al. Prognostic value of deep learning PET/CT-based radiomics: Potential role for future individual induction chemotherapy in advanced nasopharyngeal carcinoma. Clin Cancer Res. 2019;25(14):4271–9. https://doi.org/10.1158/1078-0432.Ccr-18-3065.
Article CAS PubMed Google Scholar
Zhong L, Dong D, Fang X, Zhang F, Zhang N, Zhang L, et al. A deep learning-based radiomic nomogram for prognosis and treatment decision in advanced nasopharyngeal carcinoma: A multicentre study. EBioMedicine. 2021;70:103522. https://doi.org/10.1016/j.ebiom.2021.103522.
Article PubMed PubMed Central Google Scholar
Bao D, Zhao Y, Li L, Lin M, Zhu Z, Yuan M, et al. A MRI-based radiomics model predicting radiation-induced temporal lobe injury in nasopharyngeal carcinoma. Eur Radiol. 2022;32(10):6910–21. https://doi.org/10.1007/s00330-022-08853-w.
Article PubMed Google Scholar
Lv W, Yuan Q, Wang Q, Ma J, Feng Q, Chen W, et al. Radiomics analysis of PET and CT components of PET/CT imaging integrated with clinical parameters: Application to prognosis for nasopharyngeal carcinoma. Mol Imaging Biol. 2019;21(5):954–64. https://doi.org/10.1007/s11307-018-01304-3.
Article CAS PubMed Google Scholar
Bologna M, Corino V, Calareso G, Tenconi C, Alfieri S, Iacovelli NA, et al. Baseline MRI-radiomics can predict overall survival in non-endemic EBV-related nasopharyngeal carcinoma patients. Cancers (Basel). 2020;12(10):2958. https://doi.org/10.3390/cancers12102958.
Article CAS PubMed PubMed Central Google Scholar
Wang Y, Li C, Yin G, Wang J, Li J, Wang P, et al. Extraction parameter optimized radiomics for neoadjuvant chemotherapy response prognosis in advanced nasopharyngeal carcinoma. Clin Transl Radiat Oncol. 2022;33:37–44. https://doi.org/10.1016/j.ctro.2021.12.005.
Article CAS PubMed Google Scholar
Lin M, Tang X, Cao L, Liao Y, Zhang Y, Zhou J. Using ultrasound radiomics analysis to diagnose cervical lymph node metastasis in patients with nasopharyngeal carcinoma. Eur Radiol. 2023;33(2):774–83. https://doi.org/10.1007/s00330-022-09122-6.
Article PubMed Google Scholar
Zhang LL, Li YY, Hu J, Zhou GQ, Chen L, Li WF, et al. Proposal of a pretreatment nomogram for predicting local recurrence after intensity-modulated radiation therapy in T4 nasopharyngeal carcinoma: A retrospective review of 415 chinese patients. Cancer Res Treat. 2018;50(4):1084–95. https://doi.org/10.4143/crt.2017.359.
Article CAS PubMed Google Scholar
Li C, Jing B, Ke L, Li B, Xia W, He C, et al. Development and validation of an endoscopic images-based deep learning model for detection with nasopharyngeal malignancies. Cancer Commun (Lond). 2018;38(1):59. https://doi.org/10.1186/s40880-018-0325-9.
Article PubMed PubMed Central Google Scholar
Chen ZH, Lin L, Wu CF, Li CF, Xu RH, Sun Y. Artificial intelligence for assisting cancer diagnosis and treatment in the era of precision medicine. Cancer Commun (Lond). 2021;41(11):1100–15. https://doi.org/10.1002/cac2.12215.
Article PubMed PubMed Central Google Scholar
Deepa P, Gunavathi C. A systematic review on machine learning and deep learning techniques in cancer survival prediction. Prog Biophys Mol Biol. 2022;174:62–71. https://doi.org/10.1016/j.pbiomolbio.2022.07.004.
Article Google Scholar
Sollini M, Antunovic L, Chiti A, Kirienko M. Towards clinical application of image mining: A systematic review on artificial intelligence and radiomics. Eur J Nucl Med Mol Imaging. 2019;46(13):2656–72. https://doi.org/10.1007/s00259-019-04372-x.
Article PubMed PubMed Central Google Scholar
Cheng NM, Yao J, Cai J, Ye X, Zhao S, Zhao K, et al. Deep learning for fully automated prediction of overall survival in patients with oropharyngeal cancer using FDG-PET imaging. Clin Cancer Res. 2021;27(14):3948–59. https://doi.org/10.1158/1078-0432.Ccr-20-4935.
Article CAS PubMed Google Scholar
Gu B, Meng M, Bi L, Kim J, Feng DD, Song S. Prediction of 5-year progression-free survival in advanced nasopharyngeal carcinoma with pretreatment PET/CT using multi-modality deep learning-based radiomics. Front Oncol. 2022;12:899351. https://doi.org/10.3389/fonc.2022.899351.
Article CAS PubMed PubMed Central Google Scholar
Meng M, Gu B, Bi L, Song S, Feng DD, Kim J. DeepMTS: Deep multi-task learning for survival prediction in patients with advanced nasopharyngeal carcinoma using pretreatment PET/CT. IEEE J Biomed Health Inform. 2022;26(9):4497–507. https://doi.org/10.1109/jbhi.2022.3181791.
Article PubMed Google Scholar
Andrearczyk V, Fontaine P, Oreiller V, Castelli J, Jreige M, Prior JO, et al. Multi-task deep segmentation and radiomics for automatic prognosis in head and neck cancer. Predictive Intelligence in Medicine; PRIME 2021. https://doi.org/10.1007/978-3-030-87602-9_14
Saeed N, Sobirov I, Al Majzoub R, Yaqub M. TMSS: An end-to-end transformer-based multimodal network for segmentation and survival prediction. Medical Image Computing and Computer Assisted Intervention – MICCAI 2022. https://doi.org/10.1007/978-3-031-16449-1_31
Reardon S. Rise of robot radiologists. Nature. 2019;576(7787):S54-s58. https://doi.org/10.1038/d41586-019-03847-z.
Article CAS PubMed Google Scholar
Iasonos A, Schrag D, Raj GV, Panageas KS. How to build and interpret a nomogram for cancer prognosis. J Clin Oncol. 2008;26(8):1364–70. https://doi.org/10.1200/JCO.2007.12.9791.
Article PubMed Google Scholar
Tang LQ, Li CF, Li J, Chen WH, Chen QY, Yuan LX, et al. Establishment and validation of prognostic nomograms for endemic nasopharyngeal carcinoma. J Natl Cancer Inst. 2016;108(1):291. https://doi.org/10.1093/jnci/djv291.
Article CAS Google Scholar
OuYang PY, You KY, Zhang LN, Xiao Y, Zhang XM, Xie FY. External validity of a prognostic nomogram for locoregionally advanced nasopharyngeal carcinoma based on the 8th edition of the AJCC/UICC staging system: A retrospective cohort study. Cancer Commun (Lond). 2018;38(1):55. https://doi.org/10.1186/s40880-018-0324-x.
Article PubMed Google Scholar
Çiçek Ö, Abdulkadir A, Lienkamp SS, Brox T, Ronneberger O. 3D U-Net: Learning dense volumetric segmentation from sparse annotation. Medical Image Computing and Computer-Assisted Intervention – MICCAI 2016. https://doi.org/10.1007/978-3-319-46723-8_49
Gao Huang ZL, Laurens van der Maaten, Kilian Q. Weinberger. Densely connected convolutional networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2017; pp. 4700–4708.
Qiang M, Li C, Sun Y, Sun Y, Ke L, Xie C, et al. A prognostic predictive system based on deep learning for locoregionally advanced nasopharyngeal carcinoma. J Natl Cancer Inst. 2021;113(5):606–15. https://doi.org/10.1093/jnci/djaa149.
Article PubMed Google Scholar
van Griethuysen JJM, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. 2017;77(21):e104–7. https://doi.org/10.1158/0008-5472.Can-17-0339.
Article PubMed PubMed Central Google Scholar
Tibshirani R. The lasso method for variable selection in the cox model. Stat Med. 1997;16(4):385–95. https://doi.org/10.1002/(sici)1097-0258(19970228)16:4%3c385::aid-sim380%3e3.0.co;2-3.
Article CAS PubMed Google Scholar
Chen YH, Chang KP, Chu SC, Yen TC, Wang LY, Chang JT, et al. Value of early evaluation of treatment response using (18)F-FDG PET/CT parameters and the epstein-barr virus DNA load for prediction of outcome in patients with primary nasopharyngeal carcinoma. Eur J Nucl Med Mol Imaging. 2019;46(3):650–60. https://doi.org/10.1007/s00259-018-4172-3.
Article CAS PubMed Google Scholar
Chan SC, Yeh CH, Chang JT, Chang KP, Wang JH, Ng SH. Combing MRI perfusion and (18)F-FDG PET/CT metabolic biomarkers helps predict survival in advanced nasopharyngeal carcinoma: A prospective multimodal imaging study. Cancers (Basel). 2021;13(7):1550. https://doi.org/10.3390/cancers13071550.
Article CAS PubMed PubMed Central Google Scholar
Fei Z, Xu T, Hong H, Xu Y, Chen J, Qiu X, et al. PET/CT standardized uptake value and EGFR expression predicts treatment failure in nasopharyngeal carcinoma. Radiat Oncol. 2023;18(1):33. https://doi.org/10.1186/s13014-023-02231-6.
Article CAS PubMed PubMed Central Google Scholar
Dmytriw AA, Ortega C, Anconina R, Metser U, Liu ZA, Liu Z, et al. Nasopharyngeal carcinoma radiomic evaluation with serial PET/CT: Exploring features predictive of survival in patients with long-term follow-up. Cancers (Basel). 2022;14(13):3105. https://doi.org/10.3390/cancers14133105.
Article CAS PubMed Google Scholar
Skrede OJ, De Raedt S, Kleppe A, Hveem TS, Liestøl K, Maddison J, et al. Deep learning for prediction of colorectal cancer outcome: A discovery and validation study. Lancet. 2020;395(10221):350–60. https://doi.org/10.1016/s0140-6736(19)32998-8.
Article CAS PubMed Google Scholar
Defeudis A, Mazzetti S, Panic J, Micilotta M, Vassallo L, Giannetto G, et al. MRI-based radiomics to predict response in locally advanced rectal cancer: Comparison of manual and automatic segmentation on external validation in a multicentre study. Eur Radiol Exp. 2022;6(1):19. https://doi.org/10.1186/s41747-022-00272-2.
Article PubMed PubMed Central Google Scholar
Lin YC, Lin CH, Lu HY, Chiang HJ, Wang HK, Huang YT, et al. Deep learning for fully automated tumor segmentation and extraction of magnetic resonance radiomics features in cervical cancer. Eur Radiol. 2020;30(3):1297–305. https://doi.org/10.1007/s00330-019-06467-3.
Article PubMed Google Scholar
Meng M, Bi L, Feng D, Kim J. Radiomics-enhanced deep multi-task learning for outcome prediction in head and neck cancer. Head and Neck Tumor Segmentation and Outcome Prediction; 2023. https://doi.org/10.1007/978-3-031-27420-6_14
Andrearczyk V, Oreiller V, Abobakr M, Akhavanallaf A, Balermpas P, Boughdad S, et al. Overview of the HECKTOR challenge at MICCAI 2022: Automatic head and neck tumor segmentation and outcome prediction in PET/CT. Head and Neck Tumor Segmentation and Outcome Prediction; 2023. https://doi.org/10.1007/978-3-031-27420-6_1

Download references

Funding

This work was funded by National Natural Science Foundation of China (Grant number 82272035), Special Clinical Research Project of Health Industry of Shanghai Municipal Health Commission (Grant number 20224Y0238), and Australian Research Council (Grant number DP200103748).

Author information

Bingxin Gu, Mingyuan Meng and Mingzhen Xu contributed equally to this work.

Authors and Affiliations

Department of Nuclear Medicine, Fudan University Shanghai Cancer Center, Shanghai, People’s Republic of China
Bingxin Gu, Mingzhen Xu & Shaoli Song
Department of Oncology, Shanghai Medical College, Fudan University, Shanghai, People’s Republic of China
Bingxin Gu, Mingzhen Xu & Shaoli Song
Center for Biomedical Imaging, Fudan University, Shanghai, People’s Republic of China
Bingxin Gu, Mingzhen Xu & Shaoli Song
Shanghai Engineering Research Center of Molecular Imaging Probes, Shanghai, People’s Republic of China
Bingxin Gu, Mingzhen Xu & Shaoli Song
Key Laboratory of Nuclear Physics and Ion-Beam Application (MOE), Fudan University, Shanghai, People’s Republic of China
Bingxin Gu, Mingzhen Xu & Shaoli Song
School of Computer Science, the University of Sydney, Sydney, Australia
Mingyuan Meng, David Dagan Feng & Jinman Kim
Institute of Translational Medicine, National Center for Translational Medicine, Shanghai Jiao Tong University, Shanghai, China
Lei Bi

Authors

Bingxin Gu
View author publications
You can also search for this author in PubMed Google Scholar
Mingyuan Meng
View author publications
You can also search for this author in PubMed Google Scholar
Mingzhen Xu
View author publications
You can also search for this author in PubMed Google Scholar
David Dagan Feng
View author publications
You can also search for this author in PubMed Google Scholar
Lei Bi
View author publications
You can also search for this author in PubMed Google Scholar
Jinman Kim
View author publications
You can also search for this author in PubMed Google Scholar
Shaoli Song
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shaoli Song.

Ethics declarations

Ethics approval

All procedures involving human participants were carried out in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards. This article does not contain any experiments with animals.

Consent to participate

Informed consent was obtained from all individual participants included in the study.

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 883 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gu, B., Meng, M., Xu, M. et al. Multi-task deep learning-based radiomic nomogram for prognostic prediction in locoregionally advanced nasopharyngeal carcinoma. Eur J Nucl Med Mol Imaging 50, 3996–4009 (2023). https://doi.org/10.1007/s00259-023-06399-7

Download citation

Received: 01 June 2023
Accepted: 11 August 2023
Published: 19 August 2023
Issue Date: November 2023
DOI: https://doi.org/10.1007/s00259-023-06399-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Multi-task deep learning-based radiomic nomogram for prognostic prediction in locoregionally advanced nasopharyngeal carcinoma

Abstract

Purpose

Methods

Results

Conclusion

Similar content being viewed by others

Computed tomography-based deep-learning prediction of induction chemotherapy treatment response in locally advanced nasopharyngeal carcinoma

Deep Learning and Radiomics Based PET/CT Image Feature Extraction from Auto Segmented Tumor Volumes for Recurrence-Free Survival Prediction in Oropharyngeal Cancer Patients

Deep learning signatures reveal multiscale intratumor heterogeneity associated with biological functions and survival in recurrent nasopharyngeal carcinoma

Introduction

Materials and methods

Patients

PET/CT imaging

Multi-task deep learning-based radiomics analysis

Statistical analysis

Results

Patient characteristics

Establishment of MTDLR nomogram

Performance of radiomic nomograms

Survival analysis for risk group stratification

Discussion

Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Ethics approval

Consent to participate

Conflict of interest

Additional information

Publisher's note

Supplementary Information

Supplementary file1 (DOCX 883 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation