Oropharyngeal squamous cell carcinoma: radiomic machine-learning classifiers from multiparametric MR images for determination of HPV infection status

Suh, Chong Hyun; Lee, Kyung Hwa; Choi, Young Jun; Chung, Sae Rom; Baek, Jung Hwan; Lee, Jeong Hyun; Yun, Jihye; Ham, Sungwon; Kim, Namkug

doi:10.1038/s41598-020-74479-x

Oropharyngeal squamous cell carcinoma: radiomic machine-learning classifiers from multiparametric MR images for determination of HPV infection status

Article
Open access
Published: 16 October 2020

Volume 10, article number 17525, (2020)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Oropharyngeal squamous cell carcinoma: radiomic machine-learning classifiers from multiparametric MR images for determination of HPV infection status

Download PDF

Chong Hyun Suh¹^na1,
Kyung Hwa Lee^2,3^na1,
Young Jun Choi¹,
Sae Rom Chung¹,
Jung Hwan Baek¹,
Jeong Hyun Lee¹,
Jihye Yun¹,
Sungwon Ham³ &
…
Namkug Kim^1,4

2038 Accesses
41 Citations
4 Altmetric
1 Mention
Explore all metrics

Abstract

We investigated the ability of machine-learning classifiers on radiomics from pre-treatment multiparametric magnetic resonance imaging (MRI) to accurately predict human papillomavirus (HPV) status in patients with oropharyngeal squamous cell carcinoma (OPSCC). This retrospective study collected data of 60 patients (48 HPV-positive and 12 HPV-negative) with newly diagnosed histopathologically proved OPSCC, who underwent head and neck MRIs consisting of axial T1WI, T2WI, CE-T1WI, and apparent diffusion coefficient (ADC) maps from diffusion-weighted imaging (DWI). The median age was 59 years (the range being 35 to 85 years), and 83.3% of patients were male. The imaging data were randomised into a training set (32 HPV-positive and 8 HPV-negative OPSCC) and a test set (16 HPV-positive and 4 HPV-negative OPSCC) in each fold. 1618 quantitative features were extracted from manually delineated regions-of-interest of primary tumour and one definite lymph node in each sequence. After feature selection by using the least absolute shrinkage and selection operator (LASSO), three different machine-learning classifiers (logistic regression, random forest, and XG boost) were trained and compared in the setting of various combinations between four sequences. The highest diagnostic accuracies were achieved when using all sequences, and the difference was significant only when the combination did not include the ADC map. Using all sequences, logistic regression and the random forest classifier yielded higher accuracy compared with the that of the XG boost classifier, with mean area under curve (AUC) values of 0.77, 0.76, and 0.71, respectively. The machine-learning classifier of non-invasive and quantitative radiomics signature could guide the classification of the HPV status.

Multiparametric MRI–based radiomics model for predicting human papillomavirus status in oropharyngeal squamous cell carcinoma: optimization using oversampling and machine learning techniques

Article 18 October 2023

Diagnostic Accuracy and Reliability of Deep Learning-Based Human Papillomavirus Status Prediction in Oropharyngeal Cancer

Using radiomics for predicting the HPV status of oropharyngeal tumors

Article Open access 04 January 2024

Introduction

Human papillomavirus (HPV) status is a dependable and independent prognostic factor in patients with oropharyngeal squamous cell carcinoma (OPSCC). Patients with HPV-positive OPSCC have better survival rates than patients with HPV-negative OPSCC¹. Because of differences in the oncogenesis, epidemiology, and prognosis; the eighth edition of the American Joint Committee on Cancer (AJCC) tumour-node-metastasis staging system classifies OPSCC into HPV-positive and HPV-negative tumours². Therefore, the preoperative differentiation between HPV-positive and HPV-negative OPSCC is critical for patient management as well as prognosis³.

The distinct oncogenesis of HPV-positive OPSCC results in characteristic histopathology^4,5, perfusion, and diffusion parameters, which are related to the angiogenesis and cellularity of the tumour. Several studies have reported diagnosis of the HPV status in patients with OPSCC using preoperative computed tomography (CT) or magnetic resonance (MR) imaging^6,7,8. HPV-positive OPSCC tends to exhibit cystic cervical lymph node metastasis^6,7,8 and primary tumours with well-defined borders and an exophytic appearance⁷. Recent studies reported that diffusion-weighted imaging (DWI) may help predict HPV status in patients with OPSCC, as HPV-positive OPSCC reveals a low mean apparent diffusion coefficient (ADC) compared with HPV-negative OPSCC^9,10,11. Furthermore, a histogram analysis based on dynamic contrast-enhanced MR image showed significantly higher K_ep kurtosis values and lower V_e min values in patients with p16-positive OPSCC¹². Recently, several published studies had addressed the prediction of HPV status employing a CT-based radiomics approach; however, their diagnostic performance was moderate (area under the curve; AUC, 0.75–0.80)^13,14,15. To date, no studies reported on the application of radiomic machine-learning classifiers on multiparametric MR images to predict HPV status in patients with OPSCC. Therefore, we hypothesise that pre-treatment multiparametric MR image combined with DWI could predict HPV status accurately employing radiomic machine-learning classifiers in patients with OPSCC.

Results

Study population and imaging dataset

Of the 70 consecutive patients with OPSCC, 10 were excluded owing to unknown HPV status (n = 4), post-treatment MR images (n = 4), and loss of MR image data (n = 2). Finally, 60 consecutive patients with OPSCC were enrolled in this study (Table 1). Forty-eight patients (80%) were HPV-positive, and 12 patients (20%) had HPV-negative OPSCCs. The median age was 59 years (range: 35 to 85 years), and 83.3% of the patients were male. The imaging data were randomised into a training set (40 MR images containing 32 HPV-positive and 8 HPV-negative OPSCC) and a test set (20 MR images containing 16 HPV-positive and 4 HPV-negative OPSCC) in each fold.

Table 1 Baseline characteristics of the included patients.

Full size table

Selected features

The study design is shown in Fig. 1. Linear regression with the least absolute shrinkage and selection operator (LASSO) penalty was performed in each cross-validation fold. The average number of selected features with the best classification performance was 221, using four MR sequences, namely, the axial T1-weighted imaging (T1WI), fat-suppressed T2-weighted imaging (T2WI), axial fat-suppressed contrast-enhanced T1-weighted imaging (CE-T1WI), and ADC maps from DWI. Table 2 shows the seven top-performing features, which were sorted based on the frequency of selection in the 60 experiments multiplied by the sum of the LASSO coefficients (weights) in each validation. Six out of the seven features extracted from ADC maps and one feature extracted from the T1WI sequence were selected. Four of these features were wavelet-transformed features. Supplementary Figure 1 illustrates the different ranges of the seven features for HPV-positive and HPV-negative cases in the whole dataset. Six out of seven features exhibited statistically significant differences between the two groups. Figure 2 shows an example of the original ADC map and its wavelet-transformed images of ‘LLL’ and ‘HLH’, where the features with the highest values of the sum of LASSO coefficients are found. The list of the top five selected features from each sequence and their various combinations are described in Supplementary Table 1. In the additional experiment comparing features extracted from primary tumour (T) and nodal (N) volumes delineated on ADC maps, four out of the five top-performing features from T volumes exhibited significant differences between the HPV-positive and HPV-negative group, whereas the features extracted from N volumes did not exhibit significant differences (Supplementary Figure 2).

Table 2 Top 7 features from four MR sequences.

Full size table

Comparing accuracies between sequences

The overall accuracy was increased by adding another MR sequence regardless of the types of classifiers. Table 3 lists the mean AUCs with standard deviations of each sequence and their combinations obtained by three different classifiers. The highest accuracy was achieved using four MR sequences. Upon comparison of each combination and all sequences as a reference for each classifier, the inclusion of all sequences yielded a significantly superior performance to that obtained using three sequences or less, exclusively when the combination did not include the ADC map. There were no significant differences between using three sequences or less while including the ADC map and using all sequences with a random forest and XG boost classifier.

Table 3 Classification accuracies between various combinations of sequences.

Full size table

Comparing accuracies between machine-learning classifiers

The mean AUCs of logistic regression, random forest, and XG boost classifier were 0.77 ± 0.12 (95% confidence interval [CI] 0.50 to 0.96), 0.76 ± 0.12 (95% CI 0.47 to 0.97), and 0.71 ± 0.12 (95% CI 0.50 to 0.93), respectively, when using selected features from all sequences (Fig. 3). The logistic regression classifier yielded the highest value of the mean AUC, which was not significantly superior to that exhibited by the random forest classifier (P value = 0.338), while demonstrating performance superior to that of the XG boost classifier (P value = 0.009). The average sensitivity and specificity were 0.71 (95% CI 0.31 to 0.97) and 0.72 (95% CI 0.50 to 1.00) in the logistic regression classifier, 0.70 (95% CI 0.33 to 0.93) and 0.72 (95% CI 0.50 to 1.00) in the random forest classifier, and 0.62 (95% CI 0.21 to 0.90) and 0.65 (95% CI 0.25 to 1.00) in the XG boost classifier, respectively, as shown in Table 4.

Table 4 Results of the ROC curve analysis of 3 models.

Full size table

Discussion

In the present study, we extracted quantitative image features from multiparametric MR sequences in OPSCC patients and developed machine-learning classifiers following a feature reduction to identify the HPV infection status. Our results show that the logistic regression classifier (0.77 ± 0.12) and the random forest classifier (0.76 ± 0.12) demonstrate higher values of the mean AUC compared with those exhibited by the XG boost classifiers (0.71 ± 0.12). The average sensitivity and specificity in the logistic regression classifier were 0.71 and 0.72, respectively. This radiomic signature of HPV status can be used to develop non-invasive tools for discriminating OPSCC patients.

Increasing evidence suggests that radiomics, a method that non-invasively extracts quantitative information from medical images, can be used to characterize intra-tumoral heterogeneity^16,17,18,19. Previous exploratory studies indicate a correlation between the HPV infection status and CT-based radiomic signature in head and neck squamous cell carcinoma (HNSCC)^13,14,15,20. These studies reported AUC values that ranged from 0.70 to 0.86. Although most radiomics studies for classifying the HPV status are based on CT, Ravanelli et al. investigated the correlation between MR imaging texture features and HPV status in OPSCC⁹. The authors developed a simple predictive model based on mean ADC values and smoking status that yielded an AUC of 0.944. In the present study, we developed a tool for classifying the HPV status using radiomic features from multiparametric MR images and machine-learning classifiers with an AUC of 0.77.

Recent studies have addressed whether the ADC-histogram analysis can be used to identify different histopathological features in HNSCC^9,21,22. According to de Perrot et al., diffusion phenotypes based on the histogram analysis of ADC values reflect distinct degrees of tumour heterogeneity in HPV-positive and HPV-negative HNSCCs²¹. It has been shown that the mean and median ADCs are significantly lower, whereas excess kurtosis and skewness are significantly higher in HPV-positive tumours than in HPV-negative tumours. In their study, HPV-positive tumours exhibit leptokurtic right-skewed histograms, which correspond to homogeneous tumours with densely packed cells, a scant stromal component, and scattered comedonecrosis. Meanwhile, HPV-negative tumours exhibit symmetric normally distributed ADC histograms, which correspond to heterogeneous tumours with variable cellularity, a high stromal component, keratin pearls, and necrosis. Meyer et al. investigated the correlation of ADC values with prognostically relevant histopathologic parameters, including the expression of Hif1-alpha, VEGF, EGFR, p53, p16, and Her 2²². They found that ADC histogram reflects different histopathological features in HNSCC, and associations between ADC histogram parameters and histopathology depend on the p16 status. In this study, features extracted from ADC maps were attributed the highest weight after LASSO regression, and they were mostly included in the top-performing features.

Recent studies found that the radiomics signature from multiparametric MR images achieved higher prognostic accuracies compared with a single MR sequence^23,24,25,26. In the present study, using four MR sequences yielded the highest classification accuracy. However, the difference between using four sequences and three or less sequences was significant only in cases not including ADC maps. The selected features after LASSO regression from four MR sequences included features from all MR sequences, whereas features from the ADC map comprised a large percent of top-performing features. Considering a small sample size and imbalance of HPV status in this study, further studies might be needed to confirm whether combining multiple MR sequences enables the detection of more detailed differences between HPV-positive and HPV-negative tumours.

Machine-learning models have rapidly improved in the past few years. Radiomics is an emerging field for machine-learning that allows the conversion of radiologic images into mineable high-dimensional data^{24,27,28,29,30}. Only few studies investigated the effect of different feature selections and machine-learning classification methods on radiomic features^27,30. In these studies, the random forest classifier had the highest prognostic performance for diagnosing cancers from benign tumours. Further, Parmar et al. observed that a generalised linear model exhibits a high prognostic performance in HNSCC and non-small-cell lung cancer types, whereas it shows low stability for HNSCC²⁷. The present study compared three machine-learning classifiers including the logistic regression, random forest, and XG boost model. The logistic regression classifier and random forest classifier demonstrated performance superior to that of the XG boost classifier. The most plausible reason is that the final selected features are highly discriminative in their classification of HPV status, which proves to be most suitable for the logistic regression classifier. However, considering that logistic regression models generally perform better for smaller data sets, compared with tree induction models, and are prone to overfitting^31,32, further validation with large samples might be needed.

Our study has several limitations. First, it is a retrospective study performed on a relatively small sample with a highly imbalanced dataset for machine-learning (n = 60). Repeated cross-validation and feature selection using the LASSO regression were applied to mitigate the risk of overfitting in this situation. Second, it remains to be validated whether our radiomics signature can be applied to different MR systems, imaging protocols, and software platforms. Therefore, multi-centre studies with large samples and a prospective study design are required to evaluate the true predictive value of the radiomics signature. Third, the regions-of-interest (ROIs) in the tumours were manually delineated based on ADC maps, which tend to be affected by movement artefacts such as breathing and swallowing, along with frequent susceptibility artefacts from the air-tissue interface. Furthermore, the stability analysis, i.e., assessing the robustness of the features, was not properly conducted. To achieve optimal feature selection, the slightly better performing feature can be selected from various kinds of similar features via the wavelet transform, which could lead to low reproducibility of wavelet features. Therefore, the stability and reproducibility of selected features must be investigated in further studies.

In conclusion, the present study developed radiomic machine-learning classifiers from multiparametric MR images for the determination of the HPV status in patients with OPSCC. Our results show that logistic regression and the random classifier applied subsequent to feature selection from MR images, including T1WI, T2WI, T1-CEWI, and ADC maps, using LASSO regression exhibit the highest classification accuracy; furthermore, features selected from the ADC map were crucial in classifying the HPV status. This method explores the integration of anatomical and multiparametric MRI radiomics into clinical models, which might have a significant impact in the MR-guided radiotherapy for head and neck cancers.

Materials and methods

This study was approved by the institutional review board of Asan Medical Center (tertiary referral center). The local ethics committee, institutional review board of Asan Medical Center, waived off the written informed consent due to the retrospective nature of the study. We reported our results according to the standards for reporting of diagnostic accuracy studies (STARD) 2015 guidelines³³ and strengthening the reporting of observational studies in epidemiology (STROBE)³⁴.

Study population

We enrolled consecutive patients with newly diagnosed histopathologically proved OPSCC, who were examined by head and neck MR imaging between April 2012 and November 2017. The eligibility criteria were as follows: (a) patients diagnosed by histopathology with a pre-treatment OPSCC, (b) patients with known HPV status, (c) patients who were examined by head and neck MR imaging including DWI, and (d) patients that were > 20 years old. Patients who had received chemotherapy, radiation therapy, or excisional biopsy prior to the MR imaging were excluded.

Analysis of HPV status

All analyses of the HPV status were performed by the pathology division of our institution without prior knowledge of the MR imaging results. P16 immunohistochemistry or HPV DNA detection by polymerase chain reaction (PCR) was used as the reference standard^35,36. P16 immunohistochemistry was performed using CINtec p16 histology (anti-p16^INK4a mouse monoclonal antibody and immunohistochemical detection kit; Roche MTM Laboratories, Heidelberg, Germany) and HPV DNA detection was performed by PCR/DNA chip scanning (high-risk subtypes of 16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59, 68, 73, 82, and other lower or undetermined risk subtypes)³⁷. HPV-positive OPSCC was diagnosed based on the positive results of either p16 or HPV DNA PCR³⁸.

MR acquisition protocol

Head and neck MR imaging was conducted using a 3-T scanner with a 64-channel coil (Skyra, Siemens Healthcare) and the MR imaging protocol as follows: To obtain CE-T1WI, an intravenous dose of 0.1 mmol/kg of contrast agent gadoterate meglumine (Dotarem; Guerbet, Paris, France) was injected into the patient. DWI MR imaging was conducted using multi-shot read-out-segmented echo-planar imaging in the axial plane before the injection. The detailed DWI sequence parameters were as follows: repetition time/echo time, 5450/62 ms; b values of 0 and 1000 s/mm²; section thickness of 4 mm; no gap; field of view of 192 \(\times\) 192 mm², and acquisition time of approximately 5 min. The ADC maps were obtained automatically within the manufacturer console. Imaging data were de-identified in accordance with the health insurance portability and accountability act privacy rule.

Image segmentation and pre-processing

Figure 1 depicts the overall workflow. First, 3D ROIs for contrast-enhanced portions were manually segmented by two neuroradiologists (with 6 and 13 years of experience in neuroradiology) on ADC maps for the primary tumour, while also considering T2WI and CE-T1WI MR sequences during the segmentation. One definite pathologically proven malignant lymph node was manually segmented on the T2WI sequence, while also considering CE-T1WI MR sequences. We employed the medical imaging interaction toolkit (MITK) software platform (https://www.mitk.org, German Cancer Research Center, Heidelberg, Germany)³⁹. Both the primary tumour and lymph node volumes belonged to the same patient. T1WI, T2WI, CE-T1WI, and ADC maps were co-registered with SPM software (https://www.fil.ion.ucl.ac.uk/spm/), using affine transformation with normalized mutual information as a cost function, with 12 degrees of freedom and tri-linear interpolation⁴⁰. The original ROIs were co-registered on the T1WI, T2WI, and CE-T1WI for the tumour and on the T1WI, CE-T1WI, and ADC maps for the lymph node, then manually adjusted to suit each sequence. All MR images were resampled into isometric voxels of size 1 \(\times\) 1 \(\times\) 1 mm³ as input data. Field inhomogeneity of MR images was corrected using the N4ITK algorithm⁴¹. To ensure just comparison of the extracted features across all patients, intensity normalization was conducted for T1WI, T2WI, and CE-T1WI sequences.

Radiomic feature extraction

From the segmented mask, 1618 total radiomic features were extracted using MATLAB R2015a (MathWorks Inc., Natick, MA), using a similar approach to previous study of Yun et al.⁴² at the same institution. The range of mean ± 3 standard deviation of the entire intensity range was quantized into 32 density bin levels for the texture features. The features included seven shape and volume features, 17 first-order features, 162 texture features, and 1432 wavelet features (Supplementary Table 2). First-order features were derived from the intensity histograms using first-order statistics, including the intensity range, energy, entropy, kurtosis, maximum, mean, median, uniformity, and variance. Texture features were obtained from a grey-level co-occurrence matrix (GLCM) and a grey-level run-length matrix (GLRLM) using the segmented mask in 13 directions in 3D space⁴³. For the GLCM analyses, texture features were computed for varying distances of 1, 2, and 3 voxels in 13 directions. Then, a single-level directional discrete wavelet transformation was applied with a high-pass and a low-pass filter⁴⁴. In total, eight wavelet-decomposition images were generated from each MR sequence input: LLL, HLL, LHL, HHL, LLH, HLH, LHH, HHH images, where ‘L’ depicts the ‘low-pass filter’ and ‘H’ depicts the ‘high-pass filter’. The first-order and texture features were subsequently applied to the wavelet-transformed images (17 first-order features + 162 texture features) multiplied by eight images, yielding 1432 wavelet features.

Feature selection and classification

The extracted features may be noisy or highly correlated with each other; therefore, feature selection is required to increase the prediction accuracy and minimise computational cost⁴⁵. To reduce over-fitting or any type of bias in our radiomics model, LASSO-penalized linear regression was applied to the training data. All radiomics features were centred and scaled to a value with a mean of zero and a standard deviation of one (z-score transformation before applying feature selection). With a linear combination of the selected features weighted by their respective coefficients, a model was used to estimate the HPV status. LASSO regression was implemented using Python (Python Software Foundation, version 3.5.2) with the Scikit-learn package (https://github.com/scikit-learn/scikit-learn)⁴⁶. Features with larger contributions to the model were selected.

Three different machine-learning classifiers were applied: logistic regression, random forest⁴⁷ using the Scikit-learn package, and XG boost⁴⁸ using the Xgboost package (https://github.com/dmlc/xgboost). The algorithms were selected based on their high performance and readiness for application. Three different models were computed and compared to determine the best combination for determining the HPV status in the data set. The models were developed separately for each of the T1WI, T2WI, CE-T1WI, and ADC maps, as well as various combinations of these sequences. Classifiers were trained with a stratified threefold cross-validation procedure repeated 20 times, which allows repetition of experiments for each model up to 60 times. All possible combinations of hyperparameters were investigated by the grid search using GridSearchCV library in the Scikit-learn package. (Supplementary Table 3). The feature selector and each classifier were trained with a stratified threefold cross-validation procedure, which was repeated 20 times. This indicates an up to 60-fold repetition of the experiments for each model. The procedures, including z-normalization of extracted features, followed by feature reduction using LASSO regression and machine learning classification were executed separately on the training data during each cross-validation fold.

Statistical analysis

The Mann–Whitney U test was used to estimate the relationship between selected radiomic signatures and HPV status, and to compare accuracies between various combinations of MR sequences in a pairwise manner⁴⁹. AUCs were used to determine the diagnostic performance, with optimal thresholds of the imaging parameters determined by maximizing the sum of the sensitivity and 1 − specificity, i.e., the Youden index, values.

Data availability

The datasets generated and analysed during the current study are available from the corresponding author on reasonable request.

References

Ang, K. K. et al. Human papillomavirus and survival of patients with oropharyngeal cancer. N. Engl. J. Med. 363, 24–35. https://doi.org/10.1056/NEJMoa0912217 (2010).
Article CAS PubMed PubMed Central Google Scholar
Amin, M. B. et al. AJCC Cancer Staging Manual 8th edn. (Springer, New York, 2017).
Book Google Scholar
National Comprehensive Cancer Network. Clinical practice guidelines in oncology for head and neck cancers V.3.2019. 2019. https://www.nccn.org. Accessed 28 Jan 2020.
Troy, J. D. et al. Expression of EGFR, VEGF, and NOTCH1 suggest differences in tumor angiogenesis in HPV-positive and HPV-negative head and neck squamous cell carcinoma. Head Neck Pathol. 7, 344–355. https://doi.org/10.1007/s12105-013-0447-y (2013).
Article PubMed PubMed Central Google Scholar
Mungai, F. et al. CT assessment of tumor heterogeneity and the potential for the prediction of human papillomavirus status in oropharyngeal squamous cell carcinoma. Radiol. Med. 124, 804–811. https://doi.org/10.1007/s11547-019-01028-6 (2019).
Article PubMed Google Scholar
Goldenberg, D. et al. Cystic lymph node metastasis in patients with head and neck cancer: An HPV-associated phenomenon. Head Neck 30, 898–903. https://doi.org/10.1002/hed.20796 (2008).
Article PubMed Google Scholar
Chan, M. W. et al. Morphologic and topographic radiologic features of human papillomavirus-related and -unrelated oropharyngeal carcinoma. Head Neck 39, 1524–1534. https://doi.org/10.1002/hed.24764 (2017).
Article PubMed Google Scholar
Huang, Y. H. et al. Cystic nodal metastasis in patients with oropharyngeal squamous cell carcinoma receiving chemoradiotherapy: Relationship with human papillomavirus status and failure patterns. PLoS ONE 12, e0180779. https://doi.org/10.1371/journal.pone.0180779 (2017).
Article CAS PubMed PubMed Central Google Scholar
Ravanelli, M. et al. Correlation between human papillomavirus status and quantitative MR imaging parameters including diffusion-weighted imaging and texture features in oropharyngeal carcinoma. AJNR Am. J. Neuroradiol. 39, 1878–1883. https://doi.org/10.3174/ajnr.A5792 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chan, M. W. et al. radiologic differences between human papillomavirus-related and human papillomavirus-unrelated oropharyngeal carcinoma on diffusion-weighted imaging. ORL J. Oto-rhino-laryngol. Relat. Specialties 78, 344–352. https://doi.org/10.1159/000458446 (2016).
Article CAS Google Scholar
Payabvash, S., Chan, A., Jabehdar Maralani, P. & Malhotra, A. Quantitative diffusion magnetic resonance imaging for prediction of human papillomavirus status in head and neck squamous-cell carcinoma: A systematic review and meta-analysis. Neuroradiol. J. 32, 232–240. https://doi.org/10.1177/1971400919849808 (2019).
Article PubMed PubMed Central Google Scholar
Meyer, H. J., Leifels, L., Hamerla, G., Hohn, A. K. & Surov, A. Associations between histogram analysis parameters derived from DCE-MRI and histopathological features including expression of EGFR, p16, VEGF, Hif1-alpha, and p53 in HNSCC. Contrast Media Mol. Imaging 2019, 5081909. https://doi.org/10.1155/2019/5081909 (2019).
Article CAS PubMed PubMed Central Google Scholar
Bogowicz, M. et al. Computed tomography radiomics predicts HPV status and local tumor control after definitive radiochemotherapy in head and neck squamous cell carcinoma. Int. J. Radiat. Oncol. Biol. Phys. 99, 921–928. https://doi.org/10.1016/j.ijrobp.2017.06.002 (2017).
Article PubMed Google Scholar
Yu, K. et al. Radiomic analysis in prediction of human papilloma virus status. Clin. Transl. Radiat. Oncol. 7, 49–54. https://doi.org/10.1016/j.ctro.2017.10.001 (2017).
Article PubMed PubMed Central Google Scholar
Leijenaar, R. T. et al. Development and validation of a radiomic signature to predict HPV (p16) status from standard CT imaging: A multicenter study. Br. J. Radiol. 91, 20170498. https://doi.org/10.1259/bjr.20170498 (2018).
Article PubMed PubMed Central Google Scholar
Parmar, C. et al. Radiomic machine-learning classifiers for prognostic biomarkers of head and neck cancer. Front. Oncol. 5, 272. https://doi.org/10.3389/fonc.2015.00272 (2015).
Article PubMed PubMed Central Google Scholar
Wu, X. et al. Differentiation of diffuse large B-cell lymphoma from follicular lymphoma using texture analysis on conventional MR images at 3.0 Tesla. Acad. Radiol. 23, 696–703. https://doi.org/10.1016/j.acra.2016.01.012 (2016).
Article PubMed Google Scholar
Zhou, Y. et al. CT-based radiomics signature: A potential biomarker for preoperative prediction of early recurrence in hepatocellular carcinoma. Abdom. Radiol. 42, 1695–1704. https://doi.org/10.1007/s00261-017-1072-0 (2017).
Article Google Scholar
Wang, G. et al. Pretreatment MR imaging radiomics signatures for response prediction to induction chemotherapy in patients with nasopharyngeal carcinoma. Eur. J. Radiol. 98, 100–106. https://doi.org/10.1016/j.ejrad.2017.11.007 (2018).
Article PubMed Google Scholar
Buch, K. et al. Using texture analysis to determine human papillomavirus status of oropharyngeal squamous cell carcinomas on CT. AJNR Am. J. Neuroradiol. 36, 1343–1348. https://doi.org/10.3174/ajnr.A4285 (2015).
Article CAS PubMed Google Scholar
de Perrot, T. et al. Apparent diffusion coefficient histograms of human papillomavirus-positive and human papillomavirus-negative head and neck squamous cell carcinoma: Assessment of tumor heterogeneity and comparison with histopathology. AJNR Am. J. Neuroradiol. 38, 2153–2160. https://doi.org/10.3174/ajnr.A5370 (2017).
Article PubMed Google Scholar
Meyer, H. J., Leifels, L., Hamerla, G., Hohn, A. K. & Surov, A. ADC-histogram analysis in head and neck squamous cell carcinoma. Associations with different histopathological features including expression of EGFR, VEGF, HIF-1alpha, Her 2 and p53. A preliminary study. Magn. Reson. Imaging 54, 214–217. https://doi.org/10.1016/j.mri.2018.07.013 (2018).
Article CAS PubMed Google Scholar
Dang, M. et al. MRI texture analysis predicts p53 status in head and neck squamous cell carcinoma. AJNR Am. J. Neuroradiol. 36, 166–170. https://doi.org/10.3174/ajnr.A4110 (2015).
Article CAS PubMed Google Scholar
Parekh, V. S. & Jacobs, M. A. Integrated radiomic framework for breast cancer and tumor biology using advanced machine learning and multiparametric MRI. NPJ. Breast Cancer 3, 43. https://doi.org/10.1038/s41523-017-0045-3 (2017).
Article PubMed PubMed Central Google Scholar
Ren, J. et al. Magnetic resonance imaging based radiomics signature for the preoperative discrimination of stage I-II and III-IV head and neck squamous cell carcinoma. Eur. J. Radiol. 106, 1–6. https://doi.org/10.1016/j.ejrad.2018.07.002 (2018).
Article ADS PubMed Google Scholar
Liu, Z. et al. Radiomics of multiparametric MRI for pretreatment prediction of pathologic complete response to neoadjuvant chemotherapy in breast cancer: A multicenter study. Clin. Cancer Res. 25, 3538–3547. https://doi.org/10.1158/1078-0432.CCR-18-3190 (2019).
Article CAS PubMed Google Scholar
Parmar, C., Grossmann, P., Bussink, J., Lambin, P. & Aerts, H. Machine learning methods for quantitative radiomic biomarkers. Sci. Rep. 5, 13087. https://doi.org/10.1038/srep13087 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Choy, G. et al. Current applications and future impact of machine learning in radiology. Radiology 288, 318–328. https://doi.org/10.1148/radiol.2018171820 (2018).
Article PubMed PubMed Central Google Scholar
Giger, M. L. Machine learning in medical imaging. J. Am. Coll. Radiol. 15, 512–520. https://doi.org/10.1016/j.jacr.2017.12.028 (2018).
Article PubMed Google Scholar
Giraud, P. et al. Radiomics and machine learning for radiotherapy in head and neck cancers. Front. Oncol. 9, 174. https://doi.org/10.3389/fonc.2019.00174 (2019).
Article PubMed PubMed Central Google Scholar
Perlich, C., Provost, F. & Simonoff, J. Tree induction vs. logistic regression: A learning-curve analysis. J. Mach. Learn. Res. 4, 211–255. https://doi.org/10.1162/153244304322972694 (2003).
Article MathSciNet MATH Google Scholar
Garcia-Magarinos, M., Lopez-de-Ullibarri, I., Cao, R. & Salas, A. Evaluating the ability of tree-based methods and logistic regression for the detection of SNP-SNP interaction. Ann. Hum. Genet. 73, 360–369. https://doi.org/10.1111/j.1469-1809.2009.00511.x (2009).
Article PubMed Google Scholar
Bossuyt, P. M. et al. STARD 2015: An updated list of essential items for reporting diagnostic accuracy studies. Radiology 277, 826–832. https://doi.org/10.1148/radiol.2015151516 (2015).
Article PubMed Google Scholar
Vandenbroucke, J. P. et al. Strengthening the reporting of observational studies in epidemiology (STROBE): Explanation and elaboration. PLoS Med. 4, e297. https://doi.org/10.1371/journal.pmed.0040297 (2007).
Article PubMed PubMed Central Google Scholar
Jordan, R. C. et al. Validation of methods for oropharyngeal cancer HPV status determination in US cooperative group trials. Am. J. Surg. Pathol. 36, 945–954. https://doi.org/10.1097/PAS.0b013e318253a2d1 (2012).
Article PubMed PubMed Central Google Scholar
Cantley, R. L. et al. Ancillary studies in determining human papillomavirus status of squamous cell carcinoma of the oropharynx: A review. Pathol. Res. Int. 2011, 138469. https://doi.org/10.4061/2011/138469 (2011).
Article Google Scholar
Lee, B. et al. Prognostic value of radiologic extranodal extension in human papillomavirus-related oropharyngeal squamous cell carcinoma. Korean J. Radiol. 20, 1266–1274. https://doi.org/10.3348/kjr.2018.0742 (2019).
Article PubMed PubMed Central Google Scholar
Lee, S. et al. Refining prognostic stratification of human papillomavirus-related oropharyngeal squamous cell carcinoma: Different prognosis between T1 and T2. Radiat. Oncol. J. 35, 233–240. https://doi.org/10.3857/roj.2017.00465 (2017).
Article PubMed PubMed Central Google Scholar
Nolden, M. et al. The Medical Imaging Interaction Toolkit: Challenges and advances: 10 years of open-source development. Int. J. Comput. Assist. Radiol. Surg. 8, 607–620. https://doi.org/10.1007/s11548-013-0840-8 (2013).
Article PubMed Google Scholar
Maes, F., Collignon, A., Vandermeulen, D., Marchal, G. & Suetens, P. Multimodality image registration by maximization of mutual information. IEEE Trans. Med. Imaging 16, 187–198. https://doi.org/10.1109/42.563664 (1997).
Article CAS PubMed Google Scholar
Tustison, N. J. et al. N4ITK: Improved N3 bias correction. IEEE Trans. Med. Imaging 29, 1310–1320. https://doi.org/10.1109/TMI.2010.2046908 (2010).
Article PubMed PubMed Central Google Scholar
Yun, J. et al. Radiomic features and multilayer perceptron network classifier: A robust MRI classification strategy for distinguishing glioblastoma from primary central nervous system lymphoma. Sci. Rep. 9, 5746. https://doi.org/10.1038/s41598-019-42276-w (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
43Materka, A. & Strzelecki, M. Texture Analysis Methods—A Review. COST B11 report (1998).
Wang, J. Z. Wavelets and imaging informatics: A review of the literature. J. Biomed. Inform. 34, 129–141. https://doi.org/10.1006/jbin.2001.1010 (2001).
Article CAS PubMed Google Scholar
Zhang, Y., Oikonomou, A., Wong, A., Haider, M. A. & Khalvati, F. Radiomics-based prognosis analysis for non-small cell lung cancer. Sci. Rep. 7, 46349. https://doi.org/10.1038/srep46349 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Breiman, L. Random forests, machine learning 45. J. Clin. Microbiol. 2, 199–228 (2001).
Google Scholar
Sheridan, R. P., Wang, M., Liaw, A., Ma, J. & Gifford, E. Correction to extreme gradient boosting as a method for quantitative structure–activity relationships. J. Chem. Inf. Model. https://doi.org/10.1021/acs.jcim.0c00029 (2020).
Article PubMed Google Scholar
Mann-Whitney U Test. The Corsini Encyclopedia of Psychology, 1–1

Download references

Acknowledgements

This study was supported by grant no.2018-094 from the Asan Institute for Life Sciences, Asan Medical Center, Seoul, Korea, and from Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health & Welfare, Republic of Korea (HI18C2383). The illustration of Fig. 1 was drawn by Minkyeong Kim.

Author information

These authors contributed equally: Chong Hyun Suh and Kyung Hwa Lee.

Authors and Affiliations

Department of Radiology and Research Institute of Radiology, University of Ulsan College of Medicine, Asan Medical Center, 86 Asanbyeongwon-Gil, Songpa-Gu, Seoul, 05505, Republic of Korea
Chong Hyun Suh, Young Jun Choi, Sae Rom Chung, Jung Hwan Baek, Jeong Hyun Lee, Jihye Yun & Namkug Kim
Department of Medicine, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Republic of Korea
Kyung Hwa Lee
Department of Biomedical Engineering, Asan Medical Institute of Convergence Science and Technology, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Republic of Korea
Kyung Hwa Lee & Sungwon Ham
Department of Convergence Medicine, University of Ulsan College of Medicine, Asan Medical Center, 86 Asanbyeongwon-Gil, Songpa-Gu, Seoul, 05505, Republic of Korea
Namkug Kim

Authors

Chong Hyun Suh
View author publications
You can also search for this author in PubMed Google Scholar
Kyung Hwa Lee
View author publications
You can also search for this author in PubMed Google Scholar
Young Jun Choi
View author publications
You can also search for this author in PubMed Google Scholar
Sae Rom Chung
View author publications
You can also search for this author in PubMed Google Scholar
Jung Hwan Baek
View author publications
You can also search for this author in PubMed Google Scholar
Jeong Hyun Lee
View author publications
You can also search for this author in PubMed Google Scholar
Jihye Yun
View author publications
You can also search for this author in PubMed Google Scholar
Sungwon Ham
View author publications
You can also search for this author in PubMed Google Scholar
Namkug Kim
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All listed co-authors performed the following: 1. Substantial contributions to the conception or design of the work; or the acquisition, analysis, or interpretation of data for the work; 2. Drafting the work or revising it critically for important intellectual content; 3. Final approval of the version to be published; 4. Agreement to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. Specific additional individual cooperative effort contributions to study/manuscript design/execution/interpretation, in addition to all criteria above are listed as follows: K.H.L.—manuscript writing, image preprocessing, radiomic feature extraction and classification, and statistical analysis, C.H.S.—manuscript writing, clinical data collection and curation, and image segmentation, J.Y. and S.H.—supervision of image preprocessing, radiomic feature extraction and classification, S.R.C., J.H.B., J.H.L.—database construction and conceptual feedback, N.K. and Y.J.C.—corresponding authors; manuscript editing, coordinating study design and activities, conceptual feedback and project integrity.

Corresponding authors

Correspondence to Young Jun Choi or Namkug Kim.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Suh, C.H., Lee, K.H., Choi, Y.J. et al. Oropharyngeal squamous cell carcinoma: radiomic machine-learning classifiers from multiparametric MR images for determination of HPV infection status. Sci Rep 10, 17525 (2020). https://doi.org/10.1038/s41598-020-74479-x

Download citation

Received: 09 March 2020
Accepted: 23 August 2020
Published: 16 October 2020
DOI: https://doi.org/10.1038/s41598-020-74479-x
Springer Nature Limited

This article is cited by

Machine learning-based MRI radiomics for assessing the level of tumor infiltrating lymphocytes in oral tongue squamous cell carcinoma: a pilot study
- Jiliang Ren
- Gongxin Yang
- Ying Yuan
BMC Medical Imaging (2024)
Using radiomics for predicting the HPV status of oropharyngeal tumors
- Kubra Sarac
- Albert Guvenis
Journal of Engineering and Applied Science (2024)
Explainable prediction model for the human papillomavirus status in patients with oropharyngeal squamous cell carcinoma using CNN on CT images
- Annarita Fanizzi
- Maria Colomba Comes
- Raffaella Massafra
Scientific Reports (2024)
Data-centric artificial intelligence in oncology: a systematic review assessing data quality in machine learning models for head and neck cancer
- John Adeoye
- Liuling Hui
- Yu-Xiong Su
Journal of Big Data (2023)
Multiparametric MRI–based radiomics model for predicting human papillomavirus status in oropharyngeal squamous cell carcinoma: optimization using oversampling and machine learning techniques
- Yongsik Sim
- Minjae Kim
- Beomseok Sohn
European Radiology (2023)

Oropharyngeal squamous cell carcinoma: radiomic machine-learning classifiers from multiparametric MR images for determination of HPV infection status

Abstract

Similar content being viewed by others

Introduction

Results

Study population and imaging dataset

Selected features

Comparing accuracies between sequences

Comparing accuracies between machine-learning classifiers

Discussion

Materials and methods

Study population

Analysis of HPV status

MR acquisition protocol

Image segmentation and pre-processing

Radiomic feature extraction

Feature selection and classification

Statistical analysis

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation