Introduction

Oropharyngeal cancer constitutes 11.3% of all head and neck malignancies [1]. Apart from the most prevalent risk factors of alcohol and tobacco consumption [2], a geographical divergent increasing subset of oropharyngeal squamous cell carcinomas (OPSCC) is characterized by human papillomavirus (HPV) infections [3,4,5,6,7,8]. Most commonly they are caused by the HPV-genotype 16 [9] while non-HPV16 HPV-genotypes were also recognized with a high oncogenic potential [10].

In the course of HPV-infection, integration of HPV into the host genome results in stabilization and upregulation of the viral oncoproteins E6 and E7, E7 binds and leads to an inactivation and degradation of tumor suppressive retinoblastoma protein, presumably resulting in an upregulation of the protein p16 [11,12,13,14,15]. The immunohistochemical detection of p16-overexpression was associated with a high sensitivity to identify samples with oncogenic HPV infections [16]. Therefore, it was previously used as a surrogate marker for identification of HPV-associated OPSCCs [17, 18]. Nevertheless, previous studies suggest that the combination of both markers, p16- and HPV-status, define prognostic divergent subgroups [19,20,21,22,23,24]. This needs to be considered, especially since treatment de-intensification trials investigated altered therapeutic regimes for HPV-associated OPSCCs [17, 25]. Therefore, the aim of the study was to examine the prognostic impact of additional HPV diagnostic in a well-characterized and p16-stratified homogenously treated cohort of patients with advanced stage OPSCCs.

Patients, material and methods

Patients

A retrospective analysis of patients with advanced stage OPSCCs (III-IVa) treatable by transoral CO2-laser microsurgery (TLM) with curative intent was performed. The study was conducted at an academic tertiary referral center. Time span of inclusion was 01/2000 and 10/2015 with systematic follow-up until 10/2017. Cases were identified and data were obtained from the hospital’s cancer database and the original medical records. This study was approved by the institutional review board (“Ethikkommission der Universitätsmedizin Göttingen”; 10/2/16 An) in accordance with the national regulation and with the Helsinki Declaration of 1975, as revised in 1983. Prior to treatment, all patients gave their written informed consent for anonymized analysis of their individual data.

Exclusion criteria comprised non-OPSCC, previous or simultaneous secondary malignancies, T4b primaries, N3 metastasis, simultaneous distant metastases, previous treatment of the OPSCC in a different institution, treatment of the primary tumor by conventional surgery, flap reconstruction, primary (chemo-)radiotherapy and palliative treatment intent. Of 161 identified patients, 125 had sufficient paraffin embedded tissue material available for p16-immunohistochemistry as well as HPV-genotyping. Thereof 23 patients were diagnosed with early stage I–II OPSCCs and therefore omitted from analysis, resulting a final cohort of 102 patients with advanced disease.

Staging and treatment strategy

Preoperative staging procedures were performed as described previously [26] according to the 7th edition of classification of the Union International Contre le Cancer (UICC) [27] and the American Joint Committee on Cancer (AJCC) [28].

The therapeutic strategy was TLM-resection of the primary tumor with/without neck dissection (ND), with/without postoperative (chemo-)radiotherapy ((C)RT) as described previously [26]. The therapeutic approach was not based upon the diagnosis of HPV- or p16-status. In all cases histopathological assessment confirmed margin-negative (R0) resection.

Follow-up

The postoperative examination and follow-up was performed as previously described [26]. After 5 years without disease manifestations a patient was considered cured; however, follow-up continued in most cases.

Molecular diagnostics—p16 and HPV

p16 immunohistochemistry was performed with monoclonal p16 antibodies (Santa Cruz Biotechnology, Inc., Dallas, TX, USA; Cat# sc-56330) (1:50; pH 9) in combination with diaminobenzidine (Dako, Agilent Technologies, Inc., Santa Clara, California), and the EnVision Flex + System (Dako, Agilent Technologies, Inc.). A strong diffuse (≥ 75%) nuclear and cytoplasmic p16 immunostaining was categorized as p16-positive. No, weak, or moderate staining was considered as p16-negative [18, 29].

For multiplex fluorescent-PCR based detection of DNA from different HPV genotypes a commercial assay developed for diagnostic purposes was applied. Formalin-fixed paraffin-embedded (FFPE) tissue samples of the oropharyngeal primaries were used. A 5 µm FFPE slice was enriched for tumorous tissue by microdissection and DNA was isolated with the innuPREP FFPE DNA Kit—IPC16 (Analytik Jena AG, Jena, Germany) in combination with the InnuPure® C16 touch (Analytik Jena AG, Jena, Germany). Quantification and purity of the isolated DNA was measured by a NanoDrop™ 2000 Spectrophotometer (Thermo Fischer Scientific Inc., Waltham, MA, USA). DNA was amplified using the f-HPV-typing™ Multiplex Fluorescent-PCR Kit according to manufacturer’s protocol (Genomed LTD., Harrow, Middlesex, UK). The PCR reagent contains 16 fluorescently labelled primers designed to recognize E6 and E7 regions of HPV types 6, 11, 16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59, and 68 that are most likely present after viral integration. The f-HPV-typing™ also contains a human short tandem repeat, which serves as an internal control [30]. HPV amplicons were separated and measured by the Applied Biosystems® 3500 Genetic Analyzer and analyzed via GeneMapper™ Software 5 (Thermo Fisher Scientific Inc., Waltham, MA, USA).

Alcohol and tobacco consumption

Information about alcohol and tobacco consumption was extracted from the original medical records. Both variables were categorized dichotomously. Patients self-reporting former/current versus no tobacco consumption were assigned to the respective groups. Heavy drinkers were defined as patients who had reported at least weekly strong or daily other alcohol consumption and/or who had a disease related to heavy alcohol consumption (e.g. Wernicke–Korsakow syndrome). Patients self-reporting occasional, limited or no alcohol consumption were assigned to the group of never/social drinking patients. Cases without information on alcohol and/or tobacco consumption were omitted from the subanalysis.

Statistics

Descriptive analysis was stated by the respective mean value, corresponding standard deviation (SD), median and/or absolute and relative frequencies. Analysis of frequency distributions was conducted with the Pearson’s Chi-squared test. Five-year overall survival (OS), disease-specific survival (DSS), recurrence-free survival (RFS), and the local control rate (LCR) were calculated by the Kaplan–Meier method [31]. Date of primary surgery was defined as the starting point for these calculations. Concerning OS, death for any reason was considered as an event, whereas patients alive at last follow-up were included as censored observations. Calculating DSS, only death related to the primary tumor was considered as event, and all other causes of death were counted as censored observations as well as alive at last follow-up. For RFS, events were local and/or regional recurrence, occurrence of distant metastasis or death related to primary disease, whereas death due to other causes, and patients alive without manifestation of the primary tumor at last follow-up were considered as censored. Regarding LCR, exclusively local recurrences were determined as events. The log-rank test was applied to assess statistically significant survival differences between groups. Kaplan–Meier curves were applied to illustrate survival differences. The level of significance was defined at 5%. Statistical analysis and creation of Kaplan–Meier curves were performed with Statistica, Version 13.3 (Dell Inc., Round Rock, TX, USA). For adjustment of potential cofounding factors, Cox multivariate regression analysis (forward stepwise) was conducted with IBM SPSS® Statistics, Version 26.0 (IBM Corp, Armonk, NY, USA). p16- and HPV-status as separate covariates or the combination of both markers was analyzed together with gender, age (continuous), T-categorization (T1-2 vs. T3-T4), lymph node metastases (N0 vs. N+), extracapsular spread (N0/N+ ECS- vs. N+ ECS+), staging (III vs. IVa), histopathologic differentiation (high/moderate vs. poor), treatment (TLM +/-ND vs. TLM+ND+(C)RT). Additionally, within the cohort with available information about alcohol and tobacco consumption, these variables were also considered. For final editing Adobe Illustrator® CC, Version 18.1 (Adobe Systems Inc., San José, CA, USA) was applied.

Results

Patients

102 patients with advanced stage OPSCC (pT3-4a and/or pN + ; stage III/IVa) were included in the study. The site of primary origin of oropharyngeal tumors was 40.2% (n = 41) tonsils, 25.5% (n = 26) tongue base, 3.9% (n = 4) soft palate, 12.7% (n = 13) vallecula, 3.9% (n = 4) posterior wall, 3.9% (n = 4) lateral wall, 5.9% (n = 6) glossotonsillar sulcus and palatal arch, 2.9% (n = 3) uvula, and 1% (n = 1) tonsillar fossa. All patients were treated primarily by TLM. It was completed with neck dissection in 96.1% (n = 98) and postoperative (chemo)radiotherapy in 73.5% (n = 75).

p16 expression and HPV-typing results

p16-overexpression was seen in 49.0% (n = 50) of all tumors. HPV-DNA was detected in 40.2% (n = 41) of all OPSCCs. Distribution of p16-status and HPV-typing results are depicted in Table 1. In most of these cases, HPV-genotype 16 was detected. With regard to all HPV-DNA positive specimens, HPV16 was found in 80.5% (n = 33), HPV18 in 7.3%, (n = 3), HPV33 in 26.8% (n = 11), HPV51 in 2.4% (n = 1) and HPV59 in 19.5% (n = 8). In most of the tumors one single HPV-type was identified (n = 31; 75.6%). Most commonly it was HPV16 (n = 23; 56.1%). The majority of these OPSCCs with selective HPV16 infection showed p16-overexpression (n = 20; 87.0%). Multiple HPV infections were detected in 24.4% (n = 10) of all HPV-positive specimens. Worthy of note, all of them also exhibited HPV16, while only 30.0% (n = 3) of them showed a p16-overexpression.

Table 1 Results of HPV-typing stratified by p16-status

Concordance between p16-immunostatus and presence of HPV-DNA was observed in 63.7% (n = 65), presenting either as p16-positive and HPV-DNA-positive (n = 27; 26.5%) or p16-negative and HPV-DNA-negative (n = 38; 37.3%). Contradictory results were found in 36.3% (n = 37). p16-positive with no presence of HPV-DNA (p16-positive/HPV-negative) were 22.5% (n = 23) of the OPSCCs. Vice versa 13.7% (n = 14) were p16-negative and positive for HPV-DNA (p16-negative/HPV-positive). Among p16-positive OPSCCs the presence of HPV-DNA and, therefore, concordance of p16- and HPV-positivity was 54.0% (n = 27 of 50).

Disease characteristics

Patients’ and disease characteristics, treatment details and follow-up data of the whole study group as well as stratified by p16-status, HPV-status and the resulting subgroups with both markers combined are depicted in Table 2. As shown, distribution analysis of characteristics between patients stratified by either (1) p16-immunohistochemistry, or (2) HPV-typing, or (3) both markers combined, revealed no significant difference in reference to gender, N-category, presence of lymph node metastases with extracapsular spread, prognostic stage and treatment (Pearson’s Chi-squared test P > 0.1 respectively). However, T-categories were significantly differently distributed between p16-positive and p16-negative OPSCCs (P < 0.01). p16-negative tumors were more often diagnosed in a higher T-category (pT3/pT4: n = 40 of 52; 76.9%) compared to p16-positive tumors (n = 24 of 50; 48.0%). A significant different distribution of T-categories was also observed, when taking both markers combined into account (P = 0.02). Concordant p16-negative/HPV-negative cases were more often locally advanced tumors (pT3/pT4a: n = 29 of 38, 76.3%) and concordant p16-positive/HPV-positive cases presented more often as locally circumscribed tumors (pT1/pT2: n = 16 of 27, 59.3%). Furthermore, histopathological differentiation was significantly differently distributed between p16-positive and p16-negative tumors (high/moderate vs. poor: P = 0.01). Most of p16-negative OPSCCs had a high or moderate differentiation (n = 46 of 52; 88.5%). Most of the poorly differentiated tumors were constituted by p16-positive tumors (n = 16 of 22; 72.7%).

Table 2 Patients’, disease characteristics and follow-up data stratified by p16-status and HPV-status

Tobacco and alcohol consumption were significantly differently distributed between patients with p16-positive and p16-negative disease, respectively (smoking P < 0.01; alcohol P < 0.01). Never smoking patients had more often p16-positive tumors (n = 21 of 26; 80.8%), whereas former/current smokers had more often p16-negative ones (n = 43 of 62; 69.4%). Furthermore, no/social drinking patients had predominantly p16-positive disease (n = 34 of 53; 64.2%), whereas those with heavy alcohol consumption had more p16-negative cancers (n = 28 of 33; 84.8%). With regard to the HPV-status, no significant different distribution was observed for smoking (P = 0.49) or alcohol consumption (P = 0.11).

Oncological results

Treatment failures

Local and/or regional treatment failures were diagnosed in 18.6% (n = 19) of all patients. The occurrence of treatment failures was not differently distributed between patients with p16-positive versus those with p16-negative (P = 0.24) or HPV-positive versus HPV-negative (P = 0.48) OPSCCs. No significant difference was observed including both markers combined (P = 0.08).

Details of the treatment failures, occurrence of distant metastases and secondary primaries stratified by p16- and HPV-status and the marker-combined subgroups are provided in Table 3.

Table 3 Treatment failures stratified by p16- and HPV-status

Survival estimates

All patients included had advanced stage oropharyngeal cancer (stage III–IVa). The 5-year estimates of the complete cohort were as followed: OS 73.3%, DSS 78.3%, RFS 69.8% and LCR 85.6%.

Comparing the survival of patients with p16-positive or negative tumors showed superior estimates for p16-positive cases [26]. Within the present study, in addition, the HPV-status of those patients was investigated. Comparing the survival of cases with HPV-positive vs. HPV-negative tumors, no trends towards a significant difference were observed for any of the endpoints: OS 64.9% vs. 78.7% (P = 0.46), DSS 69.3% vs. 84.0% (P = 0.19), RFS 68.2% vs. 71.5% (P = 0.60) and LCR 88.8% vs. 84.0% (P = 0.65) for HPV-DNA-positive vs. HPV-DNA-negative tumors, respectively (Online Resource 1). Summarized, prognostic impact has been demonstrated for the p16-status, whereas stratification based on the presence of HPV-DNA did not result in significant discrepancies in survival rates.

For the following analysis, the whole study group was stratified into four groups based on p16-overexpression and HPV-DNA presence. This resulted in the group of p16-positive/HPV-positive tumors (n = 27, 26.5%), p16-positive/HPV-negative (n = 23, 22.5%), p16-negative/HPV-positive (n = 14, 13.7%), and p16-negative/HPV-negative (n = 38, 37.3%). Among patients with p16-positive OPSCCs, no differences between the estimates of HPV-DNA positive or negative tumors were observed (OS 78.1% vs. 85.6%, P = 0.73; DSS 81.1% vs. 85.6%, P = 0.97; RFS 82.9% vs. 72.6%, P = 0.48; LCR 94.7% vs. 85.7%, P = 0.35). Within patients with p16-negative tumors, those with p16-negative/HPV-negative disease demonstrated significant higher estimates for both, DSS and RFS, compared to the group of patients with p16-negative/HPV-positive disease (DSS 82.6% vs. 47.6%, P = 0.02; RFS 71.2% vs. 42.9%, P = 0.04). The OS was 73.6% vs. 40.8% (P = 0.15), and the LCR was 83.9% vs. 78.6% (P = 0.50), for the group of p16-negative/HPV-negative vs. p16-negative/HPV-positive tumors, respectively. Considering all four subgroups, OS and DSS were the highest in the group with discordant p16-positive/HPV-negative (OS 85.6%; DSS 85.6%) tumors and the lowest for those with a p16-negative/HPV-positive disease (OS 40.8%; DSS 47.6%; Fig. 1a, b). With regard to RFS and LCR the highest estimates were calculated for the concordant p16-positive/HPV-positive group (RFS 82.9%; LCR 94.7%), and still lowest for the p16-negative but HPV-positive one (RFS 42.9%; LCR 78.6%; Fig. 1c, d). Altogether, p16-positive subgroups (p16-positive/HPV-positive, p16-positive/HPV-negative) always constituted the groups with the highest survival estimates. The group of p16-negative/HPV-negative tumors demonstrated comparable survival. The group with the lowest survival for all endpoints was p16-negative/HPV-positive (Table 4, Fig. 1).

Fig. 1
figure 1

Five-year Kaplan–Meier estimates of overall survival (a), disease-specific survival (b), recurrence-free survival (c) and local control rate (d) stratified by p16- and HPV-status combined. Patients at risk are shown below the diagram. P values are calculated by log-rank test

Table 4 Overview of studies evaluating five-year oncological outcomes for patients with OPSCC with known p16- and HPV-status treated with curative intent

Subgroup analysis of alcohol and tobacco consumption

Information about tobacco and alcohol consumption was documented for 83.3% (n = 85) of all patients. These patients were considered for a further subset analysis.

In reference to tobacco consumption, the majority of patients (n = 59 of 85; 69.4%) had reported either current or former smoking, whereas 30.6% (n = 26 of 85) had negated any smoking history. Most of the never smoking patients had p16-positive tumors (21 of 26, 80.8%), whereas the group of former/current smokers exhibited predominantly a p16-negative disease (41 of 59, 69.5%; P < 0.01). HPV-DNA status was not differently distributed between never smoking patients (13 of 26; 50.0%) and those with a positive history for tobacco consumption (25 of 59, 42.4%; P = 0.51).

With regard to alcohol consumption, 37.6% (n = 32) of the patients were categorized as heavy drinkers, whereas 62.4% (n = 53) had reported no or occasional alcohol consumption and therefore were assigned to the group of never/social drinking patients. This group predominantly consisted of patients with p16-positive tumors (34 of 53, 64.2%), whereas heavy alcohol consumption was predominantly reported by patients with a p16-negative disease (27 of 32, 84.4%; P < 0.01). Again, HPV-DNA status was not differently distributed between never/social drinking patients (27 of 53; 50.9%) and those with heavy alcohol consumption (11 of 32, 34.4%; P = 0.14).

Considering both factors combined, 71.8% (61 of 85) had a positive history of smoking and/or heavy drinking. Most of these patients had p16-negative OPSCCs (41 of 61, 67.2%), whereas the group of never/social drinking patients without a history of tobacco consumption had predominantly p16-positive tumors (19 of 24, 79.2%; P < 0.01). Again, HPV-DNA detection was not differently distributed between never smoking and never/social drinking patients (12 of 24; 50.0%) and smokers and/or heavy drinkers (26 of 61, 42.6%; P = 0.54). In summary, the distribution of tobacco and/or heavy alcohol consumption differed only significantly between patients with p16-positive and p16-negative disease.

Survival estimates by subgroups of alcohol and tobacco consumption

In a subset of 85 patients with documented history of tobacco and alcohol consumption the estimates for OS, DSS, RFS and LCR accounted for 69.8%, 75.7%, 69.1% and 88.7%, respectively.

The group of never-smoking patients revealed significant superior estimates for OS and DSS compared to the group of former/current smokers (OS 91.7% vs. 59.3%, P < 0.01; DSS 91.7% vs. 67.5%, P < 0.01). The estimates of RFS and LCR for never smoking patients and those with a smoking history showed no significant difference (RFS 74.4% vs. 66.8%, P = 0.28; LCR 90.5% vs. 88.7%, P = 0.51). Further stratification based on p16-status or HPV-status within the two groups of never or former/current smoking patients demonstrated that never smoking patients with p16-positive tumors had favorable OS, DSS and RFS compared to former/current smokers with a p16-negative disease (OS 90.0% vs 57.4%; DSS 90.0% vs 67.8%; RFS 84.2% vs 66.0%). In contrast, the group of p16-positive cancer patients that shared the history of former/current smoking demonstrated comparable low survival (OS 63.0%, DSS 66.7%, RFS 68.2%; Fig. 2a–c). Regarding LCR, estimates were less divergent between these groups (94.1% vs. 86.5% vs. 93.8%; Fig. 2d).

Fig. 2
figure 2

Five-year Kaplan–Meier estimates of overall survival (a), disease-specific survival (b), recurrence-free survival (c) and local control rate (d) stratified by tobacco consumption and p16-status combined. Patients at risk are shown below the diagram. P values are calculated by log-rank test

Within the two groups of never or former/current smoking patients, stratification upon HPV-status did not demonstrate prognostic differences (Online Resource 2).

Summarized, never smoking patients with p16-positive disease showed the highest survival, whereas former/current smokers demonstrated a comparable inferior survival, despite the p16-status.

With regard to drinking habits, there was no statistical survival difference between never/social drinking patients and heavy drinkers (OS: 71.5% vs. 67.2%, P = 0.741; DSS: 79.0% vs. 70.7%, P = 0.242; RFS: 66.5% vs. 73.8%, P = 0.768; LCR: 88.0% vs. 89.9%, P = 0.953). Therefore, this factor was not considered for further subgroup analysis stratified on p16- or HPV-status.

Analysis of potential cofounders and cox multivariate regression

In order to identify potential prognostic relevant cofounders, the association between prognostic endpoints and further clinicopathological characteristics were analyzed. The results of these univariate analyses are shown in Online Resource 3.

For adjustment of potential cofounding factors, also a Cox multivariate regression analysis (forward stepwise) was conducted. Regarding the complete cohort, in the final equation of OS the p16-status (HR = 0.41, 95% CI, 0.19–0.887, P = 0.023) and extracapsular spread (HR = 2.1, 95% CI 1.009–4.372, P = 0.047) were included. For DSS, extracapsular spread was included in the final model when considering p16-status and HPV-status separately (HR = 2.768, 95% CI 1.088–7.038, P=0.033). For RFS and LCR no significant covariate was identified, respectively.

Considering the combination of p16 and HPV, results of the multivariate analysis demonstrated that the combined p16/HPV-status and ECS had an impact on OS (ECS: HR=2.254, 95% CI 1.067-4.765, P=0.033; combined p16/HPV-status: p16-negative/HPV-negative (reference), P=0.036, p16-positive/HPV-positive: HR=0.525, 95% CI 0.182-1.516, P=0.234, p16-positive/HPV-negative: HR=0.534, 95% CI 0.183-1.559, P=0.251, p16 negative/HPV positive: HR=2.151, 95% CI 0.854-5.417, P=0.104) and likewise DSS (ECS: HR=2.951, 95% CI 1.147-7.593, P=0.025; combined p16/HPV-status: p16-negative/HPV-negative (reference), P=0.017; p16-positive/HPV-negative: HR=0.657, 95% CI 0.157-2.757, P=0.566; p16 negative/HPV positive: HR=0.837, 95% CI 0.198-3.529, P=0.808; p16-negative/HPV-positive: HR=3.923, 95% CI 1.239-12.416, P=0.02). For RFS, T-categorization was in addition to the combined p16-HPV-status included in the final equation (T-categorization: HR=0.416, 95% CI 0.184-0.943, P=0.036; combined p16/HPV-status: p16-negative/HPV-negative (reference), P=0.011 p16-positive/HPV-positive: HR=0.365, 95% CI 0.105-1.26, P=0.111; p16-positive/HPV-negative: HR=0.695, 95% CI 0.239-2.021, P=0.504; p16-negative/HPV-positive: HR=2.676, 95% CI 1.03-6.949, P=0.043). For LCR no significant covariate was identified.       

Considering the cohort with information about alcohol and tobacco consumption neither p16-nor HPV-status or both combined were associated with one of the prognostic endpoints in the univariate analyses (Online Resource 3). Consistent with this, in the multivariate Cox regression analysis only extracapsular spread as well as tobacco consumption were identified to effect OS (ECS: HR = 2.323, 95% CI 1.066–5.06, P = 0.034; tobacco consumption: HR = 4.403, 95% CI 1.317–14.717, P = 0.016) as well as DSS (ECS: HR = 3.197, 95% CI 1.154–8.855, P = 0.025; tobacco consumption: HR = 8.348, 95% CI 1.1–63.329, P=0.04). For RFS and LCR no significant covariate was identified.

Complementary analyses for patients with OPSCCs of sublocations, with a potential carcinogenic role of HPV (tonsils, base of tongue)

The classification algorithms of the current (8th) edition of the AJCC/UICC cancer staging manual differentiate between p16-positive and p16-negative OPSCCs but not between tonsillar/tongue base OPSCC vs. other OPSCC sublocations [5, 18]. Nevertheless, studies indicated an etiological link especially between HPV presence and OPSCCs of the tonsillar region and base of tongue [32,33,34,35]. Therefore 74 patients with tumors, that originated from tonsillar regions (ICD-10-GM Version 2020 C09.-, n=48) or base of tongue (ICD-10-GM Version 2020 C01, n=26) were identified for complementary subset analyses. The detailed results are provided in the Online Resources 4–6 (Tables) and 7–8 (Figures).

p16 positive tumors were significantly more often localized in the tonsillar region or base of tongue (Pearson's chi-squared test, P < 0.01). In contrast the HPV was not significantly differentially distributed between tonsillar/tongue base OPSCC compared OPSCCs of other sublocations (Pearson's chi-squared test, P = 0.308). There was no significant prognostic difference between patients with tonsillar/tongue base OPSCC compared to patients with OPSCCs of the other sublocations (OS, P = 0.506; DSS, P=0.929; RFS, P = 0.806; LCR, P = 0.138). Consistent with results the complete cohort, exclusively the p16-status (OS, P = 0.014; DSS, P = 0.021; RFS, P = 0.025; LCR, P = 0.028), but not the HPV-status (OS, P=0.974; DSS, P = 0.837; RFS, P = 0.844; LCR, P = 0.690) was associated with significant prognostic discrepancies in the univariate analyses of the cohort of tonsillar/tongue base OPSCC. Combining both markers, the p16-positive (p16-positive/HPV-positive, p16-positive/HPV-negative) and p16-negative/HPV-negative groups demonstrated comparable superior estimates (OS 81.5% vs. 84.9% vs. 72.3%; DSS 84.8% vs. 84.9% vs. 76.7%; RFS 85.7% vs. 71.3% vs. 67.0%; LCR 94.4% vs. 85.0% vs. 77.4%). Lowest survival was observed for patients with p16-negative/HPV-positive OPSCCs (OS 42.9%; DSS 42.9%; RFS 28.6%; LCR 57.1%; Online Resource 7).

Never smoking patients with p16-positive tonsillar/tongue base OPSCC demonstrated the best survival, whereas within former/current smokers with p16-positive and p16-negative disease OS and DSS were comparable low (OS 88.9% vs. 64.9% vs. 57.8%; DSS 88.9% vs. 69.2% vs. 61.0%; Online Resource 8).

Discussion

This study presents a retrospective analysis of the oncological outcome of patients with advanced stage p16-positive or p16-negative OPSCCs with further examination for HPV-DNA. All patients received a TLM-based treatment strategy with/without neck dissection, with/without postoperative (chemo-)radiotherapy between 2000 and 2015 as described previously [25]. Throughout this time period, knowledge about the role of p16-overexpression and association of HPV-infections in head and neck cancer evolved [35, 36]. Recently, the p16-status has been introduced in the classification algorithm of the 8th edition of the AJCC/UICC cancer staging manual [5, 18] and trials investigated altered therapeutic regimes for “HPV-associated” OPSCCs [17, 34]. Nevertheless, standard treatment recommendations should not yet be altered based on p16- and/or HPV-status outside clinical trials [25].

p16-status and HPV genotype

Drawing conclusions based on research about HPV-associated tumors is still challenging, since various and inconsistent definitions for a HPV-positive tumor are used [16]. For instance, in the RTOG 1016 trial investigating radiotherapy combined with cetuximab or cisplatin in HPV-positive oropharyngeal cancer patients, HPV-status was determined by p16-overexpression [17]. Other investigators classified only tumors tested positive for HPV-DNA as well as HPV-RNA as HPV-positive [22]. Alternatively, an algorithm of first p16-immunohistochemistry and for positive samples followed by HPV-DNA PCR was applied to define “HPV-positivity” [23, 38]. p16-overexpression is considered as a surrogate marker for oncogenic HPV-infection based on the high correlation found between both [16, 29].

Within the present study for the distinction between p16-positive and p16-negative tumors the criteria suggested for the application of the classification algorithms of UICC/AJCC 8th edition was applied [18]. Even though concordance between p16- and HPV-status was evident in the majority of cases (p16-positive/HPV-positive or p16-negative/HPV-negative in 64%), more than one-third/36% of the OPSCCs showed discordant results of either p16-positive/HPV-negative or p16-negative/HPV-positive characterization. Discordance rates between p16-status and HPV-status were also described by others with, e.g. 12.7% and 45% [22, 39]. For HPV-detection we applied a commercial assay developed for diagnostic purposes that is reported with a sensitivity of 97.5% and specificity of 93% [30]. Besides the possibility of false-positive or false-negative results in HPV-typing, also several other factors might have influenced the discordancy detected in our cohort: It was described that OPSCCs with a p16-negative/HPV-positive status demonstrated low amounts of HPV-DNA [40]. Thus, it appears possible that p16-positivity distinguishes samples with different amounts of viral DNA, which, in combination with a high sensitivity of the multiplex-PCR technique used in the present study, might be resulting in the relatively high incidence of p16-negative/HPV-positive samples. In addition, the herein applied PCR uses primers designed for the E6 and E7 loci of the HPV genome, which might function as a further influencing factor [30]. Those regions are most likely to sustain after viral integration [30], probably enabling the detection of not only integrated, but also episomal or mixed HPVs. Yamashita et al. examined HPV16 infections and found a different distribution between integrated/mixed vs. episomal HPV within p16-positive and p16-negative OPSCCs. In nearly every of the p16-positive cases HPV were completely integrated or mixed, while in p16-negative OPSCCs HPV was episomal in 55.5% [20]. This indicates that p16-positivity may distinguish cases with episomal versus integrated/mixed HPVs or, in other words, that the p16-negative/HPV-positive samples may be tumors with episomal HPV, while p16-positive/HPV-positive specimens could be the ones with integrated HPVs. Furthermore, reasons for the absence of p16-overexpression despite a positive HPV-status (p16-negative/HPV-positive) could be genetic (e.g. deletions of the p16Ink4a gene [41]) or due to epigenetic alterations of the p16-gene (e.g. promotor hypermethylation [42]). p16-overexpression without the evidence of HPV (p16-positive/HPV-negative) was less likely observed in association with mutations or amplifications of the p16-gene (CDKN2A), but was found with a higher frequency in case of inactivating mutations in the histone H3 lysine 36 methyltransferase, encoded by the gene NSD1 [43]. Thus, other factors were already linked to p16-overexpression in HPV-negative head and neck squamous cell carcinomas (HNSCC). Summarized, the molecular basis especially of the discordant subgroups requires further research.

Oncological results

Survival based on p16 or HPV status alone

As previously shown, a positive p16-status was associated with a superior prognosis of patients with advanced stage OPSCCs [26]. Based on these results, the aim of the current study was to examine if there were also survival differences by stratification based on HPV as a single marker, as well as both markers combined. In the current study no significant differences with regard to the oncological outcome between patients with HPV-positive or negative advanced stage OPSCCs was observed. This is in line with the results of Park et al. who selectively included patients with advanced stage OPSCCs, as well, and described an association between OS and p16-status, but non between OS and HPV-status [21]. Evaluating patients with stage I-IV disease, Yamashita et al. reported at least a trend towards superior OS for patients with HPV-positive disease compared to HPV-negative [20]. Hoffmann et al. observed a significant higher OS along with a trend towards higher progression-free survival for patients with HPV-DNA positive vs. HPV-DNA negative disease. Their cohort included tonsillar SCC alone with 57.1% T1-2 tumors and 30.2% N0 disease; information about the distribution of prognostic stages was not provided [22]. Moreover, after further analysis for HPV-RNA, and stratifying their patients into different treatment groups, they observed a significant superior OS only between HPV-RNA as well as DNA-positive (their definition of HPV-positivity)  and HPV-negative disease among patients that had received surgery alone or additional postoperative chemoradiotherapy, respectively. In contrast, among patients treated with postoperative radiotherapy without chemotherapy, they found no differences with regard to the OS between”HPV-positive “(HPV-DNA/RNA-positive) vs.” HPV-negative “(non-HPV-DNA/RNA-positive) disease [22]. Thus, following the observations of Park et al. [21] and the current study investigating patients with advanced disease, the marker HPV indicates a limited prognostic impact. Furthermore, those results of Hoffmann et al. considered also tumor’s HPV-RNA status for definition of “HPV-positivity” indicated that the survival superiority of HPV-positivity may be restricted to patients having received different treatment regimes [22].

Notwithstanding, several additional factors could be responsible for this heterogenous results of the prognostic impact of HPV-presence leading to an aggravated comparability. On the one hand, methodical differences in HPV-detection, definition of “HPV-positivity” and HPV-genotype distribution could have an impact. For instance, a lower survival for non-HPV16-genotype positive tumors compared to HPV16-positivity ones was described [44, 45]. This indicates that a prognostic benefit of HPV-positivity may be restricted to HPV16 positive tumors. Regarding the present study, the presence of 43.9% non-HPV16-genotypes (including multiple infections containing HPV16, as well) among HPV positive tumors could be a factor that influenced the observed differences in prognostic relevance of HPV-presence. Since only 19.5% of the included OPSCCs exhibited selectively non-HPV16 genotypes, no further subset analysis was performed. In addition, differences in the (risk) constitution (e.g. prognostic stages, tobacco and alcohol consumption) of the study populations could be relevant for differing results of HPV-positivity on prognosis of patients with OPSCCs.

Survival based on p16 and HPV-status combined

Previous studies already indicated that different constellations of the p16- and HPV-status can define further prognostic divergent subgroups within OPSCCs [19,20,21,22,23,24]. Nevertheless, studies reporting long-term oncologic results of subgroups defined by p16- and HPV-status combined are rare [19,20,21,22,23,24], even though a meta-analysis by Albers et al. included 24 studies that investigated the survival of patients with HNSCC and evaluated p16- and HPV-status [24]. A comparison of our study with this meta-analysis seems limited due to the following reasons: only 13 of the included studies selectively focused on OPSCCs, and only two of them provided the 5-year survival estimates of patients treated with curative intent, with no cases of distant metastases included [21, 38]. Both studies do not provide the exact survival estimates for all four possible subgroups defined by p16- and HPV-status [21, 38]. Nevertheless, four more recently published studies focused on this topic [19, 20, 22, 23]. Results are summarized in Table 4.

In the current study, survival estimates (OS, DSS, RFS, LCR) of the two p16-positive groups that were either HPV-positive or negative showed no significant difference. This indicates the same prognostic level of patients with p16-positive OPSCCs despite the presence of HPV-DNA. This is in line with the results of Yamamoto et al. for a group treated with definitive chemoradiotherapy, even though for their group treated with definitive radiotherapy alone, they observed a significantly superior OS and PFS for patients with p16-positive/HPV-positive disease compared to those with p16-positive/HPV-negative tumors [19].

In the present study no significant differences among patients with p16-negative tumors with or without positive HPV-status were observed with regard to OS and LCR. The p16-negative/HPV-positive subgroup presented the lowest estimates. This is in line with the results of Yamashita et al. that also reported higher OS for the group with p16-negative/HPV-negative status compared to the one with p16-negative/HPV-positive disease, even though the difference was not statistically significant [20]. In the current study the group of patients with p16-negative/HPV-positive tumors demonstrated also lower estimates for DSS and RFS compared to patients with concordant negative results. This difference reached significance. Other studies did not consider these endpoints [19,20,21,22,23].

In contrast to the current study and the one by Yamashita et al. that observed the lowest outcome for the subgroup of p16-negative/HPV-positive tumors [20], Hoffmann et al. reported the worst survival for the concordant negative (p16-negative/HPV-negative) subgroup [22]. Furthermore, Nauta et al. reported that patients with p16-positive/HPV-positive disease showed a significant superior survival compared to the group with p16-positive/HPV-negative tumors, whose survival differences did not significantly differ from patients with p16-negative carcinomas [23], even though this study did not further examine p16-negative tumors with regard to the HPV-status. To our knowledge, Hoffmann et al. and the present study are the only studies to date providing data regarding all four subgroups and including 5-year survival estimates of patients with OPSCCs treated with curative intent (Table 4).

Tobacco consumption

Patients with HPV/p16-associated HNSCCs are described to have different risk profiles compared to those with HPV/p16-negative disease [46]. In particular, tobacco consumption is one factor that is reported to potentially conflict the positive impact of HPV/p16-positivity on survival [22]. Therefore, distribution as well as the potential prognostic differences within groups with a divergent history of tobacco consumption stratified by p16- or HPV-status was examined.

In the current study, most of never smoking patients had p16-positive disease, whereas the majority of former/current smokers exhibited p16-negative tumors. This is in accordance with other studies demonstrating a significant differing frequency of tobacco history between patients with a p16-positive and p16-negative disease [20, 47]. HPV-positivity was not differently distributed between never smoking patients and those with a positive history for tobacco consumption in the present study. This is in contrast to the study of Hoffmann et al., which reports a significant higher proportion of > 10 py tobacco consumption among HPV-negative patients [22].

With regard to survival rates, never smoking patients demonstrated superior estimates compared to former/current smokers. Considering the tumor’s p16-status, the highest survival estimates were observed in never smoking patients with p16-positive tumors, whereas patients with p16-positive disease and a history of smoking had a comparable inferior prognosis like p16-negative tumors of former or current smokers. This indicates that the superior prognosis of p16-positive tumors is abrogated in tobacco consumers. In the present study stratification based on HPV-status exhibited no significant differences between the oncological results of never and former/current smoking patients. This appears to be conflicting with the results of Hoffmann et al., demonstrating a negative impact of smoking in patients with HPV-positive OPSCC and > 10 py on overall survival. A comparable analysis based on p16-status was not mentioned in this study [22].

Strengths, limitations and outlook

The relevance of the present study is emphasized by the lack of a standardized definition of HPV-association in OPSCCs as well as a lack of detailed high-evidence data about the implications on the prognosis of the different p16- and HPV-status constellations. This is reflected by the paucity of studies, addressing this topic with a focus on long-term oncological outcome of patients with primary OPSCC treated with curative intent [19,20,21,22,23]. The limit in data quality and a high risk of bias of such retrospective studies also needs to be considered in the present one. Nevertheless, we aimed to reduce this limitation by implementing strict inclusion criteria and by providing detailed clinicopathological data obtained from original patients’ charts, surgical and pathological reports as well as an additional pathological assessment. This enabled us to analyze the prognostic implications of the p16- and HPV-status separately as well as combined in a well-characterized cohort of patients with homogenous stage III/IV OPSCC treated exclusively in a surgery-based concept. Therefore, the study provides data that will potentially influence the inclusion criteria and analytical scope of future, e.g. deintensification trials for HPV-positive disease.

The impact of different HPV-genotypes on prognosis and their association with p16-overexpression was not further evaluated due to their limited frequency in the present study. This topic should be investigated by multicentered studies performed prospectively or at least by pooling the already existing data of well-described retrospective cohorts.

Conclusion

The current data indicate that p16- and HPV-status should not be considered as equivalent markers for a better prognosis in OPSCC. Furthermore, they should not outweigh patient-associated prognostic factors like smoking. Thus, further studies focusing on therapeutic regimes and prognosis of OPSCC should consider both markers and provide detailed information on tobacco consumption.