Introduction

Head and neck cancers (HNC), also known as head and neck squamous cell carcinomas (HNSCC), consist of various malignancies impacting the mucosal surfaces of the upper aerodigestive tract, in areas like the nasopharynx, oral cavity, larynx, oropharynx, hypopharynx, and paranasal sinuses [1]. Annually, HNSCC is responsible for over 650,000 new cancer cases and causes more than 350,000 deaths worldwide [2,3,4]. Traditional primary risk factors for these cancers have been alcohol and tobacco use. However, in recent years, the human papillomavirus (HPV) has been recognized as a significant emerging risk factor, particularly for oropharyngeal squamous cell carcinoma (OPSCC). This has led to the identification of a distinct subtype of HPV-related tumors, which differ from those not associated with HPV. The prevalence of HNSCC tends to vary based on the specific anatomical location and the geographic area [5].

The carcinogenic nature of HPV was definitively established in 1983 when Durst et al. successfully cloned HPV type 16 from cervical carcinoma tissue [6]. It is now widely acknowledged that high-risk HPV types are accountable for almost all cases of cervical cancer. While most HPV infections are asymptomatic and tend to resolve spontaneously, persistent HPV infection in the basal cells of the cervix can lead to the development of cervical cancer [6, 7].

Numerous research articles indicate that epithelial cells from the oral cavity and tonsils can undergo immortalization through the influence of the full-length HPV-16 or its E6/E7 oncogenes [8,9,10,11,12]. Furthermore, studies using transgenic mice have shown that the E6/E7 genes of HPV 16 significantly heighten the risk of developing oral and oropharyngeal cancers [13]. It was found that the E7 gene, in particular, is more effective in triggering these cancers, yet there is a noticeable synergistic effect between E6 and E7 in the genesis of HNSCC [14].

In 2020, India accounted for 7% of the total cancer cases worldwide and a significant 24% of the global incidence of HPV-related cancers [15]. Additionally, it was reported that 80% of cervical cancer cases in India were attributed to HPV subtypes 16 and 18 [16, 17]. As part of its cancer control strategy, India has started implementing opportunistic screening programs for common cancers, including those of the cervix and oral cavity, in its healthcare facilities [18]. A notable increase in HNC occurrences was observed in the population-based cancer registries (PBCRs) for cities such as Aurangabad, Delhi, Chennai, and Bhopal among men, and in Nagpur among women, in India [19]. The age-adjusted incidence rate of HNC stood at approximately 25.9 (95% CI 25.7–26.1) and 8.0 (95% CI 7.9–8.1) per 100,000 people for men and women, respectively [20]. HNC represented around 26% of all cancer diagnoses in men and 8% in women. The likelihood of being diagnosed with HNC was 1 in 33 for men and 1 in 107 for women [20].

Despite the growing body of literature on HPV’s role in HNC, there remains a lack of consensus on its prevalence and impact. This variability is partly due to differences in study designs, population demographics, and detection methods used across various research studies. A systematic review and meta-analysis of the existing studies can provide a more comprehensive understanding of HPV’s prevalence in HNC, offering insights into regional variations.

Method

This systematic review was conducted in accordance with PRISMA guidelines [21] (Table S1) and has been registered in PROSPERO.

Literature search

An electronic literature search was conducted in various databases, including PubMed, Embase, and Web of Science from inception up to November 10, 2023. Keywords and MeSH terms related to HPV and HNC were used to devise the search strategy. No restrictions have been placed on the type of article, year of publication, or language in the search. Table S2 shows the detailed search strategy.

Inclusion criteria

Original research that reports the number of HPV-positive cases among HNC patients is included in this study. We are considering only studies conducted in India. Excluded from this study are case reports, case series, or studies lacking quantitative data. The same exclusion criteria apply to animal studies, commentaries, in vitro studies, and reviews. Studies from any location within India, whether conducted in a hospital or community setting, are eligible. There are no restrictions on the type of test used to detect HPV. A detailed inclusion criterion is specified in Table S3.

Screening and study selection

Two independent reviewers performed the screening of the articles. Nested-Knowledge software was employed for this process. The screening involved a primary review of titles and abstracts, followed by a full-text reading to assess the eligibility of studies for inclusion. An independent third reviewer was consulted to resolve any discrepancies.

Data extraction and quality assessment

Data extraction was performed using the Tag function of the Nested-Knowledge software. Three reviewers carried out the data extraction process. A fourth reviewer cross-checked and validated the extracted data. The data extracted covered the author’s name, year of publication, state/location of the study, study design, age and percentage of males in the sample, total number of HNC patients in the study, number of HPV-positive samples, and the type of test employed for detecting HPV. Quality assessment was done using a modified Newcastle-Ottawa Scale (NOS) version [22, 23].

Statistical analysis

A pooled prevalence of HPV is determined by performing a meta-analysis. A random-effects model is employed to conduct the meta-analysis. The variability in study outcomes was measured using the I2 statistic, which quantifies heterogeneity on a scale from 0 to 100%, where higher values indicate greater heterogeneity [24]. The heterogeneity was further assessed using the 95% prediction interval. We calculated the tau-squared value using maximum likelihood estimation to gain additional insights into heterogeneity [25,26,27,28]. Subgroup analysis was conducted based on the location of the study. We used a funnel plot and the Egger test to detect any potential publication bias. A p-value below 0.05 was typically regarded as statistically significant. All statistical analyses were performed using R software, version 4.3 [23, 29, 30].

Results

Literature search

In the literature search, 4,972 articles were identified from multiple databases. Among these, 521 were duplicates. After removing duplicates, 4,451 articles were subjected to screening, of which 3,981 were excluded. The remaining 470 articles were screened by full text for eligibility, resulting in the exclusion of 440, leaving 30 for inclusion. Additionally, 3 studies were added from a citation search. Finally, 33 studies were included in the review. Figure 1 depicts the PRISMA flow chart of the selection and screening process.

Fig. 1
figure 1

PRISMA flow diagram showing the screening and selection process

Characteristics of included studies

The studies included in this review examining the prevalence of HPV in HNC cases in India are characterized by a diverse methodology and geographical spread. Table 1 summarises the characteristics of included studies. These investigations employ various study designs, including prospective, retrospective, case-control, cohort, and cross-sectional studies, providing a thorough exploration of the subject. The studies are conducted across multiple states and regions of India, representing a broad geographic distribution, and include participants across different age groups, offering insights into age-related variations in HPV prevalence. The proportion of male participants in these studies varies, suggesting potential gender differences in HPV prevalence. The primary diagnostic methods used in these studies are Polymerase Chain Reaction (PCR), genotyping, PCR and P16 Immunohistochemistry (IHC), and Chromogenic In Situ Hybridization (ISH), noted for their high sensitivity. The size of the study populations ranges from small cohorts to large groups, affecting the statistical power of the findings. Rates of HPV positivity exhibit considerable variability, underlining the complexity of understanding HPV prevalence in HNC in this context. These studies span a wide time frame, allowing for the analysis of temporal trends in HPV prevalence. New Delhi contributed 4 studies to the research on HPV prevalence in HNC [31,32,33], while Uttar Pradesh was represented by 5 studies [34,35,36,37,38]. Maharashtra was the focus of 4 studies [39,40,41,42]. Multiple studies were conducted in South India [43, 44], Haryana [45,46,47], and Tamil Nadu [48, 49]. Single studies were conducted in several other locations: Andhra Pradesh [50], Kerala [51], Kolkata [52], Mumbai [53, 54], Karnataka [55, 56], Chandigarh [57, 58], and Assam [59, 60]. Additionally, one study was a multi-regional study encompassing several states [42]. The included studies were overall of moderate to high quality in modified NOS (Table S4).

Table 1 Characteristics of studies

Meta-analysis

From 33 studies encompassing a total of 4654 patients with HNC, the pooled prevalence of HPV infection was determined to be 33% (95% CI: 25–42%). Notably, substantial heterogeneity was observed among these studies (I² = 96%). A prediction interval ranging from 6.2 to 79% was also observed. Figure 2 illustrates the forest plot depicting the pooled prevalence.

Fig. 2
figure 2

Forest plot depicting the overall HPV pooled prevalence in head and neck cancer patients

Subgroup analysis

Subgroup analysis was performed on the basis of the location of the study (Fig. 3). The North region is represented by 14 studies encompassing a total of 2236 patients, revealing a pooled HPV prevalence of 27% (95% CI: 17.4–40.4%), and a high degree of heterogeneity (I² = 95%). The South region, with 7 studies and 335 patients, has a pooled prevalence of 46.3% (95% CI: 15.6–80%), also accompanied by significant heterogeneity (I² = 87%). In the West Middle region, 5 studies totalling 865 patients show a pooled prevalence of 19.8% (95% CI: 10.8–33.4%) with substantial heterogeneity (I² = 88%). A single study spans multiple locations—Madhya Pradesh, Gujarat, Rajasthan, Uttar Pradesh, West Bengal, and Assam—encompassing 170 patients and reporting a higher pooled prevalence of 39% (95% CI: 32–47%). Lastly, the East region, represented by 7 studies with 1048 patients, presents a pooled prevalence of 47% (95% CI: 32.2–62.4%) with high heterogeneity (I² = 92%).

Fig. 3
figure 3

Forest plot illustrating the results of sub-group analysis-based location

Meta-regression and sensitivity analysis

Meta-regression was performed to determine the effect of sample size on the pooled prevalence of HPV in HNC cases, as depicted in Fig. 4. However, the meta-regression analysis indicated that sample size was not significantly associated with the pooled prevalence results (p = 0.20). Additionally, a leave-one-out sensitivity analysis was conducted to identify individual studies that might affect the overall prevalence rate. This analysis revealed that the exclusion of no single study resulted in any significant changes to the overall pooled prevalence, as illustrated in Fig. 5.

Fig. 4
figure 4

Bubble plot depicting the results of meta-regression based on sample size

Fig. 5
figure 5

Sensitivity analysis of pooled prevalence

Publication bias

Publication bias was assessed using funnel plot asymmetry and Egger’s test, as shown in Figure S1. The results of our analysis did not provide evidence of publication bias (Egger’s test, p = 0.71).

Discussion

In the best of our knowledge, this is the first meta-analysis to determine the prevalence of HPV among HNC patients in India. In our analysis, we found an overall prevalence of 33% for HPV among HNC patients. South India and the eastern part of India have shown slightly greater prevalence compared to the north and west parts of India. Meta-regression, sensitivity analysis couldn’t resolve the s0000ource of heterogeneity.

Previous systematic reviews have established evidence that infection with HPV is a distinct risk factor for developing HNCs [61, 62]. The risk associated with tonsil carcinoma is notably high and aligns with what one would anticipate from an infectious cause. Our findings indicate that HPV’s involvement in tonsil cancer is similar to the role of Epstein-Barr virus in nasopharyngeal cancer, suggesting that this is yet another instance of a virus-induced cancer in the pharyngeal lymphoid tissue [62].

The risk associated with the progression or death, along with the likelihood of developing a secondary primary HNOSCC, increases by 1% and 1.5% respectively for each additional year of smoking. This remains true even when accounting for the HPV tumor status and other important factors. Importantly, the risk of death doubles for patients who continue smoking during radiation therapy. The detection of HPV in HNCs is highly significant for prognosis and can influence the modification of treatment plans based on the HPV status [63]. Thus, the accurate identification of HPV as a contributing factor in HNCs is of paramount importance. Typically, a biopsy or cytological analysis from the primary tumor site or enlarged lymph nodes is crucial for an initial diagnosis. PCR or RT-PCR (Reverse Transcriptase-PCR) are commonly employed methods for detecting HPV in tumor tissues, particularly for identifying E6/E7 in fresh frozen samples. However, these methods are associated with high setup costs and longer turnaround times [64].

The presence of HPV in HNCs significantly alters the disease’s prognosis and therapeutic approach. HPV-positive HNCs typically present with a better response to standard treatment modalities like radiotherapy and chemotherapy, leading to an overall better prognosis compared to HPV-negative cases [65]. This is primarily attributed to the distinct biological behavior of HPV-positive tumors, which tend to be more sensitive to radiation and cytotoxic agents. Studies has indicated that individuals diagnosed with HPV-positive oropharyngeal cancer have higher survival rates and lower risks of recurrence [66].

Given these differences, there is an emerging consensus on tailoring treatment strategies based on HPV status. For instance, the concept of treatment de-escalation for HPV-positive HNCs is gaining traction. This approach involves reducing the intensity of standard treatments to minimize long-term side effects without compromising the efficacy of cancer control [67]. Clinical trials are currently exploring various de-escalation strategies, including reduced-dose radiotherapy, omission of chemotherapy, and the use of minimally invasive surgeries [68, 69]. However, it’s important to approach de-escalation cautiously, ensuring that patient selection is based on robust biomarkers and clinical criteria to avoid under-treatment.

The accurate determination of HPV status in HNCs is pivotal for both prognostic assessment and guiding treatment decisions. Polymerase Chain Reaction (PCR) and Reverse Transcriptase-PCR (RT-PCR) are the gold standards for detecting HPV, particularly E6/E7 mRNA, in tumor tissues [70,71,72]. These methods are highly sensitive and specific but are often hindered by high costs and longer turnaround times, posing challenges in resource-limited settings. Emerging diagnostic techniques, such as liquid biopsies and next-generation sequencing (NGS), are promising alternatives. Liquid biopsies, which detect circulating tumor DNA (ctDNA) in blood samples, offer a less invasive method for HPV detection and monitoring [73,74,75]. NGS, on the other hand, allows for the comprehensive analysis of HPV integration sites and co-mutations, providing a more detailed tumor profile [76]. These technologies not only improve the accessibility of HPV testing but also enhance our understanding of the tumor biology, which is crucial for personalized medicine.

The significant role of HPV in HNCs necessitates public health interventions, particularly in countries like India where the burden of these cancers is high. HPV vaccination, which has been successful in reducing the incidence of cervical cancer, presents a viable strategy to curb HPV-related HNCs. The expansion of HPV vaccination programs to include both girls and boys could significantly reduce the future burden of these cancers [77, 78]. It is crucial to integrate HPV vaccination into national immunization programs and to raise public awareness about its benefits. Furthermore, the compounded risk of tobacco use in HPV-positive HNC patients highlights the urgent need for effective tobacco cessation programs. Tobacco, being a well-established risk factor for HNCs, exacerbates the risk even in the context of HPV-positive cancers. Public health campaigns targeting tobacco cessation are not only essential for preventing HNCs but also for improving outcomes in patients with existing HPV-related cancers [79,80,81].

Our study acknowledges certain limitations that should be considered. Firstly, our analysis was restricted to articles published in the English language, potentially omitting relevant research published in other languages. Additionally, the geographic coverage of our data is not comprehensive; studies from all regions of India were not available, which may affect the generalizability of our findings. While subgroup analysis was conducted, it was not sufficient to completely address the sources of heterogeneity observed in the results. We were unable to perform subgroup analysis based on site-specific cancer prevalence due to the unavailability of reported data. Future studies should include detailed data on site-specific HNC and HPV to facilitate more comprehensive analyses. This heterogeneity could stem from various factors such as differences in study design, populations, or methodologies, which our subgroup analysis could not fully disentangle. The overall sample size of the included studies was relatively small. This limitation could impact the statistical power of our findings and might lead to less precise estimates. Given these constraints, we recommend that future research should include a broader range of languages and geographic areas, especially underrepresented regions of India. More extensive studies with larger sample sizes would also be beneficial to provide more robust and generalizable results, and to further explore the sources of heterogeneity observed in this study. The absence of publication bias in our findings further strengthens the reliability of these insights.

Conclusion

This comprehensive study underscores the significant role of HPV in the epidemiology of HNCs in India. Through a detailed systematic review and meta-analysis, we determined that the pooled prevalence of HPV among HNC patients in India is 33%, revealing a substantial impact of this virus on such cancers. Notably, the study highlighted regional variations, with the highest prevalence in eastern India (47%) and lowest in the western regions (∼ 20%). This variability suggests the influence of regional factors in the prevalence and impact of HPV in HNCs. Overall, our study provides valuable data for healthcare professionals and policymakers, emphasizing the need for targeted interventions and policies to address the HPV-related burden in HNC patients in India, while also considering the regional disparities in prevalence and risk factors.