Introduction

The human papillomavirus (HPV) is an 8 kb double-stranded circular DNA virus that belongs to the large family of papillomaviruses (PVs). Currently, there are known more than 200 types of HPV but only a subgroup is predicted to cause cancer. These are classified by the International Agency for Research on Cancer (Wild et al. 2020) as high-risk types of HPV (16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59, 63, 73, and 82) and probable cancer-related types (26, 53, and 66). The low-risk viruses can cause warts (1 and 2) or benign changes within genital organs (6, 11, 40, 42, 43, 44, 54, 61, 70, 81) with no or poorly proven relation to human pathology (Clifford et al. 2003; Muñoz et al. 2003; Burd 2003; Kiwerska et al. 2019). The high-risk viruses are well known for cervical cancer pathogenesis where almost 100% of the tumors are attributed to the oncogenic activity of the HPV 16 and 18. However, HPV-related tumor pathogenesis is observed also in other sites like the anus (90% of cases), tumors of the vagina or penis (50–75% of cases), and the head and neck region (about 10% of cases) (de Martel et al. 2020; Serrano et al. 2018; Wierzbicka et al. 2021). The NIH estimates that worldwide, high-risk HPV infections cause about 5% of all cancers emphasizing on one hand the importance of papillomaviruses in the pathogenesis of human tumors and on the other the relevance of HPV vaccination („https://www.cancer.gov/”, n.d.).

HPV 16-driven oncogenesis

Canonical oncogenic mechanism that involves the E5, E6, and E7 oncoproteins

The main mechanism of HPV 16 and 18-driven oncogenesis is already well known, and for its explanation, Harald zur Hausen was awarded the Nobel Prize in Medicine in 2008 (Dürst et al. 1983; Boshart et al. 1984). The main players in this process are the oncoproteins E6 and E7 encoded by the viral genome (Münger et al. 1989). The expression of the E6 and E7 proteins is initiated after infection from the episomal form albeit on low levels because of the inhibition by the viral E2 protein. The proteins become significantly higher expressed after integration of the virus into the host genome which requires the transition from the circular to linear form resulting in the disruption of the E2 gene (Janicek and Averette 2001). The E6 and E7 proteins interact with crucial proteins of the cellular machinery responsible for cell cycle regulation and tumor suppression (Z.-A.-S. Ghoreshi 2023). E6 binds the cellular TP53 protein of well-known suppressor function triggering its ubiquitination and subsequent degradation by the E6AP ligase activity (E6-associated protein ligase) (Scheffner et al. 1993). This leads in consequence to loss of the regulatory function of TP53 towards the cell cycle and reduced expression of proapoptotic proteins such as BAX which is physiologically mediated by TP53 (Martinez-Zapien et al. 2016; Fischer 2017; Ruttkay-Nedecky et al. 2013). On the other hand, the E7 protein interacts with the cellular RB1 protein leading to the release of the potent transcription factor E2F/DP that in the G1 phase is normally bound to RB1 and inactive. The viral protein mimics therefore the physiological conditions of the late G1 phase where RB1 is inactivated by phosphorylation what leads to E2F/DP release (Rubin 2013). The E2F/DP factor drives the transcription of several genes responsible for the G1 → S transition, accelerates the cell cycle, and leads to proliferation. Eventually, the activity of the oncoproteins triggers the process of oncogenesis (Fig. 1).

Fig. 1
figure 1

Oncogenic activity of the E6 and E7 proteins encoded by HPV 16 (based on Janicek and Averette (2001) and edited)

Several cancer-related pathways are involved in this process. PI3K/AKT/mTOR is a well-known oncogenic pathway induced by E6 and E7 in head and neck cancer (Molinolo et al. 2009). These viral oncoproteins are able to active Akt pathway via upstream regulators, such as RPTK (receptor protein tyrosine kinase) and PI3K (phosphoinositide 3-kinase) resulting in increased proliferation (Aguayo et al. 2023).

Furthermore, E6 and E7 oncoproteins activate the Wnt pathway causing the accumulation of β-catenin, which may increase transcription of cell proliferation genes. This effect may be associated with a decreased protein level of β-catenin degradation inducer—Siah-1 (Bello et al. 2015). Moreover, accumulation of nuclear β-catenin is found in human HPV-positive cancers (Shinohara et al. 2001).

Similarly, NOTCH is affected in HPV-infected cells. The influence of E6 oncoprotein on NOTCH pathway regulation occurs through disruption of the interaction of MAML (mastermind-like protein) and NICD proteins, a critical component of the Notch signaling pathway. This results in turn in the loss of NOTCH target gene expression, including HEY and HES, which function as potent transcriptional repressors (Das et al. 2021; Galloway and Laimins 2015).

Another interesting aspect of E6 oncogenicity is the ability to interact with cellular proteins containing PDZ domains, such as PSD-95/Dlg/ZO-1 and Scribble, via the PBM motif (PDZ domain binding motif), presents in the C-terminus of the protein. This interaction disrupts intercellular connections between epithelial cells leading to loss of cell polarity which is a known characteristic of neoplastic cells (Gardiol et al. 1999).

In addition to the oncogenic potential of the E6 and E7 proteins, viral integration into the host genome directly triggers genomic instability (Akagi et al. 2014). Moreover, the introduced structural abnormalities may result in the disruption of tumor suppressor genes in the host genome and possibly also upregulate the expression of oncogenes (Schmitz et al. 2012; Lu et al. 2014) and reviewed in Chen et al. (2017) thus altering the function of several tumor-related genes.

Interestingly, one of the factors associated with increased risk of HPV-related malignant transformation may derive from bacterial or viral infections which can trigger alterations in the E6 oncoprotein expression. In this regard, Szostek et al. showed that the presence of Ureaplasma urealyticum–stimulated E6 expression in SiHa cells derived from human cervical squamous cell carcinoma cell line (Szostek et al. 2014).

Although the most known HPV-associated oncoproteins are E6 and E7, the E5 oncoprotein also plays a role in carcinogenesis. Recent studies have confirmed the importance of E5 oncoprotein in the modulation of the immune system and cell transformation. E5 cooperates with E7 and E6 to fuel the development of malignant cells (de Freitas et al. 2017). E5 oncoprotein inhibits apoptosis by increasing the ubiquitination and proteasomal degradation of the pro-apoptotic protein BAX consequently promoting the accumulation of cells with mutations (Maufort et al. 2010). Moreover, as shown for HPV 18, E5 potentiates EGFR signaling leading to increased ERK/MAPK activity and via this pathway is implicated in cell cycle progression. An important function of E5 oncoprotein was also confirmed in the metastatic process and is mediated by the upregulation of the MET oncoprotein (Scott et al. 2018; Hemmat and Baghi 2018).

The non-canonical oncogenic mechanism related to the function of APOBEC enzymes

Recent reports suggest that the canonical mechanism of HPV-driven oncogenesis is not the only mechanism by which the virus contributes to tumor formation. The non-canonical mechanism of viral (including HPV) oncogenicity is related to the function of the APOBEC—apolipoprotein B mRNA editing enzymes. These enzymes belong to the class of APOBEC3s (A3A, A3B, A3C, A3DE, A3F, A3G, A3H) deaminases which become activated in the cell by an interferon-mediated signal cascade triggered by sensing foreign nucleic acids by the protein machinery of the cell (Stenglein et al. 2010). Several proteins like TLR9 and DAI, AIM2, and RNA polymerase III have been identified to sense single-stranded and double-stranded DNA, respectively (Uematsu and Akira 2007; Takaoka et al. 2007; Bürckstümmer et al. 2009; Chiu et al. 2009). Innate immune sensors play a crucial role in recognizing viral nucleic acids, including those from HPV viruses. They enable a rapid immune response to infections by secreting interferon (IFN), regulating IFN-stimulated genes, and activation of the inflammasome complex. For example, the nuclear DNA sensor IFI16 detects single and double-strand DNA particles upon which it moves to the cytoplasm and activates the TBK1-IRF3 and the pro-inflammatory NFκB signaling pathway. Interestingly, it contributes further to IFN expression through interaction with STING1, a major regulator of the innate immune response. It has been recently shown that STING1 can impact virtually all aspects of tumorigenesis (Z. Ghoreshi et al. 2022; Samson and Ablasser 2022).

The overexpressed APOBEC enzymes mediate the deamination of cytidines to uridines in the foreign DNA. As uridines are atypical DNA nucleosides, they are recognized by the UNG2 glycosylase and turned to abasic lesions leading to downstream degradation of the foreign DNA by uracil excision mechanism (Stenglein et al. 2010) (Fig. 2). In fact, it has been demonstrated that this class of enzymes edits HPV DNA in infected cells (Z. Wang et al. 2014). Together, this formulates a protective mechanism that prevents DNA transmission between species.

Fig. 2
figure 2

Degradation of foreign genetic material by the APOBEC deaminases (based on Stenglein et al. (2010) and edited)

As the C → T transition is the predominant mutation in cancer genomes (Greenman et al. 2007), it allows to speculate that the virus-induced APOBEC enzymes might erroneously generate mutations in the genome of the host cell promoting malignant transformation (Suspène et al. 2011). In line with this presumption, the AID deaminase that is physiologically active during B-cell maturation in the germinal centers and mediates the process of somatic hypermutation (Maul and Gearhart 2010) was found involved in malignant transformation not only in B-cell lymphomas but also in nonlymphoid tumors (Okazaki et al. 2007). Moreover, the presence of typical mutations caused by deamination of cytosines introduced by APOBEC enzymes in a characteristic nucleotide context and thus having an APOBEC mutational signature has been demonstrated in genomes of multiple tumors (Alexandrov et al. 2013; Roberts et al. 2013). Interestingly, besides lymphoid neoplasms, APOBEC mutation signatures have been recognized in all HPV-related tumors including cervical cancers, esophageal cancers, or head and neck cancers suggesting an important role of this pathogenetics mechanism in these solid tumors. In fact, the frequent activating mutations E542K and E545K of the PIK3CA oncogene can be to a large extent attributed to the APOBEC-driven mutagenesis (Henderson et al. 2014) and reviewed in Chen et al. (2017). Faden et al. observed recently that APOBEC was the dominant mutational signature in somatic exomes in 79 HPV 16 + oropharyngeal squamous cell carcinoma cases analyzed. The authors assessed APOBEC mutational load in human and viral genomes concluding that there is a direct link in host and viral genome APOBEC-driven mutagenesis during infection. These provide an important link concerning APOBEC mutagenesis in host and virus and support a common mechanism driving APOBEC dysregulation (Faden et al. 2021).

It has been observed that in head and neck cancer patients, the APOBEC3B expression was significantly associated with treatment outcome (Janecka-Widła et al. 2021). In details, patients with high APOBEC3B expression detected in tumors had over two times higher risk of cancer progression than those with lower levels of this protein. In line with this observation, in the subgroup of HPV 16-positive HNSCC patients, those patients having tumors with low APOBEC3B expression survived significantly longer than those with high APOBEC3B expression (Janecka-Widła et al. 2021). Moreover, APOBEC3B overexpression differentiated HPV-negative low-grade oral epithelial dysplasia, which showed intermediate APOBEC3B expression, from oral cancers. The highest levels were found in grade II and III oral cancers suggesting that APOBEC3B may be used as a marker for advanced HPV-positive cancers (Argyris et al. 2021).

Thus, the erroneous functioning of the APOBEC enzymes induced by HPV infections may be an important mechanism of HPV oncogenicity, strictly related to the canonical mechanism described above. The continuous expression of oncoprotein E6 seems to be pivotal for APOBEC3B overexpression in HPV-associated cancers (Vieira et al. 2014). In line with this finding, Mori et al. demonstrated that E6-mediated degradation of the TP53 protein induces expression of the TEAD transcription factor, which is a positive regulator of APOBEC3B transcription (Mori et al. 2017). In addition, TEAD stabilizes the Yes-associated protein (YAP), an oncogenic component of the Hippo pathway (He et al. 2015).

The role of APOBEC enzymes in HPV-driven oncogenesis extends beyond the canonical mechanisms contributing to malignant transformation. Recent studies highlight the complexity and significance of APOBEC-related pathways in HPV-associated cancers; however, this non-canonical mechanism is likely secondary to the dominant mechanism based on HPV oncoproteins.

The putative oncogenic mechanism related to viral miRNAs

Virus-encoded miRNAs not only show auto-regulatory function but have also the potential to interfere with various physiological functions of the infected cell. Viral miRNAs are found in several viral genomes including herpesviruses (Grundhoff and Sullivan 2011; Dass et al. 2023), adenoviruses (Xu et al. 2007), polyomaviruses (Sullivan et al. 2005), and also plant viruses (Sharma and Singh 2016). Such viral miRNAs have been identified in EBV (Epstein-Barr virus) and HCMV (cytomegalovirus) (Pfeffer et al. 2004; Kuzembayeva et al. 2012). It has been shown that the encoded miRNAs have important functions in the process of infection and malignant transformation. For example, in EBV-infected B-cells viral miRNAs reduce immunogenicity by suppressing proinflammatory cytokines such as IL-12. This results in reduced recruitment of CD4( +) effector T cells and subsequent impaired elimination of the infected B-cells (Tagawa et al. 2016). The expression of EBV miRNAs is also brought together with worse survival of EBV-associated gastric cancer (Kang et al. 2017). Furthermore, EBV-encoded miRNAs contribute to Endemic Burkitt lymphoma clinical presentation, progression, and poor outcome (Oduor et al. 2017). Moreover, viral miRNAs were shown to contribute to cell cycle deregulation by posttranscriptional silencing of cell cycle-related genes like CCNE2 (cyclin E2), BRCC3, and EID1 targeted by miR-US25-1 encoded by the HCMV genome (Grey et al. 2010).

In light of the presented findings, it is intriguing whether HPV genomes harbor miRNAs of similar functionalities. A decade ago, two manuscripts were published wherein the authors bioinformatically as well as experimentally demonstrate that the HPV 16 genome encodes beside protein coding also miRNA genes that are expressed, albeit on low level, in human cervical lesions (Qian et al. 2013; Virtanen et al. 2016). The authors identified five miRNAs HPV 16-miR-H1, 2, 3, 5, and 6 dispersed in the viral genome and encoded on the plus or minus strand (Table 1, Fig. 3). These findings have been further validated in parallel studies of other groups (Thakur 2015) and by bioinformatic analysis of NGS data (Weng et al. 2018). Several functions promoting viral infection have been implicated for these miRNAs; however, experimentally validated data demonstrating the viral miRNA:host mRNA interaction is still lacking. Viral HPV 16-miR-H1-1 and miR-H2-1 were predicted to target several genes involved in immune system regulation and development. HPV 16-miR-H1-1 via putative targeting of BCL11A, CHD7, ITGAM, RAG1, and TCEA1 genes is able to inhibit T-cell activation leading to the inhibition of immune system development. Similarly, miR-H2-1 potentially inhibits genes involved in T-cell activation like PKNOX1, SP3, and XRCC4 as well as immune system development (JAK2, PKNOX1, SP3, XRCC4, FOXP1) (Qian et al. 2013). Also, typical cancer-related tumor-suppressive genes are predicted targets of H1 including CAV2, PTEN, and SEMA3F (Qian et al. 2013). The same holds true for the other delineated HPV miRNAs as reviewed by Gallo et al. (Gallo et al. 2020). Gutiérrez et al. identified approximately 900 target genes potentially regulated by HPV-encoded miRNAs. Within this cohort, the largest sub-group, that is 15% of the genes, was involved in cell proliferation followed by regulation of ribosomes (11%) and translation (11%) (Gutiérrez et al. 2018).

Table 1 Putative HPV 16 (reference genome NC_001526.2)–encoded miRNAs and their target genes (Qian et al. 2013; Li et al. 2008; Qureshi et al. 2014)
Fig. 3
figure 3

MiRNAs putatively encoded by the HPV 16 genome (based on Table 1)

Therefore, the cited literature data allow to carefully speculate that the putative HPV-encoded miRNAs, if real, might be involved in three crucial processes in the infected cell. That is (I) immune escape, (II) deregulation of cell cycle and attenuation of tumor suppressor genes leading to increased proliferation, and (III) takeover of the control of the cell’s transcriptional machinery. However, it needs to be stressed that the putative miRNA-driven oncogenic mechanism has a significantly inferior role compared to the function of the oncogenic virus-encoded proteins. Moreover, the functional validation of many of these findings is lacking; the proposed mechanism is speculative and not supported by direct target validation.

Controversies

Although several studies reported the identification of HPV-encoded miRNAs, mainly in human cervical lesions as reviewed before, other authors came to opposite conclusions. Wang et al. by cloning and sequencing of a small RNA library from the HPV 16 + CaSki cell line identified 174 host miRNAs but none of the reported HPV 16 miRNAs and therefore concluded that the HPV 16 virus in the analyzed cell line does not express viral miRNAs (X. Wang et al. 2008). Similarly, Cai et al. reported no viral miRNAs to be expressed from the HPV 31 genome, and Lui et al. reported the lack of HPV-encoded miRNAs in direct sequencing of HPV-positive cervical cancer cell lines (Lui et al. 2007; Cai et al. 2006).

Interestingly, Chirayil et al. developed an approach based on a library of overlapping genomic segments from an analyzed genome subcloned behind a RNA polymerase II promoter resulting in forced expression of the putative miRNAs (miDGE—miRNA discovery by forced genomic expression) (Chirayil et al. 2018). Thereafter, miRNA candidates were identified using small RNA sequencing and downstream assays. Using this approach, the authors screened 73 different PV genomes (most were human-PVs) and identified five viral miRNA candidates, 1 in HPV 17, 1 in HPV 37, 1 in HPV 41, and 2 in FcPV1. However, no miRNA candidates were identified in the genomes of high-risk HPV 16, 18, and others. The authors further screened transcriptomes of cervical carcinomas deposited in The Cancer Genome Atlas (TCGA) (Cancer Genome Atlas Research Network et al. 2017), but similarly did not observe expression of any miRNA from the high-risk HPV genomes (Chirayil et al. 2018). Another argument against the existence of HPV-encoded miRNAs is the fact that none of the predicted top putative mRNA targets suggested in the cited studies like BCL11A, ITGAM, RAG1, CAV2, PTEN, and SEMA3F was identified in the miRNA target search using miRDB performed by us (Table 1) thus showing large inconsistencies between studies.

Altogether, the reviewed findings raise several unanswered questions. What are the apparently HPV-derived sequences found by various groups? Why are the results of different researcher groups not reproduced? Might they derive from co-infection by other PVs or are they only inactive sequences found in the host genome that are remains from past infections? Assuming that these sequences do not encode functional miRNAs, why do they show such enrichment in potential targets encoding important tumor-related genes hardly possible by pure chance? Are these putative miRNAs expressed only at specific time points in the viral life cycle? Or why should high-risk HPVs, in contrast to several low-risk PVs and other viruses, lack miRNAs that facilitate the process of infection and immune evasion? What was the selective force during the evolutionary process? These and other questions need to be answered in further studies.

Conclusions

The mechanism of HPV-related carcinogenesis is much more complex than initially assumed. Since the discovery of the viral oncoproteins in the 80 s of the twentieth century, we witnessed the accumulation of experimental data which have significantly expanded the picture of how the virus infects the cell, takes over the control of its transcription machinery, and eventually leads to tumor formation. In this process, the function of the virus-encoded oncoproteins is supported by triggered genomic instability, accelerated mutagenesis via the APOBEC enzymes, and perhaps also by yet not fully understood viral miRNA sequences.