Introduction

Cervical cancer screening and management guidelines published by the United States Preventive Task Force on Cervical Cancer Screening (2018), widely endorsed by professional societies, rely on a combination of cytology, human papillomavirus (HPV) testing, colposcopy and tissue biopsy to determine appropriate clinical management of women participating in screening (Force et al. 2018). This screening program has been monumental in reducing cervical cancer incidence in the U.S. through treatment intervention in women with high-grade cervical intraepithelial neoplasia (HGCIN). A byproduct of this highly effective cancer prevention program is that, in previously published estimates, between $1 and 2 billion USD in healthcare expenditures was attributed to the follow-up management of women with screen positive tests (Chesson et al. 2012), and nearly a quarter of a million U.S. women were diagnosed with low-grade cervical intraepithelial neoplasia (LGCIN) (Henk et al. 2010). Vaccine uptake and changes to screening guidelines (HPV co-testing and increasing screening intervals for some women) have undoubtedly impacted these estimates in more recent years; however, one thing remains clear: low-grade dysplasia resolves without treatment for most women and progresses to high-grade dysplasia in fewer than one in ten women (Baseman and Koutsky 2005). While treatment intervention is compulsory for HGCIN, the clinical management algorithm for LGCIN remains complex and relies heavily on management-by-observation approaches with repeat screening tests at short-term intervals. It has been reported that on average, women with a LGCIN diagnosis undergo between 6 and 8 clinical encounters to monitor their dysplasia (Henk et al. 2010; Insinga et al. 2004). This observational approach has undesirable consequences. Repeat patient encounters in women with dysplasia poses risks for adverse effects (pain, cramping, bleeding, vaginal discharge, infection, dysmenorrhea and scarring associated with coloposcopy/biopsy) and psychosocial trauma (Gray et al. 2006; Sharp et al. 2009; McCaffery et al. 2010). Further, this approach delays treatment for those women who will experience dysplasia progression, creating the potential to miss the opportunity to intervene and prevent cancer development. The latter is of particular concern, as a significant proportion of cervical cancer cases can be attributed to inadequate management of dysplasia despite adequate screening (Spence et al. 2007). This failure in the cancer prevention care continuum is exacerbated in populations experiencing barriers to healthcare access that interfere with adherence to follow-up care, thereby contributing to socioeconomic and racial/ethnic disparities in cervical cancer incidence (Musselwhite et al. 2016). Notably, observational management guidelines were developed to address concerns of over-treatment of dysplasia in the absence of strong predictive biomarkers of LGCIN progression and pharmaceutical interventions that promote resolution of LGCIN. A better understanding of the biology underlying progression of LGCIN is needed to address these limitations in the cervical cancer prevention program.

MicroRNAs (miRNAs) are ~ 21 nucleotide, non-coding RNAs that regulate protein expression through binding of cognate sequences in the regulatory regions of mRNA transcripts. Dysregulation of miRNA expression is observed in disease states, and disruption of the miRNA-associated epigenetic regulatory mechanisms is often associated with loss of tumor suppressor function or gain of oncogene expression in solid tumors (Chen et al. 2012). MicroRNA dysregulation has been observed in cervical cancer and CIN2 + compared to healthy cervical tissue (Cheung et al. 2012; Wilting et al. 2013). Further, miRNA dysregulation has been observed in pre-neoplastic conditions and has demonstrated the potential to reveal cellular pathways involved in disease progression (Oberg et al. 2011; Stachowiak et al. 2017; Craig et al. 2020). To explore the miRNA regulatory pathways associated with cervical dysplasia progression, we conducted a case–control study in which we performed comprehensive miRNA expression profiling in low-grade cervical dysplasia tissue specimens from women with observed clinical outcomes. By comparing the miRNA expression profiles obtained for women whose low-grade dysplasia resolved naturally to that of women whose low-grade dysplasia progressed to high-grade, we identified miRNA dysregulation associated with dysplasia progression and revealed cellular pathways potentially involved early in advancement of cervical pre-neoplasia.

Methods

Screening of candidates for the study

Women who attended the University Medical Center/Interim LSU Hospital (New Orleans, Louisiana, USA) for cervical cancer screening from 2006 to 2018 were considered for inclusion in this study. Women with a recorded diagnosis of low-grade cervical intraepithelial neoplasia (LGCIN/CIN1) were screened for eligibility as a case (women with LGCIN and subsequent high-grade cervical intraepithelial neoplasia (HGCIN/CIN3)) or control (women with LGCIN and evidence of resolution without medical intervention). Two methods were employed to capture suitable candidates. The first method, which pre-dated the implementation of the hospital electronic medical record system, relied upon query of an electronic database of archived pathology specimens. A list of potential study candidates was generated by including women who had at least one cervical specimen within two calendar date windows separated by ≥ 6 months. The captured records were then further screened for predetermined eligibility criteria (described below). This method yielded a sufficient sample size of women eligible for inclusion as controls in the study but failed to yield sufficient cases for the study. As the hospital implemented the comprehensive electronic medical record system, queries were conducted of this system to capture all women with a diagnosis of high-grade cervical dysplasia using the medical billing diagnostic codes ICD-9-CM 622.1 (Dysplasia of cervix uteri) and ICD-9-CM 233.1 (Carcinoma in situ of cervix/CIN3) for dates prior to October 2015, and ICD-10-CM N87 (Dysplasia of cervix uteri) and ICD-10-CM D06 (carcinoma in situ of cervix uteri/CIN3) for dates including and after October 2015. The electronic medical record of each woman returned in the query was then reviewed to identify women with prior LGCIN diagnosis. Complete history of gynecological health encounters was reviewed for each study candidate, including results of cervical cytology, HPV co-testing, cervical histopathology, p16 immunostaining and any treatment interventions (cervical conization [cone], loop electrosurgical excision procedure [LEEP], or other surgical procedure) performed.

The study baseline visit was defined as the clinical encounter at which a woman was determined to have biopsy-proven LGCIN/CIN1 on histopathology. Women for whom a LGCIN diagnostic biopsy was not available in pathology archives were excluded. Women were also considered ineligible for the study if they were known to be HIV seropositive, if they had a history of high-grade cervical dysplasia or cervical cancer that pre-dated the baseline visit, if they had other gynecological comorbidities or if they lacked a cervix due to prior hysterectomy. A candidate who passed this initial screening was included as a case in the study if she had biopsy-confirmed high-grade cervical dysplasia (grade 3 cervical intraepithelial neoplasia, CIN3, HGCIN) in the vicinity (within two clock positions) of the baseline LGCIN biopsy. Further stringency was added by eliminating candidates with concurrent low-grade and high-grade CIN diagnoses at the study baseline visit. Women were included in the control cohort if they had cytological evidence of cervical health following the baseline visit in the absence of documentation of treatment (cone, LEEP, etc.). Further stringency was added by excluding candidates with excessive tissue removal by biopsy and those with insufficient clinical encounters to determine a clear clinical outcome. Women with persistent LGCIN were excluded from both cohorts.

MicroRNA expression analysis

The formalin-fixed, paraffin-embedded tissue biopsy corresponding to the baseline visit LGCIN diagnosis for each case and control subject was obtained from pathology archives. Total RNA including small RNA component was isolated from three 5-micron slices of tissue using the miRNeasy FFPE kit (Qiagen, Germantown, Maryland, USA) according to the manufacturer’s protocol. Total RNA extracts were shipped on dry ice to LC Sciences, LLC (Houston, Texas, USA) for microRNA microarray profiling. Quality control tests were run on each specimen upon arrival and specimens that failed QC were excluded from further analysis. Comprehensive microRNA microarray profiling services included probes for all human microRNAs (2632 mature miRNAs) annotated in miRbase 22 (Griffiths-Jones et al. 2006) with additional small non-coding RNA controls. Raw data were background-subtracted and normalized using cyclic locally weighted scatterplot smoothing (LOWESS) pair-wise regression (Bolstad et al. 2003). In preparation for clustering and differential expression analyses, additional data adjustment included filtering, log2 transformation, gene centering and normalization. Differential expression analysis compared miRNA expression values in cases to those of controls using t-test. Hierarchical clustering analysis was performed using average linkage and Euclidean distance metric, and results were visualized using TIGR Multiple Experiment Viewer software (the Institute for Genomic Research). Dot plot diagrams were created and analyzed in GraphPad Prism (v.7). Average normalized miRNA expression values were calculated, and Student’s t-test was used to compare cases to controls. Alpha level p ≤ 0.05 was considered statistically significant.

Predicted miRNA target search

Potential mRNA targets of the miRNAs of interest were predicted using TargetScan 7.2 (Agarwal et al. 2015). Due to the high potential for false positives, the results were limited to a cumulative weight context score of < − 1.0. Predicted targets were further validated using additional online target prediction algorithms miRDB, miRWalk, and miRMap (Chen and Wang 2020; Sticht et al. 2018; Vejnar and Zdobnov 2012). MirPathDB2.0 was utilized to identify gene ontology pathways enriched for miRNA:targets discovered in the dataset (Kehl et al. 2020). Dysregulated miRNAs were input into the database and unions were analyzed for all pathway databases MiRPathDB2.0 contained, which included Kyoto Encyclopedia of Genes and Genomes (KEGG), Gene Ontology (GO), Wikipathways, and Reactome. Heat maps were generated with pathways enriched for ≥ 10 miRNAs (KEGG) or ≥ 16 miRNAs (GO), and all miRNAs with ≥ 1 predicted target, included in the map.

Results

Case–control cohort

Women participating in cervical cancer screening at the University Medical Center hospital in New Orleans, LA with a diagnosis of LGCIN (n = 837) were screened for inclusion in the study. The study enrollment diagram is shown in Fig. 1. Women with low-grade CIN and a subsequent diagnosis of high-grade CIN were further screened for inclusion as cases in the study, for a total enrollment of 29 women. Women with low-grade CIN and subsequent clinical indications of restoration to healthy cervical tissue without evidence of medical intervention were included as controls in the study, for a total enrollment of 31 women. The final dataset included only those samples with high-quality miRNA microarray results (22 cases, 28 controls). Characteristics of the cohort are reported in Table 1. All participants with available HPV results had tested HPV positive. Most participants (74%) entered colposcopy follow-up with a prior atypia or low-grade squamous intraepithelial lesion (LGSIL) cytology result. Cases were marginally older than controls (mean age 34 vs 29, p = 0.049). Results of p16 immunostaining supported the HGCIN diagnosis in all cases for which immunostaining study was performed. The average observation period for controls (39 months) exceeded the average time to progression for cases (21 months).

Fig. 1
figure 1

Diagram of LGCIN cohort enrollment

Table 1 Characteristics of cohort

Differential expression of miRNAs in low-grade CIN stratified by clinical outcome

We performed comprehensive miRNA microarray analysis for expression of 2632 known human miRNAs in low-grade cervical biopsy tissues. Differential expression analysis of miRNA profiles obtained from cases and controls revealed 29 miRNAs to be significantly dysregulated (p < 0.01; Fig. 2). Of these, 24 were downregulated while only 5 miRNAs were found to be upregulated in cases compared to controls (See Supplemental Information 1). Next, we plotted the normalized expression values for each case and control patient for four of the dysregulated miRNAs (Fig. 3). On average, increased expression of miR-1260a and decreased expression of miR-638, miR-4508, and miR-4488 was observed in biopsies of LGCIN that progressed compared to biopsies of LGCIN that resolved.

Fig. 2
figure 2

Heat map of microRNAs significantly dysregulated (p < 0.01) in LGCIN that progressed to HGCIN (cases) compared to LGCIN that resolved (controls). Green gradient represents downregulation and red gradient represents upregulation of the miRNA

Fig. 3
figure 3

Dysregulation of microRNAs in LGCIN that progressed to HGCIN. Normalized average fluorescence intensity of replicate miRNA probes are plotted for each case (LGCIN that progressed) and control (LGCIN that resolved) subject. Middle bars represent the group mean and whiskers represent the standard deviation. Two-tailed Student’s t-test was used to compare the means of the groups and p < 0.05 was considered significant. Panel A miR-638; panel B miR-1260a; panel C miR-4488; panel D miR-4508

Gene ontology analysis to identify pathways enriched in progressive LGCIN

To identify cell signaling pathways potentially activated in cervical dysplasia progression, we performed pathway enrichment analysis using miRPathDB 2.0 (Kehl et al. 2020). MicroRNAs (n = 24) found to be downregulated in progressive LGCIN were input into the discovery algorithm, and predicted unions were analyzed using both the KEGG and Gene Ontology pathway databases. Gene Ontology discovery identified 67 pathways enriched with putative mRNA targets of the downregulated miRNAs (Fig. 4 and Supplemental Information 2). Pathways related to tumor growth and progression were identified among the enriched pathways, including the GO terms transcription activation, signal transduction (small GTPase and Ras), angiogenesis, and metastasis (cellular motility, migration and locomotion). KEGG pathway analysis confirmed enrichment of miRNA targets in Ras signaling and revealed other well-known cancer related pathways including Wnt, ErBb and MAPK signaling (see Supplemental Information 3). These data suggest that miRNA dysregulation during LGCIN contributes to the progression of LGCIN to HGCIN by promoting and enhancing pathways traditionally associated with cancer.

Fig. 4
figure 4

Heat map of the gene ontology biological processes pathways enriched for predicted mRNA targets of the 24 miRNAs significantly downregulated (p < 0.01) in LGCIN that progressed to HGCIN. Blue gradient represents enrichment score. Pathways were included if they met a threshold of ≥ 16 predicted miRNA:mRNA target pairs within the pathway

Predicted transcript targets of dysregulated microRNAs

In the KEGG pathway analysis, Wnt signaling emerged as the most targeted pathway with 14 miRNAs predicted to regulate the pathway (Additional file 3). Specific transcript targets within the Wnt signaling pathway were explored by comparing target predictions of four databases: TargetScan, miRDB, miRmap, and miRWalk (Agarwal et al. 2015; Chen and Wang 2020; Sticht et al. 2018; Vejnar and Zdobnov 2012). Several members of the Wnt family of proteins were predicted by multiple databases to be targeted by one or more miRNA(s) in our dataset. WNT6 was predicted to be targeted by miR-2861. MicroRNAs miR-1275 and miR-3620-5p were strongly predicted to target WNT5A, and WNT7B was a predicted target of miR-1275 and miR6756-5p. Interestingly, miR-4492 was highly predicted to target TCF7L1, a transcription factor activated by Wnt signaling. Reduced expression of miRNAs collectively targeting Wnt signaling family members suggests the possibility that Wnt signaling is activated in the progression of low-grade cervical dysplasia.

Discussion

In this report, we have identified differential expression of miRNAs that may be exploited to triage women with LGCIN for appropriate clinical management. These miRNAs may also have a mechanistic role in dysplasia progression. MicroRNA-638 was found to be downregulated in women with LGCIN that progressed to HGCIN. This miRNA is a widely studied tumor suppressor found to be downregulated in cervical cancer as well as leukemia, glioma, melanoma, and cancers of the lung (non-small cell), colon, liver, stomach, breast, esophagus, oral cavity and endometrium (Chong et al. 2021; Wei et al. 2017; Yuan et al. 2020). MicroRNA-638 has been shown to target apoptotic pathways, cell proliferative pathways and metastatic pathways (Chong et al. 2021). Conversely, miR-1260a was upregulated in women with LGCIN that progressed. Upregulation of miR-1260a has been linked to many cancers including cancers of the sex organs (ovarian cancer in women and prostate cancer in men) (Ghafour et al. 2021; Said et al. 2018), and based on studies of the closely related miRNA miR-1260b, may promote invasion/metastasis (Morita et al. 2020; Xu et al. 2018). The dysregulation of miR-638 and miR-1260a at the early stages of cervical dysplasia may facilitate advancement to high-grade cervical dysplasia by causing dysregulation of pathways involved in oncogenesis.

Other differentially expressed miRNAs identified in our study have potential to be involved in oncogenesis; however, their dysregulation is inconsistent across cancer biology studies. MicroRNA-4508, found to be downregulated in progressive LGCIN in our study, was reported to be downregulated in bladder cancer (Sabo et al. 2020). Likewise, Nagy et al. analyzed colorectal precancerous lesions and found miR-4508 to be downregulated twofold as the severity of the polyps increased from tubular to tubulovillous adenoma (Nagy et al. 2017). However, miR-4508 was shown to be upregulated in breast cancer (Boo et al. 2016). Like miR-4508, miR-4488 also demonstrates conflicting dysregulation patterns in cancer. Consistent with our finding of miR-4488 downregulation in progressive LGCIN, miR-4488 has been found to be downregulated in liver and breast cancer (Cui et al. 2014; Zheng et al. 2020; Matamala et al. 2015). However, upregulation of miR-4488 was reported in esophageal adenocarcinoma and colon cancer (Ames et al. 2017; Drahos et al. 2016). Furthermore, Yuan et al. reported 2.85-fold log2 induction of miR-4488 in cervical cancer tissue compared to healthy tissue from the same patient (Yuan et al. 2020). It is possible that there may be a bimodal pattern of miR-4488 dysregulation during the progression of cervical dysplasia to cancer that permits different oncogenic processes at different stages of cancer development.

It is important to note that no single miRNA emerged as a clear, stand-alone predictor of LGCIN progression in our study. This is not surprising given that multiple miRNAs target single mRNA transcripts; multiple transcripts are involved in individual signaling pathways, and individual miRNAs regulate multiple mRNA transcripts. This complex and dynamic relationship between miRNAs and their targets must be taken into consideration when investigating the role of miRNAs in the pathophysiology of disease. Our gene ontology analysis that included the entire network of downregulated miRNAs and their targets revealed Wnt signaling as a putative dysregulated pathway in LGCIN that progressed to HGCIN, which is consistent with reports of aberrant Wnt signaling in cervical carcinoma (Perez-Plasencia et al. 2007; Liang et al. 2016; Zhao et al. 2019). Previous reports have implicated overexpression of Wnt5A and Wnt-11 in cervical cancer invasion and metastasis (Lin et al. 2014; Wei et al. 2016); our analysis suggested possible acquisition of cell motility and migratory traits in LGCIN that progressed, potentially through deregulation of Wnt signaling mediators secondary to loss of miRNA-mediated translational suppression. The enhancement of these putative signaling pathways and the phenotypic consequences of miRNA dysregulation in progressive LGCIN are areas ripe for further empirical investigation.

Our study has notable limitations. First, we relied upon the subjective evaluation of the histopathology grade of baseline LGCIN specimens that was documented in the medical record by the clinical pathologist. Notably, misclassification of baseline histopathology may account for some individuals that progressed, underscored by the one subject in the case group that had HSIL cytology result at baseline (Table 1). We argue that whether or not the LGCIN was an accurate diagnosis, it was in fact the documented diagnosis on the clinical chart that informed the course of clinical management for women in our cohort. Thus, if our study included cases that were misdiagnosed at baseline in addition to those that truly progressed from LGCIN at baseline to HGCIN at the end of the study period, then this reflects the real-world clinical experience, and our molecular findings are pragmatic from the perspective of clinical application. Indeed, the potential for misclassification of subjectively scored histopathology grade highlights the need for more objective molecular tests to better identify women who will ultimately receive a diagnosis of HGCIN.

The retrospective study design restricted our ability to account for known modulators of cervical dysplasia risk, including HPV genotype and lifestyle factors such as smoking, sexual activity, and hormonal contraceptive use (Nagelhout et al. 2021; Cancer 2009; Cancer et al. 2007; Iversen et al. 2021). However, we excluded women with gynecologic comorbidities and human immunodeficiency virus infection, which were predicted to influence miRNA expression patterns. This study also suffered from small sample size, in part due to inconsistent intervals between clinical encounters that obscured definitive clinical outcomes for many women with LGCIN diagnoses. Large prospective cohort studies that can systematically account for confounding factors and execute consistent follow-up intervals are needed to improve the predictive value of miRNAs for LGCIN outcomes. In addition, the limited amount of residual tissue remaining in the archived specimens precluded further confirmatory studies such as repeat histopathology and/or p16 immunostaining to verify the documented clinical diagnosis, in situ hybridization for dysregulated miRNAs or immunostaining for markers of putatively dysregulated signaling pathways. Despite these limitations, the data presented offers novel insights into early neoplastic changes that may lead to cervical dysplasia progression.

Conclusions

MiRNA dysregulation observed in our study was consistent with prior reports in cancer tissues, including cervical cancer. Our data suggest that early changes in miRNA expression may aid in LGCIN progression by altering cancer-associated pathways within the dysplastic tissue. Further studies are necessary to understand the functional role of dysregulated miRNAs and their cognate targets in cervical dysplasia advancement. Increased understanding of the biological processes associated with LGCIN progression will create opportunities for innovative pharmaceutical interventions and objective, prognostic-focused adjunct tests that will improve clinical management of this common women’s health issue.