Introduction

Cervical cancer, a type of malignant genital tract tumour, is the 4th most frequently diagnosed cancer in women worldwide, making it the 4th leading cause of cancer death as well, which greatly threatens women’s lives. According to a report which provided global data and graphical visualization of cancer incidence and mortality using the database of GLOBOCAN 2018 produced by the International Agency for Research on Cancer (IARC), there were 569,847 new cases of cervical cancer causing 311,365 deaths globally in 2018 [1]. In the regions with low/medium Human Development Index (HDI) as well as in China, cervical cancer ranks second for female in incidence and mortality behind breast cancer [1, 2]. Based on the data from World Health Organization [3], in China, cervical cancer was responsible for 106,430 new cases and 47,739 deaths in the year of 2018. According to the existing epidemiological evidence in urban and rural areas of mainland China, if without any intervention, the annual number of new cervical cancer cases is predicted to be dramatically increased, ranging from ∼27,000 to 130,000 in 2010 and reach ∼42,000 to 187,000 in 2050 [4]. Therefore, the prevention and cure of cervical cancer has been one of the major public health problems in China.

Continued infection with the human papillomavirus (HPV) which is known as the most common sexually transmitted virus [5] causes cervical cancer and cervical intraepithelial neoplasia (CIN). Due to this clear relationship, cervical cancer is an avoidable disease that can be prevented, treated and eradicated, compared to many other cancers. According to how much epithelial tissue is affected, CIN can be graded on 1–3 scale, where CIN3 is the most abnormal grade. In this study, CIN1 is equivalent to low-grade squamous intraepithelial neoplasia (LSIL), while ≥CIN2 is called precancerous lesion or high-grade squamous intraepithelial neoplasia (HSIL). CIN1 has the risk of developing into ≥CIN2, and ≥ CIN2 has the risk of developing into cervical cancer. HPV16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59, and 68 are identified as “high risk HPV” (HR-HPV) due to their relatively high carcinogenic potential leading to the development of cervical cancer among more than 150 HPV strains being found [6].

Currently, large-scale cervical cancer vaccination programs have been launched and have saved many women’s lives [7, 8]. Gardasil® is a commonly used quadrivalent vaccine against HPV6, 11, 16 and 18, while HPV6, 11 are low-risk genotypes that can induce benign genital warts or condylomas [9]. Gardasil 9 was approved by the US Food and Drug Administration (FDA) in 2014 and provided protection against HPV6, 11, 16, 18, 31, 33, 45, 52, and 58 [10]. The preventive effect of HPV vaccine on cervical cancer has been confirmed in multiple studies [11, 12]. However, all HR-HPV strains that can cause cervical cancer are not completely covered by Gardasil 9, and HPV genotype distribution varies between different regions and countries, causing the incidence and mortality of cervical cancer to change geographically as well [13, 14]. Therefore, identifying the distribution of HPV types among different grade cervical lesions will provide baseline information for decisions on HPV vaccination program in China, so that the effectiveness of large-scale vaccination can be assessed and differences in geographical distribution of HPV types can be distinguished. Due to the limitation of HPV examination technology, it is not possible to obtain all types of HPV distribution. Acquiring the distribution of the most carcinogenic HPV types will provide crucial information for developing a new generation of HPV vaccines in line with China’s national conditions. Although many studies have reported the prevalence of HPV genotypes based on the presence of precancerous lesions and invasive cancer [15,16,17], a reliable and large-population study concerning HPV distribution has been rarely reported in many developing countries or regions previously.

In this retrospective cross-sectional study, patients who had undergone HPV examination and cervical pathological biopsy in the PLA General Hospital from January 2009 to June 2019 were recruited, HPV genotypes were identified, and the distribution of HPV types in different cervical lesions was analysed. The hospital’s comprehensive strength of the PLA General Hospital ranks among the top three hospitals in China in recent years [18]. In addition, the PLA General Hospital is in Beijing which is the capital of China and the most prosperous city in northern China. Thus, most women from northern China prefer to undergo abnormal and opportunistic screening for cervical cancer in this hospital, which provides adequate patient resources to reflect the relationship between HPV genotypes and precancerous cervical lesions. Moreover, the superior clinical and laboratory capabilities in this hospital ensure us to undertake this study. The purpose of this study is to investigate the distribution of HPV types in northern China and their relationship with the degrees of cervical lesions, which provides comprehensive scientific evidence to help develop regional vaccines in the future.

Methodology

Study population and samples

From January 2009 to June 2019, the patients who underwent HPV DNA testing and cervical pathological diagnosis in Chinese PLA general hospital (Beijing, China) were included in this cross-sectional study. Furthermore, we limited the interval between the two examinations within 180 days for the purpose of making sure their correlation to analyze their relationship. (see Fig. 1). Single and multiple HPV infections were assessed according to different cervical cytology. The overall infection rate of specific HPV type and the prevalence of type-specific HPV among different age groups and precancerous lesions were calculated. In this study, according to the severity of the cervical lesion, the study population was divided into 3 groups: HSIL, LSIL, and Normal. Besides, in order to identify the relationship between the age and HPV types as well as cervical lesions, five age groups were set: < 20, 20–34, 35–49, 50–64, 65–79.

Fig. 1
figure 1

Screening flowchart. CIN, Cervical Intraepithelial Neoplasia; HSIL, high-grade squamous intraepithelial neoplasia; LSIL, low-grade squamous intraepithelial neoplasia; HPV, human papillomavirus

The pathological diagnosis of cervical lesions was used as the golden standard. The diagnosis of cervical cytology was classified by the 2001 Bethesda system. Experienced pathologists in the PLA General Hospital reviewed every histology slide and classified each finding as negative, CIN grade 1/2/3. In this study, CIN1 is referred to as low-grade squamous intraepithelial neoplasia (LSIL); CIN 2/3 is regarded as high-grade squamous intraepithelial neoplasia (HSIL).

Genotype-specific test

DNA extraction and HPV genotyping were carried out using HPV genotyping real-time PCR kit (Shanghai ZJ Bio-Tech Co., Ltd) to detect the following 18 HPV types: HPV6, 11, 16, 18, 21, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59, 66, 67, 68, 82. The lowest detection limit of the kit was 1 × 104 copies/mL. Amplification techniques performed on SLAN®-96P (Shanghai Hongshi Medical Technology Co., Ltd) were used for the quantitative estimation of HPV DNA copies.

Statistical analysis

A database was established using Excel 2016, and the results were analysed by SPSS 22.0 software (SPSS Inc., Chicago, IL, USA). A Chi-square test was used for the counting analysis, and a t-test was used for variable data. A p value of < 0.05 was considered significant.

Results

Baseline characteristics

In order to analyse the attribution proportion of HPV types to precancerous lesions, there were 3134 eligible patients who underwent histopathological examination after HPV genotype-specific test within 180 days. As shown in Fig. 2, China can be divided into two parts, south and north, along the Qinling Mountains-Huaihe River line. Although the PLA General Hospital attracted the patients all over the country, 95% of the patients enrolled in this study were from northern China due to the location. The mean age of the subjects was 42.06 ± 10.82 years old, where the youngest was 17 years old and the oldest was 79 years old. 3029 (96.65%) were positive for HPV and the 2747 (87.65%) were positive for HR-HPV. The top five HPV genotypes are HPV16, 58, 52, 51 and 56. As for the pathological results, 1745 (55.68%) women had HSIL, 1354 (43.20%) women had LSIL, 35 (1.12%) women had normal cervical cytology. The distributions of age, cervical pathology, HPV genotypes and single/multiple HPV infection among the 3134 patients are presented in Table 1.

Fig. 2
figure 2

Geographical distribution of the recruited patients (n = 3134) in China

Table 1 Distribution of age, cervical pathology, HPV prevalence, the relationship between cervical pathology and HPV infection and Single/multiple HPV infection in patients (n = 3134)

Age distribution of patients with different HPV-type infections

Age-stratified HPV distribution of the patients in the study is shown in Table 2 and Fig. 3. In the group of < 20 age, there were only 3 eligible patients and one of them was infected by two HPV types, leading to 25, 50 and 25% infection rate for HPV16, 39 and 58, respectively. For the patients at the age of 20–34, the top five HPV genotypes were HPV16 (26.91%), 58 (12.54%), 52 (10.70%), 51 (7.33%) and 56 (6.09%); at the age of 35–49, the top five genotypes were HPV16 (28.51%), 52 (12.76%), 58 (12.71%), 18 (5.77%) and 51 (5.57%); at the age of 50–64, the top five genotypes were HPV16 (25.92%), 58 (12.14%), 52 (12.14%), 56 (8.88%) and 51 (6.63%); at the age of 65–80, the top five genotypes were HPV16 (25.17%), 58 (12.58%), 52(9.27%), 56(9.27%), 31(7.28%). In the patients at the age of 20–34 and 50–64, the overall prevalence of the HPV genotypes was quite similar where the top five HPV genotypes were the same but with slightly different order. However, with the regard to the 35–49 age group, HPV 18 which was not among the top 5 HPV infection types while accounted for 5.77%; at the age group 65–80, HPV 31 which also was not among the top 5 HPV infection types while occupied for 7.28%. Moreover, the prevalence of HPV16 was the highest among all the age groups except for the group of < 20 in which the sample size was too small.

Table 2 Age-stratified HPV distribution (15-year interval) in patients (n = 3134)
Fig. 3
figure 3

Age-stratified HPV distribution in patients (n = 3134, including multiple-genotype infection, HPV6, 11, 16, 18, 21, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59, 66, 67, 68 and 82 (%))

Age distribution of patients in different grades of cervical lesions

In this study, the prevalence of HSIL reached a peak in patients between 35 and 49 years of age, which was later than the peak for LSIL lesions. Age-stratified distribution of different cervical lesions in the study were presented by Table 3. To clearly show the results, a line chart of the age distribution is shown in Fig. 4. As we can see from Fig.4, among the group HSIL, LSIL, Normal, the peak age was at 35–49 years old and the rate was 49.70, 44.20, 42.90% respectively. However, as for the group LSIL, besides group 35–49 years old, the group 20–34 years old also has relatively high rate of 31.50%; in terms of group Normal, besides group 35–49 years old, the group 50–64 years old has second highest rate of 37.10%. The prevalence of LSIL demonstrated two peaks at 20–34 and 35–49, where there was no statistical significance between 20 and 34 and 35–49. The peak age of the onset of LSIL was at 20–34 years of age, which was around 7 years earlier than that of HSIL which had a peak at 35–49 years of age. The patients with normal cervix lesions also showed two peaks at 35–49 and 50–64 (there was no statistical significance between 35 and 49 and 50–64).

Table 3 Age distribution of different grade of cervical lesions (n = 3134)
Fig. 4
figure 4

Age distribution of different grade of cervical lesions

Frequency of infection with a single or multiple HPV genotypes in cervical lesions

As shown in Table 4, in all of HPV infected patients, infection with one, two, three, four, five, six, and seven genotypes of HPV was detected in 1732 cases, 726 cases, 248 cases, 65 cases, 22 cases, 8 cases and 2 cases, respectively. The frequency of a single HPV genotype infection was 55.26%, while that of multiple genotypes was 34.18%. Infection with two different genotypes was the most common multiple HPV infections, where the maximum multiple infections were seven genotypes. Among HSIL patients, infection with single, 2, 3, 4, 5, 6, 7 genotypes accounted for 60.7, 21.6, 7.3, 3.3, 0.8, 0.2 and 0.1%, respectively. As the number of HPV genotypes increased, the attributable proportion to HSIL decreased (shown in Table 3). There were no statistical differences in the frequencies of multiple HPV genotypes amongst different cervical lesions, suggesting that increased numbers of HPV genotypes did not increase the risk for HSIL. As illustrated by Fig. 5, the distribution of different genotypes to HSIL, LSIL, Normal has been presented.

Table 4 Frequency of multiple HPV types in women with different precancerous grades
Fig. 5
figure 5

Stratified- multiple HPV genotypes in different cervical pathological results HPV (−), (HPV) Single type, Two types, Three types, Four types, Five types, Six types (%)

Distribution of HPV genotypes in women with different cervical lesions

Among the 3134 patients underwent histopathological examination after HPV test within 180 days, there were 3, 099 precancerous cases and 35 normal cases. 3029 (96.64%) cases were positive for HPV and 2747 (87.65%) cases were positive for HR-HPV. According to Table 5, in the group of HSIL, HPV16 (56.46%) was the most frequent genotype, followed by HPV58 (18.41%), 52 (16.22%), 31 (8.23%) and 51 (7.68%). It is crucial to note that HPV16 was attributed more to HSIL than HPV52 and 58. Moreover, there was no significant difference between the distribution of HPV52 and 58. HPV16 (24.18%), 52 (22.01%), 58 (21.34%),56 (14.23%) and 51 (13.05%) were the most frequently detected types in LSIL, and there was no statistical difference for these former four types in the attribution to LSIL. The percentage of cases of with HPV 16 detected was 56.46% and was markedly higher in the HSIL group. As for the patients with normal cervix, HPV16 was also the dominated HPV genotype which accounted for 40.0%. HPV31 (25%), 6,11 (20%), 52 (15%), 58 (15%) and 66 (15%) were also the common types detected in normal cytology cases.

Table 5 Distribution of HPV infection with a single genotype and multiple genotypes in women (n = 3134)

Discussion

In this study, despite that our study population had geographical limitations as a single institutional survey, most of them came from northern China due to the location of PLA General Hospital (i.e., Beijing), thus providing representative samples of northern Chinese women in general. In addition, this was a long-term study, lasting for 10 years from January 2009 to June 2019, which provided the most current data for a large population in Northern China.

Among the population which both received HPV and pathological examination, the peak age of onset of precancerous lesion was between 35 and 49 years, while the previous study [19] published in 2017 had a peak age between 30 and 39 years. As for the age distribution of the population in the two studies, there was no statistically difference, where the mean age in our study was 42.06 ± 10.82 years old and the average age in that study of [20] was 40.93 ± 11.87 years old. The peak age of onset of HSIL in our study was around 7 years later than that in that study [20], probably due to the different parts of China and an increasing attention on preventing cervical cancer in recent years. Moreover, the single/multiple HPV infection is also associated with the levels of cervical precancerous lesions. Single HPV infection had the highest prevalence among HSIL/LSIL and there were no differences in the frequencies of multiple types amongst different cervical lesions, suggesting that increased numbers of HPV types did not increase the risk for HSIL, which was consistent with the studies of [20, 21].

HPV16 was the most common type in patients of HSIL (CIN2+) which was the most severe classification in this study. In this study, HPV16 had the highest prevalence amongst different levels of cervical precancerous lesions: HSIL (56.46%, 926/1640); LSIL (24.18%, 289/1195); Normal (40%, 8/20). In other studies, HPV16 was also the most common type in patients with precancerous neoplasia lesions both in Pishan county, Shenyang city, Shenzhen city and other provinces in China [22,23,24] and worldwide [15, 25, 26]. However, some other studies found that HPV52 was the most common genotype in rural North China [27] and Jiangsu, Guangdong and other provinces in China [28, 29]. All these studies indicated that HPV genotype distributions varied between different regions.

In this study, although HPV52 or 58 was not the most detected type compared to HPV16, the sum of the distribution of HPV52 and 58 was approximately as much as that of HPV16, which should deserve special attention. According to the statistical data, HPV16, 58, 52 combined were attributed to 91.1% of all HSIL (CIN2+) lesions; HPV58 and HPV52 were attributed to 34.6% of HSIL (CIN2+) cases. Nevertheless, in accordance with updated cervical cancer screening guidelines [30], only women who were HPV 16/18 positive were recommended to undergo colposcopy directly, and women who were not HPV16/18 positive were referred to cytology first, and then colposcopy if the cytology was abnormal. However, according to the above discussion, special attention should be paid on HPV58 and 52 in screening procedures in Northern China. Combining specific types of HPV with attributable proportions of HSIL can provide more accurate and effective information for cervical cancer screening programs in specific regions. Currently, there are bivalent Cervarix® and both quadrivalent and 9-valent Gardasil® approved by National Medical Products Administration of China. However, the HPV vaccine has not been added into the National Immunization Program. Recently, the National Infectious Disease Diagnostic Reagent and Vaccine Engineering Technology Research Centre of Xiamen University has developed the first domestically produced bivalent HPV vaccine against HPV16 and 18, which is expected to reduce the patients’ economic burden with a relatively cheap price. This study is supposed to provide important database which will benefit the further development of national multivalent HPV vaccine.

It is critical to note that the overall distribution and attribution proportion of each HPV genotypes elsewhere in the world are not quite similar as that of the Northern Chinese population in this study. HPV16, 18 and 45 were the most prevalent types of HPV associated with cervical lesions from a worldwide-pooled study [31]. In consistent with the above study, we also showed that HPV16 was attributed to the highest rates in HSIL (CIN2+). However, there existed some obvious differences between the worldwide-pooled study [31] and our study: 1) HPV18 had higher infection rate than HPV58 and 52 worldwide which was not true in our study; 2) HPV52 and HPV58 were more frequently detected instead of HPV45 in our study. 3) HPV51 and HPV56 as 9.03, 8.71% of the examined women were also positive for these variants, which is more than the 8.04% for HPV18. There were three possible reasons: first, here we did not investigate the prevalence of HPV types in invasive cervical cancer, HPV18 was more frequently present in adenocarcinoma rather than in precancerous lesions [32, 33]; second, HPV58 was more frequently detected in Asian cervical cancer cases than in Europe or Africa; three, the distribution and attribution of the HPV types vary geographically, not only in different parts of China but also in different regions around the world.

Conclusion

In Northern China, this study suggested that the peak age of the onset of HSIL/LSIL was between 35 and 49 years of age. Infection with multiple high-risk HPV types did not increase the risk for high-grade squamous intraepithelial lesion. HPV16, HPV58, and HPV52 were the most dominant high-risk genotypes which attributed to 91.1% of all HSIL (CIN2+) lesions. Besides HPV58 and 52, HPV51 and HPV56 also should be taken into consideration when developing vaccination program in Northern China. In terms of further developing national multivalent HPV vaccine program, the results of this study will contribute to provide critical epidemiological evidence and baseline data.