Introduction

Globally, in 2018, about 70% of cervical cancers were attributable to high-risk human papillomavirus (HPV) types 16 and 18 [1]. In South Africa, HPV16 and 18 are important causes of cervical cancer and are included in the currently available HPV vaccine.

The HPV early (E6 and E7) oncoproteins and the late (L1) protein are encoded by the HPV genome. The HPV oncoproteins play an important role in the tumorigenesis of cervical cancer [2]. Humoral immune response against major capsid late protein (L1) is generated during infection with HPV; which is a marker of past or present infection [3]. Furthermore, integration of the HPV genome in the host cell results in overexpression of E6 and E7 oncoproteins, which are involved in the transformation and progression of cervical and other HPV-related cancers [4]. Different serological assays have been used to screen for HPV-16 and -18 (E6 and E7) [5,6,7,8] oncoprotein antibodies. Serological markers for HPV-antibodies provide useful data about past and current HPV infections and their relationship to cervical cancer [9, 10].

Previous studies have shown that HPV types 16 and 18 L1, E6 and E7 antibody proteins are associated with cervical cancer susceptibility [6, 8, 11,12,13] however, with varying degrees of strength of association. For example, in a case–control study from Russia, Zumbach et al. [8] utilized an enzyme-linked immunosorbent assay (ELISA) and reported odds ratios (ORs) for cervical cancer of 64.4 (95% CI 3.8–1085) for HPV16 E6 and 4.9 (95% CI 1.3–18.7) for HPV16 E7. In another case–control study from Algeria and India, Combes et al. [10] used a multiplex immunofluorescent HPV serology assay and reported ORs for cervical cancer of 37.1 (95% CI 13.4–103) for HPV16 E6, 12.5 (95% CI 6.3–24.8) for HPV16 E7 and 5.9 (95% CI 3.1–11.1) for HPV16 L1. Earlier work on the Johannesburg Cancer Study (JCS) from 946 human immunodeficiency virus (HIV)-negative women with cervical cancer and 1,342 controls using an ELISA assay and different cut-offs found ORs between moderate and high HPV-16L1 seropositivity of 1.5 (95% CI 1.2–1.9) and 2.4 (95% CI 1.9–3.0) [14]. Thus, multiple serologic methods had been used to characterize the relationship between cervical cancer incidence and HPV.

Less is known about the role of high-risk HPV oncoproteins among cervical cancer patients in black African women who are HIV-positive and HIV-negative. In South Africa, despite high HIV prevalence (24.1%) among black South African women [15], studies on HPV16 and 18 (E6 and E7) and late (L1) seropositivity and the risk of cervical cancer are still lacking. We therefore assessed the seropositivity of HPV -16 and -18 early (E6 and E7), and late (L1) antibodies among cervical cancer cases and infection unrelated cancer controls using a multiplex HPV serology assay [16], and estimated their association with cervical cancer risk among HIV-positive and HIV-negative women.

Methods

Setting and participants

The study population were participants recruited into the Johannesburg Cancer Study (JCS) between 1995 and 2016. The aims of JCS included measuring the relative importance of known and emerging risk factors for cancer in a black African population in Johannesburg, South Africa. The details of the JCS have been described elsewhere [17]. Briefly, the JCS collected serum samples from 26,000 consecutive consenting black South African patients with newly diagnosed cancers (> 90% histopathology confirmed), referred to the medical oncology and radiation therapy wards of (the main tertiary public) Charlotte Maxeke Johannesburg Academic Hospital and associated referral hospitals and clinics. Self-reported data on demographics and key lifestyle risk factors were collected using a structured questionnaire. The JCS also collected venous blood in one serum separation tube (SST) and one Ethylenediaminetetraacetic acid (EDTA) tube. SST serum was separated from whole blood within two days of sample collection and the sample was divided into a maximum of 4 aliquots. HIV testing was done using the Vironostika HIV Ag/Ab kit [18]. The study was approved by the University of the Witwatersrand Human Research Ethics Committee (Medical) (certificate number: M200252).

Cases and controls

The JCS is amenable to a case–control design and analysis by selecting controls that are unrelated to the exposures of interest. The current study was restricted to women aged 25 to 54 years because a large proportion of the older women did not have serology samples available. Cases were 1346 women with newly diagnosed cervical cancer (C53). Controls were 2532 women with newly diagnosed infection unrelated cancers (breast (C50) (n = 1953), colon (C18-20) (n = 197), oesophagus (C15) (n = 48), endometrium (C54-55) (n = 75), lung (C33-34) (n = 76), pancreas (C25) (n = 21) and minor cancer types (n = 162) (Additional file 1: Table S1)). Figure 1 outlines the criteria used to select cases and controls.

Fig. 1
figure 1

Flow chart of the case–control selection for the subject with HPV-serology results among women aged 25–54 years in the Johannesburg Cancer Study

Serology data and laboratory methods

Serum aliquots were stored at − 25 °C and one aliquot was shipped on dry ice to the German Cancer Research Center (Deutsches Krebsforschungszentrum (DKFZ)) in Heidelberg, for antibody testing for HPV16 and 18 (L1, E6 and E7) using a multiplex serological assay. This was based on a glutathione S-transferase (GST) capture immunoassay in combination with fluorescent beads on a Luminex platform [16]. The final net (bead and GST background subtracted) Median Fluorescence Intensity (MFI) values of all samples were analysed at a dilution of 1:1000. Previously established standard cut-offs were generated at 1:100 serum dilution [19, 20] and were thus not applicable. Using the Visual Inflection Point (VIP) method [21], we defined net MFI cut-offs of 175 for HPV16 and 18 L1, 75 for HPV16 and 18 E6, 120 for HPV16 E7, and 70 for HPV18 E7. For HPV antibodies, we generated five extra binary variables to describe high-risk combinations for the presence of either E6 /E7 singly or in combination (HPV16 E6 & E7, HPV18 E6 & E7, HPV16&18 E6, HPV16 &18 E7, and HPV16/18 E6/E7).

Statistical analysis

Data on demographic characteristics were summarized using frequencies and percentages for the categorical variables, medians and the interquartile range (IQR) for the continuous variable that was not normally distributed. A Pearson's Chi-squared test (for categorical variables) and t-test (for age) were used for the comparison of demographics between cases and controls. To assess cross-reactivity between the proteins, a tetrachoric correlation coefficient rank test [22] with Bonferroni adjustments (to take account of multiple comparisons) was used to analyze the correlation between each HPV16 and 18 (L1, E6 and E7) antibodies and HIV antibodies (See Additional file 1: Fig. S2). In the control group, we calculated the seroprevalence of antibodies to HPV16 and 18 (E6 and E7) oncoproteins and L1 protein across different demographic and lifestyle factors, such as age (25–34, 35–44, 45–54 years), place of residence (rural, urban), period of interview (1995–1999, 2000–2004, 2005–2009, 2010–2016), marital status (never married, married, previously married), educational level (none, primary, secondary and above), number of sexual partners (0–1, 2–5, 6+) HIV-status (negative and positive) and parity (0, 1, 2, 3, 4+). We have previously shown that cancer types unrelated to the exposure of interest, resemble background population prevalence for that exposure [18, 23]. We performed a test for heterogeneity on categorical variables and a score test for trends in proportions for ordinal categorical variables. On the assumption that the selected controls should have similar seroprevalences to each other, we compared heterogeneity in seroprevalence of HPV16 and 18 (L1, E6 and E7) antibodies across different control cancer types using the Cochran-Mantel–Haenszel test (See Additional file 1: Fig. S1).

We used an unconditional logistic regression model to assess the association by calculating adjusted Odds Ratios (AOR) and 95% confidence intervals between HPV16 and 18 (E6 and E7) and L1 proteins seropositivity and the risk of cervical cancer. We stratified by HIV status to assess if the seroprevalence of the HPV proteins differs in HIV-positive and HIV- negative patients. We adjusted for age, education level, number of sexual partners, marital status, period of interview and place of residence. To assess whether the clinical performance of HPV-16 and -18 antibodies as a diagnostic marker for cervical cancer we performed the same in HIV-positive and HIV-negative cancer patients. We calculated the sensitivity, specificity of the various HPV antibodies for detecting cervical cancer using the "diagti” command in STATA. In addition, the area under the ROC curve (AUC) of the receiver operator characteristics was computed to compare the performance of the best combination of HPV E6 and E7 antibodies to discriminate cervical cancer from controls (See Additional file 1: Table S4). All the statistical tests were computed using STATA software version 16.0 (Stata Corp, college station, Tx) and SAS v 9.4 (SAS Institute, Cary, NC). All tests were considered significant at a two-tailed alpha of 5%.

Results

Out of a total of 2,795 cervical cancer cases and 5,569 infection unrelated cancer controls participating in the JCS, 3,878 women (1,346 cases and 2,532 controls) aged 25–54 years were included (Fig. 1). Cases were relatively younger (Median age: 42 (IQR: 37–46) compared to controls (Median age: 44 (IQR: 39–50) (p value < 0.001). At least two-thirds of both cases (63.5%) and controls (72.4%) had a secondary school education (Table 1).

Table 1 Comparison of demographic characteristics of study participants from the JCS (1995 – 2016)

Overall, 50.2% of women with cervical cancer were HIV positive vs. 26.3% HIV positive controls (Table 1). Furthermore, 13.5% of HPV16 L1 seropositive women were also HPV18 L1 seropositive (Additional file 1: Table S3).

Stratifying by HIV-status, HPV16 E6 & E7 and HPV18 E6 & E7 seropositivities were higher among HIV-negative women amongst both cases HPV16 E6 (42.7%) and controls HPV16 E6 (3.0%) as compared to HIV-positive women cases HPV16 E6 (26.8%) and controls HPV16 E6 (2.8%) (Fig. 2).

Fig. 2
figure 2

Seroprevalence of HPV-16 and -18 (L1, E6 and E7) proteins antibodies in cervical cancer cases and controls by HIV-status

Among HIV-negative controls, the overall antibody seroprevalence for HPV16 L1 was 14.3% and 0.5% for combined HPV16 E6&E7. Similar observations were made for HPV18 L1 (16.3%) and for combined HPV18 E6&E7 (0.2%) (Table 2). In HIV-positive controls, the overall antibody seroprevalence was 15.9% for HPV16 L1, HPV18 L1(15.7%), 0.5% for HPV16 E6 and E7 and 0.2% for HPV 18 E6 and E7 (Table 3).

Table 2 Seroprevalence of antibodies to HPV-16 and- 18 L1, E6 and E7 proteins in infection unrelated cancer controls in HIV-negative women
Table 3 Seroprevalence of antibodies to HPV-16 and- 18 L1, E6 and E7 proteins in infection unrelated cancer controls in HIV-positive women

In general, we did not observe differences in seroprevalence to HPV types 16 and 18 L1 and (E6 and E7) antibodies by demographic and sexual/reproductive factors (p value > 0.05). Among HIV-negative controls, the prevalence of HPV16 E7 antibody was higher among women that lived in rural areas (8.6%) as compared to women who lived in urban areas (3.7%) (p value for heterogeneity = 0.005) (Table 2). A similar pattern was observed for HPV18 E7 (rural (7.9%) vs urban (2.0%); p value = 0.001). Among HIV-positive controls, the prevalence of HPV16 E6 &E7 antibodies decreased with an increase in number births up to 2 births (p value = 0.006) (Table 3).

Being seropositive to HPV16 antibodies was significantly associated with cervical cancer: combined HPV16 E6 and 7 (AOR = 69.20, 95% CI 37.07–129.18), HPV16 E6 (AOR = 21.96, 95% CI 16.61–29.02), HPV16 E7 (AOR = 7.93, 95% CI 6.16–10.22) and HPV16 L1 (AOR = 1.74, 95% CI 1.45–2.07). For HPV18, antibody seroprevalence ORs for cervical cancer ranged from AOR = 38.61 (95% CI 15.13–98.53) for combined HPV18 E6&E7, AOR = 8.94 (95% CI 6.64–12.03) for HPV18 E7, AOR = 4.67 (95% CI 3.30–6.50) for HPV18 E6 and AOR = 1.31 (95% CI 1.10–1.56) for HPV18 L1 (Table 4).

Table 4 Association between HPV-16 and -18 L1 and (E6 and E7) seropositivity and cervical cancer

In general, HPV L1 cervical cancer ORs were higher in HIV-positive women but lower in HIV-negative women. The AORs of cervical cancer for combined HPV16 E6 & E7 seropositivity were 97.40 (95% CI 46.68–203.23) in HIV-negative, and 33.10 (95% CI 10.22–107.20) in HIV-positive women (Fig. 3).

Fig. 3
figure 3

Relationship between HPV-16 and -18 (L1, E6 and E7) seropositivity and cervical cancer risk by HIV-status. Odds ratios adjusted for age, education level, number of sexual partners, marital status place of residence and period of interview

Among HIV-positive women, HPV16/18 E6/E7 antibodies had a sensitivity of 43.0%, specificity of 90.6% and AUC of 67% to detect cervical cancer. Among HIV-negative women, the sensitivity, specificity and AUC for HPV16/18 E6/E7 antibodies was 70.6%, 89.7% and 80% respectively for detection of cervical cancer. The sensitivity, specificity and AUC for discriminating cervical cancer based on HPV16 E6 positivity was 26.8%, 97.2% and 62% respectively in HIV-positive women. For HIV-negative women who were HPV E6 seropositive, the sensitivity was 42.7%, specificity was 95.6% and the AUC 69% (Table 5).

Table 5 Clinical performance of HPV16 and 18 antibody positivity as diagnostic markers for cervical cancer by HIV-status

Discussion

In this study, we assessed antibody seropositivity to each of HPV-16 and -18 oncoproteins (E6 and E7), and L1 protein and calculated their association with cervical cancer. While the seroprevalence of HPV16 L1 in the previous JCS study among HIV-seronegative women was higher [14] than in this study, ORs between HPV16 L1 and cervical cancer were about the same (overall crude OR 2.2; 95% CI 1.8–2.6, vs 2.1; 1.5–2.7, Fig. 3). Previous data on HPV16 L1 seroprevalence in controls reflect different assay methods and cutoffs used, and perhaps some cross-reactivity with other HPV types. Our key findings, using larger sample size, more recent serological methods measuring a range of high-risk HPV types showed the important role of HPV-16 and -18 (E6 and E7) oncoproteins in cervical cancer risk in both HIV-positive and HIV-negative cancer patients. HPV-16 and -18 (E6 and E7) oncoproteins were associated in cervical cancer risk in both HIV-positive and HIV-negative cancer patients. HIV-positive patients compared to HIV-negative patients had lower sensitivity but with both high specificity for HPV16 and HPV 18(E6 and E7). Findings from our study confirmed that, in HIV-negative controls, HPV16 E7 and HPV18 E7 seroprevalence differed by place of residence. Among HIV-positive women, HPV16 E6&E7 seroprevalence increased with an increase in number of births.

The higher seroprevalence of HPV16 E6, HPV16 E7 and HPV18 E7 in women from rural areas might indicate a greater number of women with early undetected cervical or related cancer lesions. In rural areas of South Africa, women have poorer access to health care services, are less likely to present themselves to cervical cancer screening services and have less access to cancer diagnostic facilities, which are mainly found in tertiary hospitals in urban areas [24].

The high seroprevalence of HPV16 E6 (35.9%) antibodies in cervical cancer cases found in our study are similar to the seroprevalence of HPV16 E6 (32%) in cervical cancer cases reported from the United Kingdom (35%) and India and Algeria (32%) [5, 10]. We found a relatively low seroprevalence of HPV16 E6 & E7 and HPV18 E6 &E7 antibodies among controls in keeping with a low prevalence of HPV oncoprotein antibodies among women without cervical cancer [25]. Our study demonstrates that the high seroprevalence of HPV16 E6 and E7 antibodies in cervical cancer development is in accordance with other studies from Russia and Italy [8, 26].

Antibodies to the E6 and E7 oncoproteins are late markers of invasive cervical cancer [27]. In our study, seropositivity to each of the HPV-16 and -18 (E6 and E7) antibodies was associated with high risks of cervical cancer. Notably, seropositivity to HPV16 (E6 & E7) and HPV18 (E6 & E7) exhibited the highest ORs for cervical cancer (Fig. 3). Similar findings were reported in a case–control study of patients recruited from India and Algeria [10]. Our findings support those from other cross-sectional, case–control and prospective studies, albeit each using different assays [27,28,29,30].

Individuals with suppressed immune systems are susceptible to chronic infection, including HPV [31]. Thus, HIV infection increases the risk of HPV persistence [32]. Existing evidence of the association between HPV (E6 and E7) and L1 proteins antibodies and HIV has come from studies on men who have sex with men [10, 33, 34]. In contrast, most of the existing studies on the association between antibodies to HPV-16 and -18 L1, E6 and E7 proteins and cervical cancer risk have been conducted in low HIV prevalence settings [6, 8, 13, 27]. In our study, access to HIV data made it possible to stratify our analysis by HIV status. Our finding showed that antibody seropositivity to each of HPV-16 and -18 (E6 and E7) oncoproteins was strongly associated with the risk of cervical cancer among HIV-positive women. Our data support the hypothesis that HIV plays an important role in the persistence of HPV infection.

Unexpected findings in our study were that HPV-16 and -18 (E6 and E7) oncoproteins seropositivity and cervical cancer risk were higher in HIV-negative cancer patients compared to HIV-positive cancer patients. Similar findings have been observed in previous studies of HPV antibodies among HIV-positive and HIV-negative patients [35,36,37,38]. In our study, the possible explanation could be that cervical cancer risk in HIV positive women may be related to more than the two main types HPV16 and HPV18 that we tested for so their relative contribution appears attenuated (Additional file 1: Table S3). Other studies on HPV genotype distribution by HIV-status that were conducted in South Africa reported that HPV types 35, 58 and 33 were more commonly identified [39, 40] in addition to types 16 and 18. However, if this is indeed true, then the current HPV vaccine (16/18) may be less effective for HIV positive women. Another possible explanation is that in HIV-positive patients, antibodies could be an indication of reactivation of a latent HPV while in HIV-negative patients antibodies could be reinfection with a new type [35]. Another unexpected finding in our study was that the number of sexual partners was not associated with HPV L1 and (E6 and E7) antibodies. The plausible explanation could be that peak acquisition of HPV occurs at an age earlier than the age of participants in our study [41].

In the new guidelines for screening cervical cancer, the World Health Organization (WHO) has included HPV antibodies and oncoproteins as a future potential screening test [42]. We, therefore, assessed the clinical performance of HPV E6 /E7 antibodies as possible screening tests. We found that the sensitivity of the HPV antibodies for detection of cervical cancer was low but the specificity was high among HIV-positive women. In contrast, sensitivity and specificity for the detection of cervical cancer were high among HIV-negative women. Predictive values would thus vary considerably depending on the background prior likelihood of disease. Our finding is in line with the previous findings on the sensitivity and specificity of HPV16/18 E6/E7 antibodies as a screening test [28, 43].

There are strengths to our study. The seroprevalence of HPV antibodies was conducted on a large sample size, and we used a high throughput multiplex serology so all tests were done under the same laboratory conditions. The laboratory tests were conducted ‘blind’ without prior knowledge of the case/control status of the participants. Nonetheless, our study has some limitations. Serology data on exposure and cervical cancer diagnoses (of all stages) were collected at the same time. We did not have tumor HPV Deoxyribonucleic Acid (DNA) data to determine the causative type(s). Since only 50–70% of women with detectable HPV DNA in the cervix seroconvert [44], our study underestimates the true prevalence of HPV infection. The JCS did not recruit women with cervical pre-cancerous lesions. Study results are based on black women aged 25- 54 years who were recruited from the catchment area of the largest public tertiary hospital in Johannesburg. Therefore, our findings might not be generalizable to other South African regions and women aged 55 years and above.

Conclusions

Our data contribute to the evidence on the importance of HPV-16 and -18 E6 and E7 oncoproteins antibodies in discriminating cervical cancer from controls in a black South African population. HPV L1 antibodies show to be exposure markers of past HPV infection. HPV E6 and E7 show to be important markers of invasive cervical cancer. Furthermore, antibodies to HPV-16 and-18 E6 and E7 seropositivity are strongly associated with cervical cancer in both HIV-positive and HIV-negative patients.