Background

The papillomavirus is a heterogeneous group of DNA virus with circular, non-enveloped, double-stranded DNA genomes [1, 2]. This virus infects humans and different species of animals [2]. The virus is discovered from the horn of Cottontail rabbit at the beginning of 1930s [3] and also revealed as a main cause of human cervical cancer in 1970s [4]. More than 300 papillomaviruses have been identified and completely sequenced, including over 200 human papillomaviruses [5]. The high-risk carcinogenic types of HPV currently designated by the International Agency for Research on Cancer (IARC) are HPV16, HPV18, HPV31, HPV33, HPV35, HPV39, HPV45, HPV51, HPV52, HPV56, HPV58, and HPV59. The HPV68 is classified as probably carcinogenic, and HPV26, HPV30, HPV34, HPV53, HPV66, HPV67, HPV 69, HPV70, HPV73, HPV82, HPV85, and HPV97 have been associated with rare cases of cervical cancer and are considered probable carcinogens [6, 7]. Genotype 6 and 11 are low-risk types that cause genital and skin warts [8]. Genital HPV infections are very common and prevalent in the age range of 18 to 30 years [9, 10]. Infection of the cervix with HPV is necessary to cause cervical neoplasia and cervical cancer [11, 12], and integration of viral DNA into the host genome is necessary for persistent infection which could lead to the development of cervical dysplasia [11].

The prevalence of HPV is variable across the world. The study reported from developed countries indicate that the prevalence of HPV was 11 to 12% [13]. The recent global estimate indicates 11.7% of the HPV infection burden in the world [14]. The occurrence of about 85% of infected cases and 88% of the deaths due to cervical cancer is in developing countries [11].

The highest prevalence was reported in sub-Saharan Africa (24%), Eastern Europe (21%) and Latin America (16%) [15].

The burden of HPV infection is higher in HIV infected women (50.8%) than un-infected (22.6%) [16] and 78.8% among HIV infected than 34.4% of un-infected women [17]. Similarly, high-risk oncogenic HPV types is higher among HIV infected than un-infected women (48.4% vs. 17.3%) [16]. Other studies reported a prevalence of 68.0% [18] and 33.2% [19]. Moreover, the study reported from developing countries indicated extremely variable prevalence of HPV that ranges from 20 to 70% [20]. The prevalence of low-risk HPV types were 3.6 to 5.6 times higher in HIV-sero-positive women when compare to HIV sero-negative's [8].

Several risk factors are reported to be associated with HPV infection and these include HIV infection, other STIs (e.g., chlamydia, herpes simplex virus), and multiple sexual partners [11, 21]. There are also other factors that mediate HPV infection such as cigarette smoking, oral contraceptive or hormonal contraceptive use, chronic inflammation and immunosuppressive conditions [10, 11, 21, 22]. Dietary factors, socioeconomic status, race/ethnicity, geographic disparity and polymorphisms in the human leukocyte antigen system are additional factors that could mediate HPV infection [10, 11, 21, 22]. Being young age and having active sexual behavior are key risk factors for HPV acquisition and persistence of the infection [22].

HIV infection increases the risk of cervical infection due to high-risk HPV genotypes that induces high-grade cervical squamous intraepithelial lesions (HSILs), which in turn leads to the development of pre-invasive cervical lesions and invasive cervical cancer (ICC) [23,24,25]. HIV infection could alter the natural history of HPV infection through decreasing the self-clearance rate of infection and increasing progression to high grade and invasive lesions [24]. Furthermore, the incidence of HPV infection is three times higher in HIV-positive women [25], and can cause cervical cancer than their counterparts [26]. Nonetheless, with the exception of the systematic review and meta-analysis done in Kenya [27], evidences in this regard showing the burden and molecular distribution of HPV in low and middle income countries (LMICs) is limited [28]. Therefore, this review aims to fill the identified gaps by estimating the pooled prevalence of HPV, and investigating the factors associated with HPV infection among HIV infected women in LMICs.

Methods

Search strategy and screening of papers

We conducted a systematic review and meta-analysis of published articles to estimate the pooled prevalence of HPV in LMICs. We systematically searched the papers published in the following electronic databases; PubMed/MEDLINE, SCOPUS, Science Direct, Excerpta Medical Database from Elsevier (EMBASE), Web of science, Cumulative Index of Nursing and Allied Health Sciences (CINAHL) and Google scholar. The review was conducted in accordance with Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) standard [29]. We used a search strategy by combining the following key terms: molecular, molecular epidemiology, human papillomavirus, or HPV, papillomavaridae, Human immunodeficiency virus (HIV), AIDS (acquired human immunodeficiency syndrome), HIV infected, HIV positive, HIV sero-reactive, women, female and girl. We used Truncation(*) to manage spelling variation during search: infect* or positive, wom*n or female* or girl*. We used both free text and Medical subject heading [MeSH] terms during electronic database search.

PubMed database search strategy was:((((molecular[tiab] OR "Molecular Epidemiology"[Mesh]) AND ((Human papillomavirus[tiab] OR HPV[tiab]) OR "papillomaviridae"[MeSH Terms])) AND (((Human immunodeficiency virus[tiab] OR HIV[tiab]) OR "hiv"[MeSH Terms]) OR "hiv"[MeSH Terms])) AND (infected[tiab] OR positive[tiab])) AND (((women[tiab] OR females[tiab]) OR "women"[MeSH Terms]) OR "female"[MeSH Terms]) AND (("1966/01/01"[PDAT]: "2019/08/11"[PDAT]) AND "humans"[MeSH Terms] AND English[lang] AND "female"[MeSH Terms]).

The search was repeated to identify the consistency of search terms and results. Two authors independently reviewed the titles, abstracts and full articles of the retrieved studies.

Study inclusion and exclusion criteria

We included a cross sectional and cohort studies conducted in LMICs based on World Bank Country Classifications, 2018 [30] and that reported prevalence of HPV genotypes. The inclusion was restricted to the papers published in English language without limiting publication year till 11th August 2019. We excluded studies that did not clearly state the study design, outcome measured, the study conducted on HIV negative women alone, conducted in high-income countries, and the study reported HPV genotype from anal and oral organ types (Fig. 1).

Fig. 1
figure 1

Flow diagram of studies reviewed, screened and included

Study quality assessment

We assessed the quality of included studies by using the 14 items Quality Assessment Tool for Observational Cohort and Cross-Sectional studies NHLBI, NIH [31]. This assessment tool mainly focused on research question, study population, eligibility criteria (inclusion and exclusion criteria of study participants), sample size justification, exposure measures and assessment, sufficient time frame to see an effect, outcome measures and blinding of outcome assessors, follow up rate, and statistical analysis. The quality assessment was rated as good, fair and poor based on the quality assessment tool criteria. The maximum score indicating high quality was 14 and the lowest possible score was zero. The rating values of the included studies in terms of their quality were based on their design. Cross-sectional type do not consider the items which fit for cohort and taken as not-applicable (NA) and thus, the rating values were not taken from the possible maximum score (i.e. 14). In this review, all scores are written in percentage.

Data extraction

Data from eligible abstract and/or full text of the articles were extracted by considering the outcome variables (i.e. prevalence or proportion of HPV genotypes, magnitude of cancer causing HPVs or high risk (HR) HPV genotypes and low-risk HPV types), and factors that could potentially be associated with these outcomes. The characteristics of study participants of an eligible paper such as age range, mean or median age, sex, HIV sero-status, the prevalence of HPV genotype were also extracted. Study characteristics such as first author, year of publication, study duration, study setting, study location or country, study design, sample size were also extracted (Table 1). Other extracted data include the prevalence of different HPV genotypes (Table 2), factors which could potentially be associated with HPV infection and diagnostic methods applied to detect HPV infection (Table 3).

Table 1 Characteristics of included studies to estimate the pooled effect of HPV among HIV-infected women in LMICs
Table 2 Prevalence of different HPV genotypes included in the meta-analysis of women infected with HIV in LMICs
Table 3 Molecular genotyping techniques and associated factors for HPV infection

The majority of the studies included in our review had more than eighty percent and the lowest score observed was 62.5% in terms of quality. There was however one abstract included in the review, which was difficult to assess the quality of the article (Table 3).

Statistical analysis

We estimated the pooled prevalence of HPV with its 95% Confidence Interval (CI) using random effects meta-analysis model assuming the true effect size varies between studies [32]. The proportion of HPV reported in each study is multiplied by its sample size to express patients with HPV infection in number, and data presented in forest plot. Heterogeneity in the prevalence of different studies was assessed using Chi-square (χ2) based Q test with significant level of p value < 0.1 and I2. The I2 value of25% indicates low heterogeneity while 50% moderate and 75% high [33]. The potential publication bias was assessed using a funnel plot. If the 95% of the point estimate of studies lie within the funnel plot defined by straight lines, then that indicates the absence of heterogeneity [34]. The potential sources of heterogeneity were assessed by doing subgroup analysis and moment based meta regression. Meta-regression extends subgroup analyses and allows to estimate effect size. Data analysis was conducted using STATA version 14.

Results

Study characteristics

We included 19 studies in our review (Fig. 1). These studies were reported from Rwanda [36, 37], Brazil [38,39,40], Nigeria [41, 42] Thailand [43], South Africa [44,45,46,47], Zambia [48], Burkinafaso [35, 49], Senegal [23] and Colombia [50, 51]. There was one study conducted in two countries Burkinafaso and South Africa [52]. Five studies were from South America (three from Brazil and two from Colombia), one study from Asia (Thailand) and the rest were from African countries. All of the studies were from health facilities (Hospital and clinic) and the majority were cross sectional studies. The publication year varied from 2003 to 2017 while the majorities (13 articles) were published after 2009. Eight studies were published in 2013 and 2014. The maximum sample size was 1371 [44] and the minimum was 98 [41]. The age of participants ranged from 14 to 73 years [39, 50]. Three studies didn't mention the upper age range of the participants [23, 36, 42] (Table 1).

Pooled prevalence of HPV

We pooled data from 8175 HIV infected women to estimate the pooled prevalence of HPV infection using meta-analysis. The overall pooled prevalence of all types of HPV infection was 63.0% (Fig. 2) with high heterogeneity across the studies (χ2 = 3,782.80 (d.f. = 15); p value < 0.001 and I2 = 99.6%). The pooled prevalence of high risk HPV was 51.0% (Fig. 3) with heterogeneity of χ2 = 1069.84 (d.f. = 12); p value < 0.001 and I2 = 98.88%. Similarly, the estimated pooled prevalence of low risk HPV was 28.0% (Fig. 4) with heterogeneity of χ2 = 296.83; (d.f. = 5); p value < 0.001 and I2 = 98.32%.

Fig. 2
figure 2

Forest plot to estimate the pooled prevalence of human papillomavirus infection among HIV infected women with 95% CI (the estimate weighted based on random effects model): ES-Effect Size equivalent to prevalence, CI—confidence interval. In the plot, the diamond shows the pooled result and the boxes show the effect estimates from the single studies. The purple dotted vertical line indicates pooled estimate. The purple solid vertical line indicates the reference line at zero indicating no effect. The horizontal line through the boxes illustrate the length of the confidence interval and the boxes show the effect estimates from the single studies

Fig. 3
figure 3

Forest plot to estimate the pooled prevalence of high risk human papillomavirus infection among HIV infected women (the estimate weighted based on random effects model): ES—effect Size equivalent to prevalence, CI—confidence interval. In the plot, the diamond shows the pooled result and the boxes show the effect estimates from the single studies. The purple dotted vertical line indicates pooled estimate. The purple solid vertical line indicates the reference line at zero indicating no effect. The horizontal line through the boxes illustrate the length of the confidence interval and the boxes show the effect estimates from the single studies

Fig. 4
figure 4

Forest plot to estimate the pooled prevalence of low risk human papillomavirus infection among HIV infected women with 95% CI (the estimate weighted based on random effects model): ES—effect size equivalent to prevalence, CI—confidence interval. In the plot, the diamond shows the pooled result and the boxes show the effect estimates from the single studies. The purple dotted vertical line indicates pooled estimate. The purple solid vertical line indicates the reference line at zero indicating no effect. The horizontal line through the boxes illustrate the length of the confidence interval and the boxes show the effect estimates from the single studies

The pooled prevalence of high risk HPV genotypes was also estimated in the studies (i.e. HPV genotypes 16, 18, 26, 31, 33, 35, 39, 45, 51, 52, 53, 56, 58, 59, 66, 67, 68, 69, 70, 73, and 82). The highest prevalence was observed for genotype 16 (20%) followed by 18 (15%) and 52 (13%). Almost all genotypes indicated heterogeneity and the highest heterogeneity was observed in genotype 16 (I2 = 98.53%) followed by 18 (I2 = 97.18%) and 31 (I2 = 96.17%) (Table 4). The HPV genotypes 26,67,69, 70, 73 and 82 reported less frequently in the included studies which is difficult to interpret.

Table 4 The pooled prevalence of different genotypes of HPV among HIV-infected women in LMICs

Subgroup analysis

The result of subgroup analysis based on the continent from where the studies were include shows significant heterogeneity between and within the group. The pooled prevalence of HPV in African was 69.0% (95% CI: 49.0–89.0) with heterogeneity of I2 = 99.74% and p value < 0.001. The pooled prevalence of HPV in South America was 57.0% (95% CI: 44.0–71.0) with heterogeneity of I2 = 95.93% and p value < 0.001) (Fig. 5).

Fig. 5
figure 5

Forest plot of the subgroup analysis based on continent from where the studies were reported. In this plot, three diamond shapes are observed. The first two indicates subtotal prevalence's and the third one indicates the pooled estimate of the prevalence of HPV

Meta-regression analysis

We assessed the effects of sample size and year of the study on heterogeneity between the studies using meta-regression model. Both sample size and publication years significantly predicted the heterogeneity of the effect sizes (Table 5). In the adjusted model, both the sample size and publication year indicated heterogeneity in the effect size which is equivalent to the prevalence (p < 0.001). When we interpret the finding using β-coefficient, one unit increase in the sample size increases the effect size or the outcome of 1.04 points and the outcome decreases by 11.8 points for every one unit increase in the publication year (Table 5).

Table 5 Meta-regression analyses for year of study and sample size as a reason of heterogeneity on the prevalence of HPV among HIV-infected women in LMICs

Publication bias

The funnel plot (widely used to examine bias in the results of meta-analysis) for the pooled prevalence of all genotypes HPV, high risk HPV and low risk HPV indicated that there is a publication bias (Fig. 6a–c). Figure 6a, indicates the funnel plot of the 16 estimates of the HPV types available for meta-analysis (SE-Standard error, ES-Effect size: prevalence), (b) Funnel plot of the 13 estimates of high risk HPV types available for meta-analysis (SE-Standard error, ES-Effect size: prevalence), (c) Funnel plot of the 6 estimates of low risk HPV types available for meta-analysis (SE-Standard error, ES-Effect size: prevalence). The majority of included studies were out of 95% confidence limit. The outer dashed lines indicate the triangular region within which 95% of studies are expected to lie in the absence of both biases and heterogeneity. In this funnel plot, scatter properties of the included studies made by medium small size with white background color and the scale of 1.

Fig. 6
figure 6

Publication bias assessment: a funnel plot of the 16 estimates of HPV types available for meta-analysis (SE—standard error, ES—effect size: prevalence), b funnel plot of the 13 estimates of high risk HPV types available for meta-analysis, c funnel plot of the 6 estimates of low risk HPV types available for meta-analysis. In this plot, the blue broken line indicates Pseudo 95% CI, the solid red line indicates pooled estimate of the prevalence of HPV, and the scattered circle dots indicates included studies in the meta-analysis. The scale on the X-axis indicates Effect size estimate or proportion and the Y-axis indicates the precision estimate using standard Error

Laboratory techniques applied to detect HPV infection in the included studies

Molecular genotyping and HPV detection techniques applied for selected studies were Linear Array HPV Genotyping Test (LA), careHPV, genotyping using PCR-restriction fragment length polymorphism analysis, Reverse line-blot hybridization, INNO-LiPA HPV genotyping Extra® assay (Table 3).

Factors associated with HPV infection

High HIV viral load and low CD4 count were the most frequently reported factors that associated with high-risk HPV infection [23, 47, 48, 50]. Hormonal contraceptive use, CD4count < 200 cells/mm3, history of three or more sexual partners were reported as the factors associated with HPV infection [37, 38] (Table 3).

Discussion

In the current review, the pooled estimate of HPV infection prevalence was 63.0%. The estimates of high risk and low risk genotypes were 51.0% and 28.0%, respectively. Of high risk genotypes, HPV genotype 16 was high (20%) followed by 18 (15%) and 52 (13%), respectively. Low CD4count and high HIV viral load were the risk factors that most frequently reported in this review.

This finding was lower than the findings in Kenyan which reported 68.0% overall pooled prevalence of high risk HPV among HIV positive women [27]. Genotype 16 was the most prevalent HPV genotype (20.0%) in our review. This finding, however, was different from previous review which reported HPV 52 with pooled estimate prevalence of 26% among HIV infected women with normal cytology and HPV 16 which was 26% among women with abnormal cytology [27]. This difference is likely to be due to the number of included studies and the difference in the data included in the analysis, study setting and participants exposure to risk factors including HIV.

Lower CD4 count most frequently reported in this review is concordant with previous reviews in which low CD4 was strongly associated with HPV infection [53,54,55]. Previous study also revealed that the most frequent high risk genotype observed in HIV positive women (i.e. 46.7%) [54] which is closer to the current pooled estimate for high risk HPV (51.0%), indicating that HIV infection might increase the susceptibility to latent HPV infection [55].

The review conducted to estimate prevalence of HPV genotype among African women, including Ethiopia revealed that HPV16, 52, 35, 18, 58, 51, 45, 31, 53, and 56 were the ten most common genotypes in the normal cervical cytology while HPV 16, 18, 35, and 52 were the four common types [56]. Another review from Ethiopian women depicted that HPV 16, 52, 18, 58, 45 were top five genotypes with the proportion of 45.3%, 9.4%, 8.2%, 6.9%, 5.2%, respectively [57]. Correspondingly, the pooled estimates of about 21 high risk HPV genotypes among HIV infected women were reported in this review with the estimated prevalence of genotype 16 (20%), 18 (15%), 52 (13%), 31 and 58 (11% each), 35 and 53 (10% each), 33, 51, and 66 (8% each), 45 (7%), and the remaining genotypes had the pooled estimate of less than seven percent.

The original research article conducted in Korea reported prevalence of 16.7% with the high risk HPV type of 12.5% [58] which is too far up when compared with the pooled estimates of the current review which focused on HIV positive women. In addition, the study among Arab women indicated 6.2% among Qatari women and 5.9% non-Qatari women [59] somewhat concordant with the study conducted in Lebanon which reported HPV prevalence of 6.7% [60]. This variation is probably attributed to the differences in the study settings, sample sizes used and the studied population.

Our finding indicated heterogeneity on the outcome variable which is the effect size equivalent to the prevalence of HPV genotypes. Therefore, careful interpretation of the heterogeneity chi-square test (variation in effect estimates beyond chance) is required, since it has low power in the situation of a meta-analysis when studies have a small sample size or are few in number. It is worth noting at this junction that while a statistically significant result may indicate a problem with heterogeneity, a non-significant result must not be taken as evidence of no heterogeneity.

Strength and limitations of the study

This review is conducted by searching more than five biomedical databases and a large number of pooled participants are involved in the study. Another strength is that this review assessed HPV prevalence studies among HIV infected women in developing countries at large and reported pooled estimates of all HPV genotypes, high risk HPV genotype and low risk HPV. The limitation of this study was inclusion of studies published only in English language. This could be one of the possible causes for observed publication bias and heterogeneity of the estimated effects.

Conclusion

This review indicated that the pooled prevalence of all genotypes HPV and high risk HPV among HIV infected women in LMICs were considerable. To enhance the well-being of HPV/HIV co-infected women it is necessary to strengthen programs for diagnosis, treatment, and provide HPV vaccination based on common high-risk genotypes.