Introduction

Despite overall reductions in total HIV incidence in Eastern and Southern Africa, rates of new infections among 15- to 24-year-old adolescent girls and young women remain unacceptably high, with an estimated 7000 new infections each week [1, 2]. The risk of HIV infection increases progressively with age, with a significant increase among 16- to 17-year-old adolescent girls and 20- to 24-year-old young men [3]. This persistent incidence coincides with a rapidly growing adolescent population in the region, from 229 million in 2015 to an estimated 435 million in 2030 [4, 5]. Preventing new HIV infections among adolescents in Eastern and Southern Africa is critical in order to reduce adolescent mortality and morbidity [6].

Evidence examining exposure to sexual risk has largely focused on 15- to 24-year-old adolescent girls and young women, particularly given the strong association between sexual risk exposure and heterosexual HIV infection in this population in sub-Saharan Africa [7,8,9]. Early sexual debut, sex in exchange for goods or money, and age-disparate relationships among young women and older sexual partners also contribute to high HIV infection risk [10,11,12,13,14,15,16], including unintended pregnancies [17].

Evidence suggests that multi-level factors affect early exposure to sexual risk. Specifically, individual-level factors (poor knowledge of HIV transmission and prevention, low self-efficacy, and violence victimization), family-based factors (orphanhood, parent/caregiver-adolescent engagement and parental monitoring), household-based factors (poverty and food insecurity) [18], and school dropout [19,20,21,22] are closely associated with early exposure to sexual risk, including HIV infection. Supporting adolescents to remain in school protects them from HIV infection, unintended adolescent pregnancy, and sexually transmitted infections [23], especially for adolescent girls [24, 25]. To prevent new HIV infections among adolescents and young people, interventions need to address risk factors across the socio-ecological continuum [26,27,28,29]. However, among adolescents who remain enrolled in school, data is still needed to indicate which specific school-based provisions are most effective at preventing HIV infections and reducing early exposure to sexual risk. In particular, evidence on which types of subsidies should be applied to which subgroups of adolescents would support more effective delivery of HIV and sexual risk prevention services.

While we know about risk and protective factors for HIV infection among older adolescent girls and young women, there is less evidence on sexual risk among adolescents living with HIV [29]. Research on reducing sexual risk exposure among young adolescents, for those who are HIV uninfected as well as those living with HIV, is critical to breaking the cycle of HIV transmission. Reducing exposure to sexual risk has been associated with reduced new infections among young people and adults [30, 31]. For instance, increasing condom use amongst adolescents is a key strategy found to effectively reduce exposure to HIV among adolescents and young people [32]. Despite this data, evidence is needed to know what works in early adolescence, when many of these sexual risks first emerge, including which combinations of provisions have the highest impact to prevent exposure to sexual risk [33].

The evidence on HIV prevention efforts for adolescents and young people highlights the need to combine biomedical, individual and structural interventions to address the causal pathways to HIV infection during this vulnerable age [33,34,35]. Programming for younger adolescents is critical to developing resilience, initiating safe behaviors, and enabling early detection of risk factors [36]. However, improved access to quality and consistent services and programming for adolescents and young people is needed to achieve the highest reduction in HIV incidence [37]. Findings from a large dataset of South African adolescents showed that consistent access to the government-issued Child Support Grant was associated with reduced HIV risk behaviors [38]. Additional evidence on the effect of the timing and consistency of accessing HIV prevention programming for adolescents is needed, beyond government cash transfers. Longitudinal analyses are needed to show if, and how, access to single or combinations of services or provisions shapes early exposure to sexual risk, particularly among an age group rarely included in such research: 10- to 14-year-old adolescents.

We use data from two longitudinal data sets from two South African surveys to answer three research questions: (1) which provisions (alone and in combination) are associated with preventing exposure to sexual risk in early adolescence, (2) do associations of provisions with reduced sexual risk exposure vary by gender and residential location (urban/rural), and (3) how do differing levels of access to provisions over time influence early exposure to sexual risk?

Methods

This analysis draws upon the pooling of individual-level data from two existing independent longitudinal cohort studies, Young Carers and Mzantsi Wakho, based in three provinces of South Africa. The Young Carers study is a prospective cohort of n = 3401 adolescents aged 10–17 living in two South African provinces, Mpumalanga and Western Cape. The cohort collected data between 2010 and 2012. Participants were recruited through door-to-door recruitment of two randomly-selected census enumeration areas in a rural and urban district in each province [39]. The Mzantsi Wakho study is a prospective cohort of n = 1410 10- to 19-year-old adolescents of sero-assorted HIV status living in a health sub-district in the Eastern Cape province in South Africa, with data collected from 2014 to 2017. The longitudinal Mzantsi Wakho sample includes 1080 adolescents living with HIV, an important population for secondary HIV prevention efforts. The studies shared investigators and used similar data collection procedures, which are explained in more detail in prior publications [39, 40]. The studies had high acceptability with low refusal rates (< 4% at both baseline and follow-up). During follow-up, 96.7% of Young Carers and 93.8% of Mzantsi Wakho participants were re-interviewed. Follow-up times averaged 353 days for Young Carers (range 77–829 days) and 571 days for Mzantsi Wakho (range 210–1091 days). Ethical approvals for data collection were obtained from Universities of Oxford (SSD/2/3/IDREC and SSD/CUREC2/12-21), Cape Town (HREC 389/2009 and CSSR 2013/04), and the relevant provincial South African Departments of Health, Basic Education, and Social Development [39, 41].

Data Pooling

To minimize measurement bias, we pooled data following a rigorous examination of study data collection tools for conceptual and measurement validity [42]. All items were measured using identical tools across the two surveys, except for HIV knowledge, as detailed below. Questions measured the same phenomena within comparable time frames, and answer options were comparable for categorical variables. In cases where variables had a different number of categories between studies, the variable with the larger number of categories was reduced to be equivalent to the variable with the least categories [43]. We conducted a quality control check following variable harmonization, using cross-tabulations for categorical variables and five-number summary for continuous variables by sample. We investigated dissimilar category counts to ensure that differences in frequencies reflected actual differences between population, and not difference in definitions or data collection procedures. After completing this process, we computed estimates on the resulting dataset as one larger sample, controlling for original study location in each analysis.

Ethics and Data Collection Procedures

All participants 18 and over provided voluntary informed consent, and both caregiver and adolescent provided assent/consent if participants were under 18 years old. Researchers fluent in local languages (Xhosa, Swati, Tsonga, and Sotho), who were trained to discuss sensitive topics with children and adolescents, conducted face-to-face interviews. Questionnaires were co-designed in consultation with a teen advisory group and piloted with adolescents from the study areas. Participants were interviewed in a location of their choosing, which maximized safety, confidentiality, and comfort throughout the interview. No financial remuneration was awarded for participating, but all adolescents received a snack pack and a study participation certificate. Participants at risk of harm or experiencing harm at the time of interview received support to link with emergency or support services, based on the recommendations of a trained social worker.

Measures

Full questionnaires are available at www.mzantsiwakho.co.za and www.youngcarers.org.za. The primary study outcome was exposure to sexual risk, measured as the incidence of at least one of four high-risk sexual experiences between baseline and follow-up: inconsistent condom use, transactional sex, age-disparate relationship, or early sexual debut. Inconsistent condom use referred to ‘never, ‘rarely’ or ‘sometimes’ using condom during sexual intercourse in the past year. Transactional sex was past-year incidence of sex in exchange for food, shelter, school fees, transport, or money. Age-disparate relationship was past-year incidence of having a sexual partner more than 5 years older. Early sexual debut referred to having had a first sexual intercourse before the age of 18 years old. Past-year pregnancy/having made someone pregnant was recorded through self-reports for both adolescent girls and boys.

Provisions

A total of eight provisions were hypothesized to delay sexual risk exposure based on the existing evidence base: (1) strong parental/caregiver supervision; (2) positive parenting/caregiving; (3) abuse-free homes; (4) school feeding; (5) affordable school materials; (6) affordable school fees; (7) government cash transfers; and (8) HIV prevention knowledge. Adolescents were recorded as experiencing or having access to these provisions or not at baseline and follow-up, and access was categorized as (1) no access at either baseline or follow-up; (2) intermittent access, defined as access at baseline or follow-up; (3) consistent access, measured as access at both baseline and follow-up.

Parenting/caregiver support was measured using two sub-scales of the Alabama Parenting Questionnaire [44, 45]. Strong parental/caregiver supervision was computed as adolescents choosing ‘very good’ on all six items about home rule-setting or monitoring of adolescent socializing, while positive parenting was measured as scoring ‘very good’ on all five items about caregiver-provided praise and positive reinforcement. Living in an abuse-free home was measured through adolescent self-reported monthly or more frequent experiences of physical or emotional violence at home [46]. Adolescent access to three types of educational subsidies was measured through adolescent self-reported access to: school feeding, defined as accessing at least one meal a day at school; affordable uniforms and school materials, measured as the adolescent’s family being able to afford uniforms and school stationery; and affordable school fees, defined as access to fee-free schooling. Each subsidy was included as a separate provision in analyses. Government cash transfers was defined as access to at least one child-related government grant into the home, such as South Africa’s Child Support Grant or Foster Child Grant. HIV prevention knowledge was measured as adolescent’s correct knowledge of four items on different modes of HIV transmission and prevention methods, for example ‘HIV cannot be passed from a HIV-positive mother to her unborn child.’

Additionally, analyses controlled for eight covariates across analyses, pre-selected for their potential effects on sexual risk exposure or access to social protection: (1) age; (2) gender; (3) urban/rural location; (4) formal/informal housing type; (5) province of residence; (6) socio-economic status (households were defined as poor if the adolescent reported missing at least one of eight basic necessities, such as “money to go to the doctor”, or “three meals a day”); (7) orphanhood, defined as being both maternally and paternally orphaned, and (8) HIV status, determined through self-reported HIV-positive test or being on antiretroviral therapy.

Analyses

The final pooled dataset consisted of n = 4811 participants. All participants who had been exposed to sexual risk prior to baseline data collection were excluded from analysis (n = 1149), to allow the analyses to identify provisions associated with exposure to sexual risk. Of the n = 3662 (76%) participants who reported no exposure to sexual risk at baseline, 97% were enrolled in school at both baseline and follow-up. We focused our analyses among adolescents enrolled in publicly-funded government schools in both data points, given high rates of enrolment at both times in this young sample, which reflect enrolment rates in other studies in South Africa [47]. To explore which provisions were associated with the outcome, we conducted final analyses using Stata 15 among n = 3635 adolescents who did not report sexual risk at baseline and were enrolled at school at both times, with exposure to sexual risk at follow-up as the outcome. Missing data was minimal (< 1%) for all included variables.

First, we calculated descriptive statistics for all covariates, provisions included. We used comparison tests to compare adolescents exposed to sexual risk at follow-up and those who were not exposed, using two-sample t-tests for continuous variables and Pearson’s χ2 tests for categorical variables (Table 1). We reported frequency distributions for longitudinal access to provisions (Table 2).

Table 1 Socio-demographic characteristics
Table 2 Access to supportive and protective provisions

Second, to assess the validity of the self-reported study outcome, we tested whether exposure to sexual risk was associated with incident self-reported pregnancy or having made someone pregnant, using Pearson’s χ2 test.

Third, we assessed associations between each level of access to provisions and exposure to sexual risk in multivariable logistic regressions using the three-step Hosmer-Lemeshow approach [48] (Table 3). In Model 1, we included all factors in a multivariable logistic regression with exposure to sexual risk as the outcome, controlling for all covariates. In Model 2, only variables associated with reduced sexual risk exposure at p < 0.1 in the previous model were included. In Model 3, we only included variables associated with the outcome at p < 0.05 level in the previous model.

Table 3 Multivariable regression model of provisions associated with incident sexual risk exposure

Fourth, we tested potential interactions between all significant provisions to test for multiplicative effect. We then tested whether gender or rural residence moderated the association between the provisions and early sexual risk exposure, in light of evidence suggesting that HIV prevention programming may have different results by gender [49] and location of delivery. We tested moderation using two-way interaction terms of each factor and either gender or rural residence in multivariable regressions, including all other covariates.

Lastly, we entered all provisions significantly associated with the outcome in the final model into a marginal effect model using multivariable logistic regression with covariates held at their mean values (Table 3). We calculated predicted probabilities of the outcome according to level of access to provisions (no access, intermittent access, consistent access) as well as according to the type of access (no intervention, single intervention or combination of all interventions) (Fig. 1).

Fig. 1
figure 1

Probability of incident sexual risk exposure by combination of consistent access to provisions

Results

Participant Characteristics and Outcome

Participants were, on average, 12.8 years old (SD 2.0, range 9–19 years) at baseline and 14.1 years old at follow-up (SD 2.1, range 10–21 years), and 56% were female. They lived in three provinces (Eastern Cape 29.4%, Western Cape 31.8% and Mpumalanga 38.8%), with over half living in urban areas (58%), the majority of whom were participants from urban communities in the Eastern Cape. Three-quarters lived in formal housing; another three-quarters lived in households lacking at least one basic necessity, such as food or sufficient money for school fees. Overall, 6.7% of the sample were orphaned.

Notably, 8.6% of participants reported sexual risk exposure between baseline and follow-up. This self-reported outcome was strongly associated with reporting pregnancy or having made someone pregnant during the study period incident pregnancy (Pearson’s χ2 (1) = 109.5, p ≤ 0.001). In bivariate analyses, adolescents who reported exposure to sexual risk were older, more likely to be orphaned, living in urban areas, and living in the Eastern Cape (Table 1).

Access to Provisions

Three levels of access to a total of eight provisions are reported in Table 2. Only one-quarter of adolescents reported consistent access to strong parental supervision, and less than one in five received consistent positive parenting. A third of adolescents reported living in abuse-free homes at both baseline and follow-up, while nearly three-quarters accessed government cash transfers consistently at baseline and follow-up. Only 2.1% of the sample had correct HIV knowledge, which may be explained due to the young age of the participants.

Provisions Associated with Exposure to Sexual Risk

Four provisions were associated with reduced odds of exposure to sexual risk. Adolescents receiving consistent parenting/caregiver supervision were 47% less likely to report sexual risk exposure (OR 0.53 95%CI 0.35–0.80 p = 0.002). Living in abuse-free homes at both baseline and follow-up was associated with a 45% reduction in the odds of sexual risk exposure (OR 0.55 95%CI 0.37–0.81 p = 0.002). Receiving at least one meal a day at school consistently reduced the odds of sexual risk exposure by 45% (OR 0.55 95%CI 0.35–0.88 p = 0.012). HIV prevention knowledge was also associated with reduced sexual risk exposure, with a stronger impact on consistent correct HIV prevention knowledge: adolescents who had the correct knowledge at baseline and follow-up were 57% less likely to report sexual risk exposure compared to no access (OR 0.43 95%CI 0.21–0.88 p = 0.021), while those who had inconsistent HIV prevention knowledge were likely to report a 32% reduction in sexual risk exposure (OR 0.68 95%CI 0.47–0.99 p = 0.044).

Combinations of consistent provisions, compared with single consistent provisions, were associated with lower odds of sexual risk exposure (Supplementary Table 1). Consistent access to each factor reduced the probability of reporting sexual risk exposure from 12.9% (95%CI 7.2–18.7) without any access to provisions, to 1.0% (95%CI 0.2–1.8) with consistent access to all four provisions (Fig. 1). Most of the participants accessed at least one provision (51.9%), followed by 30.8% of adolescents that accessed two provisions, 8.5% who accessed three of the four provisions and a handful (n = 7, 0.2%) who accessed all four of the provisions. About 10% of the adolescents did not access any of the provisions.

Older age and living in the Eastern Cape province remained significantly associated with exposure to sexual risk in the final model. There were no two- or three-way interactions among the four provisions significantly associated to reduced exposure to sexual risk (data not shown). Gender moderated the effect of the association between school feeding and sexual risk exposure, with no other significant moderation effects (Supplementary Tables 2 and 3). The effect of accessing school feeding (whether intermittently or consistently) was stronger among young adolescent boys than adolescent girls, with consistent access to school feeding among boys resulting in the highest reduction in the probability of reporting exposure to sexual risk (Fig. 2).

Fig. 2
figure 2

Probability of incident sexual risk exposure by gender and access to school feeding scheme

Discussion

The findings from this secondary analysis of data from two large, longitudinal studies in South Africa demonstrate significant associations between four provisions and reduced exposure to sexual risk in early adolescence. These longitudinal data provide additional evidence that can shape current and future programming for HIV prevention for adolescents in South Africa and similar HIV risk contexts. It capitalizes on two large-scale prospective surveys of early adolescents to draw lessons on factors that shape the onset of exposure to sexual risk and HIV, in the transition from early to middle adolescence, which are critical for programming for this early age group [50]. Most adolescent-focused research and programming targets older adolescents, particularly 15- to 24-year-old girls and young women, but these new findings suggest an additional impact of providing such programs to younger girls and boys. These provisions—parental supervision, abuse-free homes, school feeding and HIV prevention knowledge—could be provided through home- and school-based programming.

Four provisions were associated with reduced odds of sexual risk exposure. Each provision individually reduced the odds of sexual risk exposure from 32% (OR 0.68 95%CI 0.47–0.99 p = 0.044) to 57% (OR 0.43 95%CI 0.21–0.88 p = 0.021), highlighting the importance of accessing provisions in early adolescence consistently to address long-term vulnerabilities and risks faced by South African adolescents and young people. Importantly, combinations of interventions resulted in a 92% reduction in the probability of reporting sexual risk exposure, from 12.9% (95%CI 7.2–18.7) to 1.0% (95%CI 0.2–1.8). Only two out of five participants accessed multiple provisions, although just over half accessed at least one of them. One in 12 adolescents in this sample reported sexual risk exposure over a one-year period—a high rate considering that 99% of participants were under 18 years old at baseline. Access to government cash transfers was not associated with sexual risk exposure, but this may be linked to lower access to cash transfers in the highest-risk group of older girls. Our findings confirm the importance of investing in early adolescence to reduce poorer health and educational outcomes in later adolescence [36]. This is particularly relevant considering findings from the Violence Against Children Survey suggest high rates of coercion and violence during first sexual encounters in Southern Africa [51].

An important finding of our analyses is that intermittent access to provisions—access at either baseline or follow-up, but not both—does not reduce the likelihood of sexual risk exposure, with the exception of HIV knowledge. However, consistent access over a full year in early adolescence to provisions may have substantial impacts in delaying sexual risk exposure. Gender did not moderate the association between the provisions and sexual risk exposure, with the exception of school feeding. This finding suggests that in early adolescence, the same combination of interventions may support both boys and girls, whether they live in rural or urban communities, to prevent sexual risk exposure and the possibility of HIV infection. The significantly stronger effect of school feeding schemes on reducing sexual risk exposure among adolescent boys needs to be further investigated. As programming increasingly focuses on 15- to 24-year-old adolescent girls, it is critical to provide consistent access to provisions that, in early adolescence, may have long-term benefits for both adolescent boys and girls.

This study has several limitations. Our sample of adolescents was still young, and consequently their reported exposure to sexual risk is lower than the risk they will experience throughout the course of adolescence and early adulthood. We can expect higher rates of sexual risk exposure among older adolescents. However, there is insufficient research with young adolescents, and this study provides critical information about this age group. Another limitation is linked to the number of provisions measured. The pooled data only included shared measures for eight possible factors. Third, this study did not collect biomarker data, such as HIV incidence using clinical test results. Nonetheless, in light of strong linkages documenting sexual risk exposure in early adolescence and HIV incidence [13, 52, 53], our findings contribute to an understanding of how to break pathways leading to HIV infection among adolescents and young people.

Despite these limitations, our longitudinal analyses with an under-researched age group provide valuable information for future programming seeking to reduce HIV incidence among adolescents and young people. While sustaining programming that reaches the highest-risk groups, namely 15- to 24-year-old young women, is essential, these findings show a substantive impact on younger adolescents—including boys—and those who have not yet initiated sexual activity or been exposed to sexual risk. They suggest that early, consistent access to protective programming at home and school—programming that focuses on reducing food insecurity, boosting caregiver-adolescent engagement, improving correct HIV prevention knowledge, and preventing violence in the home—is critical to preventing sexual risk exposure among vulnerable young people.