Background

Youth physical activity (PA) levels worldwide [1] and in the UK [2] are low despite the physiological and psychological benefits [3]. Adolescent females have been highlighted as a particular group at risk of declining PA with recent data showing that only 16% and 9% of girls aged 11–12 and 13–15 years, respectively, are sufficiently active [2]. Although the decline in PA may begin prior to the transition between primary and secondary (high) school, and occurs in both boys and girls [4], there is a need to identify ways to stem the decline in PA in girls once they enter secondary schools as the decline is steeper and faster in girls than boys. Schools have been highlighted as a setting to tackle the inactivity crisis [5,6,7] using a variety of strategies [8,9,10,11,12]. Effects are generally small and short-term [11, 13]. Furthermore, potential mediators of physical activity changes (i.e. to help explain effectiveness) are not robustly or routinely assessed [14]. Reviews have identified that much of the evidence comes from the US, [8, 9] and there is an absence of robust “proven” or “promising” practice within the UK [15].

In an effort to tackle youth inactivity the Youth Sport Trust (YST; the largest non-profit organisation focussing on youth sport and activity in the UK) have developed programmes designed to support schools in engaging girls in PA. ‘Girls Active’ is one such programme which uses peer leadership and marketing to empower adolescent girls to influence school decisions, develop themselves as role models, and promote PA to peers. Schools are provided with training and resources to review the their PA, sport and physical education (PE) provision, culture and practices to girls and to create action plans for how they will implement changes. The programme had not been robustly or independently evaluated.

The aim of this study was to assess the effectiveness of the Girls Active PA programme in UK secondary schools.

Methods

Ethics approval and study sponsorship were obtained from the University of Leicester. The sponsor had no role in the design, undertaking or reporting of the study. School principals provided written consent for their school to participate. Parents/guardians were provided with an opt out consent form and only pupils who did not return the opt out consent form participated. Participants provided verbal assent prior to each measurement session and could withdraw at any time.

Design

This cluster RCT had follow-up at seven and 14 months. Following baseline measurements, schools were randomised by an independent statistician to one of two groups stratified by school size (pupil median: < 850, ≥850) and percent black and minority ethnicity (BME) pupils (median: < 20%, ≥20%). Sequentially numbered sections within a folder were used to implement the group allocations. The investigator team were not aware of the sequence until after randomisation. Measurement team members, except the team lead for the day, were blinded to group randomisation. The trial statistician was not blinded. However, the statistical analysis plan was signed off prior to database lock and any deviations from the analysis plan are reported herein.

Stakeholder involvement

Girls Active is built on over 10 years of work by the YST that includes consultations with school senior leaders, teachers, young people, national agencies and corporate partners. This extended to consultations with adolescent girls and their teachers in order to refine the programme prior to this evaluation study. The research team consulted with two groups (n = 8 and n = 6) of adolescents and PE teachers (n = 3) at local schools in Leicester City ahead of the funding application. This informed the decision around outcome measures: i.e. preference of a wrist worn accelerometer over one worn on the hip or the thigh, economic evaluation cost diaries and process evaluation themes and questions. Key stakeholders were involved in the study as two lay members sat on the trial steering committee (TSC). Lead teachers co-designed their own school reports so that anonymised baseline data specific to their school could be used within their schools. Further dissemination will be undertaken to the study participant through a further anonymised school level report as well as a briefing event for school leads and other personnel involved in education, sport and physical activity decision making. Stakeholders were not involved in decisions around study design, recruitment or conduct of the study and the burden of the programme was not assessed.

Participants and school clusters

All state (government funded) secondary schools in Leicester City, Leicestershire and Rutland (LLR), UK (n = 56) with a Key Stage 3 (KS3: age 11–14 years) were eligible and were sent a letter of invitation to the study. Of these, 25 agreed to come to a briefing event about the study, 15 attended of which 14 obtained consent from the school principal for their school to participate. In tandem, a further 26 state schools that were geographically close to LLR but in neighbouring counties were approached and 6 of these consented to participate. Schools provided the research team a list of all eligible girls between the ages of 11 and 14 years and in years 7, 8 and 9. All eligible pupils were provided with an information pack that contained a separate participant and parent/guardian information sheet and opt out consent form as well as an invitation letter. Pupils had two weeks to return the opt out consent form. Using a random number generator, 90 girls from each school were chosen at random (split between year groups).

Sample size

This study was designed to provide adequate power to detect a meaningful group difference [16] in MVPA of 10 mins/day assuming a standard deviation of 18 mins/day in MVPA [17], a power of 90%, a significance of 0.05, a cluster size of 56 girls and an intra-class correlation of 0.1. Twenty schools and ≥ 80 girls/cluster allowed for cluster attrition and 30% loss to follow-up and non-compliance with accelerometer wear.

Participant assessments

Measurement sessions took place at school during the school day. There was an explanation of methods, the assent process, question and answers and the accelerometer protocol.

Objective PA

Methods used in this study have been described in the protocol paper [18]. Briefly, girls wore a GENEActiv accelerometer 24 h/day for seven days on their non-dominant wrist at all time-points and were given a £5 gift voucher on return of the device with at least 4 days of data collected. The GENEActiv devices were initialised with a sampling frequency of 100 Hz and set to start recording at midnight on the first day of data collection and stop recording at midnight seven days later.

Data processing

GENEActiv .bin files were analysed with R-package GGIR v1.2–11 (http://cran.r-project.org) [19, 20]. Variables of interest were calculated over the 24 h day using published thresholds: MVPA, overall PA (average acceleration; derived using the Euclidean Norm Minus One method), light PA and sedentary time [21, 22]. Sedentary time was partitioned out from sleep using the nocturnal sleep detection algorithm in GGIR [23]. The meeting of PA guidelines [24] was calculated as those achieving ≥ 60 mins MVPA on each measurement day. Accelerometer variables were calculated for the whole day and by specific periods: during school, after school (up to 9 pm), school days and non-school days. Each school’s own start and end times were used to define school hours variables. PA data analysis include participants with ≥ 16 h of wear-time during each 24 h period [25] on ≥ 2 days including ≥ 1 school day.

Psychological outcomes

A range of psychological factors that may mediate PA participation were self-reported on a paper-based questionnaire as described in the protocol paper [18]: intentions and motivation to be active; attitudes to PA; perceived family, peer and teacher social support for PA; perceptions of the school social and physical environment; PA self-efficacy and enjoyment; perceived importance of PA, and physical self-perceptions (self-esteem, body attractiveness and physical self-worth).

Anthropometric and body composition measurements

Standing and sitting heights (Seca 213 stadiometer, Seca, Birmingham, UK) and weight (Tanita SC-330ST, Tanita Europe BV, Middlesex, UK) were assessed to the nearest 0.1 cm and 0.1 kg, respectively, using standardised procedures. Body mass index (BMI) was calculated and converted into z-scores relevant to the UK population [26]. Percentage body fat was estimated using pediatric scales (Tanita SC-330ST, Tanita Europe BV, Middlesex, UK).

Potential covariates

Participant age (in months) was calculated from date of birth, year group was self-reported, and socioeconomic status was represented by calculating the index of multiple deprivation (IMD) from participants’ self-reported postcode. Age at peak height velocity (APHV) was used as an indicator of biological maturity category [27].

School level data

School level socioeconomic position/deprivation was represented by the percentage of pupils eligible for free school meals (%FSME) from the relevant 2015 school census [28]. School pupil numbers were reported by the lead teacher and verified by 2015 school census data.

Intervention group – Girls Active programme

Details of the Girls Active programme have been described in the protocol paper [18]. The aim of Girls Active is to provide a support framework for schools to review and change their PA, PE and school sport culture and practices with the support of the YST and a hub school. Teachers completed a school self-review and attended an initial training day delivered by a YST national tutor. At this day, teachers discussed the programme elements (including establishing a peer leader group) and started developing their school action plan. They received a folder of case studies and electronic versions of marketing materials for promotion within their school. One of the key elements to the programme was the formation of a girls leadership and peer marketing group to empower girls to influence PE, sport and PA in their school, develop as role models, and promote and market PA to other girls. Over the course of the programme, resources for lead teachers were uploaded to an online file sharing system. School leads attended a peer review day to share practice with other teachers facilitated by the development coach and a hub school teacher. Lead teachers were offered in-person or phone support through the hub school or development coach. Lead teachers were free to implement the programme flexibly in whatever way they wished but were encouraged to set up a peer leader group who were to market PA to their peers and help prioritise PA decisions within their school. Teachers were encouraged to identify pupils that are not necessarily the sporty pupils but those who bring a range of perspectives and could communicate with, and motivate their peers. They were provided with two £500 capacity funding instalments to coincide with action plan submission. The programme was delivered in the same manner as would have been done in the real-world setting and, in theory, Girls Active has the potential to continue on in schools if implemented with sustainability and embedding in mind. The elements of the “off-the-shelf” programme which was originally developed without specific reference to theory were mapped to constructs in social cognitive theory post-hoc by the academic team (Fig. 1). Seven core components of Girls Active were identified: submission of the first self-review and action plans; attendance of lead teacher at initial training; use of package of resources or use of an alternative; engagement of young people as peer leaders; use of online, in person or phone support of hub and/or development coach; lead teacher attendance at peer review day; and submission of the second mission analysis. The intervention was delivered at the cluster level but the participant data reported herein were collected at the individual level.

Fig. 1
figure 1

Proposed logic model for the Girls Active programme

Control group - usual practice

Control arm schools were not given any specific guidance or advice and were assumed to carry on with their usual practice of PE and sport provision.

Primary and secondary outcomes

The primary outcome was difference in the change in mean mins/day of MVPA at 14-month follow-up between groups. Secondary outcomes included: overall PA; sedentary time; and all PA outcomes split by weekday and weekend day, and during school hours and after school hours; BMI z-score; percentage body fat; and psychological factors that may mediate PA participation as described above.

Economic analysis

A full description of the economic assessment methods of the Girls Active trial are available [29]. Briefly, micro-costing methodology [30] was applied to calculate the costs of delivering the programme over a whole school year for the intervention arm schools. This provides a mean cost per school. Money and time spent on delivering Girls Active were self-reported by lead teachers using two types of bespoke cost diaries and a questionnaire and, if needed, these were followed up with a phone call for clarification. The cost-consequence analysis was conducted from a public sector, multi-agency perspective. Health related quality of life measured by the Child Health Utility-9D [31] was used as the measure of effect. Use of GP and school based services (school nurse and school counsellor) were used as the measure of costs. These were all self-reported by participants at each timepoint.

Statistical analysis

The results are reported according to the CONSORT statement for cluster RCTs [32]. Primary and secondary outcome analyses were based on a complete case analysis. Intention to treat (ITT; all schools and recruited pupils were analysed in the group they were randomised to) and per protocol analyses were also undertaken for the primary outcome as sensitivity analyses. The per protocol population included schools that engaged with 70% of the seven core components (as detailed above) of the programme over the 14 months and had complete data for the analysis concerned on ‘by analysis’ basis. In the control arm, the per protocol population included all schools/pupils randomised to that arm.

Generalised estimating equations, accounting for school level clustering, and adjusting for baseline MVPA, stratification factors of school size (< 850, ≥ 850) and % of non-White pupils (< 20%, ≥ 20%), %FSME and participant year group, were employed. Subgroup analyses involved within subgroup stratification and between subgroup interactions effects between the intervention arm and pre-specified subgroups: baseline school (social deprivation and size) and pupil (ethnicity, maturation status, year group, PA level) characteristics. Sensitivity analyses explored the effect of the number of valid accelerometer days and the season of data collection. All analyses were performed using Stata (v.14.0), with statistical significance set as p < 0.05. Changes from the agreed statistical analysis plan included the addition of %FSME (school SES) and participant year group as covariates in the primary and secondary outcomes analyses.

Results

In Spring 2015, 20 secondary schools were recruited (Fig. 2). Of these, 18 schools agreed to be followed-up at seven months, and 19 at 14 months. From these schools, 1752 adolescent girls provided assent and participated at baseline (Feb – April 2015), 1405 (80.2%) at seven months (Sept – Nov 2015), and 1361 (77.7%) at 14 months (April – June 2016). Tables 1 and 2 present the baseline characteristics of the schools and participants, respectively. Participants who did not complete the 14 month assessment (n = 301) were older (p < 0.001), had a higher BMI z-score (p = 0.021) and provided 0.2 days less accelerometer data (p < 0.001) at baseline (Table 3). Complete accelerometer data (i.e., ≥ 2 valid days, including at least one school day at baseline and 14 months) was available for 1211 participants (69.1%) for the primary outcome analysis at 14-month follow-up. No serious adverse events/reactions were reported.

Fig. 2
figure 2

CONSORT flow chart for the Girls Active cluster randomised controlled trial

Table 1 Baseline characteristics of each school (cluster) by randomised group; usual practice (control) or the Girls Active programme (intervention)
Table 2 Baseline characteristics at the individual participant level by randomised group; usual practice (control) or the Girls Active programme (intervention)
Table 3 Baseline characteristics of completers versus non-completers at 14 month follow-up

Primary outcome analysis

There were no differences between intervention and control groups at 14 months for the change in mean mins/day of MVPA in the complete case (1.8 mins/day; 95% C.I. -0.8 to 4.3; p = 0.178), intention-to-treat (1.7 mins/day; 95% C.I. -0.6 to 3.9; p = 0.158) or per protocol (1.7 mins/day; 95% C.I. -1.15 to 4.48; p = 0.246) analyses (Table 4). At seven months, a difference in MVPA between groups was found in the complete case (2.4 mins/day; 95% C.I 0.1 to 4.7; p = 0.039), intention-to-treat (2.3 mins/day; 95% C.I. 0.2 to 4.3; p = 0.028) and per-protocol (3.1 mins/day; 95% C.I. 0.93 to 5.35; p = 0.005) analyses.

Table 4 Changes in minutes per day of MVPA at 7 and 14 (primary outcome) month follow-up between participants randomised to usual practice (control) or to the Girls Active programme (intervention)a

Subgroup analysis

Between subgroup interaction effects revealed subgroup effects (i.e. the programme had a differing effect depending on certain baseline characteristics) at 14 and seven months (Figs. 3 and 4, respectively). At 14 months, the between subgroup interaction effects (p < 0.001) revealed a difference between randomised arms of 3.9 mins/day (95% C.I. 1.39 to 6.09; p < 0.001) favouring the intervention arm in larger schools (≥ 850 pupils). In smaller schools (< 850 pupils) there was a difference between randomised arms of -4.4 mins/day (95% C.I. -7.34 to -1.41; p = 0.004) favouring the control arm. At seven months, in White European and early maturers there was a difference between randomised arms of 3.1 mins/day (95% C.I. 0.60 to 6.02; p = 0.017) and 5.1 mins/day (95% C.I. 1.69 to 8.48; p = 0.003), respectively, favouring the intervention arm.

Fig. 3
figure 3

Forest plot of the effect of the intervention at 7 months on the primary endpoint by subgroups

Fig. 4
figure 4

Forest plot of the effect of the intervention at 14 months on the primary endpoint by subgroup

Accelerometer secondary outcomes

No differences were found in the MVPA variables at seven or 14 months when stratified by weekday/weekend or time of day (Table 5). Generally, there were no changes in the other accelerometer variables at seven or 14 months except a significant difference in total PA at seven months (1.4 mg; 95% C.I. 0.1 to 2.2; p = 0.030), in sedentary time during the after school period at seven months (-4.7 mins/day; 95% C.I. -8.9 to -0.6; p = 0.026), in total light PA at seven months (5.7 mins/day; 95% C.I 1.0 to 10.5; p = 0.018), and light PA on school days at 7 months (4.5 mins/day; 0.3 to 8.8; p = 0.038). No differences were found in the proportions of participants active at least 60 min over the measurement period at either seven or 14 months.

Table 5 Objectively measured activity variables at follow-up between participants randomised to usual practice (control) or to the Girls Active programme (intervention)a

Body composition secondary outcomes

No differences between the groups were seen for BMI z-score at either seven (0.003 kg/m2; 95% C.I. -0.06 to 0.07; p = 0.908) or 14 months (0.02 kg/m2; 95% C.I. -0.06 to 0.09; p = 0.636). Similarly, no differences between the groups were seen for % body fat at either seven (-0.15%; 95% C.I.-0.83 to 0.52; p = 0.656) or 14 months (0.25%; 95% C.I. -0.68 to 1.17; p = 0.600).

Psychosocial secondary outcomes

There was no pattern in the differences found and they were sporadic (Table 6) and slight (questionnaire scales were on a five or seven point scale). At 14 months there was a difference in intentions to being active in favour of the control group (-0.21; 95% C.I. -0.37 to -0.05; p = 0.012). At seven months there was a difference in perceived importance in favour of control group (-0.42; 95% C.I. -0.66 to -0.18; p < 0.001). At 14 months there was a difference in participants confidence in being active in favour of the control group (-0.08; 95% C.I. -0.14 to -0.02; p = 0.013). A significant difference in self-esteem was seen at seven months (0.06; 95% C.I. 0.01 to 0.11; p = 0.025) in favour of the intervention group. At 14 months there was a difference between groups perceptions of their school physical environment in favour of the control group (-0.13; 95% C.I. -0.24 to -0.01; p = 0.032). At 14 (-0.09; 95% C.I. -0.18 to -0.01; p = 0.032) months there was a difference in identified motivation (motivated by engaging in activities that are a means to an end) in favour of the intervention group.

Table 6 Scores for psychosocial measures at follow-up for participants randomised to usual practice (control) or to the Girls Active programme (intervention)

Sensitivity analyses

Sensitivity analyses showed similar results to the main analysis with no differences between groups at 14 months when the various levels of accelerometer data provision were used. The differences found at 7 months in the main analysis were similar when accelerometer data provision was one day or more (2.4 mins/day; 95% C.I 0.13 to 4.62; p = 0.038), any 2 days or more (2.4 mins/day; 95% C.I 0.12 to 4.72; p = 0.039), and any 3 days or more (2.4 mins/day; 95% C.I 0.02 to 4.7; p = 0.048), any 4 or more days (2.3 mins/day; 95% C.I. -0.03 to 4.7; P = 0.053) and any three weekdays plus one weekend day or more (2.2 mins/day; 95% C.I. -0.2 to 4.6; p = 0.078). The addition of baseline season of data collection to the analysis revealed a difference in MVPA change at 14 month of 0.8 mins/day (95% C.I. 1.2 to 3.5; p = 0.027) only. When follow-up season was included in the model there was a difference in MVPA change of 2.8 mins/day at seven months (95% C.I. 0.4 to 5.1; p = 0.020).

Economic analysis

Depending on how Girls Active was implemented costs ranged from an estimated £1054 per year to £3498 per year per school. There were no statistically significant differences found between the groups for health-related quality of life (CHU9D utility index scores) and frequencies and costs of GP and school service use.

Process evaluation

How each schools implemented the programme was addressed specifically within the process evaluation. In particular, how peer leaders identified and the groups established and run; the positive feelings lead teachers and pupils had about the programme; the challenges and opportunities to implementing a flexible programme within the current educational landscape and school-level constraints to implementation.

Discussion

Girls Active was designed by the YST to be, and is being delivered as, a flexible programme for UK schools. This paper investigated the effectiveness of the programme in impacting MVPA of adolescent females. PA of this population is of significant public health interest. The complete case, intention-to-treat and per protocol analyses found no evidence of a sustained intervention effect on MVPA between intervention and control groups after 14 months (primary outcome). At the shorter follow-up of seven months, significant differences in MVPA were observed between groups in the complete case, intention-to-treat and per protocol analyses, but the differences in change were small: 2.4, 2.3 and 3.1 mins/day, respectively. When observing MVPA changes in the per-protocol analysis it would seem that implementation of more of the seven core components (as presented in the methods) of Girls Active prevented the MVPA decline i.e. control schools declined by 6.2 mins/day while the intervention schools declined by 1.9 mins/day between baseline and seven months. Due to the flexible nature of the programme lead teachers spent a wide range of time and money on delivering and implementing Girls Active within their schools. The cost-consequence analysis showed no effect on health related quality of life or service use (typical measures utilised in health economic analysis).

The pre-specified subgroups analysis found that the intervention was effective at 14 months in larger schools (+ 3.9 mins/day) but caused an MVPA decrease in smaller schools (-4.4 mins/day). This warrants consideration when designing programmes for roll-out at scale. Although the relative success in larger schools makes intuitive sense, findings from the process evaluation of this study will provide information of the perceived challenges and opportunities in intervention schools. Different strategies within a programme, not necessarily different programmes, may be needed for non-White European girls and late maturing girls based on the findings of the sub-group analysis at seven months.

We saw some differences between groups at seven months for accelerometer variables in favour of the intervention group, but again these differences were relatively small. Little evidence of any changes in potential psychosocial mediators during the course of the programme were found. Process evaluation data will map out timelines of programme activity to help explain these findings.

Comparison with extant literature

School-based PA interventions targeting adolescents have shown limited success on objective measures of PA, particularly for older adolescents [33]. Recent evidence from school-based trials with a primary outcome of objectively measured MVPA from the UK [34,35,36,37], Australia [38, 39] and the Netherlands [40] has emerged. Of these, two were aimed at adolescent girls in secondary schools, [37, 39] and only one reported a significant effect on objective MVPA but in males only [38]. Other studies are currently underway in UK secondary schools which capitalise on influential pupils, mentors, or in-class peer leaders [41, 42].

The findings of this study add to evidence from well-designed, adequately powered trials on the lack of effectiveness of school-based PA programmes on objectively assessed PA [34, 37, 39]. When there is a significant effect, it is ‘small’ (a standardised mean difference ≤ 0.49) [33] highlighting the challenges with intervening on adolescents’ PA within the school setting. Although small effects are seen multi-component programmes and those underpinned by theory may be more effective [9, 13]. Extensive support may be required to show significant but modest intervention effects (e.g. PA4E1 [38]).

This was an evaluation of a programme currently implemented in UK secondary schools so addressing the limitations to the Girls Active programme itself was beyond the scope of this study. Accompanying process evaluation and cost-effectiveness papers will underscore barriers to implementing school-based PA programmes and also demonstrate what resources were actually used to implement the programme, which will be highly valuable for future research and roll-out at scale.

Strengths and limitations

The strengths of this evaluation study include a fully powered cluster RCT design. The sample was multi-ethnic, incorporating diverse schools. A random sample of 90 girls/schools were included in the evaluation to ensure it was not the “most active” girls who would sign-up. The baseline PA levels would shows that meeting PA levels are low (2.3%) the IQR of MVPA of 29.9–57.0 mins/day would suggest pupils with a wide variety of activity levels were included in this study. Overall, 25.9 to 78.2% of all KS3 girls/school (mean 37.2%) were included making our results generalisable to a whole school population in this age group. However, this may have also acted as a limitation in that over a 14 month period Girls Active is likely to only have had a modest reach, meaning our sample may have not been fully exposed to the intervention. A targeted evaluation sample may have yielded different results. Objective measurement of MVPA as the primary outcome was in line with public heath priorities and meets the need for “objective and comprehensive evaluation” methods for evaluating programmes [43]. Although flexible programme such as Girls Active may benefit from more pragmatic study designs, from a research methods standpoint the RCT design was well received by partners, schools and pupils. Overall, 24% of the schools that were contacted entered the trial (20 out of 82) and schools and pupils were recruited to target. There was good compliance with the accelerometer, low levels of missingness from questionnaires, and participant loss to follow-up was in line with other studies. A full economic analysis was also completed for the Girls Active trial. Based on best-practice, the methods used with schools were designed for this study and the methods used and results found add to the literature on the costs of delivering programmes in the school setting.

Conclusion

At 14 months, our primary measure of effectiveness was change in MVPA at 14 months. No difference in change in MVPA between Girls Active control and intervention schools was found. At seven months there was less of a decline in MVPA in the intervention schools compared to the control schools. Differences in sub-groups may mean the programme has potential in certain types of schools or pupils. Process evaluation and complete and detailed economic analysis papers will give context and detail of the delivery of such a flexible programme.