The results of a randomized controlled trial of police body-worn video in Australia

Objectives We report the results of a randomized controlled trial of police body-worn video (BWV) cameras in an Australian context, with a focus on how cameras influence evidence gathering, court processes/outcomes, and police/public behavior. Methods The 6-month trial undertaken by the Western Australia Police Force involved a sample of officers ( N = 498) acting as their own controls with camera use ( “ treatment ” ) randomly allocated across shifts. A range of parametric and non-parametric tests were conducted to explore the influence of BWVon interview efficiency, rate/timing of guilty pleas, conviction rates, sanction rates, police use-of-force, assaults against police, and citizen complaints against police. Results The trial generated mixed results in support of this technology within this Australian context. BWV recordings did result in evidence-gathering benefits by producing cost/time efficiencies when taking field interviews. BWV footage had limited impact on court processes/outcomes, with indication that camera evidence encouraged earlier guilty pleas but no corresponding increase in the rate of guilty pleas or convictions. BWV did influence police operational decision-making, with increased sanction rates and use-of-force on treatment days. The extent to which officers engaged with the trial compounded these patterns. There was no evidence that BWV prevents problem behavior, with citizens ’ complaints increasing on treatment days and no influence of BWVon rates of assaults against police. Conclusions These findings highlight the need for additional context-specific clarity about why police use BWV cameras. In particular, BWV users should clearly specify the causal mechanisms through which cameras will achieve administrative, evidentiary, operational, and/or problem-prevention goals.


Introduction
There has been extensive uptake of body-worn video (BWV) cameras by Australian, UK, and US law enforcement agencies in the absence of clear evidence demonstrating camera efficacy (Lum et al. 2019). This paper reports the findings of a 6-month randomized controlled trial of BWV undertaken by the Western Australia Police Force (WAPF) in 2016. The rest of this section briefly explains the link between BWV and the focuses of this trial: evidence gathering, court outcomes, and police/public behavior. 1 First, with respect to evidence, BWV has produced efficiencies when dealing with complaints against police ) and facilitated evidence collection (Spencer and Cheshire 2018). Second, there is limited and mixed evidence to support a position that BWV is useful for court outcomes. While Spencer and Cheshire (2018) found that BWV positively influenced court outcomes in their study, an analysis of the relationship between BWV footage and judicial outcomes by Yokum et al. (2017) found no such benefits.
Moving on to the influence of BWV on police/public behavior, there is a large body of research across a range of contexts to believe people will behave differently when they think they are being watched (termed the Hawthorne effect: for a systematic review, see McCambridge et al. 2014b). However, there is a range of results across studies, which means it is uncertain the extent to which BWV cameras produce the Hawthorne effect in practice. Two categories of metrics that demonstrate these inconsistencies are operational decisions made by officers wearing BWV and the frequencies of problem behaviors (by police and public) in the presence of BWV.
Focusing on police use-of-force and sanction rates provides insight into the influence of BWV on operational decisions. There are mixed findings regarding the link between BWV and police use-of-force. One body of research does indicate that cameras can reduce use-of-force (e.g., ). However, a contrasting set of studies demonstrates no distinguishable impact of BWV on officer use-of-force behavior (e.g., Peterson et al. 2018). BWV camera-wearing officers have also demonstrated an increase in their sanction rates relative to officers without cameras (e.g., Braga et al. 2017). Contrasting these findings, Yokum et al. (2017) concluded that BWV did not influence policing activity. Furthermore, McClure et al. (2017) and Peterson et al. (2018) observed significant reductions in arrests for officers wearing cameras. The varied spectrum of inconsistent outcomes across trial contexts could be due, at least in part, to the interaction between BWV cameras and police discretion (Rowe et al. 2018).
Complaints and assaults against the police are also useful proxies to examine the potential for BWV cameras to prevent problem behavior. There have been some instances where BWV cameras have reduced complaints against police (e.g., Peterson et al. 2018), but this effect is inconsistent, with Ariel et al. (2017) finding no such outcome. There is also a lack of confidence that cameras would protect police from assaults (Ellis et al. 2015;Headley et al. 2017), likely explained by the moderating influence intoxication and familiarity with the criminal justice system would have on "potential" offenders' decision-making (Owens and Finn 2018).
Building on these previous research findings, the trial examined the following hypotheses: 1 For the expanded introduction with additional references, see the online only technical appendix.
& Evidence: BWV cameras and associated digital systems will produce cost and time efficiencies when taking field interviews. & Court: BWV cameras will increase rates of guilty pleas, result in earlier lodging of guilty pleas, and increase conviction rates. & Behavior (operational decisions): BWV will increase sanction rates and decrease police use-of-force. & Behavior (problem prevention): BWV will decrease citizens' complaints against police and assaults against police.

Design
This trial took place in two Western Australian locations: Perth (the state capital) and the regional town of Bunbury (located approximately 2 h drive to the south of Perth). Perth has a population of over two million people and the Perth policing sub-district encompasses 21 km 2 of densely populated residential area, the central business district, and a major entertainment district. In comparison, approximately 45,000 people live in Bunbury, and this regional policing subdistrict covers over 1000 km 2 including the town center and a mixture of rural and coastal areas. Selecting these locations enabled WAPF to determine the utility and practicality of implementing BWV in regional and metropolitan settings. Both locations typically generate a wide variety of policing tasks that require officers to have regular contact with the public in diverse settings (ranging from nighttime entertainment precincts through to commercial areas). Furthermore, these areas were likely to generate sufficient incident volume to allow reasonable data gathering during the trial and, from a logistical perspective, both areas could accommodate the necessary training and management requirements for the trial. During the trial, days were randomly designated as either "treatment" (all participating officers commencing a shift wore and used BWV), or "control" (no officers wore BWV). The treatment/control calendar was prepared at the outset of the trial by a random number generator. Access to this calendar was restricted and, prior to 6 am each day, all officers, supervisors, and district office hierarchy involved in the trial received emails advising the treatment/control status of the day. This design decision maximized the likelihood of "saturation" treatment on treatment days and minimized the likelihood of treatment on control days. For analysis purposes, the treatment/control status for any job, incident, or recording was calculated based on date-time because participating units' shifts did not cover 24-h periods and a late shift commencing one day may have finished at 06:00 h the following day. Any matter occurring between midnight and 06:00 h inherited the treatment/control status of the previous day.
On trial days, officers adopted a "limited discretion" approach when attending criminal incidents (officers were directed to record all domestic violence incidents, all incidents involving aggression/violence, all incidents where evidence capture was possible, and all incidents involving use-of-force). Officers retained a high level of discretion as to whether recording was continued and in what other circumstances recording was appropriate. During the trial, WAPF officers were operating under the following guidelines 2 : & Wear cameras overtly and warn members of the public they were being recorded (using an RCT-specific script), as soon as practicable; & Ask permission to film in private situations (such as inside homes) and abide by any request to cease filming in such situations; and & Use mobile telephones linked to BWV to record evidence of incidents and interactions with witnesses/victims of crime in the field.

Sample
The trial ran from 13 June 2016 to 16 December 2016 and involved 498 officers: 80% male, average age 36 years, and average 8.3 years' service (Probationary Constables, 5%; Constables, 28%; 1st Class Constables, 25%; Senior Constables, 23%; and officers ranked Sergeant and above, 16%). Due to random variation (treatment/control day allocation) and individual differences in work attendance because of a range of factors (including annual leave, sick leave, alternative duties, court appearances, etc.), officers had varying opportunities to use the cameras during the trial (86 treatment and 101 control days).

Measures
This evaluation utilized the following operational datasets: & Incident management system (IMS): criminal and non-criminal incidents, charges laid (including assaults against police 3 ), sanction rates, other enforcement activities, and information relating to recorded interviews. & Criminal code infringement notices (CCINs): discretionary, proactive policing action issued for antisocial behavior and/or liquor infringements. & Computer aided dispatch (CAD): public calls for service and internally generated tasking. & Use-of-force: reportable use-of-force includes drawing and pointing/discharging a Taser or firearm, or use of baton, handcuffs, police dog/horse, other weapons, or empty hand tactics that result in bodily injury.

Officer engagement with the trial
We anticipated there would be variation in the extent to which officers "bought-in" to the initiative (e.g., Spencer and Cheshire 2018). Consequently, this review also examined outcome differences for high-and low-engagement officers, determined as follows. First, the ratio for treatment day CAD attendances per treatment week each officer was rostered was calculated (median = 7.8). Officers whose average CAD attendance exceeded this median had reasonable opportunity to be involved with the trial and technology, if they wanted to. This subset of officers was then ranked on four metrics: (1) domestic violence (DV) attendances that were recorded by BWV (%, as directed by trial guidelines), (2) video files produced per CAD job (avg.), (3) audio files recorded per CAD job (avg.), and (4) video file recorded (avg. duration). Each officer's equally weighted rank scores were summed (with the lowest ranked score indicative of the greatest engagement with the use of the BWV). The high-and low-engagement officers were the 70 at either end of this ranking distribution. The relative compositions of these two "engagement" groups are displayed in Table 1.
Evidence: Did BWV produce cost/time efficiencies when taking field interviews?
Officers were directed to conduct an audio record of interview (ARI) using a BWV-linked mobile phone, rather than a "traditional," signed, written (statement) interview wherever possible when interviewing victims or witnesses. Table 2 shows that officers recorded 38% of their interviews on treatment days and 4% of interviews on control days. The average duration for these interviews was 14 min (treatment days) and 10 min (control days). By calculating the difference between the average duration for ARIs and statements, the BWV system was estimated to have saved 328 h on treatment days and 29 h on control days. The rate of interviews per 1000 CAD jobs was also significantly higher on treatment days. Furthermore, as displayed in Table 3, high-engagement officers were more likely to generate ARIs on treatment days and recorded more interviews on all days relative to low-engagement officers.
Court: Did BWV affect guilty pleas or conviction rates?
We examined all court records with an offence date within the scope of the trial where the arresting officer's unit was a unit participating in the trial. Because of the way the court data is structured, it was not possible to determine individual officers involved with enough consistency to be able to draw engagement comparisons. Given the time lag between arrest and finalization in court, with a view to maximizing the number of  events that had been finalized, this data extract was undertaken in April 2017 (4 months after the trial had been concluded). In total, 5006 of 7803 offences (64% of the eligible offences) had entered a guilty plea at this time.
Examination of guilty plea rates found no significant difference between cases from treatment (65.3 guilty pleas per 100 offences) and control (63.9 per 100 offences) days (Z-test for the rate ratios, Z = 0.75, p > .22). With the intent of capturing incidents where the BWV footage may have been used in prosecution and/or disclosed to the offender or their legal team, a sub-set of cases (n = 210) with BWV-relevant phrases entered into the Statement of Material Facts (SOMF) was identified. Analysis of this sub-set also demonstrated a non-significant difference between treatment (60.5 guilty pleas per 100 offences) and control (63.9 per 100 offences, Z-test for the rate ratios, Z = − 0.60, p > .27). There was also no significant difference between the total treatment and control day conviction rates ( There was support for the expectation that BWV encourages earlier guilty pleas. Of the 5006 offences for which a guilty plea had been entered, 55.5% were entered on the first or second appearance and 89.6% had been entered by the sixth appearance, by which time there were significantly more treatment day offences finalized (91.0% vs. 87.7% of control cases, Z = 3.63, p < .001, odds ratio = 1.41 (95% CI low = 1.17, high = 1.69)). There was indication that the timing of pleas was influenced for the sub-set of matters where the existence of BWV footage was recorded in the SOMF (47.2% of the 127 matters resolved by plea at the first appearance vs. 35.2% of first appearances for control offences, Z = 2.65, p < .005, odds ratio = 1.65 (95% CI low = 1.15, high = 2.36)).

Behavior (operational decisions): Did BWV affect sanction rates and use-of-force?
To test whether BWV increased sanction rates on treatment days, we examined CCINs, criminal charges, police-issued restraining order 4 issue, and move-on notices 5 (see Table 4). The rates of criminal charges and move-on notices issued were significantly greater on treatment days. Officer engagement also influenced sanction rates (Table 5), with high-engagement officers generally demonstrating greater sanction rates on treatment days relative to their own rates on control days and low-engagement officers on all days.
High-end use-of-force events in the WAPF are very rare. Despite this, for the 106 incidents recorded during the trial period, the use-of-force rate was higher on treatment days (1.0 per 1000 CAD jobs) relative to control days (0.7 per 1000 CAD jobs, Z = − 2.04, p < .03, odds ratio = 1.51 (95% CI low = 1.02, high = 2.24)).
Behavior (problem prevention): Did BWV affect complaints and assaults against police?

Discussion
There were indications that BWV recordings have an evidentiary value through cost/time efficiencies when taking field interviews. However, the ARI capability interacted with officer engagement to influence the likelihood of an interview being undertaken and the likelihood of the interview being captured using the cameras/phones. Further research should seek to uncover the drivers for these patterns and also examine how the potential time savings from recorded interviews hold-up downstream in the investigative process. From the court perspective, there was limited indication that BWV evidence encouraged earlier guilty pleas, but no evidence that the rate of guilty pleas or convictions increased. Anecdotal evidence from officers involved in the trial may partially explain this, with a large majority indicating at the time of analysis very few of the incidents they had attended when using BWV had appeared in court, and when cases had appeared in court there was a feeling that the BWV footage was not readily accepted (Clare et al. 2019). Furthermore, we note that this study was limited because not all of the cases were finalized at the time of the analysis. This issue warrants further research to allow all cases to be finalized, examine issues such as offence types involved, and explore the extent to which the existence and nature of BWV was disclosed/accepted during trial.
Looking at behavioral change and operational decisions, there was evidence that BWV increased sanction rates and this pattern was moderated by officer engagement. When considering police use-of-force and the potential preventive value of BWV, expectations based on the Hawthorne effect were not supported. Both police use-offorce and citizens' complaints increased on treatment days and there was no influence of BWV on rates of assaults against police. The potential influence of context-specific "floor effects" on these results should be considered, as it was difficult for already-lowfrequency events to decline as the result of an intervention, and small-number variation makes it difficult to reliably interpret the patterns surrounding low-frequency events.
To conclude, the authors want to propose some questions for consideration by other police agencies interested in implementing this technology. First, what is the purpose of BWV in a police setting? For example, are they intended to improve evidence gathering and provide efficiencies, improve the public's sense of police legitimacy, or change public/police behavior (and if so, when, where, and how)? Recent works by Piza (2018) and Malm (2019) critique current approaches to using BWV, highlighting the inconsistent, unclear causal mechanism(s) through which cameras are intended to simultaneously address multiple, competing goals. It is likely that future BWV camera interventions would benefit from additional pre-implementation focus on determining the context-specific mechanisms by which BWV is a part of the solution to existing problems. Additional clarity around purposes for the cameras will help determine how to utilize this technology and appropriately measure its impact.
Next, how do you minimize the dilution of impact caused by disengaged officers? Unfortunately, randomization does not guard against disengaged participants (McCambridge et al. 2014a), so understanding more about why some people engage with the cameras is crucial to maximizing and evaluating the efficacy of BWV. The significant impacts of engagement level identified in this review suggest that police forces would benefit from identifying and monitoring drivers of high and low engagement.
Finally, prior to implementing BWV, police agencies should consider the (publically funded) cost-benefit ratio of deployment (see Piza (2018), for a discussion of these issues). This is not to argue that cameras cannot have a cost-saving effect (e.g.,  discuss cost savings in terms of reduction in citizen complaints resulting from camera use), nor that cost savings should be a sole/primary driver of BWV deployment, but instead to ensure police departments consider factors such as starting rates for complaints, mechanism-specific, targeted use of cameras, and develop realistic expectations of their likely benefits/impact before commencing wholescale rollout.