The planning and reflection of police use of force training: a German case study

The current study aimed to elicit the planning and reflecting processes of police trainers with regards to the delivery of police training. Four police trainers were explicitly asked about their planning for and reflecting on training sessions. In total 34 interviews were conducted (17 pre, 17 post) and analysed. The results indicated that police trainers employ two main strategies to progress their learners towards the aims of the training session. First, they focus on making the learning experience fun and second, they point out the relevance of the to be learnt skills by creating the demand, showcasing mistakes and then subsequently focusing on developing the needed skills in isolated contexts. However, the data indicated that police trainers were generally deficient in their capability to set training session objectives and to align their delivery of training in a coherent and effective way. Furthermore, higher levels of reflection of the delivery of the training session were almost absent. The results identify a need for professional development for police trainers in the areas of planning and reflection.


Introduction
In the law enforcement domain recruits, regular officers and specialised personal regularly engage in practice geared towards developing the skills to cope with different operational situations, ranging from non-conflictual police-citizen encounters to use of force situations (Birzer 2003;Nota and Huhta 2019). Police trainers are in charge of planning, delivering and reviewing the respective training sessions (Cushion 2020;Staller and Zaiser 2015). This process-in line with current conceptualisations of coaching (Lyle 2018a)-is a decision-making process at its core (Abraham et al. 2006;Lyle 2002;Cushion et al. 2003;Abraham et al. 2015). Police trainers, as coaches, draw from a number of knowledge structures to inform their decision-making when planning, delivering and reflecting (Abraham and Collins 2011a;2011b). However, research in the context of police training indicates, that police trainers do not regularly engage in planning and reflecting processes (Cushion 2020). In order to further investigate these findings, the current study focuses on how police trainers actually plan and reflect on their training sessions and the relative depth of their reflections. By expanding the methodology employed by Cushion (2020), the current study presents interview data of discussions stimulated by the delivery of police training in Germany reported elsewhere (Staller et al. 2021a, b, c).

Planning training sessions in police training
When police trainers plan their coaching, a key aspect is knowing what learners should be able to know and do as a result of coaching (Staller et al. 2021a, b, c). The intended learning objectives arise from the analysis of the learners needs relative to his/her current context and form the basis for long-, medium-and short-term plans with specific outcome, performance and process goals. These objectives serve as a reference points from which coaches can monitor and adjust their planning, delivery and reflection . As such, planning provides a "tentative map" to follow (Till et al. 2019), providing a sense of direction and expectations against which current development can be continually monitored, on which alternate coaching strategies can be decided upon in order to accommodate and from which responses to changing needs of learners or contextual changes can be developed (e.g. resources). Coaches can only intervene if the need to act is noticed within long-, medium-or short-term planning or, more immediately, within the actual training activity. In order to notice the need to act, coaches have to be continuously attentive to moments of importance or disruption. Engaging in deliberate and purposeful planning can help coaches to detect anomalies by having clear expectations against which current observations of the reality can be compared against and that might otherwise be overlooked (Jones et al. 2013). Within the teaching and coaching process, the capacity to think in this manner while events are happening has been described as reflectionin-action (Martindale and Collins 2012;Schön 1983). Improving the clarity of expectations before a coaching event facilitates reflection-in-action, and thus provides an important stimulus for reflection after the coaching event; described as reflection-on-action (Schön 1983;Kovacs and Corrie 2017).
Coaching is a continuous planning, delivery and reflection process . To help police trainers cope with the ongoing demands of a dynamic teaching learning environment (Till et al. 2019;Kiely 2012;Abraham et al. 2015) the Coaching Practice Planning and Reflective Framework (CPPRF) has been developed (Muir et al. 2011;Till et al. 2019). It can be understood as a thinking tool to help coaches clarify their expectations and encourage connections between the desired objectives and coaching strategies. In particular the CPPRF encourages thinking and reflection around the (a) intended learning objectives (on a group / individual level), (b) the design of learning activities, (c) coach behavioural strategies and (d) learner engagement. As a planning tool, the CPPRF encourages coaches to think about their coaching goals and the alignment with learner needs and wants (the who), the field-specific demands (the what) and the learning environment (the how) to clarify their expectations. Furthermore, coaches are encouraged to consider how each coaching interaction is nested within the long-, medium-and short-term objectives of learners' development (Abraham and Collins 2011a). The intra-coordination of session activities with the inter-coordination of sessions over a period of time to progress the learner towards agreed overarching goals is referred to as constructively aligned learning practice Biggs 1996).
In police training, the on-going complex process of planning, delivering and reflecting as the heart of coaching best practice (Till et al. 2019;Muir et al. 2015;Abraham and Collins 2011a) has not been fully acknowledged . However, attempts to incorporate practical coaching models to frame the delivery of police training have begun to emerge (Nota and Huhta 2019;Körner and Staller 2018;Staller 2021, a, b, ctaller 2021). Police trainers need to exhibit the ability to integrate ideas from the interdependent areas of coaching knowledge (e.g. police-specific content knowledge, knowledge about skill acquisition, motivation etc.) to inform their reasoning and decision-making when they plan, deliver and reflect on police training sessions (Till et al. 2019). This includes identifying the target performance in relation to the learners' current state to formulate input demands as well as performance, process and outcome goals (Abraham and Collins 2011a;Till et al. 2019;Abraham et al. 2015). This includes a careful consideration of the "who", "what" and "how" to develop a coaching plan that is coherent, progressive and nested in the bigger picture of a competent police officer. However, the study of how coaches plan, deliver and reflect is scarce, particularly in the domain of police training. Some data is provided by Cushion (2020) from his observations of officer safety training in the UK. This work surmised that police trainers invested limited time in discussions about the planning and delivering of courses and did not explicitly consider any criteria for delivery success, such as outcome, performance or process goals. Cushion (2018) concludes, that "training was in a self-confirming 'loop'" (p. 7) manifesting traditional training approaches, consisting of large amounts of teacher-centred practices aiming at developing skill in an isolated fashion. Such approaches are regularly observed in police training (Basham 2014;Birzer 2003;Cushion 2020;Staller, et al. 2021a, b, c). This lack of self-reflection has recently been outlined as one of the major obstacles in advancing the delivery of police training (Staller et al. 2019a, b).

Coach learning: reflection and learning-loops
From a coach learning perspective, indications of a lack of reflection in police training coaching practice (Cushion 2020;Staller et al. 2019a, b) points towards that learning in police use of force coaching lacks awareness of underlying assumptions (Brookfield 2013), such as the underlying pedagogical approach (Birzer 2003;Körner and Staller 2018) or the inclination towards a more warrior or guardian-oriented policing approach (McLean et al. 2019;Staller et al. 2019a, b). Being aware of the underlying assumptions of ones coaching practice is a necessary ingredient for higher levels of learning (Argyris 2003;Tosey et al. 2012). "Higher" levels of learning in this context means that new perspectives with a successively wider scope are involved. Tosey et al. (2012) conceptualized these higher levels in recursive loops that function as feedback loops from all levels for previous levels and vice versa. Whereas single-loop learning is concerned with doing things right, double-loop learning takes place by examining and altering the governing causal and prescriptive assumptions of such practices (Argyris 2003), answering the question of "Am I doing the right things?". For example, a single-loop reflection of a police trainer, Oliver, teaching a take-down technique for a resisting citizen revolves around questioning himself about when to intervene in case a learner does not perform the technique like it was shown by him. Further reflecting on a higher level on the same example will incorporate questioning the assumption of why our trainer adheres to an ideal technique. Oliver becomes aware that he was taught in a traditional linear him/herself leading to the assumption of an ideal technique (Moy et al. 2015). Our trainer now becomes aware of other governing assumptions like non-linear pedagogy and is able to connect the underlying assumption with his current practice, resulting in double-loop learning. In the current training session, our police trainer now guides the learner to find their individual solution to taking down a person rather than impressing upon them a prescribed technique. By further acknowledging, challenging and unpacking assumptions with increased awareness of the relativeness and role of theoretical concepts a higher level of reflection and learning takes place; this has been described as a triple loop learning process (Tosey et al. 2012). The guiding question here is "How do I decide what is right?", involving questioning of what Brookfield terms "paradigmatic assumptions" (Brookfield 2017): The structuring assumptions we employ to order the world into fundamental categories and that are hard to recognize. In our example, our police trainer Oliver-triple-reflecting on the situation-asks himself about when and why to apply a non-linear or linear approach to learning. He becomes aware that his drive towards a specific pedagogical approach is partially fuelled by what other coaches in his milieu favour at the moment. Oliver now starts asking why, when and for who he should apply a specific pedagogical approach. Reflecting on his current problem, he thinks about the pros and cons of linear and non-linear approaches to training in the context of the specific situation he encounters with his learner and reaches the tentative conclusion that he will currently employ a more linear approach to training with more augmented feedback on the take-down execution, because he feels, that his learner has difficulties in coordinating his movements, lacks creativity and is easily frustrated when he is not immediately succeeding in taking down his partner.

Aims and scope of the study
The reflective practitioner has been highlighted as a desirable outcome of coach education in police training (Körner and Staller 2018). Reflective practice is generally considered as a continuous interaction between planning and delivery that deepens understanding of the learner's own practice and experience and leads to more skilled practice (Schön 1983). From this viewpoint it seems problematic that police trainers seem to lack planning and reflecting processes as evidenced by Cushion (2018). However, it cannot be inferred that police trainers do not engage in planning and reflection of their teaching and delivery of programs from observation studies alone. Even though they do not have the time to discuss issues with each other (Cushion 2020) it can be assumed that they still engage in some form of planning and reflecting process. However, the content and the thought processes associated with these procedures is unknown. As such, the current study aims at eliciting these planning and reflecting processes by explicitly asking police trainers about their planning and reflecting on their training sessions.

Methods
The case study was conducted at a German Police Academy, where the recruits of the Hessian State police are trained and educated. The research reported builds upon observational study data collected during a case study of police training (Staller et al. 2021a, b, c). A study section with five full days of police training for two groups provided the analytical frame of the study with the object being the process of training (Thomas 2011), resulting in approx. 46 h of observed police training. The training was delivered by 4 police trainers with a mean age of 39 (SD = 6.16) years and an average of 13.00 (SD = 9.87) years of experience as a police trainer. All police trainers were male. The sample of police trainers consisted of the complete police trainer staff that delivered the training within the observational study. As such, the current study was constrained by the available timeframe and access to the police trainers within the observational study.

Data collection
Before and after the training sessions, the police trainers were interviewed using a semi-structured approach. Prior to the sessions, the interview was structured using the CPPRF. Specifically, questions pertained to (a) the place of the sessions within the bigger picture of police training; (b) the planned structure of the learning activities; (c) the planned teaching behaviours; and (d) the planned strategies to engage recruits in learning activities. After the sessions, police trainers were asked to comment on the training session. The complete data set comprised of pre-and postinterviews of 17 training sessions (5 Self-defence and Arrest training; 4 Firearms training; 8 Tactical training). The interviews were recorded and transcribed verbatim and subjected to further analysis. Copies of the anonymised interview transcripts are available upon request.

Data analysis
The analysis of the interviews was based on sound qualitative research techniques, using qualitative thematic analysis (Braun and Clarke 2006). The analytical strategy was chosen according to the objectives of the study aiming at investigating the broader context of planning from the coaches' perspective and identifying the respective thematic focus. Interviews were analysed using MAXQDA data analysis software (Version 18.2.0). The analysis involved independently familiarizing oneself with all of the data by reading transcriptions and listening to interview tapes. Transcripts were examined both inductively and deductively, with the deductive analysis based on the research question (CPPRF, single-loop, double-loop and triple-loop reflection on the training session). Raw data themes were identified and built up into meaningful themes and sub-themes. The number of times a raw data theme was referenced (a meaning unit) was tallied. Subsequently, samples of these data sets were re-examined by another member of the research team (SK). Issues of contention between the researchers (MS, SK) were resolved using questioning and debate (Abraham et al. 2006).

Results
The higher order themes included planning and considerations before the training session and reflection levels about the training session (see Table 1: Results of qualitative analysis of interviews displaying hierarchical themes). Concerning the planning of the training session, results revealed fiver lower order themes categorized according to the CPPRF: (a) the learning objectives of the training session, (b) the embedding in the overall structure of police training, (c) the structure of the specific training session, (d) the planned coach behaviour to support the learner in reaching their aims of the session and (e) the facilitation of learner engagement. Reflections of the training session included single-loop reflections and one incident of doubleloop reflection.

Planning and considerations before the training session
With regards to the learning objectives of the training session coaches most often referred to the topic of the session (e.g. weapon handling, arrest techniques): Table 1 Results of qualitative analysis of interviews displaying hierarchical themes Raw data theme (number of meaning units)

Lower order theme
Higher order theme Topic of the training session (e.g. Simulation training) (6) Learning objectives Planning and considerations before the training session Repetition of previous content (7) New technique / tactic (5) Determine performance status (3) Sensitisation (2) Relates to the previous training session (3) Embedding in the overall structure Important later on in the studies (3) According to the curriculum (2) Important as police officer (1) Distinct training segments (6) Design of learning activities Demand creation (5) Increasing complexity (2) Starting with repetition of previous content (2) Flexibility (2) Showing mistakes (4) Planned coach behaviour to support reaching aims of the training session Being supportive (4) Fun (4) Personal responsibility (3) Let them try themselves (3)  (2) Being motivating (2) Being positive (2) Show understanding (1) Pointing out relevance as police officer (6) Facilitation of learner engagement Make it fun (6) Create demand (4) Teach new things (2) Training activity per se (2) Participation as coach (2) Adaptation of one's own teaching (5) Reflection-in-action Single-loop reflection about the training session Implemented what I wanted (10) Positive evaluation of own teaching Happy with the result (7) Everybody had fun (4) Was a round thing (4) Participants improved in the course of the training session (3) Was able to add additional content (2) Debriefing took too long (1) Negative evaluation of own teaching Was not able to reach the aim of the training session (2) Not possible to engage all participants (1) Relativization of teaching practice Warm up too long-but yet necessary (1)  (1) Participants did not engage in independent practice (2) Satisfied with the performance of the participants (3) Satisfied with the performance of the participants Not enough coaches for too many participants (3) Organisational problems Too little time (2) Too much load from studies in the morning (2) Uncertain about giving group or individual feedback (1) Uncertainty about own behaviour Double-loop reflection about the training session "The goal is the shooting with two weapon systems, long weapon, short weapon, and to practice switching between the weapons." (Day 3, Firearms training) What learners should achieve by the end of a training session was not mentioned. Reference was also made to the repetition of previous content, and the introduction of new techniques or tactics.
"Today is a complete repetition, refreshment of the basic training and of course then one or two additional techniques or extensions of the basic knowledge." (Day 1, Tactical training) Coaches provided scarce information about how the specific training session is embedded in the overall structure of police training. They mainly referred to the content relating to previous training sessions and the content of the upcoming session being an essential element of police training, which will be built upon in future training sessions: "Then we'll pick up where we left off last week." (Day 3, Tactical training) However, coaches gave no further detail on how specifically the content fits into the overall structure of the curriculum.
Concerning the design of learning activities, coaches often described distinct training segments that appeared to be isolated from each other. For example, the self-defence and arrest coach outlining the structure of a training session on day 4: "A short general warm up; then I will explain the circuit training. Then we do techniques. Then repeating the arrest concept. We finish with boxing." (Day 4, Self-defence and arrest training) Coaches provided no rationale for this structure. Another common point of reference was the "creation of demand". Coaches refered to this term, when they created training tasks, where learners failed due to a technique or tactic, that they had not been introduced to yet. Demand creation was especially prevalent in tactical training: "Demand generation scenario, as I said. Then new things like opening doors and team behaviour in a 360 environment." (Day 3, Tactical training) The creation of demand was also identified within the context of the planned behaviour of the coach and applied by use of the strategy of showcasing the recruits' mistakes. Police trainers described this as a strategy with the intent to support the learners: "Yes, to show mistakes, and to improve them. That they can then better apply the techniques." (Day 5, Self-defence and arrest training) Coaches planned at being supportive in correcting the mistakes. They articulated an intent to help the recruits in improving their mistakes and to perform better: "[I want] to always stand aside to answer the questions, to do justice to everyone, to clear up all mistakes, to uncover all mistakes" (Day 3, Tactical training) The importance of projecting "fun" was mentioned several times as a planning principle. A coach referred to having fun himself, in order to make the training session fun for the students.
"Having fun. That I have fun in class myself and I think that's very important. When I have fun in class, so that my colleagues also notice that. So together we then can achieve goal of the training session in connection with fun." (Day 2, Tactical training) The element of fun was also stated quite often with regards to facilitating the engagement of the learners.
"Going in and out of class with fun". (Day 5, Self-defence and arrest training) Besides having fun, coaches also planned in pointing out the relevance of the content of the training session for the recruits as police officers. Specifically, police trainers mentioned the relevance with regards to training content for worst case scenarios, where officers do not have a second chance if they fail (e.g. a lethal knife attack).
"They [the recruits] should be aware that they need to be motivated. Because in a worst-case scenario we don't have a second chance. Here you must succeed the first time, otherwise they get injured or even die. And I think that should be the greatest motivation." (Day 1, Tactical training) Besides skills for worst-case scenario, skills that are needed on a daily basis were also pointed out as relevant for the recruits: "The need to know the importance of the arrest technique. That they need the technique in their daily life. Because of that, they should practice accordingly and be serious during the training session." (Day 3, Self-defence and arrest training) Coaches stated a belief that the relevance the content itself provides the motivation to engage the learner with the subject material.
"The training content must be enough motivation." (Day 5, Tactical training) Taken together, concerning the planning before training sessions the results showed, that police trainers reported a specific topic as the learning objective without referencing what the learner should achieve during the session. The topic was embedded in the long-term development insofar that the current content follows the previous content and that it is necessary for the content in the next training session. The learning activities were designed and structured as isolated elements. Police trainers reported they designed training tasks where recruits fail in order to create demand and to highlight their mistakes. Being supportive and having fun themselves were reported as dominant planned coaching behaviours. Finally, learner engagement was reported to be facilitated by the relevance of the content and the element of fun.

Single-loop reflection about the training session
The interviews immediately following the training sessions resulted in a number of different types of reflections, which, for the most part, were single-loop in nature. The interviews showed that coaches reflected upon their reflection in-action while conducted their training session. Police trainers mentioned that they adapted their teaching when they encountered problems in the delivery of the session. Coaches spoke of adapting the task difficulty when they saw that recruits were not experiencing success of hitting the assigned target area. A firearms trainer provided a rationale for this decision.
" […] in the further process I designed the exercises a bit simpler, more balanced, in order to give a positive feeling" (Day 4, Firearms training) Concerning adaptation in other fields of police training, one coach of tactical training recalled that he scanned the recruits for their motivation and how they were engaging to adapt his program accordingly. The tactical trainer further reported that he perceived that the recruits were tired from the lessons in the morning and adapted the pace of the training session accordingly by teaching "step-by-step".
Reflecting on their teaching performance, police trainers evaluated their teaching very positively in terms of being able to implement the specific planned content: "And yes, so it worked out well, training goal achieved. So all components that we had planned, they were implemented accordingly". (Day 1, Tactical training) It was often mentioned that they were pleased with the result of the session and that the delivery itself was fun. One police trainer regularly mentioned that the session itself was a "round thing", meaning that everything worked out nicely without any problems. "Yes, it was a round thing today". (Day 3, Self-defence and arrest training) The analysis revealed one account in which a police trainer negatively evaluated an aspect about their teaching. It was an admission from the coach that they did not achieve the aim of the training session. They attributed this to an organisational problem.
"Then the main part, the [entering of] unclear areas I had been able to implement. But, I have to admit that due to the fact that I am alone and the size of the group I did not completely meet the aim of the training session." (Day 3, Tactical training) Although this was the only negative single-loop reflection of police trainers teaching performance recorded; there were accounts of relativization of teaching practice. These accounts gave the impression that coaches were not explicitly satisfied with a session; however, a rationale was put forward to relativize the decisions made. In the following example, the coach appeared to be well aware of the inclusion of a very long warm-up that included a fitness circuit. Even though this part of the session was not connected to the curriculum content of self-defence and arrest training, the coach provided a rationale for his decision: "The warm-up might have been a bit long, but I think it's necessary to keep people physically fit." (Day 3, Self-defence and arrest training) Concerning the performance of the learners, coaches pointed out the deficits of the recruits on a regular basis. It was mentioned that the recruits had forgotten a lot: "The problem is after one week a lot has already been forgotten." (Day 3, Tactical training) The perception that recruits had forgotten or were not able to perform the taught skills was attributed to periods of field training where recruits did not have access to police training in an academy setting. However, this was not an argument for the problem that recruits were perceived having forgotten the content from a training session two weeks earlier.
Coaches also pointed out deficits in the motivation of the young officers to practice, even though it was clear that their skills needed improving. A firearms coach complained that recruits did not use the opportunity to practice with inert weapons at the back of the shooting range. Instead, they waited passively until it was their turn.
"What I didn't like was that the offered dry training with red weapons was not used [by the recruits], because you could see that there were still difficulties in shooting while moving". (Day 3, Firearms training) Although coaches reported deficits in skill retention and motivation, the coaches did also comment on the satisfactory performance of recruits over the course of a training session: "And in the further process [in the training session] shooting while moving and the weapon change were constantly repeated and in the end I had a good impression that they can shoot well while moving and when they have to change their weapon." (Day 5, Firearms training) However, reflections on satisfactory performance were reserved for advances made by recruits within a session. It is noteworthy that coaches did not mention performance improvements between the training sessions.
Several coaches mentioned organizational problems impacting their delivery of police training, including too little time for training to be effective, the general load, both cognitive and physical, recruits had to endure on a training day, and a low coach to recruit ratio. For example, after a tactical training session a coach commented on the class size too big for him as a single coach.
"But due to the size of the group, I couldn't get all the way to the aim of the training session. And when larger study groups come, it will get more difficult." (Day 2, Tactical training) The coach, in this example, is referring to an anticipated rise in numbers of recruits in the near future. He further explained, that due to current increases in the recruitment of police officers large group sizes will be the standard for coaches to handle.

Double-loop and triple-loop reflection about the training session
There was one account of uncertainty regarding an adopted coaching behaviour that can be considered a double-loop reflection. Specifically, a coach mentioned a discussion with his peers about addressing issues with a recruit individually or in front of the whole group: "We have also just sat down ourselves within the coaches and discussed that we might have been able to optimize a few things. Possibly we should address things in front of the whole group and not only individually, thus problems, so that they don't arise in the next group, because we have already seen them visually or heard them from the others." (Day 1, Tactical training) This was the only evidence in the interviews that suggested levels of reflection that went beyond single-loop learning. The data showed not events of triple-loop reflection.

Discussion
The current study investigated the planning and reflecting processes of police trainers around the delivery of their training sessions. The interviews of the police trainers provided insights into the question of (a) what and how they planned for the delivery of their training session and (b) how and to what level they reflected upon the training session.

(Reflective) planning decisions
Concerning police trainers planning up-front of their training sessions to progress police recruits towards their long-term development goal, the results yielded several findings that related to the interdependent areas of the CPPRF (Muir et al. , 2011. Given the centrality of aligning the controllable elements of a training session (i.e. the structure and design of learning activities and coach behaviour) to maximise learner engagement with the long-term goals of learner development, we first discuss the alignment of the learning objectives in the long-term development plan before, secondly, focusing on strategies of trainers to maximise learner engagement in order to reach their learning objectives.
The interviewed police trainers did not appear to have a clear conceptualization of the learning objectives of the training session and its place within the macrostructure of police training. Connections to the long-term development plan were only made concerning to the last and the following training session. The rationale for the practicing on a specific training content (e.g. a takedown) was that it followed the last content and precedes the next. This likely contributed to the isolated, disconnected elements of the curriculum that were observed in the observational study of the police trainers' delivery (Staller et al. 2021a, b, c). This disconnection between elements of the curriculum has also been reported by other studies in the context of policing (Cushion 2020). This disconnection is problematic insofar that no systematic progression towards an overall development goal is possible. A constructive alignment of a police training program would demand that the intended learning objectives arising from the police officers needs in the field become the basis for designing long-, medium-, and short-term plans. These learning objectives then provide a key reference point from which police trainers and decision-makers can monitor and adjust the effectiveness of their programs, Staller et al. 2021a, b, c). The key reference point in our study seemed to be the number of the training session, with the police training manual stating what should be taught in which training session (Staller et al. 2021a, b, c), an approach that was also evidenced by Cushion (2020). This suggests that the purposeful alignment and structure of training content seems an issue to be optimized within police training coach learning.
Second concerning the controllable elements of coaching (i.e. coaching behaviour & activity design), the police trainers focused on two overarching aspects that they planned to compliment through their coaching behaviour and the structure and design of learning activities: (a) making the learning experience fun and (b) the relevance of the to be covered material.
As it concerns "fun", police trainers tried to have fun by themselves and to have fun with the recruits. Police trainers provided no explanation of how they create a fun learning environment. However, as interview data from recruits about the same coaches in another study showed (Staller et al. 2021b) police trainers regularly achieve this aim. Aligning coaching behaviours to make the learning fun is in line with current recommendations (Beni et al. 2017), since having fun has recurrently been stated as a reasonto participate in youth sports and its absence as a reason for dropout (Visek et al. 2015;Bengoechea et al. 2004). Furthermore, it has been described as central to meaningful activity experiences (Jakobsson et al. 2014;Smith and Parr 2006). However, Beni et al. (2017) recommend not to prioritize fun at the expense of other criteria of meaningful experiences.
Police trainers also plan to provide meaningful experiences by providing content, that is relevant and thus meaningful for the recruits. As such, they regularly communicated the relevance of the taught material in order to promote learner engagement. Data from interview studies showed (Staller et al. 2021b) that recruits are motivated by the perceived relevance of the content in police training. However, studies have shown that trainees sometimes perceive that there are differences in the content and its delivery between the police training learning environment and the field (Rajakaruna et al. 2017;Renden et al. 2015;Staller et al. 2021a). for developing the skills they perceive they need it may impact motivation. Police trainers seem to be aware of this by employing strategies to bolster the relevance of the taught content. They created learning tasks where recruits were "shot" to create the demand for learning a specific tactic to avoid getting shot. The benefit of learning from errors in complex environments is well-document throughout the literature (Metcalfe 2017;Piggott 2007), particularly, when receiving corrective feedback, including an analysis of the reasoning leading to the mistake (Morrison and Meliza 1999). Police trainers planned for the congruent coaching behaviour by showing the recruits the mistakes they made and by subsequently being supportive in managing the mistakes and prescribngtechniques and tactics.
Police trainers perceived the provided content (techniques and tactics) and the way of provision as useful and hence, relevant for the recruits. Data from the related observational study (Staller et al. 2021a, b, c) showed that police trainers predominately engaged recruits in repetitive, isolated practice with high amounts of corrective feedback. This points towards a problematic issue: That what police trainers think is most effective-technique/tactic to succeed in the field or pedagogical approach to best facilitate learning-is not necessarily the best approach. For example, police trainers design the isolated learning activities with the best intentions, namely to promote learner engagement, since they (the trainers) think that recruits perceive the content and how it is delivered as relevant per se. Police trainers think that through structuring learning of techniques and tactics in these linear isolated processes recruits develop the skills needed (i.e. their long-term development goal). However, research has shown, that diametral to the police trainers' intentions, repetitive and isolated practice does not facilitate learner motivation and engagement within this social milieu (Staller et al. 2021b) and that other pedagogical approaches (e.g. non-linear pedagogy) may be more effective in this specific context (Körner and Staller 2018). Likewise, concerning the content of specific techniques or tactics, interview data repetitively showed that what is learned in training is not necessarily that what is needed in the field (Staller et al. 2021a;Renden et al. 2015;Jager et al. 2013).
To tackle this issue, police trainers have to be aware of their governing assumptions about what is needed and what works in the field (e.g. techniques, tactics, skills) and which pedagogical approach governs their teaching behaviour. Being aware of theses underlying assumptions constitutes the need for double-loop reflection (Tosey et al. 2012;Brookfield 2013).

The depth of reflection
With one exception, coaches engaged in single-loop reflection when reviewing the training session they had just delivered. Coaches reflected what has happened within the training session based on their causal and prescriptive assumptions (Brookfield 2013). Based on what their assumptions about what should happen in a particular situation and under what conditions a specific behaviour changes, coaches generally evaluated the training session positively. At the same time, they pointed towards deficits within the students, especially, that they did not retain taught skills from the last sessions. For coaches, this lack of skill retention provided valuable data concerning the monitoring of their training program. From here adjustments in the planning and delivery of the program can be made in order to further progress towards the long-term goal Otte et al. 2019;Farrow and Robertson 2017). However, in the light of their governing assumptions of what constitutes a good training session (i.e. single-loop reflection), they felt they delivered the training well. It can be assumed that the consequence herein can be seen in the other areas of reflection: if they did well the lack of skill progression within the recruits must be attributed to something else, like organisational problems or the need for more physical fitness, as evidenced by the reported relativization strategies. Indeed, police trainers in a recent interview study  pointed towards the general deficits of young recruits in our society in explaining what has changed in police training over time. In the light of the current evidence that police trainers mainly engage in single-loop reflection, this may seem to be an adaptive strategy to cope with the monitored lack of skill progression (Vries and Timmins 2015; Martinie and Fointiat 2006;Shultz and Lepper 1996).
The reported incident by one police trainer of being unsure about a specific approach in highlighting the mistake of a recruit, indicates that police trainers do engage in double-loop reflection. However, compared to the extensive amount of single-loop reflection, this may be problematic with regards to delivering effective programs in police training. There is a consensus that effective coaching depends on the coach's ability to continually reflect, adapt and innovate (Gallimore et al. 2014). Not surprisingly, coach education programs in the sport domain are designed to prepare coaches for ongoing learning, emphasizing the importance of having them reflect on problems they encounter within their working environments (Gilbert et al. 2009;Trudel et al. 2010). Teaching critical thinking with regards to one's own coaching practice would involve being alert of the underlying assumptions of practice (Brookfield 2013), setting the base for being able to challenge and refine (if needed) these governing ideas (Argyris 2003;Tosey et al. 2012).
In defence of the police trainers, coaching courses in police training seem not to systematically incorporate elements of reflective practice, in Germany at least . In his case study, Cushion (2020) concluded, that "instructors […] are time pressed and have limited time for reflection on their own practice and are therefore constrained and limited by what they know" (p. 13). Since ongoing critical reflection has been advocated as an important tool for professional practice in law enforcement in general (Christopher 2015) and for police training specifically Körner and Staller 2018), coach development programs should be adapted accordingly.

Practical implications and further research
The reported data can be geared towards two major issues that are worth considering in order to improve practice. First, our data indicated a lack of constructive alignment of the program, resulting in non-specific learning objectives for each session, making it hard for police trainers to plan and adjust their behaviours and the structure and design of learning activities to reach the overarching aims of the training program. In order to improve practice, police trainers and curriculum designers might consider a clearer conceptualisation of a long-term development plan for police recruits. This may involve questioning and reasoning what a competent police officer actually needs concerning his or her operational skills in the field (Rajakaruna et al. 2017). Police trainers also could enhance their ability to coherently align training sessions towards an objective that reaches beyond the last and next training session. However, police organisations have to reflect critically if they provide police trainers with the necessary tools and skills that allow them to constructively align training programs. As such coach development and learning programs in police training may consider focus on (a) the alignment of learning objectives of specific training session to medium-and long-term developmental goals and (b) how coaches then can work towards these aims (e.g. by using the CCPRF).
Second, the current study highlights the problems associated with a lack of higher levels of reflection in police training. Not going beyond single-loop reflection may lead to the manifestation of ineffective training delivery on an individual level, since assumptions about what is needed and which approach is best to achieve it are not questioned. . In this light, we suggest that decision-makers and coach developers in police training focus on enhancing the reflexivity of police trainers, especially concerning higher levels of reflection. This includes the ability to reflect on assumptions and models that underpin the chosen learning and coaching process (Chow et al. 2016), including those premises that underpin this reflection process itself (Körner and Staller 2018).
Concerning further research, the results of the current study point out various avenues. With regard to the documented planning and reflection processes of police trainers and the practical implications aiming at coach learning and development a set of further questions arise. Studies focusing on the police trainer may ask: What are the developmental pathways of police trainers? Where do coaches get their knowledge from? What are the formal and informal sources of coaching knowledge in police training? What information concerning coaching do police trainers attend to? How does reflection of the coaching process effect future coaching practice and vice versa: How does learning improve reflection? Research questions aiming at the organisational structures supporting or hindering coach learning and development may include the following: How do police organisation implement coach learning and development for police trainers? How are coaching development courses designed and delivered? Since currently not much is known about police trainer learning and development research into such question would provide valuable insights.
From the perspective of application research (Lyle 2018b) and linked to the capacity of critical reflection of coaches, further research could focus on further developing the reflective practitioner in police training (Körner and Staller 2018;Schön 1983). This may include designing and evaluating programs to enhance reflective thinking within the domain of police training.
Finally, concerning the underlying assumptions of coaching practice and what is needed as a police officer, we would encourage researchers to further investigate two areas that would provide a basis for double-and triple loop reflection: First, to investigate the benefits and drawbacks of different pedagogical approaches in police training and, second, to further examine the needs of frontline police officers concerning their capability to manage operational situations and conflict.

Limitations
Our employed methodology has several inherent weaknesses that should be acknowledged. First, the interviews were conducted immediately before and after the training sessions. It may be possible that due to this immediacy police trainers focused on microlevel planning. In both cases this may have limited the depth of the reporting of their planning and reflecting. Second, the interviewers (SK, VH, MS) in this study had a positive reputation within the police academy studied. As a result, it remains possible that coaches wanted to impression manage the opinions of the interviewers (Leary and Kowalski 1990) by presenting their view of competence, resulting in few accounts of negative or critical evaluation of their practice. Future studies may consider highlighting reflexivity as a sign of competence towards the police trainers. Third, the current study relates to data collection in a case study (Staller et al. 2021a, b, c), which took place at a specific time period in the education of young police officers in a specific training institute. Therefore, caution should be taken in generalising the results to other law enforcement agencies at a national or international level. As such, the presented thought processes require validation. Fourth, results may contain gender bias since all trainers interviewed were male. As such, results may be limited to gender-based perspectives calling for further research including female and non-binary police trainers. However, despite the limitation, the employed methodology provided insights into a rather closed process. Especially, since police institutions in Germany have been reluctant to provide access to research coaching in police training . For the first time, the current study provides insight into the planning and reflection of police trainers.

Conclusion
The current study aimed at eliciting the planning and reflecting processes of police trainers regarding the delivery of their training sessions. Our results indicated that police trainers employ two main strategies in order to progress their learners towards the aims of the training session. First, they focus on making the learning experience fun, and, second, they point out the relevance of the to be learnt skills by creating the demand, showcasing mistakes and then subsequently focusing on developing the needed skills in isolated contexts. However, police trainers struggle to conceptualize the immediate learning objectives and embedding them in the long-term development plan for police recruits. Concerning the reflection after the delivery of the sessions, the results yielded that police trainers mainly evaluated their training positively and identified deficits in the progress of their recruits' progression. Furthermore, higher levels of reflection, such as questioning governing assumptions and beliefs about training approaches and what is needed for recruits in the field (e.g. long-term development), were almost absent.
Taken together, this indicates a need for higher levels of reflection and learning within police trainers. Not going beyond single-loop reflection processes-"Did I do it right?" -, may lead to the manifestation of ineffective training delivery strategies and providing recruits with content that is not necessarily needed for long term development. Instead, police trainers should also engage in higher levels of reflection questioning-"Am I doing the right things?" and "How do I decide what is the right thing?". In this light, we suggest that decision-makers and coach developers in police training focus on enhancing the reflexivity of police trainers, especially concerning higher levels of reflection.