Structural validation of three German versions of behavioral and motivational scales in high-risk sports

Likert-scale Abstract The aim of the present research was to validate German language versions of three inventories in high-risk sports to facilitate future research in the signiﬁcant population of German speaking high- risk sports participants. We translated the Sensation Seeking, Emotion Regulation and Agency Scale (SEAS), the Risk-Taking Inventory and the Accidentsand Close Callsin Sport Inventory into German, then testedthe hypothesized factor structures with 719 high-risk sport participantsfrom the European Alps using Bayesianstructural equation modelling (BSEM). The ﬁnal models were all good ﬁts to the data, had good internal consistency and displayed adequatediscriminantvalidity. All inventories displayed the same factor structure as in the English inventories bar the G-SEAS After inventory in which a three-factor model ﬁtted better than a two-factor model. Possible reasons for this diﬀerence include diﬀerences in the samplepopulation, translation bias, or cross-cultural diﬀerences; however it seems likely that the nuanced approach of BSEM allowed this study to disentangle emotion regulation transfer from agency transfer afterparticipatingin high-risk sport where previous attemptsusing other methods have failed to. This will allow future research in high-risk sport to be conducted beyond English speaking populations and more signiﬁcantly,facilitatethe investigation of diﬀerences betweenthe transfer eﬀects of agency and emotion regulation.


Introduction
10) defined high-risk sports as, "all sports where you have to reckon with the possibility of serious injury or death as an inherent part of the activity". High-risk sports, such as freeride skiing, paragliding and mountaineering are no longer fringe activities with few participants, but are increasingly popular and have become a socially acceptable form of risk-taking (Pain & Pain, 2005;Turner, McClure, & Pirozzo, 2004).
Risk-taking research has long been dominated by Zuckerman's Sensation Seeking Theory (Zuckerman, 2008). The construct of Sensation Seeking was discussed as the major motive for starting and maintaining health-risk behaviors such as drug taking, gambling and also participation in high-risk sports. Indeed, Zuckerman's sensation seeking questionnaire, the Sensation Seeking Scale V (SSS-V; Zuckerman 1994), has been termed "synonymous" (p. 414) with risk-taking research (Llewellyn & Sanchez, 2008). However, measuring motivation to engage in high-risk sports through the SSS-V is biased. Many items of the Thrill and Adventure Seeking subscale of the SSS-V relate to the willingness of the participants to engage in high-risk sports (e.g., mountain climbing); however, these items are somewhat tautological when assessing sensation seeking within a population of high-risk sports participants (Llewellyn & Sanchez, 2008).
Suggesting that sensation seeking is the single panoptic explanation for voluntary engagement in sports as diverse as Himalayan mountaineering (e.g., extended duration, long periods of boredom, physically painful) and skydiving (e.g., very limited duration, characterized by thrilling enjoyable sensations) seems overly simplistic and several studies have shown that the motives for participation in high-risk sport are more varied than this (Barlow et al., 2015;Barlow, Woodman, & Hardy, 2013;Castanier, Le Scanff, & Woodman, 2010;Castanier, Le Scanff, & Woodman, 2011;Frühauf, Hardy, Pfoestl, Hoellen, & Kopp, 2017;Kerr & Houge Mackenzie, 2012;Lafollie & Le Scanff, 2007;Woodman et al., 2013;Woodman, Hardy, Barlow, & Le Scanff, 2010;Woodman, Huggins, Le Scanff, & Cazenave, 2009). A number of qualitative studies have uncovered additional and alternative motives for participation in high-risk sports (e.g., emotion regulation, agency, challenge, nature) (Brymer, 2010;Brymer & Gray, 2010;Frühauf et al., 2017;Kerr & Houge Mackenzie, 2012;Willig, 2008;Woodman et al., 2010). In light of these developments in understanding the motivations for participation in high-risk sports, a number of quantitative tools have been developed. Barlow et al. (2013) established the Sensation Seeking, Emotion Regulation and Agency Scale (SEAS), a series of inventories that measure the following: the need for sensation, difficulty with emotion regulation, and lack of agency between bouts of participation in high-risk sports; the experience of sensation, emotion regulation, and agency while participating; and the transfer of sensation, emotion regulation, and agency following participation. This was based on research showing that participants in prolonged high-risk activities have difficulty with emotion regulation and a diminished sense of agency in aspects of their life and thus might participate in those high-risk sport activities to experience agency and become aware of their emotions . Barlow et al. (2013) developed the SEAS using a variety of participants who took part in both high-risk sports (e.g., mountaineering and skydiving) and low-risk sports (e.g., basketball and hockey), in doing so they found that some activities might be motivated by the sensations of the activity (e.g., skydiving) and others might be motivated by the emotion regulation and agency transfers (e.g., mountaineering).
Understanding the motives for participation in high-risk sports allows researchers to better comprehend the potential benefits and risks to participants. Nevertheless, the objective risk of the activity is undeniable, which is underlined by the higher rates of both acci- dents and close calls 1 in high-risk sports than in low-risk sports (Barlow et al., 2015;Gosteli et al., 2016). To contribute towards more safety in high-risk sports two further aspects have to be taken into account, namely objective risks and participants' behavior. Objective risks (e.g., environmental hazards such as avalanches) must be accepted as an inherent aspect of participation in high-risk sport, but participants are not risk-takers per se since they are able to influence their risk exposure by adapting their behavior (Gosteli et al., 2016;Leiter & Rheinberger, 2016;Llewellyn & Sanchez, 2008). Whereas objective risks cannot be modified, there seems to be an orthogonal nature of behavior in participants which consists of deliberate risk-taking and precautionary behavior . To contribute to the understanding of these behaviors, the Risk-Taking Inventory (RTI) was developed to measure precautionary behavior (PB) and deliberate risk-taking (DRT) in high-risk sport participants . Recent research suggests that behavior in highrisk sport, namely in freeriding, changes based on individuals' experiences of accidents and close calls (Frühauf et al., 1 Close calls are defined as "incidents that come very close to resulting in a negative outcome" (Woodman et al., 2013, p. 480) 2017). Thus, it is important to quantify accidents and close calls and relate them to participants' behaviors. This can be done by using the Accidents and Close Calls in Sports Inventory (ACCSI; Barlow et al., 2015). Research showed that accidents and close calls were positively correlated to DRT and negatively to PB (Barlow et al., 2015).
At present, SEAS, RTI, and ACCSI are available only in the English language and the scales were validated with English-speaking participants. However, there are differences in the amount of leisure time physical activity across European countries (Martínez-González et al., 2001) and there is also evidence for crosscultural differences in risk-taking (Mata, Josef, & Hertwig, 2016) regarding risk behaviors like gambling or speed driving (Molinaro et al., 2014;Wallén Warner, Ozkan, & Lajunen, 2009).
Furthermore, there is a dearth of validated measures for carrying out research in non-English-speaking high-risk sport populations. When considering that "The Alps comprise the largest and most popular sports region in Europe" (p. 1) and that many alpine sports are classified as high-risk sports (e.g., ski touring, mountaineering, mountain biking, rock and ice climbing and paragliding; Burtscher, 2008), it becomes clear that there is a need for validated measure for conducting research in non-English-speaking high-risk sport populations. Austria is just one German-speaking alpine country with almost one third of the 180,000 km 2 of mountainous area in the Alps (Burtscher, 2008). Thus, with the aim of taking the first step towards filling the lacuna highlighted above the aim of the present research was to validate German language versions of the SEAS, RTI, and ACCSI.

Procedure
Following institutional approval by the Board for Ethical Questions in accordance with the Declaration of Helsinki, we collected the data using a web-based questionnaire in a cross-sectional design. We recruited participants from a number of different high-risk sports via emails to students and employees of the University of the first author and local sports clubs (e.g., paragliding association). All participants completed the survey online. Participants not finishing the survey were excluded from analyses.

Participants
The final number of participants was 719 (25% female), with the highest numbers of individuals performing various disciplines in paragliding (59%). Further, the sample consisted of freeride skiers (14%), mountain trail runners 2 (16%), freestyle skiers/snowboarders (7%), as well as mountaineering athletes (4%). The participants had a mean age of 35.4 (±11.6) years and reported an average of 7.3 (±6.1) years of experience. Age and experience varied between sport activities, with the youngest age and lowest years of experience in freestyle skiing and snowboarding (age: 23.0 ± 3.8 years; years of experience: 6.0 ± 3.6 years) and the oldest age and most years of experience in paragliding (age: 38.9 ± 11.4 years; years of experience: 7.9 ± 6.5 years). Female participation ranged from 19% in freestyle skiing/snowboarding to 32% in mountain trail running.

Sensation Seeking, Emotion Regulation and Agency Scale (SEAS)
The SEAS  comprises three separate inventories which measure three different factors asking about three differenttimes namelyBetweenparticipation, While participating, and After participation. The Between participation inventory evaluates the time when not participating for a significant amount of time and measures need for sensation, difficulty with emotion regulation and lack of agency. The While inventory evaluates the experience of sensation seeking, emotion regulation and agency asking about the time while participating. The After inventory measures the transfer of sensation, emotion regulation, and agency asking about the time following participation. Each inventory contains 18 items with a seven-point Likert scale response mode ranging from one (completely disagree) to seven (completely agree). Barlow et al. (2013) found evidence to support a three-factor structure for the Between and While inventories; however, they found that a two-factor model was a better fit to the data for the After inventory, with Agency and Emotion Regulation being combined into a single factor (i.e., agentic emotion regulation). Cronbach's alpha (α) displayed good internal consistency for each factor: Between participation inventory (α ≥ 0.84), While participating inventory (α ≥ 0.70), and After participation inventory (α ≥ 0.89). The SEAS factors correlated with established measures of sensation seeking, emotion regulation and agency .

Risk-Taking Inventory (RTI)
The RTI  measures risk-taking in high-risk sports across two orthogonal factors, deliberate risk-taking (DRT, three items) and precautionary behaviors (PB, four items) on a seven item five-point Likert-scale

Abstract
The aim of the present research was to validate German language versions of three inventories in high-risk sports to facilitate future research in the significant population of German speaking highrisk sports participants. We translated the Sensation Seeking, Emotion Regulation and Agency Scale (SEAS), the Risk-Taking Inventory and the Accidents and Close Calls in Sport Inventory into German, then tested the hypothesized factor structures with 719 highrisk sport participants from the European Alps using Bayesian structural equation modelling (BSEM). The final models were all good fits to the data, had good internal consistency and displayed adequate discriminant validity. All inventories displayed the same factor structure as in the English inventories bar the G-SEAS After inventory in which a three-factor model fitted better than a two-factor model. Possible reasons for this difference include differences in the sample population, translation bias, or cross-cultural differences; however it seems likely that the nuanced approach of BSEM allowed this study to disentangle emotion regulation transfer from agency transfer after participating in high-risk sport where previous attempts using other methods have failed to. This will allow future research in high-risk sport to be conducted beyond English speaking populations and more significantly, facilitate the investigation of differences between the transfer effects of agency and emotion regulation.

Keywords
Risk-taking · Sensation seeking · Emotion regulation · Agency · Bayesian statistics

Accidents and Close Calls in Sport Inventory (ACCSI)
The ACCSI (Barlow et al., 2015) is a sixitem, two-factor inventory asking about experienced accidents (three items) and close calls (three items) on a seven-point Likert scale from one (never) to seven (always). A good model fit was confirmed in varying samples (Barlow et al., 2015). Moderate correlations between DRT and Accidents (r = 0.31-0.54) and Close Calls (r = 0.52-0.64) were shown. PB showed weaker, negative correlations with Accidents (r = -0.02 to -0.33) and Close Calls (r = -0.10 to -0.34).

German scale development
We translated the items following the guidelines of Guillemin, Bombardier, and Beaton (1993). The items were translated into German by a group of Sports Science Masters students, who were all fluent in German, and they were asked to note any remarks and questions while translating. The group met to discuss items and phrases until full consent was reached. In the next step translated and original items were sent to three experts from the field of health psychology and sport science who were equally fluent in both English and German language. They were asked if the German items were an accurate translation of the original English items, when they identified problematic items we modified them and the process was repeated until the experts agreed that all of the German items accurately represented the meaning of the original English ones.

Statistical analyses
We tested the hypothesized factor structure using Bayesian structural equation modelling (BSEM) in Mplus 7 (Muthén & Muthén, 2012), estimating BSEMs with weakly informative priors for approximate zero cross-loadings and residual correlations, as recommended by Muthén and Asparouhov (2012), for each inventory. Each BSEM was estimated using the Markov chain Monte Carlo (MCMC) simulation procedure, using the Gibbs sampler over 200,000 iterations across two MCMC chains in order to assess model convergence and stability of estimates. Model convergence can be assessed in a number of ways (Kaplan & Depaoli, 2012). In this study, we used the Gelman-Rubin convergence diagnostic (potential scale reduction factor; PSR) and Kol- We assessed model fit by examining: factor loadings, the posterior predictive p (PPp) value and the results of likelihood χ 2 tests, which examine differences between the model generated and the observed data. Excellent model fit is indicated by PPp values of approximately 0.50 and with a symmetric 95% credibility intervals (CI) centered around zero (Muthén & Asparouhov, 2012). We assessed internal consistency using the composite reliability coefficient (Fornell & Larcker, 1981) and discriminant validity using the latent variable correlations obtained from the BSEMs.
Where good model fit or convergence was not reached we re-examined the items in each scale to identify which items were problematic. To improve model fits, we removed items from each inventory; items were considered for removal based on the following criteria: having low factor loadings (< 0.6); theoretical (ir)relevance; having substantive cross-loadings or correlated residual variances (> ± 0.2); parameters with significant K-S tests; the highest PSR; or trace plots which showed poor mixing of the MCMC chains. Once identified, we removed these items and re-estimated the BSEM. We repeated this process until the final model for each inventory was both statistically and theoretically sound; it was important that item removal could be justified both statistically and theoretically to avoid making modifications based on sample specific, chance characteristics of the data, which then may not represent the relationships among variables in the wider population (Biddle, Markland, Gilbourne, Chatzisarantis, & Sparkes, 2001).
Given that Barlow et al. (2013) found a two-factor structure to be most appropriate for the SEAS After inventory, we analyzed our data with two-and threefactor models in order to see if the translated version confirmed these initial findings. In addition to the criteria mentioned above, we used the Deviance Information Criteria (DIC) to compare the two-and three-factor G-SEAS After inventory as recommended by Asparouhov, Muthén, and Morin (2015).
Once the final models were established we performed a sensitivity analysis as the

Loadings and 95% CIs on intended factors in bold text
choice of priors can affect the parameter estimates (Muthén & Asparouhov, 2012;Stenling, Ivarsson, Johnson, & Lindwall, 2015). To do so, we re-ran the final models for each inventory with smaller (0.05) and larger (0.015) priors for the cross-loadings, before comparing the parameter estimates for discrepancies between these models and those estimates with a prior variance of 0.01. We also weighted items with their respective factor loadings and then calculated Pearson's correlation coefficients to examine the relationships between subscale means.

Model fit and convergence
Visual inspection of trace plots for all parameters supported convergence (i.e., showed good mixing and no upward or downward trends). In addition to this the PSR value for all models fell below 1.1 during the warm-up phase of the simulations and remained below 1.1 for the remaining iterations (. Table 1), and there were no significant K-S tests for any of the five models. For each G-SEAS inventory, the initial 18-item BSEM models with non-informative priors achieved adequate convergence and all items had significant loadings on their intended factors only. However, there were a myriad of other problems that meant the models were not deemed to be suitable. In the Between inventory, two items had problematic correlated residuals and one low major factor loading (0.49). In the While inventory, there were ten problematic correlated residuals and three major factor loadings were low (0.53-0.58). The three-factor After inventory had five problematic correlated residuals and one low major factor loading (0.56). The two-factor After model had one low major factor loading (0.56) and had 17 correlated residuals that exceeded their a priori limits. Through an iterative process, four items were removed from each G-SEAS inventory using the criteria outlined earlier (the remaining items can be seen in . Tables 2, 3 and 4).
The BSEMs for the three 14-item G-SEAS inventories, the G-ACCSI, and the G-RTI with informative small variance priors for cross-loadings and residual correlations have excellent fit with PPp values of approximately 0.5 and having symmetric 95% posterior predictive CIs centered on zero (. Table 1). The major factor loadings in each inventory were significant, acceptable, and in the hypothesized direction. Furthermore, there were no cross-loadings that exceeded their a priori limits (. Tables 2, 3, 4, 5 and 6). There were

Loadings and 95% CIs on intended factors in bold text
no correlated residuals that exceeded their a priori bounds in the Between, While, three-factor After participation G-SEAS, G-RTI, or G-ACCSI. However, there were four correlated residuals that exceeded their a priori limits in the twofactor After inventory all in the Agentic Emotion Regulation factor; correlated residuals within a factor indicates that there is shared variance that is unaccounted for by the model (e.g., there is another latent factor influencing the data).
The DIC for the 14-item two-factor After model was 19,097.742 and the DIC for the 14-item, three-factor After model was 19,096.132, lower than that of the two-factor model despite having 16 more parameters. Despite the differences in DIC being small, the three-factor After model is a better fit to the data as there are no correlated residuals that exceed their a priori limits.

Model sensitivity
Sensitivity analyses for each inventory showed that the factor loadings and cross-loadings were relatively stable when specifying smaller (0.005) and larger (0.015) a priori variance priors. Using both smaller and larger prior variances 100% of discrepancies across the three inventories were within ±0.05, the maximum discrepancy was -0.043 with smaller prior variances set; and the maximum discrepancy was -0.029 with larger prior variances set.
Internal consistency, discriminant validity, and concurrent validity . Table 7 shows the latent factor subscale means, standard deviations, composite reliabilities and latent factor intercorrelations for the three G-SEAS inventories, the G-RTI, and the G-ACCSI. The composite reliability of each subscale resulted in r > 0.8 across each inventory and the subscales within each inventory were all positively correlated. The G-RTI subscales had a weak inverse relationship and the Accidents and Close Call subscales were positively correlated (0.61). None of the 95% CIs for interfactor correlations encompassed 1.00, thus, supporting the discriminant validity of the subscales within each inventory (Anderson & Gerbing, 1988).
. Table 8 shows correlations between subscale means, where the items were weighted with factor loadings. Sensation seeking of the Between participation G-SEAS was positively correlated with DRT, accidents and close calls and negatively with PB and age. Accidents and close calls were positively correlated with sensation seeking, which was also negatively related to both PB and age. Accidents and close calls were both positively correlated with DRT and negatively with PB. Age displayed negative correlations with all subscales except PB, which resulted in a weak positive relation. Difficulty with emotion regulation as measured by the Between participation emotion regulation subscale, correlated positively with DRT, accidents, and close calls and negatively with PB and age.

Discussion
The aim of this study was to validate three different German versions of inventories in high-risk sports. The BSEM analyses supported a good model fit in all three scales (G-SEAS, G-RTI, G-ACCSI). All subscales of each inventory showed good internal consistency and supported good discriminant validity. The correlations between the G-RTI and G-ACCSI scales derived similar results as shown in the original paper (Barlow et al., 2015). While in the development of the original RTI and the ACCSI the rela-tionship with the SEAS inventories was not tested, Barlow et al. (2015) tested the relationship between the original RTI, ACCSI, and alexithymia (i.e., difficulty describing, feeling, or identifying emotions). The relationships between alexithymia and the RTI and ACCSI were similar to the relationships between the Between emotion regulation subscale and the G-RTI and G-ACCSI subscales. These correlations suggest that a difficulty with emotion regulation is associated with higher deliberate risktaking (i.e., the positive correlation of the G-SEAS Between emotion regulation and DRT and its negative correlation with PB (. Table 8)). Deliberate risktaking in the G-RTI correlated with the experience of Sensation Seeking in the G-SEAS as much as DRT in the RTI correlated with the Brief Sensation Seeking Scale (Hoyle, Stephenson, Palmgreen, Lorch, & Donohew, 2002) in the study by Barlow et al., (2015). This suggests that there is a link between the motivations for participation in high-risk sport and participants' behaviors.
The factor structure of the original scales was replicated in the G-RTI and G-ACCSI (Barlow et al., 2015;Woodman et al., 2013) with similar factor loadings in the G-ACCSI and higher factor loadings in the G-RTI. Similarly, the factor structure of the original SEAS was replicated in the Between and While inventory of the G-SEAS replicated the factor structure of the original SEAS; Sensation Seeking, Emotion Regulation and Agency were shown as separate constructs . This supports the view of a multidimensional construct of motivation in high-risk sports. In the analysis of the After inventory of the G-SEAS a threefactor BSEM was a better fit to the data than a two-factor BSEM as was found in the original SEAS After inventory, with satisfaction of sensation needs and agentic emotion regulation transfer as the two factors.
Four possible reasons for the difference in factor structure between the SEAS and G-SEAS are as follows: differences in analytical methods, translation bias, cross-cultural differences, and differences in sports included in the samples. Contrary to the SEAS analyses, the present analyses used BSEM, which allows more complex, and thus more realistic, models to be specified. BSEM was recently used by Niven and Markland (2016) to establish motivational factors in walking and was favored over the independent clusters model using a maximum likelihood approach to confirmatory factor analysis (ML-CFA ICM). The authors criticized the ML-CFA ICM approach because "ICM approach channels unspecified covariation between indicators through their factors, upwardly biasing interfactor correlations" (Niven & Markland, 2016, p. 97). This artificial inflation of interfactor correlations may be the reason why Barlow et al. (2013) rejected the threefactor model in the original SEAS After inventory. Given that emotion regulation and agency are conceptually related, a highly restrictive model (e.g., ML-CFA ICM) would be less suitable from a theoretical standpoint than a model that does allow for small cross-loadings and  residual correlations (e.g., BSEM with weakly informative priors). We only analyzed the data using BSEM; therefore, we cannot conclude how, if at all, factor structures may have been different when using ML-CFA ICM. However, if these differences in the factor structure are due differences in the analytical methods it may be that BSEM could reveal a threefactor structure in the After inventory of the original SEAS data. Another possible explanation for the differences in items and factor structure could be due to translation bias and/or cross-cultural differences. Validation studies of translated health constructs repeatedly showed differences in factor structure and number of items (Hwang, Kim, Kim, Kim, & Ahn, 2013;Kim, DeCoster, Huang, & Bryant, 2013;Nagels et al., 2013). Cross-cultural differences were especially noticed between different ethnic populations, e.g., Korean and Greek (Hwang et al., 2013) or Hispanics and non-Hispanics (Sanchez & Vargas, 2016).
It is now evident that high-risk sport participants are not one homogenous population (Barlow et al., 2015Castanier et al., 2010;Woodman et al., 2010); therefore, it is possible that differences in sports included the sample populations may account for the differences in factor structures between the G-SEAS and SEAS. The sample used by Barlow et al. (2013) included skydivers, mountaineers and low-risk sport participants; the sample in this study included paragliders, mountain runners, freeride skiers, freestyle skiers, and mountaineers.
It was shown in the health surveillance of clinical populations that analyzing different clinical samples resulted in a different factor structure (Thaler et al., 2015). In the present study, participants from a wide range of high-risk sports have been included, going some way towards reflecting the heterogeneity of high-risk sport populations, thus, making the questionnaire usable for a large population.
The differences of the G-SEAS to the SEAS should not be considered as a negative outcome since the constructs showed a good model fit and the adjustments of the G-SEAS (shorter questionnaire, three factors for the After inventory) could be interpreted as improvements to the scale. Shorter questionnaires have higher response rates (Nakash, Hutton, Jørstad-Stein, Gates, & Lamb, 2006); therefore, response rates and adherence to the 14item scale may be higher than to an 18item scale. From a measurementperspective the three-factor structure of the After inventory presented in this article identifies emotion regulation and agency as two distinct transfer mechanisms in the current sample. Barlow et al. (2013) showed, using the two-factor solution in the After inventory that mountaineers experience a significant higher agentic emotion regulation transfer than both skydivers and low-risk controls. While participating, only the experience of emotion regulation could distinguish between mountaineers and skydivers which means that skydivers experienced similar feelings of agency while participating than mountaineers. This raises the question whether skydivers experience an agency transfer effect, which could not be shown in the study by Barlow et al. (2013) due to confounding results of emotion regulation and the limitation of a two-factor structure. Duration of the activity may be more important tothe transferof emotion regulation than agency. Freeride skiing comprises elements of thrill seeking (using lift-supported access for the activity) as well as a prolonged activity (using ski touring/hiking for the activity). Frühauf et al. (2017) reported that freeriders appreciate the fact that they are in charge of what they do and that they are not forced to follow strict rules (i.e., experience agency) and for some, this was crucial to their wellbeing (e.g., transfer effect of agency). The three-factor model in the G-SEAS After inventory might contribute to a better understanding of transfer effects from high-risk sports by offering the possibility to distinguish between agency and emotion regulation transfer effects in the German-speaking population for future studies using the G-SEAS.

Strengths and limitations
First, one might mention a lack of validation with established measures (e.g., the Sensation Seeking Scale). However, the concurrent validity of the inventories has been well established in the original since we validated constructs which already put those inventories in relation to established measures Woodman et al., 2013), validating the three scales with one another was seen as sufficient. Second, both the exclusion of SEAS items and structural validation was done in the same sample. This approach might result in biased estimates for structural validation. Future studies using the G-SEAS might consider calculating indices for internal consistency and factorial structure. As discussed there are several strengths resulting from the differences in the factor structure of the G-SEAS; nevertheless, a limiting factor is that because of those differences, the G-SEAS and the original SEAS cannot be used to directly compare German-and English-speaking populations. This study has provided evidence for a three-factor structure for each of the G-SEAS scales. Without collecting new data in both German-and Englishspeaking populations to carry out a measurement invariance analysis, it is not possible to make direct comparisons between data collected using the SEAS and G-SEAS. We would recommend that this analysis be carried out using BSEM as the differences in factor structures of the SEAS and G-SEAS may be due to differences in analytic methodologies. Therefore, we recommend that G-SEAS should be used in German-speaking populations and should not be used to make direct comparisons with data collected using the SEAS. Interscale correlations do not differ between the 14-item and the 18item German version; this suggests that despite missing 4 items, the G-SEAS still measures identical constructs. Another limitation which is not necessarily tied to this research only is the in our opinion problematic operationalization of highrisk sports. Despite the definition we cited at the beginning of the introduction, we think future research might benefit from a more detailed definition including additional components related to skills and experience. Though this is beyond the scope of this article, it is worth mentioning that researchers in high-risk sports need to address this problem in future studies.
This study has a number of strengths, including the variation within the population; in addition to being of sufficient size, the sample included a number of different high-risk sports, participants from a variety of experience levels and of different ages and sexes, thus, ensuring a het-erogenic sample population. The analytical methods used in this study should also be seen as another strength of this article. It has been shown that BSEM is a more appropriate method than traditional CFA methods as it better reflects the complexities of reality (Niven & Markland, 2016;Stenling et al., 2015).

Conclusion
The current study validated three different inventories for high-risk sports in German language (G-SEAS, G-RTI, G-ACCSI) and showed good internal consistency, discriminant validity, and a good model fit in all scales. The BSEM analyzes support a 14-item three-factor structure of the G-SEAS despite the SEAS having a 18-item three-factor structure for While and Between and a two-factor structure for the After participation inventory . The G-SEAS is seen as an improvement to the SEAS since shorter questionnaires increase response rates (Nakash et al., 2006) and a three-factor structure might help to distinguish between agency and emotion regulation transfer effects in future studies. However, a limiting factor of the differences in the factor structure is that the G-SEAS and SEAS cannot be used to directly compare German-and English-speaking populations with one another. Correlation analyses displayed relations between motivational and behavioral components of the high-risk sport activity with Sensation Seeking showing the highest correlation with deliberate risk-taking. The present study validated the three different inventories in German language, which is a first step towards the development of crosscultural motivational and behavioral constructs in high-risk sport participants.