1 Introduction

Throughout the past decade, the use of statistical modeling in the analysis of sport performance has gained a rapidly increasing interest. In the approximate 20 years, there has been a remarkable change in both the collection of sports data and the opportunities to address sports questions using statistical models. Methods for measurement and evaluation of player performance have become a relevant issue as such as the prediction of game outcomes or the study of experimental sports science data involving non-standard data structures collected through the use of recent technologies (Albert et al. 2016). Motion-tracking or remote sensor technologies, for instance, have made it possible to accumulate detailed information on player-level dynamics and athletes’ characteristics (Liebermann et al. 2002), and large data archives for sports have become easily accessible even to non-experts. Computational advances have also determined a growing number of applications using sports data for measuring players’ and teams’ abilities, and decision-making within a game (see Albert et al. 2016, and reference therein). Apart from the value of the performance indicators, used to categorize athletes’ and teams’ performance, recent contributions focused on why and how some behaviors emerge in performance contexts (McGarry 2009) and in the ways in which performance interacts with personality traits (Laborde et al. 2020). Therefore, recent studies investigated which types of personality have more success in sport (Allen et al. 2013), widening the classical perspective focused on the measurement of athletes’ abilities through tracking data. This led to the development of a branch of psychology—sports psychology—geared to study and explain performance and well-being of athletes in terms of their psychological traits. Researchers tried to define the personality of the successful athlete, comparing the personality test scores of lower performance athletes with those of higher performance ones, or exploring differences between professional and amateur athletes (Aidman 2007; Allen et al. 2011). Several studies investigated the association between personality and participation in organized sports, also accounting for gender differences (Allen et al. 2013; Malinauskas et al. 2014; Mckelvie et al. 2003; Paunonen 2003). Further analyses focused on athletes attending different sport types, showing differences in personality traits between team sport and individual sport athletes (Allen et al. 2011; Eagleton et al. 2007; Nia and Besharat 2010; Steca et al. 2018). Some contributions focused on interpersonal relationships in athlete–athlete and coach–athlete dyads and team functioning (Bell 2007; Jowett and Nezlek 2012; Rhind and Jowett 2011). These studies explored the role of dissimilarity in personality traits and the contribution of personality to team performance. Further approaches moved toward a deepening study of athletes’ mental strategies and skills (Olmedilla et al. 2018), often based on the study of one of the main theoretical framework for analyzing personality in sport, that is the five-factor model (McCrae and Costa 2008). Although the huge number of mentioned researches, based on different statistical contents, fewer studies have focused on the relationships among personality traits, specific athletic skills and strategies to enhance performance. The present paper is framed in this line of research and aims to identify personality traits which can be successfully used to assess the performance of athletes engaged in individual sports. In the study, the performance is measured by means of an overall observable variable which allows overcoming the drawbacks of the measurements obtained with game statistics solely (Piedmont et al. 1999). Furthermore, the identification of psycho-social athletes’ profile may update skill and training to improve the performance during competitions. The paper proposes a component-based structural equation modeling (component-based SEM) to analyze the relationships between some psychological constructs, measuring personality traits and mental skills, and a combination of indicators (composite), measuring sports performance. The main focus is on identifying personality traits and mental skills having the largest impact on sports performance. Rather than focusing only on objective behaviors, such an approach evaluates also variables not directly observable related to sport performance, such as cognition and affect, considering measurement error and measurement invariance, as well as the validity and reliability of the latent constructs. In the present paper, we consider data collected in 2019 for the STATSPO project (Statistical modeling and Data Analytics for Sports. Psychosocial aspects to assess the performance: the case of swimmers). The project was supported by University of Naples Federico II and Italian Swimming Federation ISF (Campania Regional Committee). Data consist of a small number of observations, due to the representative sample selected by ISF, along with a high number of both latent and corresponding manifest variables. This structure supports the choice of the component-based SEM as reference model for the proposed research as remarked in the next section. Specifically, the work is organized as follows. Section 2 presents methods for data collection and implementation in personality trait analysis. Section 3 provides a detailed description of the survey, the data and the constructs considered in the research. Section 4 presents the component-based SEM, the reference methodology for analyzing the collected data. Results, separately for the measurement and the structural model, along with the evaluation of the model reliability, are reported in Sect. 5. Finally, a discussion on the main results and the conclusions with some further research developments to be explored are included in Sects. 6 and 7, respectively.

2 Methodologies for personality traits analysis

To collect data relevant to the quantitative measurement of constructs like attitudes and personality traits, multi-item scales are among the most popular measures used in questionnaires. More in detail, constructs are theoretical concepts not directly observable that can be inferred from a series of observable indicators (MacCorquodale and Meehl 1948). An example in our analysis is the athletes’ personality, inferred by responses given to questions on behaviors and choices (Schweizer et al. 2020; Steca et al. 2018). According to the classical theory of measurement (Spearman 1904; Thurston 1947), the measurement of latent constructs almost always relies on the detection of a multivariate set of indicators of the construct itself. Since no single indicator can capture the overall theoretical meaning, classical theory exploits multiple measurements of the same latent concept. Many aptitude and personality tests, for instance, use questionnaires with a certain number of items which are combined to give a total score relative to some dimension of attitude or personality. When the phenomenon to be analyzed has a complex nature, as for the analysis of athletes’ personality, several multi-item scales are jointly used to measure the different aspects of the phenomenon. In some cases, causal connections are also hypothesized among the different scales either making reference to underlying theories or setting some research hypotheses (Simon 1954).

Regarding the statistical methods in sport psychology research, the most popular included regression analysis, mediation and moderation analysis, analysis of variance, cluster analysis, and SEM (Biddle et al. 2001); in recent applications, also more complex methods such as latent class analysis, latent profile analysis, and growth mixture modeling are being to be used (Myers et al. 2018). Among them, SEM is the reference methodology to deal with latent constructs interconnected by a network of causal relationships.

SEM is a class of models aiming to analyze the relationships between a set of latent variables (LVs), measured through multiple manifest variables (MVs) (Bollen 1989). SEM combines the principles of factorial analysis (Spearman 1904) with those of path analysis (Wright 1918). In particular, factorial analysis relates the observed variables (items) to the respective constructs. This part of the model is named measurement or outer model. Instead, path analysis captures the structural connections relating constructs to each other. Structural or inner model is the name used to refer to this part of the model. Two main approaches have been proposed in the SEM literature. They are distinguished by the different estimation procedure (Henseler et al. 2009; Tenenhaus et al. 2005): covariance-based SEM (Jöreskog 1978) and component-based SEM (Wold 1982). Covariance-based SEM estimates the model parameters by minimizing the discrepancy between the empirical covariance matrix and the theoretical covariance matrix implied by the model. Instead, component-based SEM estimates the parameters after determining the construct scores as linear combinations of the corresponding observed variables while maximizing proximity to the connected constructs. Covariance-based SEM commonly exploits maximum likelihood estimation (Bollen 1989), whereas partial least squares path modeling (PLS–PM) is the usual choice for component-based SEM (Tenenhaus et al. 2005; Wold 1982).

In the last two decades, there has been a heated debate among supporters of the two approaches (Henseler et al. 2014; Rönkkö and Evermann 2013; Rönkkö et al. 2016). Very briefly, advocates of component-based SEM have outlined some particular advantages in terms of greater flexibility. Covariance-based SEM poses indeed distributional assumptions (continuous and normal data), and special requirements about the number of indicators per construct (generally three or more) and the sample size (generally several hundred, Iacobucci 2010). Recently, the debate focused on the type of constructs considered in the model. On the one side, the traditional constructs, typical of behavioral sciences, that assume a latent variable underlying the set of indicators; on the other side, the artificial constructs (emergent variables), typical of business and social sciences, that emerge from the indicators (Henseler 2017). Personality traits are an example of constructs of the first type, business process performance of the second type. The two types of constructs give rise to different models. In the first case, the hypothesis is that the manifest variables are explained by a common factor, that is the latent variable, and a unique random error. For this reason, indicators have to be strongly correlated since they are expression of the same underlying LV. In the case of emergent constructs, a linear combination of indicators provides the constructs. Therefore, the indicators of emerging variables are not required to be correlated with each other. Latent variables are also called reflective constructs, while emergent variables are also called formative constructs if they are intended to be formed and not caused by their indicators (Henseler 2020). The nature of the constructs, latent vs emergent, establishes the choice between covariance-based SEM and component-based SEM, the former being preferred when the model includes only latent variables, the latter when both latent and emergent variables are considered (Benitez et al. 2020; Henseler 2017).

As discussed above, component-based SEM is the proper choice to test models that simultaneously encompass latent and emergent variables as in the presented contribution.

3 Data

The presented survey aimed to investigate the relationships among sports performance, personality and athletes’ mental skills. It involved a sample of young athletes enrolled in Italian Swimming Federation (Campania Regional Committee). A sample of 161 elite swimmers (from now on simply swimmers) was examined. Leading details on the sample are reported in Table 1.

Table 1 Summary statistics concerning the characteristics of the 161 respondents of the survey

Personality was assessed by using a list of 25 adjectives representative of the Big Five (B5) dimensions in the Italian lexical context (Barbaranelli et al. 2007; Caprara and Perugini 1994). The list consists of 5 adjectives for each of the five personality dimensions, namely Extraversion, Emotional stability, Openness, Agreeableness, and Conscientiousness. Participants were required to fill out the questionnaire indicating how appropriate each adjective is for describing themselves on a 5-point scale (not at all, slightly, moderately, quite a lot, at all). A previous study in sport psychology conducted by Steca et al. (2018) also used this self-report measure to assess athletes’ personality traits. Mental skills were evaluated using the sport performance psychological inventory (IPPS-48), made up of 48 items in which respondents state how often (from 1 = never to 6 = always) they describe their sporting experience (Robazza et al. 2009). IPPS-48 factors assess both cognitive and emotional aspects relevant to athletes’ performance. This scale measures the following eight dimensions: Self-talk, Goal setting, Self-confidence, Emotional arousal control, Cognitive anxiety, Concentration disruption, Mental practice, and Race preparation. Finally, in this contribution, the observable variable related to performance (P) takes into account win on competed races on a scale with five anchored options: 1 refers to no success in any competitions, 5 refers to success in all participated competitions. A detailed description of the scales and the analyzed traits/constructs is reported in Table 2, while Figs. 1 and 2 depict the divergent stacked bar charts for the B5 and the IPPS-48 dimensions, respectively. Each panel of the two figures refers to a given dimension, the corresponding items being on the vertical axis while the stacked bins represent the response rates associated with the different response modes. The bars in each panel are located with reference to the neutral point scale (moderately for the Big Five scale and 3.5 for the IPPS-48 scale). Therefore, in case the bar for a given item tends to lie in the right part of the plot, this denotes a percentage of respondents with points in the upper part of the correspondent scale. Inversely, in case the major part of the bar is located in the left part of the plot. Course, if respondents with low points balance respondents with high points, the bar is centered around the neutral point scale, depicted by the solid vertical line in each panel. In addition, the bins composing a single bar (greyscale/different colors) express the rate of respondents for the given response mode. Segments of the same level in greyscale/same color are comparable across items and panels. Inspection of Fig. 1 shows that the majority of indicators has a negative asymmetric distribution, highlighting a marked agreement for all. With reference to each dimension of the B5 scale, this underlines the consistency of the items with respect to the construct they intend to measure. On the other hand, Fig. 2 points out discordant dimensions of the IPPS-48 scale. In fact, the dimensions representing Cognitive anxiety and Concentration disruption present positive asymmetric items, i.e., a disagreement of respondents on them. The same type of graph is used in Fig. 3 to report the observable variable related to performance. In this case, the chart presents a rather uniform distribution of athletes in terms of performance.

Table 2 Description of the scales and related constructs used in the analysis
Fig. 1
figure 1

Divergent stacked bar chart for the five dimensions of the B5 scale

Fig. 2
figure 2

Divergent stacked bar chart for the eight dimensions of the IPPS-48 scale

Fig. 3
figure 3

Divergent stacked bar chart for the observable variable related to performance

4 Method

SEM consists of two submodels, as already stated in Sect. 2 namely a measurement (outer) model, which describes the relationships between each construct and the corresponding indicators, and a structural (inner) model, which describes the relationships between the different constructs. In turn, the structural model is composed of as many structural equations as there are the dependent constructs in the model. A construct that assumes the role of dependent variable in at least one of the structural equations is defined endogenous, while a construct that assumes only the role of predictor is defined exogenous. In SEM framework, a graphical representation called path diagram is commonly used to represent in a visual way the variables and the relationships in the models: ellipses are used to represent latent variables, hexagons for emergent variables, squares for indicators, and arrows for relationships. The latter originate in the independent variables and point to the dependent variable and are used to depict both the relationships between variables and constructs (measurement model) and between the various constructs (structural model). The directionality of the arrows is defined by the theory that explains the analyzed phenomenon or by new research hypotheses. Note that most common SEMs are those in which relationships do not create cycles, i.e., feedback effects, but all flow in one direction. These models are called recursive, and we will refer to them throughout this work. Figure 4 depicts the path diagram proposed in this paper, limited to the inner model; indicators (measurement model) are not included for reasons of readability. From now on, we will refer to this model as the STATSPO model. As evident from the path diagram, the model proposed for the assessment of athletes’ personality involves a high number of constructs, unlike what is typical in social and economic fields. This produces a dense network of relationships. The model assumes that both the exogenous constructs of the B5 scale and the endogenous ones of the IPPS-48 scale impact on the endogenous construct performance. Moreover, dependence relationships are also assumed between the two scales B5 and IPPS-48, with the latter assuming the role of mediator in the relationship between the B5 constructs and the P construct.

Fig. 4
figure 4

Path diagram limited to the structural model for the assessment of athletes’ personality (STATSPO model)

4.1 The model

The set of involved constructs can be formally represented by the vector

$$\begin{aligned} {\mathbf {Y}} = [Y_{1}, \ldots , Y_{n}, \ldots Y_{n+1}, \ldots , Y_{n+m} ]^\prime , \end{aligned}$$
(1)

where, without loss of generality, the exogenous constructs are placed at the first n positions, and the endogenous ones are positioned at the remaining m positions. The indicators, observed on the N statistical units and corresponding to the j-th construct, are denoted by \(X_{jh}\), where \(h=1, \ldots , p_{j}\), with \(p_j\) denoting the number of indicators in the construct j. Below we use the index i to refer to units (\(i=1, \ldots , N\)).

Following the conventional notation, the structural model can be formulated as:

$$\begin{aligned} Y_{j}= & {} \sum _{k=1}^{j-1}\beta _{jk}Y_{k} + \zeta _{j}, \ \ j=n+1, \ldots , n+m, \end{aligned}$$
(2)

where \(\beta \) denotes the so-called path coefficients, \(\zeta \) the error term, and where each endogenous construct \(Y_{j}\) can depend on one or more exogenous constructs \(Y_{1}, \ldots , Y_{n}\), or on some preceding endogenous construct. This is because the present work focuses exclusively on recursive models. In addition, we assume the prediction specification hypothesis, that is \(E(\zeta _{j}|Y_{1}, \ldots , Y_{j-1})=0\) (Wold 1982).

According to the recent literature, different types of measurement model can be distinguished (Bollen and Bauldry 2011; Henseler 2017), and the selected candidate depends on the type of construct considered. In sciences, the constructs hypothesize the existence of a latent variable, and the reference model is the reflective measurement model:

$$\begin{aligned} X_{jh}= & {} \lambda _{jh}Y_{j} + \varepsilon _{jh}, \ \ j=1, \ldots , n+m, \end{aligned}$$
(3)

where each indicator is assumed to be a manifestation of a common factor \(Y_{j}\) with an error term \(\varepsilon _{jh}\). The parameter \(\lambda _{jh}\) denotes the loading associated with the h-th manifest variable in the j-th construct. Also, in this model the prediction specification hypothesis is required, i.e., \(E( \varepsilon _{jh}|X_{j1}, \ldots , X_{jp_{j}})=0\). Alternatively, when the construct does not assume any latent variable but corresponds to a mixture of elements without any kind of causal relationship, then the composite measurement model is the choice:

$$\begin{aligned} Y_{j}= & {} \sum _{h=1}^{p_{j}}\pi _{jh}X_{jh}, \ \ j=1, \ldots , n+m. \end{aligned}$$
(4)

Here, the \(Y_{j}\) construct is composed as a linear combination of the \(X_{jh}\) indicators with weights \(\pi _{jh}\).

The STATSPO model represented in Fig. 4 includes both latent and emergent variables. The B5 and IPPS-48 scales measure the personality and mental characteristics of athletes and are therefore operationalized by reflective measurement model (3). In contrast, the performance is composed by a single item measuring win on competed races, and the composite measurement model (4) is employed.

4.2 The algorithm

Component-based SEM commonly exploits the PLS–PM algorithm as estimation method (Wold 1975, 1982). From hereinafter, the indicators are assumed to be centered. The algorithm starts with an initial phase where the weights for the scores of the constructs are obtained through an iterative procedure. The algorithm estimates then the parameters of the structural model (path coefficients) and of the measurement model (weights and loadings) (Lohomöller 1989). The main idea of PLS–PM is to obtain two alternative estimates of scores associated with each construct. One takes into account the measurement model (outer), and one takes into account the structural relationships (inner). In the first step, each construct is calculated as a linear combination of its own centered indicators using a set of arbitrary weights \(w_{j1}, \ldots , w_{jp_{j}}\) summing to 1:

$$\begin{aligned} {\hat{Y}}_{j}= & {} \sum _{h=1}^{p_{j}}w_{jh}X_{jh} \end{aligned}.$$

The first outer estimate of the construct scores is obtained. Then, the first inner estimate of each construct is obtained as a normalized linear combination of the outer estimations of the connected constructs:

$$\begin{aligned} {\hat{Z}}_{j}= & {} \sum _{k=1}^{n+m}\tau _{jk}{\hat{Y}}_{k}, \end{aligned}$$

where the weights \(\tau _{jk}\) can be defined according to different weighting scheme (see Lohomöller 1989 for more details). At this point, the algorithm updates the weights on the basis of two different modes, defined mode A and mode B:

mode A:

The weights are obtained through the simple regression of each indicator on the inner estimate of the corresponding construct.

mode B:

The weights correspond instead to the multiple regression coefficients of the inner estimate of the construct on the corresponding set of indicators.

The choice of the mode is strictly related to the nature of the measurement model: Mode A is more appropriate for a reflective model, i.e., for latent variables (Eq. 3), Mode B for a composite model, i.e., for emergent variables (Eq. 4). Once the weights have been updated, the algorithm iterates through the previous steps until the convergence criterion is satisfied:

$$\begin{aligned} \left[ \sum _{j}\sum _{h}\left( w_{ih}^{(s)}-w_{jh}^{(s-1)}\right) ^2\right] ^{1/2} \le \varepsilon \end{aligned},$$

where \(\varepsilon \) is an appropriately chosen positive convergence tolerance value. After convergence, the final weights \(w_{ih}^{s}\) are used to estimate the construct scores as a linear combination of the indicators:

$$\begin{aligned} {\hat{y}}_{ji}= & {} \sum _{h=1}^{p_{j}}w_{jh}^{(s)}x_{jhi}. \end{aligned}$$

The final scores are used to obtain the least squares estimate of the structural coefficients (\(\beta _{jk}\)) according to Eq. 2, and those of the measurement model (\(\lambda _{jh}\) and \(\pi _{jh}\)) according to Eqs. 3 and 4.

Note that in principle PLS-PM estimates composites and not common factors, so the scores obtained for the latent variables contain measurement errors. To overcome this issue, a variant called consistent PLS-PM (PLSc) (Dijkstra and Henseler 2015b) has been proposed to estimate reflective measurement model. However, the new approach could lead to not admissible solutions. This concerns the so-called Heywood cases (Krijnen et al. 1998), which occur when one or more variances implied by the model are negative. The small sample size is one of the causes related to the occurrence of Heywood cases. The STATSPO model, as previously described, has a very complex structure, with many constructs and a small sample size. The use of the PLSc is therefore not possible as some not admissible solutions are obtained. Therefore, we will consider Mode A to estimate the B5 and the IPPS-48 constructs. Although Mode A does not consistently estimate reflective measurement model, it provides weights that are proportional to the true correlations between the indicators and their common factor (Dijkstra and Henseler 2015a).

4.3 Model assessment

The PLS–PM is generally validated separately for the measurement and the structural model using a sequential approach: once the assessment of the measurement model is satisfactory, it is possible to move on to the structural one. The assessment phase follows different tracks for reflective models and composite models. The reflective measurement model assumes a latent variable responsible for the correlation structure among the indicators, and several are the aspects to be verified; among them:

  • Composite reliability: it checks the amount of random error contained in the construct scores, which is expected to be limited. The indicator \(\rho _{a}\) (Dijkstra and Henseler 2015b) is a valid measure for PLS–PM. A reasonable threshold is the value of 0.707, which indicates that more than \(50\%\) of the variance in the construct scores is explained by the corresponding latent variable.

  • Convergent validity: it represents the amount of the indicators’ variance explained by the latent variable. It is measured by the average variance extracted (AVE) index. A value greater than 0.5 is considered acceptable because it would mean that more than half of this variability is explained by the latent variable (Bagozzi and Yi 1988).

  • Indicators reliability: it measures the amount of variance presented in latent variable in terms of the contribution of each indicator. The loadings are good candidates to measure this issue (Hair et al. 2010). Since loadings in PLS–PM are standardized, a value higher than 0.707 indicates that more than \(50\%\) of the indicator variance is explained by the corresponding latent variable. It should be mentioned that slightly lower values are not a concern as long as the construct validity and reliability are assured.

  • Discriminant validity: it allows differentiation between the different aspects measured by latent variables. It can be measured through the heterotrait–monotrait ratio of correlations (HTMT) criterion (Henseler et al. 2015). A value lower than 0.85 indicates a good discriminant validity.

With respect to the composite measurement model, there are no stringent constraints on the correlations between indicators. The only aspects to be evaluated are the sign, size, and significance of the weights. It is also necessary to check for a possible multicollinearity among the indicators. In a composite model, weights are crucial as they determine the construct’s scores, just like a weighted sum of their indicators (see Eq. 4). Furthermore, since they are obtained through a multiple regression with the construct scores as the dependent variable and the indicators as the independent variables, multicollinearity must be checked. High levels of collinearity could produce unexpected signs of the coefficients or large confidence intervals. Furthermore, to determine if the indicators really contribute to compose the construct, it is necessary to test that they are significantly different from zero.

With respect to the structural model, the assessment is similar to that of any regression model. Therefore the sign, size, and significance of the coefficients must be evaluated and tested. In case of mediator variables, indirect and total effects have to be considered. The total effects correspond to the sum of the indirect and direct effects, where the latter are expressed by the path coefficients. Furthermore, the \(R^{2}\) index allows to evaluate the amount of variability of each endogenous construct explained by its predictor constructs. The \(Q^{2}\) index (Chin 2010; Hair et al. 2016) is a further measure of the predictive relevance of the model. It measures how accurately the model predicts the omitted data points of indicators in reflective measurement models of endogenous constructs and single-item endogenous constructs (the procedure does not apply for composite models). Values of \(Q^{2}\) greater than zero indicate highly predictive model for a specific construct, whereas values less than zero represent a lack of predictive relevance. Finally, for the significant effects, it may be useful to measure their effect size \(f^{2} = \frac{R^{2}}{1-R^{2}}\). Values of \(f^{2}\) greater than 0.35, 0.15, and 0.02 indicate strong, moderate and weak effects, respectively (Cohen 1988).

It is worth noting that all statistical inference in PLS–PM is based on bootstrap techniques, and in particular on percentile bootstrap confidence intervals (Aguirre-Urreta and Ronkko 2018). Recent developments propose a bootstrap-based test for the overall model fit (Dijkstra and Henseler 2015b; Henseler 2020; Henseler and Schuberth 2020). The null hypothesis is that the model fits perfectly, i.e., the variance–covariance matrix \({\hat{{\varvec{\Sigma }}}}\) implied by the SEM does not differ from the observed variables’ population variance–covariance matrix \({{\varvec{\Sigma }}}\). Since the latter is typically not available, the empirical variance–covariance matrix \({\mathbf {S}}\) of the observed variables is considered.

The discrepancy between the two matrices is measured using the squared Euclidean distance:

$$\begin{aligned} d_\mathrm{ULS}=\frac{1}{2} \mathrm{trace} ({\mathbf {S}}-{\hat{{\varvec{\Sigma }}}})^2 \end{aligned}.$$
(5)

An alternative measure exploits the geodesic discrepancy:

$$\begin{aligned} d_{G}=\frac{1}{2} \sum _{k=1}^{P} (\mathrm{log}(\varphi _{k}))^2 \end{aligned},$$
(6)

where \(\varphi _{k}\) is the kth eigenvalue of \({\mathbf {S}}^{-1}{\hat{{\varvec{\Sigma }}}}\) and P is the total number of indicators in the model. A discrepancy value larger than the \(95\%\) (or \(99\%\)) quantile of the corresponding reference distribution HI95 (or HI99) leads to rejection of the null hypothesis. The evaluation of the overall model fit of a saturated model, i.e., a model in which all constructs are allowed to be freely correlated, is used to assess the measurement and composite models’ validity. Moreover, the standardized root mean squared residual (SRMR) is the reference measure of approximate model fit. A SRMR value smaller than 0.080 indicates an acceptable model fit (Henseler et al. 2015).

5 Results for STATSPO data

The following subsections report the PLS–PM results, first for the measurement model, and then for the structural model. In particular, the measures of validity of the latent constructs for the measurement model are reported in Sect. 5.1, while for the structural model tests for the hypothesized relationship among constructs are provided in Sect. 5.2.

Statistical analysis was carried out using R version 4.0.2 (R Core Team 2020), the semPLS (Monecke and Leisch 2012) and the cSEM (Rademaker and Schuberth 2020) packages. Additional custom R routines were used for creating specific graphs to display the different outputs.

5.1 Measurement model results

The assessment of the measurement model starts with the evaluation of the overall fit of the model with the saturated structural model. Table 3 reports three discrepancy measures and both the \(95\%\) and \(99\%\) quantiles of their corresponding reference distributions. Only the \(d_{G}\) discrepancy measure is below the \(95\%\) quantile of the reference distribution (HI95), thus providing empirical evidence for the latent variables and the emergent variable inside the model. Moreover, the SRMR is below the suggested threshold of 0.080 (Henseler et al. (2014), indicating acceptable model fit.

Table 3 Evaluation of the overall fit of the saturated model

The quality of the reflective measurement model is then assessed through several measures aimed to evaluate reliability, convergent validity, and discriminant validity. Table 4 shows the average variance extracted (AVE) and the Dijkstra–Henseler’s rho (\(\rho _{A}\)). The constructs and the corresponding numbers of items are reported on the first two columns of the table. The value of \(\rho _{A}\) is greater than 0.70 for all latent variables, pointing at good internal reliability. The reflective measurement model also meets convergent validity, as indicated by the values of AVE. For all latent variables, indeed, AVE value is greater or close to 0.50.

Table 4 Reliability analysis

Indicator reliability can be assessed through Fig. 5 that depicts the results for the loadings: each panel corresponds to a block of the measurement model, the indicators are reported on the vertical axis, the values of the loadings on the horizontal axis. In each panel, the indicators are sorted in decreasing order according to their values. The points represent the estimates of the loadings, the segments the 95% bootstrap confidence intervals. The vertical line placed at 0 in each panel highlights the significance of all the loadings. Finally, a shaded area is used to mark values greater than 0.70. Figure 5 shows that all indicators are statistically significant. Moreover, all but B5_I12 report values greater than 0.50 (minimum threshold still considered acceptable, Hair et al. 2010).

Finally, we assess discriminant validity through the HTMT ratio. As shown in Table 5, all HTMT values are less than 0.85, and then we can conclude that discriminant validity is satisfied for all constructs in the model.

In sum, results provide evidence for the reliability and validity of all the latent variables we considered.

Table 5 Discriminant validity (HTMT)
Fig. 5
figure 5

Loadings (points) for the several constructs (panels) of the reflective measurement model. The indicators are reported on the vertical axis. In each panel, the indicators are sorted in decreasing order according to their values. Segments depict the 95% bootstrap confidence intervals. A vertical line is located at 0, and a shaded area highlights the zone in which loadings are greater than 0.70

As regards the assessment of the composite model, the multicollinearity, the weights, the loadings and their significance should be evaluated. However, the only emergent variable of the STATSPO model is the P construct, which is a single-indicator measurement (Diamantopoulos et al. 2012). In this case, the construct scores are identical to the standardized indicator values and the model assessment is then not required.

5.2 Structural model results

The evaluation of the structural model starts from the analysis of the overall fit of the estimated model, shown in Table 6. The \(d_{G}\) discrepancy value is below its corresponding HI95 value, indicating that the estimated model is not rejected at a \(5\%\) significance level.

Table 6 Evaluation of the overall fit of the estimated model

The results for the path coefficient estimates, their significance, and the coefficient of determination (\(R^{2}\)) are summarized in Fig. 6. The figure follows the same style of the previous representation of the loadings: each panel refers to an endogenous construct, and the predictor constructs are on the vertical axis, sorted in decreasing order according with their values. The points depict the path coefficients, and the segments the correspondent 95% bootstrap confidence intervals (CIs). Lighter segments/points, crossing the vertical lines at 0 are not statistically significant, darker segments correspond to significant coefficients. The values of \(R^2\) are reported in the subtitle of each panel.

Fig. 6
figure 6

Path coefficients (points) for the several endogenous constructs (panels) of the structural model. Segments depict the 95% bootstrap confidence intervals. The vertical lines located at 0 in each panel allow to visually catch the significance of the path coefficients: lighter segments correspond to nonsignificant path, darker segments to significant paths. The values of \(R^2\) are reported in the subtitle of each panel

As we expected, personality traits are related to athletes’ mental skills. In particular, the Extraversion dimension affects Goal setting (\(\beta = 0.44\), \(95\%\) CI = 0.22–0.56), Self-confidence (\(\beta = 0.50\), \(95\%\) CI = 0.35–0.65), and Race preparation (\(\beta = 0.31\), \(95\%\) CI = 0.06–0.46) in a positive way and the Concentration disruption (\(\beta = -\,0.32\), \(95\%\) CI = \(-\,0.52\) to \(-\,0.12\)) in a negative way. On the other hand, Emotional stability has a significant positive effect on Self-confidence (\(\beta = 0.31\), \(95\%\) CI = 0.20–0.43) and Emotional arousal control (\(\beta = 0.47\), \(95\%\) CI = 0.33–0.60), and a negative impact on Cognitive anxiety (\(\beta = -\,0.55\), \(95\%\) CI = \(-\,0.70\) to \(-\,0.40\)) and Concentration disruption (\(\beta = -0.17\), \(95\%\) CI = \(-\,0.40\) to \(-\,0.02\)). Openness affects more mental skills: Self-talk (\(\beta = 0.16\), \(95\%\) CI = 0.02–0.40), Cognitive anxiety (\(\beta = 0.22\), \(95\%\) CI = 0.01–0.39), and Concentration disruption (\(\beta = 0.25\), \(95\%\) CI = 0.05–0.42). Finally, Conscientiousness and Agreeableness have a less widespread effect: the former is only related to Mental practice (\(\beta = 0.22\), \(95\%\) CI = 0.08–0.38), whereas the latter to Self-talk (\(\beta = 0.22\), \(95\%\) CI = 0.03–0.41). The values of \(R^2\) for the IPPS-48 dimensions are reported in the subtitles of each panel of Fig. 6. They turn out to be good, ranging from 0.158 to 0.478, especially for Self-confidence (\(R^{2}= 0.48\)), Emotional aurousal control (\(R^{2}=0.38\)), Goal setting (\(R^{2}=0.28\)), and Cognitive anxiety (\(R^{2}=0.27\)).

Regarding the relationship between mental skills and personality traits on performance, Self-talk, Emotional arousal control, and Cognitive anxiety affect swimmer’s performance, accounting for \(18\%\) of the variance (\(R^{2}=0.18\)). In particular, Self-talk and Emotional arousal control have a positive effect on the performance (\(\beta =0.23\), \(95\%\) CI = 0.03–0.44, and \(\beta =0.29\), \(95\%\) CI = 0.07–0.54, respectively), whereas a negative effect is found for Cognitive anxiety (\(\beta =\,-\,0.23\), \(95\%\) CI = \(-\,0.44\) to \(-\,0.03\)). Contrary to our hypotheses, personality traits do not significantly affect performance directly (see results related to path coefficient). However, indirect effects (mediated by mental skills) and the total ones (direct and indirect) can be evaluated. This decomposition of the effects is depicted in Fig. 7: the dimensions are reported on the vertical axis in decreasing order according to the total effects (top bars for each dimension), the middle bars and the bottom bars represent the direct and indirect effect, respectively. Results in Fig. 7 show a relevant indirect effect for the Emotional stability trait (indirect effect = 0.25, CI = 0.12–0.41). In addition, the effect size of the significant path coefficients ranges from 0.02 (Openess \(\longrightarrow \) Self-talk) to 0.31 (Emotional stability \(\longrightarrow \) Cognitive anxiety), and thus, it is small-to-medium, according to the Cohen’s category (\(f^2\) index). Detailed information on the \(f^2\) values for all constructs is given in the supplementary material. The significance of the indirect effect highlights the role of mediator of mental skills in the relationship between Emotional stability and Performance. It is a full mediation (Henseler and Schuberth 2020; Nitzl et al. 2016) since the direct effect of Emotional stability is not significant (as evidenced by the confidence interval in Fig. 6), while the indirect one is. This means that mental skills absorb the direct relationship to change its direction, as evidenced by the positive sign of the total effect. The results in Table 7 report the decomposition of the total effect into direct and indirect effects and the decomposition of the indirect effect into the various components (partial indirect effects) relating to each of the mental skills. It is worth noting that the indirect effect through Emotional arousal control and Cognitive anxiety is substantial, implying that they totally mediate the effect of Emotional stability on Performance.

Fig. 7
figure 7

Decomposition of the effects on Performance for the personality traits dimensions. The top bars associated with each dimension (vertical axis) depict the total effects, the direct and indirect effects are represented using the middle bars and bottom bars, respectively. The dimensions are sorted in decreasing order according to their total effects on performance

Table 7 Effect estimates of Emotional stability on Performance

The \(Q^{2}\) index was calculated to measure the model’s predictive relevance for the indicators of the reflective measurement models of the endogenous constructs. The \(Q^{2}\) values are all greater than zero (Cognitive anxiety = 0.127, Concentration disruption = 0.048, Emotional arousal control = 0.152, Goal setting = 0.144, Mental practice = 0.047, Race preparation = 0.055, Self-confidence = 0.299, Self-talk = 0.056), thus highlighting a predictive relevance for all the reflective constructs examined.

6 Discussion

The study examined by component-based SEM methodology how the personality traits mostly affect swimmers’ mental skills and, in turn, the effect of mental skills on their performance.

Our results show that Self-confidence and Emotional arousal control are the skills strongly related to Big Five personality traits. As reported in Bellou et al. (2018), the lack of Emotional stability (also defined as neuroticism) generally causes low self-confidence levels. In our specific context, this means that the tendency to be emotionally reactive and insecure lead swimmers to feel not being up to gain expertise, achieve goals, and express their potential. On the other hand, experiencing positive emotions (being extravert) positively affects the swimmers’ confidence in their skills. Emotional arousal control is greatly influenced only by the Emotional stability personality trait. Therefore, swimmers who are able to manage stress and emotions also succeed in controlling competition stress and anxiety and channeling the concentration and energy on the performance. This finding is in agreement with the results in Petito et al. (2016). Regarding the other two mental skills related to emotional aspects, namely Cognitive anxiety and Concentration disruption, we found that both are related to the personality traits of Emotional stability and Openness. In particular, openness to experience can be defined as the tendency to seek new experiences: swimmers who enjoy new and exciting tasks tend to report a higher cognitive anxiety level and concentration disruption. As far as we know, these relationships have not been clearly explored in the literature. We can speculate that being fascinated by many different stimuli could not allow athletes to focus their attention on sporting goals, generating more anxiety for their performance and concentration problems. Conversely, swimmers with a low score on Emotional stability, and with a high level of apprehensiveness and predisposition to anxiety, tend to experience more cognitive anxiety and worry before and during competitions, and a more impaired concentration. These results are in line with those reported in Allen et al. (2013) and Petito et al. (2016). Concentration disruption is also influenced by Extraversion. This appears to be in contrast with the other evidence in sport psychology research (Allen et al. 2020). Indeed, in our sample, extraverted swimmers present a higher level of concentration. However, a study conducted by Kaiseler et al. (2019) showed that extraversion negatively predicts the use of distraction-oriented coping in sport. In other words, this means that when extraverted athletes cope with stressful situations (i.e., what we mean by coping strategy), they are less likely to shift their attention on non-sport-related aspects causing mental distraction.

Compared to emotional aspects, mental skills involving cognitive aspects are less influenced by personality traits. The frequency of inner dialogue recourse to instruct and motivate oneself, namely Self-talk (Van Raalte et al. 2016), is related to the personality traits of Agreeableness and Openness. In particular, swimmers with a more trusting and helpful way of thinking (Agreeableness) and an imaginative, creative, and flexible personality (Openness) are more prone to use self-talk to enhance their performance. These results are consistent with what was reported in Brinthaupt (2019) and Depape et al. (2006). In addition, Extraversion affects two cognitive aspects related to swimmer’s mental skills: Goal setting and Race preparation. Being determined, resolute, dominant, and energetic increases the swimmer’s ability to define adequate action plans and strategies to reach athletes’ goals, practicing them during their training. Finally, swimmers who are more organized, hardworking, and responsible (i.e., Conscientious) tend to simulate reality: they mentally reproduce their movements (Mental practice) to improve and refine their abilities. Other works exploring the relationships between personality and mental skills for other sports report similar results (Budnik-Przybylska et al. 2019; Judge and Ilies 2002). Furthermore, results highlight the role of mediator of mental skills in the relationship between Emotional stability and Performance.

As regards the latter, swimmers involved in this study are heterogeneous: 39% did not participate in many competitions and did not often win, about 22% reported a moderate success, whereas 40% participated in many competitions and had good performances. Our study suggests that three of the considered mental skills are particularly important to have successful performances in swimmer competitions. Two skills involve emotional aspects (Self-confidence and Cognitive anxiety) and one cognitive practice (Self-talk). Firstly, self-talking allows swimmers to focus on the present moment, avoid distractions, acquire new skills, modify incorrect automatic movements, and consequently improve their performance. Secondly, negative expectations about performance and concern about the possibility of failure negatively impact the performance, lowering swimmers’ possibility of success. Thirdly, swimmers able to manage stress and emotions channeling their energy on the performance reach also better results in very stressful competitions.

Finally, only indirect and positive effect of Emotional stability on Performance has been detected when mental skills were used as mediators between personality traits and performance. In such a case, no direct effect of personality on performance emerges from the research. The total effects suggest a high impact of Self-talk, Emotional arousal control, Cognitive anxiety, and Extraversion on swimmers’ performance. We also highlighted the specific mental skills particularly related to participation and success in swimmer competitions. From a practical point of view, we retain that these findings could be really useful to design strategies and interventions both for coaches and swimmers to maximize the performance and well-being.

7 Conclusion

The interest in the role of personality in sport has been increasing in recent years. Several studies have pointed out the effect of personality traits on athletes’ performance and success (see Allen et al. 2013; Allen and Laborde 2014; Steca et al. 2018 among the others). In contrast, fewer studies have focused the relation among personality traits and specific athletic behaviors, skills, and strategies to enhance performance. To fill this gap, our work provides evidence on what personality traits mostly affect swimmers’ mental skills and, in turn, the effect of mental skills on their performance. The current study advances an important line of inquiry studying the interdependence of psycho-social aspects and performance of athletes and providing evidence for a very specific athlete’s population (elite swimmers). Our sample’s homogeneity, very uncommon in research on organized sports (Steca et al. (2018), allows us to provide findings on an athlete’s category that has received little attention in the literature. Although this study addressed some relevant questions, it has its limitations. One limitation is that the data are about a sample of young swimmers; therefore, claims may arise on some specific questions concerning the personal sphere. It remains unclear whether the results are cyclically related and together are caused by such factors as individual difference characteristics and sociocultural variables, as well as other factors. Longitudinal and experimental research on different clusters could uncover these potential multi-layered variables. Another extension is about the possibility to consider further objective measures to evaluate the performance also with respect to the swimming style that may be considered in future studies. With reference to the last point, it would be interesting to investigate the role of coach and his/her contribution in objective and subjective performance-related outcomes and athletes’ skills or well-being (e.g., win-loss record, personal bests, technique acquisition, competence, conflict). Finally, future research should extend the main findings in both other sports and contexts.