Introduction

In recent decades, the focus of study in psychology has expanded, with more attention being paid to positive functional aspects that are the basis of human well-being (Chou et al., 2013). The promotion of well-being has been associated with multiple physical and psychological benefits (Hoen et al., 2013; Czekierda et al., 2017). Moreover, well-being has been positively related to physical health and the implementation of healthy behaviors, especially in young people (Frisch, 2000). In the educational context, higher levels of well-being are associated with fewer behavioral problems in the school environment (McKnight et al., 2002) and a greater locus of control, self-esteem, and intrinsic motivation (Huebner, 1991). In fact, recent meta-analysis found positive correlations between academic performance and well-being (Bücker et al., 2018). However, some studies have shown that this relationship is low and inconsistent (Topham & Moller, 2011). Anyway, well-being and academic achievement are considered central indicators of positive psychological functioning in education systems (OECD, 2015). Similarly, several studies have shown that promoting employee well-being in the educational context is important. For instance, Kern et al. (2014) found that school staff with higher engagement and better relationships reported greater job satisfaction and organizational commitment.

Most of the proposed well-being models use a combination of hedonic and/or eudaimonic domains (Steptoe et al., 2015). The hedonic perspective of well-being (also called subjective well-being) includes, on the one hand, positive and negative affective aspects and, on the other hand, cognitive aspects, including overall satisfaction with life and specific domains such as education, health and housing (Diener et al., 1999; Vázquez et al., 2015). Eudaimonic well-being focuses on the development of natural or innate potential (Jongbloed & Andres, 2015) and stems from the implementation of actions consistent with personal values that imply a full commitment to oneself (Ryan & Deci, 2001). This movement toward the development of one’s own maximum potential generates benefits not only for oneself but also for society (Waterman, 2008).

Currently, there is growing interest in measuring individual and community well-being (Helliwell et al., 2020). Improving quality of life has become a priority for many social, academic and political sectors (Layard, 2013). This interest has favored the development of tools to measure the well-being of the population (Layard, 2013).

PERMA Model

Seligman (2011) proposed the PERMA model, a renewed and expanded version of his previous model of multidimensional well-being (Seligman, 2002). The PERMA model identifies 5 elements: positive emotions (P), engagement (E), relationships (R), meaning (M); and accomplishment (A). This integrative theory proposes that optimal well-being requires the presence of both hedonic and eudaimonic dimensions. To make the PERMA model measurable, Butler and Kern (2016) developed the PERMA-Profiler. The English version was validated with a large international sample (N = 31,966), resulting in a final set of 15 items (3 items for each dimension of PERMA). In addition, 8 items were added to evaluate negative emotions (NEs, 3 items), physical health (H, 3 items), loneliness (1 item) and happiness (1 item). Validation studies of the PERMA-Profiler indicated adequate reliability and validity of the scale (Butler & Kern, 2016). Confirmatory factor analysis showed that a five-factor model fit the data adequately and that all 5 factors were reliable. This short valid instrument based on the well-known Seligman’s model has become one of the most commonly used tools in well-being research (Donaldson et al., 2021).

Currently, adaptations and validations of the PERMA-Profiler scale are being developed in different cultures. In general, previous studies report adequate indices of internal consistency. Regarding the factorial structure of the questionnaire, various structures are reported. For example, while the adaptations for Malaysia, Australia, Ecuador and Colombia report between 1 and 3 factors through exploratory analysis, confirmatory factor analyses corroborate a theoretical structure of 5 factors in the adaptations for Indonesia, India, Japan, Turkey, Germany, Greece, Chile, Venezuela, and Italy. Regarding population groups, the PERMA-Profiler has been validated in samples composed of students in the United States (Umucu et al., 2020), Turkey (Bülbül, 2018), Italy (Giangrasso, 2021), Indonesia (Hidayat, 2018), India (Singh & Raina, 2020), Chile (Cobo-Rendón et al., 2020) and Venezuela (Cobo-Rendon et al., 2021). Few studies have used adult samples, such as research conducted in Germany (Wammerl et al., 2019), Greece (Pezirkianidis et al., 2021) Australia (Ryan et al., 2019), Malaysia (Khaw & Kern, 2015), Japan (Watanabe et al., 2018), and Ecuador (Lima et al., 2017), and institutionalized seniors, in a Colombian study (Hernández-Suárez et al., 2018). PERMA-Profiler has been also validated for college students with disabilities (Tansey et al., 2018).

Although there are studies in which the PERMA-Profiler has been validated in Spanish (i.e., Ecuador, Chile, Venezuela and Colombia), the reliability, validity and factorial invariance of the PERMA-Profiler in other Spanish-speaking countries such as Mexico has not yet been proven. One of the main limitations of the previous versions of PERMA is that they were not created in universal Spanish, that is, with the intention of being valid for most Spanish-speaking countries. It is worth-mentioning that, beyond the linguistic differences between Mexico and other Spanish-speaking countries, these cultures vary in their levels of well-being and their determining factors (Helliwell et al., 2019). Also, these Spanish adaptations show difficulties regarding certain items. This is the case of E1 (i.e., ‘How often do you become absorbed in what you are doing?‘) and E3 (‘How often do you lose track of time while doing something you enjoy?‘) included in the Engagement factor. These items showed poor reliability in the translated versions (i.e., Chile, Venezuela, Colombia), probably due to the double meaning (positive and negative) in Spanish. Moreover, convergent and discriminant validity of PERMA-Profiler have been poorly examined. It is still unknown how PERMA-Profiler is related to other measures of general well-being widely used in Spanish-speaking countries. Similar to other validations (Butler & Kern, 2016; Wammerl et al., 2019), convergent validity may be tested using other well-being measures such as the Positive Emotions Scale (Watson et al., 1988), or the Satisfaction with life Scale (Diener et al., 1985), while discriminant validity may be tested using negative functioning variables, such as Negative Emotions Scale (Watson et al., 1988) or Fixed Mindset Scale (Dweck, 1999). As some well-being researchers have recognized, it is important to demonstrate that general well-being and other related constructs are relatively independent (Donaldson et al., 2021). Moreover, although there is empirical evidence of the underlying PERMA structure for gender or age groups in different languages (eg., Pezirkianidis et al., 2021; Wammerl et al., 2019), the equivalence of the structure and factor loadings have not yet been explored in Spanish.

Therefore, more exhaustive analyses are needed to examine previously unanalyzed psychometric properties in the Spanish-speaking population (e.g., convergent and discriminant validity between variables), based on larger samples (the sample sizes of the studies conducted in Ecuador, Colombia, Chile and Venezuela were 1247, 130, 1462 and 202 participants, respectively), with a greater age range (the study conducted in Ecuador focused only on adult populations, the study conducted in Colombia focused only on institutionalized seniors, and the study conducted in Chile focuses only on adolescent students).

The Present Study

The PERMA model has been considered a useful framework for assisting clinicians (Carr et al., 2021) or school (Waters, 2011; Wingert et al., 2020) and work leaders (Larsen et al., 2021) to design well-being interventions. Thus, having a brief tool that evaluates these dimensions in the Spanish-speaking population is essential. The general objective of this study is to evaluate the psychometric properties of the PERMA-Profiler in a sample of students and employees (i.e., teachers and administrators) in a Mexican school environment. To address the general objective, the following specific objectives were established: (1) to determine and confirm the factorial structure of the measure; (2) to evaluate the reliability of the scale and subscales; (3) to analyze the content, convergent and discriminant validity; and (4) to explore the measurement invariance across gender and age. Given the mixed evidence on the correlation between well-being and academic achievement, the relationships between academic performance and the different PERMA dimensions were also examined. On the other hand, due to inconsistencies among previous studies regarding the factorial structure of PERMA-Profiler, it is not possible to establish a specific hypothesis with respect to which model will yield the best fit.

Methods

Participants

Two different samples were used in the study. Sample 1 was composed of 23,723 students (51% women). Sampling consisted of a non-probability sampling method via the use of a voluntary sample of students enrolled in high school programs, professional or master’s degrees offered by a private Mexican university. Outliers were detected in the data on PERMA-Profiler and 36 students were excluded from the final sample. The ages of the participants in sample 1 ranged from 13 to 68 years (M = 21.50; SD = 8.10). Sample 2 was composed of 2,783 employees (i.e., teachers and administrators) from the same university (59.3% women); the ages of the participants were greater than 20 years (M = 36.08). Tables 1 and 2 provide the details of these samples.

Table 1 Sociodemographic characteristics of the sample 1 (N = 23,723)
Table 2 Sociodemographic characteristics of the sample 2 (N = 2,783)

Measures

Participants in sample 1 completed the PERMA-Profiler, the Positive and Negative Affect Scale (PANAS), the Satisfaction with Life Scale (SWLS), the Pemberton Happiness Inventory (PHI), the Growth Mindset Scale (GMS), and the Physical Health Survey (SF-12). Participants in sample 2 completed the PERMA-Profiler and PHI. In addition, the average grades of the students were used as an indicator of academic performance. The evaluation protocol developed is described below.

PERMA-Profiler (Butler & Kern, 2016). The total scale consists of 23 items (15 items that evaluate the 5 dimensions of PERMA plus 8 filler items). Respondents answer using a Likert scale (0 = ‘never or not at all’ to 10 = ‘always or completely’). Mean scores were analyzed. The levels of internal consistency reported by the original study (Butler & Kern, 2016) range from 0.60 to 0.95 for the different dimensions and the total scale.

Positive and Negative Affect Scale (PANAS; Watson et al., 1988; Spanish validation by Lopez-Gomez et al., 2015). The PANAS includes 2 subscales with 10 items that evaluate positive affect (PA) and negative affect (NA). Respondents answer using a Likert scale (0 = ‘Nothing’ to 4 = ‘Always’). In the present study, the reliability was α = 0.80 for the PANAS-PA and α = 0.85 for the PANAS-NA.

Satisfaction with Life Scale (SWLS; Diener et al., 1985; Spanish validation by Vázquez et al., 2013). The SWLS is composed of 5 items that evaluate life satisfaction. For this, the degree of agreement with each statement is indicated using a 7-point Likert scale (1 = ‘strongly disagree’ to 7 = ‘strongly agree’). Scores can range from 5 to 35 points, with higher scores indicating greater life satisfaction. In the present study, Cronbach’s alpha was 0.85.

Pemberton Happiness Index (PHI; Hervás & Vázquez, 2013). The PHI consists of 21 items, with 11 related to remembered well-being and 10 related to experienced well-being. The remembered well-being score is generated using a Likert scale, and the experienced well-being is assessed through a dichotomous response (yes/no), generating a score of 0–10 based on positive and negative experiences. In the present study, Cronbach’s alpha was 0.83.

Growth Mindset Scale (GMS; Dweck, 1999). The GMS consists of 16 items scored using a 6-point Likert scale (1 = “strongly agree” to 6 = “strongly disagree”). It has 2 subscales: growth mindset and fixed mindset. This scale has been previously used in Mexican populations, showing good reliability values (Ortiz Alvarado et al., 2019). In the present study, Cronbach alpha was 0.82 for growth mindset, and 0.82 for fixed mindset.

Short-Form Health Survey (SF-12; Ware, Kosinski, & Keller, 1996; Spanish validation by Vera-Villarroel et al., 2014). The SF-12 consists of 12 items that assess the intensity and/or frequency of the health condition of the respondent. The score ranges from 0 to 100, where higher scores indicate higher health-related quality of life. Scores for physical health (SF-P) and mental health (SF-M) can be extracted from the SF-12. The reliability of the scale presents levels above 0.70.

Academic performance: Student’s academic performance was determined by the average grade obtained in the semester of the survey. The grades ranged on a scale from 0 to 100 points. The mean performance was 86.93 (SD = 5.83).

Translation Process

The translation of the PERMA-Profiler was based on the international guidelines for cross-cultural adaptation of self-report measures (Beaton et al., 2000). The translation process consisted of the following 5 stages: initial translation, synthesis of the translation, back-translation, expert validation, and testing of the pre-final version. A committee of two bilingual experts and two psychologists participated in the back-translation process for the PERMA-Profiler scale, evaluating the conceptual and semantic equivalence until reaching a consensus. The items that showed less consensus were E1 (‘How often do you become absorbed in what you are doing?’) and E3 (‘How often do you lose track of time while doing something you enjoy?’). After the preparation of the first translation and its application in a preliminary study with 1,499 students, the reliability of the subscales was satisfactory, with the exception of engagement (E) (α = 0.48). This index improved when item E3 was eliminated. The expert committee suggested that some participants may interpret ‘losing track of time’ with a negative connotation. Therefore, a new translation for item E3 was proposed (i.e., “How often does time go by very quickly when you are doing something you enjoy?”). E1 was also nuanced (‘How often do you become absorbed, or are you into, what you are doing?’). This final translation of E1 and E3 differed from the Spanish (Mexican) version used in the Butler & Kern (2016) study. The questionnaire was administered to a sample of 96 students. The reliability of the engagement factor improved substantially (α = 0.77). Therefore, this translation was used for the study. This preliminary sample was not included in the subsequent analysis.

Procedure

The evaluation protocol was applied online through Qualtrics. This study was approved by the ethics committee of the university and the Institute for Happiness Sciences, a research center of the university. Before answering the survey, participants signed an informed consent form. At the beginning of the academic year, the university requests parental consent for participation in academic surveys of any student under the age of 18. The survey was voluntary, and the participants did not receive any remuneration for participation. The survey was part of a broader evaluation protocol aimed at determining satisfaction with the university. All participants in sample 1 completed the PERMA-Profiler and another additional questionnaire, which was administered randomly among the participants (i.e., the PANAS, SWLS, PHI, GMS, or SF-12). This strategy ensured that the length of the evaluation protocol was adequate. The number of responses to each scale were: 4,625 for PANAS; 4,787 for SWLS; 4,692 for PHI; 4,817 for GMS; 4,802 for SF-12). Regarding sample 2, all participants completed the PERMA-Profiler and the PHI.

Data Analysis

First, a descriptive analysis was performed to examine demographic information for sample 1 and 2. To explore the differences in PERMA scores by age or gender, descriptive analyses of the PERMA-Profiler for sample 1 were also performed.

Next, to evaluate the factorial validity of the PERMA-Profiler, exploratory and confirmatory factor analysis was performed using Exploratory Structural Equation Modeling (ESEM). The ESEM takes into account both the exploratory and confirmatory factor analysis and it considers significant cross-factor loadings (Asparouhov & Muthen, 2009). Models fit and parameter estimates were evaluated based on the ESEM only; fit indices and parameter estimates for confirmatory factor analysis (CFA) models were presented for information. ESEM and CFA were carried out with sample 1 (N = 23,723) and sample 2 (N = 2,783).

To test for normality, an index for acceptable limits of skewness and kurtosis of ± 2 was used (Field 2009). Mardia’s coefficient was 1,195. Maximum likelihood estimation with robust (Huber-White) standard errors (MLR) was used as an estimator to address the potential results bias generated through non-normally distributed data (MLR; Yuan & Bentler, 2000; Finney & DiStefano, 2013). The global goodness-of-fit of the models is acceptable if (1) the comparative fit index (CFI) and Tucker-Lewis index (TLI) are ≥ 0.90; and (2) the root mean square error of approximation (RMSEA) and standard root mean square residual (SRMR) are ≤ 0.06 (90% CI ≤ 0.10; Hu & Bentler, 1999). Research on the validity of fit indices has indicated that the statistical power of the chi-squared test is highly dependent on sample size, where “in large samples, virtually any model tends to be rejected” (Bentler & Bonett 1980, p. 588). For this reason, the evaluation of the global model fit using the 4 goodness-of-fit indices is critical. For model comparisons of the competing models (i.e., the 1-factor, 2-factor, 3-factor and 5-factor structures), we used the goodness-of-fit indices and the Bayes information criterion (BIC), where smaller values indicate a better model fit (Raftery, 1995).

Subsequently, to evaluate the internal consistency of the questionnaire, Cronbach’s alpha values were calculated for the total score and for each factor score of the PERMA-Profiler in sample 1. The item-test relationship was also analyzed.

Content, convergent and discriminant validity between variables were examined using a comprehensive protocol of well-being measures. First, PHI was included (Hervas & Vazquez, 2013). This measure has been widely used in Spanish-speaking countries and has not been previously used to evaluate the validity of the PERMA-Profiler. In addition, correlational analyses were carried out between the PERMA-Profiler and the other tools included in the protocol administered to sample 1 (i.e., PANAS, SWLS, GMS, and SF-12). A correlation analysis was also performed between the PERMA-Profiler and academic performance.

Finally, in order to test for measurement invariance, gender (males and females) and age group (i.e., under 18 vs. 18 or more) were taken into account. The procedure for multi-group CFA’s with increasingly restricted nested models was used (Kline 2015). We tested for the following restrictions: configural (M1; no restrictions), metric (M2; adding equal factor loadings), scalar (M3; adding equal intercepts), and strict uniqueness (M4; adding equal residual variances) measurement invariance. The pairwise comparisons were made with its less restrictive predecessor (i.e., M1 vs. M2, M2 vs. M3, etc.). For model comparison of these nested models, changes in the CFI as well as the RMSEA were used, where lack of invariance is defined as ΔCFI ≥ 0.01 (Cheung & Rensvold, 2002) and ΔRMSEA ≥ 0.015 (Chen, 2007).

The lavaan R package (version 0.6–2; Rosseel 2012) in RStudio (version 4.0.5; RStudio 2016) was used for ESEM, CFA and measurement invariance analyses. All other statistical analyses were conducted in SPSS 25.

Results

Descriptive Results

Table 3 shows the means and standard deviations of the PERMA-Profiler for sample 1 as a function of age and gender. The t test for independent samples indicated that there were no statistically significant differences in well-being by gender [t(23,440) = 1.67; p = .57]. On the other hand, analysis of variance did indicate significant differences in well-being by age group [F(3, 23,435) = 145.68, p < .001]. The post hoc Bonferroni analysis indicated significant differences among all groups, indicating higher levels of well-being for the older age groups. Table 4 shows the means and standard deviations of the PERMA-Profiler items.

Table 3 Descriptive data of the PERMA-Profiler total score by age and gender (sample 1)
Table 4 Means and standard deviations for PERMA-Profiler items (sample 1)

Factor Analysis

ESEM was conducted to explore and verify the factorial validity of the PERMA. Different factor structures were tested: the original structure of five factors (Butler & Kern, 2016), the 1-factor, 2-factor and 3-factor structures that have arisen in Spanish-speaking countries (Hernández-Suárez et al., 2018; Lima et al., 2017) Footnote 1. In addition to verifying the model fit using sample 1 (i.e., students), the fit of the models was tested using a different sample composed of professors and employees of the university (sample 2).

The goodness-of-fit indices are presented in Table 5. All models presented an acceptable fit; however, the 5-factor model yielded better indicators for both sample 1 and sample 2. Moreover, the Bayes information criterion (BIC) for the 5-factor model was slightly smaller, indicating a better model fit. Factor loadings are presented in Table 6. All factor loading in the CFA results were significant (p < .001) for both samples with values ranging from 0.46 to 0.86, thus meeting the values recommended by Brown (2015). Factor loadings in the ESEM results are meaningful when they are higher than 0.32 (Worthington & Whittaker, 2006). Factor correlations were < 0.85, as recommended by Brown (2015) (see Fig. 1).

Table 5 Model fit indices of the CFA and ESEM (sample 1 and sample 2)
Table 6 Factor loadings in the CFA and ESEM results (sample 1)
Fig. 1
figure 1

Schematic representation of ESEM model: five-factor model (sample 1). (Note. F1 = Positive emotions; F2 = Engagement; F3 = Positive Relations; F4 = Meaning; F5 = Achivement.)

Reliability Analysis

Table 7 shows the reliability coefficients for the subscales for sample 1 and 2. Cronbach’s alpha coefficients (α) greater than 0.70 are acceptable values. The 15-item scale had a reliability of α = 0.94 (McDonald’s Omega = 0.93) for sample 1, and α = 0.92 (McDonald’s Omega = 0.92) for sample 2, and the total PERMA-Profiler scale had a reliability of α = 0.90 (McDonald’s Omega = 0.89) for sample 1, and α = 0.91 (McDonald’s Omega = 0.91) for sample 2. The engagement dimension showed the lowest reliability value (α = 0.69 and 0.67, respectively). Moreover, Composite Reliability (CR) coefficient was examined (Table 8). All CR values were higher than the desirable critical value of 0.70 (Hair et al., 2010).

Table 7 Reliability PERMA-Profiler (sample 1 and 2)
Table 8 Average Variance Extracted (AVE), the Composite Reliability (CR), and the shared variances of the dimensions (Sample 1)

Validity Analysis

Firstly, convergent and discriminant validity between dimensions were assessed using the Average Variance Extracted (AVE) and the Maximum Shared Variance (Table 8). All AVE values were higher than the critical value 0.50 (Hair et al., 2010), except for engagement. This result indicate that the engagement factor has convergent validity issues (AVE < 0.05), which means that the three items that compose this factor do not correlate well with each other within the engagement factor.

Factors whose shared variance are lower than their AVE are characterized by adequate discriminant validity (Hair et al. 2010). Also, all AVE values were higher than the shared variance of the dimensions, except for positivity-engagement, positivity-relations, meaning - achievement. These results indicate poor discriminant validity between these dimensions, since a portion of each factor’s variance is better explained by items that mainly load on another factor.

To further evaluate the convergent and discriminant validity of the PERMA Profiler, the possible correlations between the PERMA-Profiler dimensions and other measures (i.e., PANAS, SWLS, PHI, GMS and SF-12) were examined for sample 1. As levels of skewness and kurtosis were high, spearman correlations were performed. Since there were significant differences in the well-being by age, age was included as a covariate in the analysis. The results are shown in Table 9.

Table 9 Descriptive statistics and bivariate correlations between the different instruments (sample 1)

First, the main factors of PERMA were strongly correlated with the PHI, SWLS and PANAS-PA, and slightly correlated with SF-Physical. Regarding the secondary components, NE showed a strong and positive correlation with PANAS-NA and a moderate correlation with fixed mindset. The health subscale was highly correlated with the SF-physical and moderately correlated with PANAS-PA, the PHI and the SWLS. In addition, the total PERMA-Profiler score was strongly correlated with PANAS-PA, the SWLS and the PHI and slightly correlated with SF-Physical and growth mindset.

Negative correlations were taken into account to clarify the discriminant validity between variables. The total PERMA-Profiler score and its principal components were slightly negatively correlated with NE, loneliness, PANAS-NA and fixed mindset.

Regarding the relationship between well-being and academic performance, significant correlations (p < .01) were observed, although of moderate size, between all the dimensions of the PERMA-Profiler and academic performance.

Measurement Invariance

In order to test the Five-Factor Model for measurement invariance, gender and age groups were taken into account (see Table 10). Regarding gender, two subgroups (i.e., male and female) were considered. The results demonstrate that the CFI differences between configural, metric scalar and strict invariance across gender were less than 0.01 indicating that factor structure, factor loadings, items intercepts, and residual variances were equivalent across gender. Regarding age, two subgroups (under 18 vs. 18 or more) were considered. Results showed that criteria for configural, metric and scalar measurement invariance were met. There was a non-significant decrease in model fit. Thus, the factor structure, loadings and item intercepts were equivalent across groups. However, the results showed that criteria for strict measurement invariance (i.e., equal residual variances) were not met. Although changes in the RMSEA were ≤ 0.015, ΔCFI was 0.029. Thus, residual errors were not equivalent across age groups.

Table 10 Model comparisons for measurement invariance of the Five-Factor Model regarding gender and age groups

Discussion and Conclusions

The main purpose of the present study was to evaluate the validity of the PERMA theory of well-being by developing and testing a translated and culturally adapted version of the PERMA-Profiler in a large Mexican student sample. For this, the psychometric properties of the adaptation of the questionnaire to Spanish were estimated, the reliability and validity of the questionnaire were tested, and the most appropriate factor structure for this population was confirmed.

First, in relation to the descriptive results, although there was no difference in well-being by gender, differences by age were found (i.e., older age groups showed higher well-being). Although many studies found that life satisfaction showed a U-shaped curve over the life span, with its low point around 50 years of age (Blanchflower & Oswald, 2008), this hypothesis has not yet been tested using other integrative well-being measures such as PERMA. Moreover, mean scores for each of the 15 items were higher than those found in other Latin American countries (such as Ecuador, Chile and Venezuela). The average scores obtained for each factor were much higher than those found in the original study and in the other PERMA-Profiler validations (i.e., Japan, India, Germany, and Australia). These higher scores in well-being in Mexico could be due, in part, to the so-called ‘Latin American paradox’, territories in which a much higher well-being is evident than would be expected due to their socioeconomic status (Macchia & Plagnol, 2019). The presence of protective factors commonly associated with well-being, such as social and cultural relationships and religiosity, could explain this paradox (Reyes, 2017). However, it should be noted that these higher well-being scores may also be due to the sample characteristics, since it was drawn from students and employees in private schools with higher socioeconomic status.

Regarding the factorial structure of the PERMA-Profiler, the original 5-factor model proposed by Butler and Kern (2016) yielded better indicators, with values within the ranges of scores accepted in the psychometric literature (Brown, 2015). These results confirm the original structure, coinciding with other validations in which CFA is used, such as in Chile, Venezuela, Greece, Indonesia, India, Japan, Turkey, Germany and Italy. Moreover, results support that the five-factor model is acceptable for both students and employees.

Focusing on reliability, the analyses indicated acceptable internal consistency for the 15 items of the PERMA-Profiler, similar to the original version (Butler & Kern, 2016) and to validations in other cultural contexts. This result supports the internal consistency of the Mexican translation.

Regarding the levels of consistency for the different dimensions of the PERMA-Profiler, the results were theoretically adequate for all dimensions, except for engagement (E). Despite having modified the translation of this dimension after the first preliminary study, the factor loading was still slightly lower than theoretically expected. This low reliability in the dimension of engagement is also found in some samples of the original study and in other validations (e.g., Pezirkianidis et al., 2021). Several studies indicate that these low levels of reliability could be due to the composition of the dimension itself, which includes cognitive, psychological and behavioral aspects (Lambert D’raven, & Pasha-Zaidi, 2016), and could be causing an internal fragmentation of the dimension. It is also possible that low levels of reliability are related to the inclusion of very different components in the same dimension (i.e., commitment, absorption, and dedication). Following this line, it is worth mentioning that AVE for engagement was slightly lower than 0.5. However, Fornell and Larcker (1981) suggested that, if AVE is lower than 0.5, but CR is higher than 0.6, the convergent validity of the construct is still adequate.

Regarding validity, there was a significant correlation among the main factors of the PERMA-Profiler, values higher than those reported in the original study, reaffirming the high correlation among the different dimensions of well-being. In any case, correlations among factors were lower than 0.85, indicating acceptable discriminant validity between dimensions (Brown, 2015). Positivity-engagement, positivity-relations, meaning-achievement showed poor discriminant validity. Only the Greek validation analyzed the shared variance, finding similar discriminant validity issues (Pezirkianidis et al., 2021). High cross-factor loadings in the five-factor structure may be taken into account or reworded in future analyses. For instance, discriminant validity improves when P2 (‘In general, how often do you feel positive?’) or A1 (‘In general, how often do you feel positive?’) are not included in the discriminant validity analysis.

On the other hand, the Latin American validations reported lower correlations between the dimensions of well-being and distress of the PERMA-Profiler than those of the original study, highlighting the need to evaluate them both independently in the Latin American context.

The results from this study show, for the first time, the relationship of the PERMA-Profiler with other relevant scales of well-being in the Spanish-speaking population. A high correlation was found between the total PERMA-Profiler score and the PHI. This measure of well-being, which includes hedonic and eudaimonic components, has high indices of reliability and validity. This strong correlation indicates that these two instruments measure very similar aspects, denoting high content validity.

Regarding convergent and discriminant validity between well-being variables, the high correlation with the SWLS is notable, implying a strong relationship with subjective well-being. However, the agreement between the 2 measures is not complete, suggesting that PERMA allows for a broader view than subjective well-being (Seligman, 2018). The PERMA model has influenced the design of multiple interventions in various contexts; therefore, having a valid and reliable multidimensional measure that fits this theoretical model can make a significant contribution.

There were strong correlations between positivity and PANAS-PA, between the negative emotions subscale and PANAS-NA, and between health and the SF-12, supporting the convergent validity of the subscales. Finally, while there was a positive correlation between the PERMA-Profiler and a growth mindset, there was no correlation with a fixed mindset, indicating the discriminant validity of the measure.

Focusing on academic performance, significant and positive correlations were observed, although slight, between this and all dimensions of the PERMA-Profiler. Moreover, academic achievement was negatively related to negative emotions. These significant correlations with academic performance coincide with the results reported for U.S. veterans, the only validation study of the PERMA-Profiler that analyzed the possible correlation (Umucu et al., 2020). This result is also consistent with what was found in a meta-analysis carried out by Bücker et al. (2018), who found a correlation with an average strength of r = .16 (varying between 0.11 and 0.22). Previous research has already shown that, while negative emotions negatively influence self-regulated learning, positive emotions are associated with better self-regulated learning, greater motivation and better grades (Mega et al., 2014). Well-being favors persistence in and commitment to academic tasks and helps students use more effective cognitive processing strategies (Tuominen-Soini et al., 2008). However, it is worth-mentioning that this relationship between well-being and academic performance can be bidirectional, where academic success could also lead to greater subjective well-being (Neubauer et al., 2018).

Regarding measurement invariance, the multi-group CFAs revealed strict measurement invariance of the Five-Factor Model for gender, indicating that the scores among men and women can be compared and interpreted meaningfully. On the other hand, the scale partially met criteria for measurement invariance regarding age. Thus, factor structure, factor loadings and indicator means can be safely compared between age groups. However, indicator residuals comparisons between age groups must be made cautiously. Despite this limitation, these are, to our knowledge, the first findings which explore measurement invariance for the PERMA-Profiler considering gender and age in Spanish-speaking nationalities. By investigating the scale’s measurement invariance, this study paves the way for further research on developmental differences in well-being.

There are several limitations that must be taken into account when interpreting the results. The first limitation relates to the sample of participants because the adaptation was carried out in the educational context. Although the age range was wide and the model was also confirmed in a sample with a different profession and age profile (teachers and administrative staff), it is possible that individuals with a lower level of education or lower socioeconomic level are not represented by this sample. In the future, it will be necessary to perform validation studies with samples of other ages, socioeconomic levels, etc., to expand the applicability of this scale in different contexts. Second, this is a cross-sectional study. For this reason, it is not possible to establish causality or directionality in the relationships with other well-being variables (e.g., academic performance). Therefore, longitudinal studies would be beneficial. Third, the reliability of the engagement factor was low. Although these data have been found in most validation studies and even in some samples of the original study, additional interviews should be conducted to evaluate the understanding of the items that compose this dimension, in addition to an investigation of cultural differences with respect to engagement.

Despite the limitations, this study allowed the validation of the Spanish version of one of the most commonly used tools in well-being research. Many interventions in the clinical, educational and work environments have been designed from the perspective of the PERMA model. Therefore, the adaptation and validation of the PERMA-Profiler to the Mexican context allows for a multidimensional tool of well-being that will facilitate the evaluation of well-being programs in the Spanish-speaking population, a tool that may be useful as an indicator of changes in positive functioning. Despite validation in other Latin American contexts, this study allowed the examination of new psychometric properties of the questionnaire (i.e., validity, measurement invariance) through a larger sample size and with a greater age range. This study confirms that the five-factor model is acceptable for different gender, ages and populations. In addition, this study incorporated in its evaluation protocol other measurement tools with a high degree of reliability already validated in Spanish, an aspect that was not taken into account in any of the Latin American studies. Considering the existence of contradictory results regarding the possible relationship between well-being and academic performance, this study allowed to expand the available evidence in a different cultural context.

The promotion of well-being is increasingly being sought in various contexts (i.e., educational, social, work). Measuring well-being in the university context helps address the needs of its constituents (Oades et al., 2011). Therefore, having innovative tools in an educational context that are reliable and easy to use for the Spanish-speaking population is essential to continue advancing the design of new interventions. In the future, it will be necessary to continue validating multi-component programs for the promotion of different dimensions of well-being in educational institutions and in society (Chaves & Kern, 2017).