Introduction

Assessing Quality of Life (QoL) has become crucial for delivering effective health and social care services to the elderly population (Van Leeuwen et al., 2019). Numerous studies have examined the intricate relationship between QoL and potential influencing factors using a variety of cohort datasets. For instance, these investigations have effectively measured the impact of familial support (Andrade et a., 2020), or medical occurrences like the diagnosis of cancer (Geessink et al., 2017) on QoL. The concept of QoL however is multifaceted, posing challenges in its measurement (Barofsky, 2012; Netuveli & Blane, 2008). Thus, focusing on a population sharing similar physical, mental, and social characteristics, such as the elderly, can enhance the consistency of QoL measurements (Netuveli & Blane, 2008; Williams, 1977). This approach ensures that any detected changes in QoL accurately mirror shifts in the underlying concept, rather than being influenced by variations in measurement techniques (Vandenberg & Lance, 2000), a critical aspect of clinical and public health research (González-Blanch et al., 2018).

One of the QoL scales, CASP-19, consists of 19 items organized into four domains - Control, Autonomy, Self-realization, and Pleasure. Developed from Maslow’s psychological theory(Maslow, 2013), CASP-19 aims to assess the QoL of individuals aged 65 to 75 (Hyde et al., 2003). Despite numerous large datasets being utilized to investigate CASP-19 through the application of its second-order common factors, inconsistencies have arisen in the results (Sexton et al., 2013). In response, Wiggins et al. (2008) proposed a more concise version of CASP-19, comprising 12 items, which more dependably captures QoL than the original scale. Oliver et al. (2021a, b) reported several versions of the CASP-12 structure factors. Limited efforts however have been made to report the measurement consistency of CASP-12. Measurement invariance refers to the consistency in how an instrument is interpreted among various groups of individuals (Vandenberg & Lance, 2000).

In many studies, researchers often forget to consider measurement errors when looking at Quality of Life (QoL) among different groups. This oversight can lead to biased results (Lin et al., 2016). Health research commonly compares subgroups to help with clinical decisions (Lin et al., 2016). It is therefore crucial to consider how personal factors might affect how people respond to different situations, including when using CASP-12.

It’s important to remember that older adults are not all the same. Factors like retirement, losing a spouse, dealing with chronic illnesses, and financial struggles can affect them differently. Also, as people get older, changes in their lives may lead to different priorities and motivations. That’s why age is a key factor in understanding the quality of life for older adults (Scott et al., 2022). Gender, age, and education level also play a role in how older individuals perform on language tests (Lin et al., 2016; Snitz et al., 2009). Life circumstances, such as women being more likely to be widowed, can influence how they respond to assessments of their quality of life (Ko et al., 2019; Scott et al., 2022). Often, studies group together personal details like gender and education when studying older people (Johnson et al., 2013; Lin et al., 2016).

Additionally, a person’s wealth affects various aspects of life, like getting good healthcare, staying active, and eating well. Having less money may sometimes mean sacrificing important things, like spending time with family. This trade-off means that people with different levels of wealth may see what makes a good quality of life in different ways (Kagan, 2022).

The authors have found no previous studies which explored the uniformity of the common second-order factor structure of CASP-12, encompassing four factors, across diverse groups in England. As a result, the current study has two aims: (1) to test three second-order structure factors of the CASP-12 using data from the English Longitudinal Study of Ageing (ELSA), and (2) to investigate whether the selected CASP-12 structure remains consistent across five variable groups: gender, age, educational levels, wealth, and two time periods.

Methods

Dataset

ELSA is a longitudinal research project which collects information from private households of individuals aged over 50 years residing in England (Steptoe et al., 2013). ELSA currently features nine waves of data collected over an 18-year period (Banks et al., 2021).

Measurements

Five categorical variables were selected for investigation, namely gender, age, education, net wealth, two time periods. Three age groups were compared: 50–59, 60–69, and 70–90 years. The reason for selecting these age groups was to assume participants within these ages share common perspectives for QoL. The participants were categorized into three education levels (high, average or foreign and low), following the approach of Alattas et al. (2023). The participants were categorized into three wealth levels (rich, average and poor), following the approach used in a previous study (Alattas et al., 2023. To increase the analysis power, new participants were included from wave one to wave nine and then classify them into two period groups: wave one participants and wave two through wave nine participants. We compared two time periods, W1 vs. w2 to W9, to see whether the perspective of the scale, CASP-12, between participants who joined at the first wave and those who joined at wave 2 onwards is invariant or not. Note that with this split, sample size for each group is comparable.

The CASP-12 instrument comprises 12 items that aim to evaluate an individual’s QoL through four domains: control, autonomy, pleasure and self-realisation (Wiggins et al., 2008). The participants were asked a series of questions and rated their responses using a four-point scale from 0 to 3 where 0 indicated ‘never,’ 1 ‘not often,’ 2 ‘sometimes’ and 3 ‘often.’ Any questions that had negative wording were given a reverse score. Reverse scoring flips the direction of the numerical scoring scale. This means that a score of 0 would be assigned to “often”, while “sometimes” would get a score of 1, “not often” would receive a score of 2, and “never” would be assigned a score of 3. The total score was obtained by adding up the responses to each question. A higher score indicated a better QoL. Table 1 presents all 12 items of the CASP-12.

Table 1 CASP-12 items with their domains

Analytical Sample

Figure 1 shows the steps taken to select the analytical sample. A total of 18,679 participants were included from the ELSA study across nine waves. They were selected based on complete information available for four variables, including gender, age (between 50 and 90), education level and net wealth. Participants were excluded if no information was available for any 12 items of the CASP-12. Additionally, repeated measures for included participants were removed as the measure with the least missing values of CASP-12 was kept. The final sample size after exclusions was 17,221 participants.

Fig. 1
figure 1

Procedure for selecting the analytical sample

Statistical Approach

First, the summary statistics for the sample and five determinants were examined. Then a graphical analysis was conducted to explore the relationship between the participants age and the total scores of the CASP-12 as well as its four domains using regression splines. To achieve this, the bs function in R was utilized which employs B-splines for the analysis (Hastie, 1992).

Since we are interested in QoL as one component, Multiple Group Confirmatory Factor Analysis (MG-CFA) was conducted for several second-order structure factors for CASP-12. The requirements applicable to establish the second-order factor of the selected CASP-12 model are: (a) lower (first) order factors are highly correlated, and (b) there is a higher order factor that accounts for their relations (Chen et al., 2005). Thus, three second-order structured factors and a first-order structure factor (CASP-12-FO) were conducted. The three second-order structure factors are as follows: the first is a common second-order factor with four first-order factors (control, autonomy, pleasure, and self-realization), denoted here CASP-12-4D. The second is a common second-order factor with three first-order factors (CASP-12-3D), so the Control and Autonomy domains had combined. The third is like CASP-12-3D but with 11 items (CASP-11-3D), which item (A3) has been removed. The reason for deleting this item is that its loading factor value was less than 0.40, as shown in the results section.

Since the response values of the CASP-12 items are greater than two and all of the items have a moderate or high skewness, they were treated as continuous variables and the robust maximum likelihood was used to estimate the parameters of confirmatory factor analysis (CFA) (Robitzsch, 2020). In our analysis, we opted for the Robust Maximum Likelihood (RML) method over the Satorra-Bentler scaled chi-squared statistic. This decision was influenced by the limitation of the latter in handling incomplete data, as noted by Savalei and Rosseel (2022). Furthermore, RML is deemed more appropriate for datasets exhibiting moderate deviations from normality, as indicated by Li (2016).

The question of whether to treat response values on a scale as continuous or ordinal is a subject of ongoing discussion. According to (Robitzsch, 2020) treating response values ordinally may impose a normal distribution on latent factors, potentially introducing inaccuracies in empirical applications. Given our substantial sample size, any distinctions between different methods may be practically negligible and thus considered inconsequential.

A MG-CFA was undertaken to ensure that measurements are consistent of the CASP-12 across five variable groups: gender, age groups, educational levels, net wealth and two time periods. Sequentially, five levels of measurement invariance (MI) were tested following Chen et al. (2005), and each level introduced more equality constraints across groups with the consideration for second- and first-order factors. Configural invariance tests the equivalence of the fixed and free factor loadings pattern. Weak invariance tests the equivalence of factor loadings across groups for first-order loadings and second-order factor loadings in two separate steps. Meanwhile, strong invariance, additionally to weak invariance, examines the equality of intercepts of indicators and means for first-order latent factors in two separate steps.

Three fit indices were considered: robust chi-square distribution with a degree of freedom (df), robust comparative fit index (RCFI) and robust root mean square error of approximation (RRMSEA). The Chi-square test is impacted by sample size, meaning that as the sample size grows, the test becomes more responsive to even minor variations between the correlation matrix of observed values and the correlation matrix of expected values. Alternatively, RCFI or RRMSEA were used to assess the goodness of fit. The CFI and RMSEA range from 0 to 1, and the values of 0.90 (acceptable fit) or 0.95 (good fit) are used as cut-points for the CFI while 0.06 (good fit) or 0.08 (acceptable fit) for RMSEA (Schumacker & Lomax, 2012). To compare five constrained models, the ΔRCFI criterion was used due to its lower sensitivity to sample size. Consequently, if there is a change in RCFI of less than 0.01 along with a change in RRMSEA of less than 0.015, the models can be considered to be comparable (Chen, 2007).

If the selected factor structure of CASP-12 did not meet the criteria for strong invariance among the five variable groups, the participants would be divided into subgroups based on the variable group with the poorest MG-CFA fit indices. Each subgroup would then be tested separately for MI. Due to missing observations in the sample, we selected an observation with the least missing values from the CASP-12 items for each participant. Also, we used all available information for participants with some missing values in the CASP-12 items by applying the FIML method. Full information maximum likelihood (FIML) was used to deal with missing data with parameter estimates being calculated using all of the available information (Cham et al., 2017). The analysis was performed in R and all of the CFA models were estimated using the lavaan version 0.6–12 package (Rosseel, 2012).

Results

Table 2 presents the distributions of gender, age, educational level, net wealth and two-time periods.

Table 2 Summary characteristics of the sample

Out of the total sample, 54% were women. Half of the sample (51%) were aged between 50 and 59. Two-thirds of the sample consists of participants who are educated at a high or average level. Both wealthy and poor participants each represented 40%, with those classified as ‘average’ wealth accounting for 20%. More than half of the sample (57%) were from wave 1, while 43% participants were from waves 2 to 9.

Figure 2 illustrates the association between the total scores of CASP-12 and its four domains with age. The number of participants who have complete data on CASP-12 items is 16,816. Figure 2(a) indicates a negative relationship between the total scores of CASP-12 and age for individuals aged 70 and above. As shown in Fig. 2b and e, participants began to lose control and self-realization when they reached this age. However, their autonomy gradually increased (Fig. 2c). Among the CASP-12 domains, the pleasure domain had the highest scores, with an average score close to 9 (upper score) for most ages.

Fig. 2
figure 2

The relationship between the CASP-12 score and several group variables using regression splines; only the participants with no missing values in the CASP-12 (N = 16,816)

A CFA was conducted on three common second-order structure factors of CASP-12 as well as four first-order factor as shown in Table 3. The results showed that CASP-12-4D and CASP-12-FO achieved an excellent fit (RCFI = 0.95 and RRMSEA = 0.067) for CASP-12 and (RCFI = 0.95 and RRMSEA = 0.066). The standardized parameter estimates for both factor structures were displayed in Fig. 3.

Table 3 Model fit statistics of Models CASP-12, CASP-12-3D, CASP-11-3D and CASP-12-FO
Fig. 3
figure 3

Standardized factor loadings for the CASP-12-4D (left side) and for the CASP-12-FO (right side). All estimates p < 0.01. C, A, P and S refer to the items for each domain of the CASP-12 that are shown in Table 1

Figure 3 shows that there is a higher (second) order factor that accounts for four lower (first-order) factor relations and four lower (first-order) were of the CASP-12 highly correlated. Thus, CASP-12 is applicable.

A second-order MG-CFA test of the CASP-12-4D was conducted separately for five variable groups: gender, age groups, educational levels, net wealth and time periods. Following this, a series of rigorous model tests was undertaken for five levels of invariance (configural, two weak invariance levels, and two strong invariance levels). As demonstrated in Table 4, RCFI value of the CASP-12-4D is around the benchmarks for a good fit as well as RRMSEA values for all five variable groups. Next, although RCFI value of CASP-12-4D met the invariance up to a second-order strong level for gender and two-time periods (ΔRCFI < 0.01 and RRMSEA < 0.015), the instrument failed to achieve that across age groups, educational level and net wealth. Note that the value of ΔRCFI exceeded the cutoff (0.01) between the weak and strong levels for these three variable groups. The value of ΔRRMSEA did not exceed the cut-off (0.015) except for age groups.

Table 4 Five levels of multi-group confirmatory factor analysis (MG-CFA) for the second-order of the CASP-12-4D grouped by gender, age, educational level, net wealth and two-time periods

Next, we tested the second-order MG-CFA for three subsamples separately based on their age: 50–59, 60–69 and 70–90 years old because the MG-CFA fit indices were the poorest for age compared to the other variable groups that did not meet the requirement of the strong level (education and net wealth). Two age categories were defined within each age group. Table 5 shows CASP-12-4D’s fit indices for all the three samples. For younger participants, CASP-12 exceeded the benchmarks for a good fit for all five variable groups (RCFI > 0.95 and RRMSEA < 0.060) while the fit indices for the older and oldest participants achieved the acceptable fit. (RCFI > 0.90 and RRMSEA < 0.080). Except for the net wealth variable group (ΔRCFI > 0.01), CASP-12 is consistent across all variable groups up to strong level for second- the first-order factors. In the oldest participants, the ΔRCFI for the first-order strong level of the two age categories (70–79 vs. 80–90) is equal to the cut-off value (0.01). In addition, the net wealth groups were reduced to two (rich/average vs. poor) since the estimated variance of the self-realization domain was negative for participants with average net wealth. The value of Δ RRMSEA of the net wealth variable group did not exceed the cut-off (0.015) in all five test levels for all subsamples.

Table 5 Five levels of multi-group confirmatory factor analysis (MG-CFA) for the second-order factor of the CASP-12-4D in three separate samples based on age, grouped by gender, two age groups, educational level, net wealth and two-time periods

Discussion

This study evaluated the short version of the CASP-12 through three second-order structure factors and reported that the common second-factor with four first-order structure factor (CASP-12-4D) and the common first-order structure factor had the highest fit indices compared to the other structure factors. Additionally, CASP-12 was consistent across genders and time periods meeting the strong second and first-order invariance criterion. However, it displayed (second and first-order factors) weak invariance in terms of age, educational levels, net wealth, and two-time points, but not strong invariance. The consistency of CASP-12-4D across gender, two age categories, education levels, and time periods was improved and met (second and first-order factors) strong invariance by considering three separate age groups (50–59, 60–69 ad 70–90). Net wealth still fails to meet the strong invariance levels, which might reflect the different perspectives of QoL between wealthy and poor people. Studies have reported inequality among them in several aspects, such as healthcare, living conditions, and engaging in leisure activities that often enable wealthier individuals in aged communities to experience a better quality of life compared to poor people who attempt to have basic needs most of the time (Steptoe & Zaninotto, 2020).

CASP-12-4D could serve as a valuable tool for healthcare providers as well as social research by considering two domains of QoL (Control and self-realization) especially for those aged 70 and over. Also, by considering peoples’ age and net wealth, the accuracy of interpreting CASP-12 score differences enhances. While there is no considerable difference in between rich and average participants (not detailed here), poor participants experience a more pronounced impact. Consequently, it is essential to conduct a sensitivity analysis if the participants’ wealth status varies.

While the short version of the CASP (CASP-12) is generally favoured over its longer counterpart (CASP-19), the literature reveals various factor structures for the short version(Oliver et al., 2021). These structures underwent international modifications to enhance their psychometric robustness (Oliver et al., 2021a, b). In our study, the CASP-12-4D and CASP-12-FO models exhibited superior performance, aligning with similar findings in European populations that shared limitations in the autonomy domain (Oliver et al., 2021a, b). Note that different factor structures were deemed optimal in other European studies. For example, CASP-12 with three domains (control/autonomy, self-realization, and pleasure) was found to fit better in some instances (Sim et al., 2011; Wiggins et al., 2008). In a separate study, CASP-11 emerged after removing one item from the autonomy domain, as it demonstrated the lowest correlation coefficient (Hamren et al., 2015). A majority of these studies were conducted across various waves in the Survey of Health, Ageing, and Retirement in Europe (SHARE) datasets (Oliver et al., 2021a, b). with only a limited number focusing on the English population (Sim et al., 2011; Wiggins et al., 2008), often presenting CASP-12-3D. Notably, information on CASP-12-4D is relatively scarce in these studies, and the sample sizes tended to be small. Considering these factors, CASP-12-4D and CASP-12-FO may be preferable for use among older individuals in England, aligning well with the theoretical background of the four domains in the scale (CASP-12-4D). Their superior goodness-of-fit indices in a larger English population suggest their suitability for comprehensive assessments of QoL in England.

Although previous studies that tested the measurement invariance for CASP-12 is limited, and they only reported the partial measurement invariance, for example, study by Oliver et al. (2021) reported partial invariance of the CASP-12 across different age groups. In relation to wealth and its influence on Quality of Life (QoL), our findings align with those of other studies (Kim & Park, 2015; Wingen et al., 2021). It is consistently observed that individuals from higher socioeconomic backgrounds tend to experience elevated status and enhanced Quality of Life.

It is essential to report several difficulties while conducting longitudinal measurement invariance for CASP-12-4D. The ELSA dataset contains nine waves, but participants’ on average respond to four waves. Moreover, the time between the two points is quite long, two years. In addition, an initial result indicated that the model identification (RCFI > 0.90) decreases with increased time points (waves). Therefore, there is a need to carry out further work to address these challenges.

The study possesses notable strengths, including substantial sample size and the inclusion of five distinct demographic groups for assessing second-order CASP-12-4D measurement invariance. Nonetheless, limitations exist. While the ELSA dataset contained several variables, we only focused on four variable groups, namely gender, age, education, and wealth. This decision was made due to more missing values in the other variables in subsequent waves.

In summary, the common second-order structure factor with four first-order factors (CASP-12-4D) shows higher fit indices than the other common second-order structure factors used in other studies. Except for wealth, second-order CASP-12-4D within three separate age groups demonstrates second and first-order strong consistency across diverse demographic groups, including gender, two age categories, educational level and two-time periods increasing the accuracy of the instrument measure when used in social and public health research applications. Further work is needed to test the longitudinal measurement invariance analysis of the second-order CASP-12 model.