Introduction

The world’s global population is graying. The UN Population Division projected the number of people over age 60 would increase from just under 800 million in 2011 to just over 2 billion in 2050 (Bloom et al. 2011). The aging population in developing countries is estimated to increase more than 2.5 times, compared with a 71% increase in developed countries (Lux and Scherger 2017). According to last census of Iran population conducted in 2011, about 8.23% of people in Iran aged 60 years and over (Statistical Center of Iran 2011). As the trend in many countries around the world, Iran started to experience graying of population as well (Shirazikhah et al. 2012; Amini et al. 2013) and there will be a rise in the rate of growth of the elderly population after 2015, so that its size will double between 2015 and 2030 (Mehryar and Ahmad-Nia 2004).

The increased life expectancy means that people have an opportunity to live a longer life. However, maintaining the quality of life in these extended years and its subjective proxy measure namely life satisfaction would be a major concern in the elderly population as well for gerontological research.

Life satisfaction has been defined as “a global evaluation by the person of his or her life”(Pavot et al. 1991). It has been described as the subjective expression of quality of life (Fernandez-Ballesteros et al. 2001).The psychological well-being and life satisfaction, as its fundamental component, is of great importance at older ages (Parker et al. 2008).

Life-satisfaction has also been used to evaluate the effects of social policies particularly those that aim to improve the quality of life. It can be used as an outcome measure for assessment of the interventions (Veenhoven 1996).

Life satisfaction is a broad concept and has been measured by using different instruments. One of the instruments that measure life satisfaction specifically in older adults, considering the entire domain of this concept, is the “Life Satisfaction Scale (LSS)”. The LSS is a revised version of “Life Satisfaction in the Elderly Scale” developed by Salamon and Conte in 1984. This is a longstanding measure of life satisfaction with optimum psychometric properties (Salamon and Conte 2003). The LSS has been proved to be convenient and user-friendly to administer. Studies of psychometric properties of LSS indicate that it is an appropriate instrument, both clinically and theoretically (Salamon 1988).

Despite the importance of the concept of life satisfaction in the elderly, there is no valid and reliable scale in Persian language regarding this concept. Therefore, the present study was conducted to translate the original version of the LSS into Persian and to evaluate the psychometric properties of the Persian version of this scale in Iranian older adults.

Materials and Methods

Procedure

The present study is a methodological research through which the life satisfaction scale has been translated into Persian and the psychometric properties of the Persian version of LSS has been evaluated.

The LSS, a background variables questionnaire including demographics (ages, sex and educational level), and the Persian version of the Short Form Health Survey (SF-36) (Montazeri et al. 2005) as a questionnaire of a related construct were administered to 334 volunteer Persian speaking elderly people living in Tehran, Iran. The participants were selected from general public using convenience sampling based on inclusion criteria. They were community-dwelling older adults who were members of the elderly centers of Tehran municipality, and members of the elderly retirement centers. The appropriate sample size was determined based on Sample Size for Factor Analysis Method that suggested 300 provides a moderately good quality for factor analysis (Comrey and Lee 1992). Among the total number of 450 survey packets that were successfully delivered, 400 were returned. 66 questionnaires were considered invalid because more than one item in a subscale were missing or the overall number of missing items was more than 20%. In addition, 70 older adults agreed to participate in the test-retest reliability study. Among them, we selected 50 people who didn’t report any recent unexpected event that might influence their judgement about their own life satisfaction. To assess the test-retest reliability the questionnaire was re-administered to 50 participants (20 female, and 30 male) a month later. The eligibility criteria for participation in this study were adults with 60 years of age and older, literacy (at least 9 years of school education), and normal cognitive function (based on the results of clock drawing test as a cognitive function screening tool) (Aprahamian et al. 2009). After selecting eligible elderly and explaining the general nature of the study, the participants who signed the consent form for participation completed the LSS and SF-36 as self- report questionnaires. Furthermore, a single question with 5 point Likert-type scale responses concerning the overall satisfaction with life was asked. The completion of the questionnaire took about 20–25 min.

Translation Process

After obtaining permission from the developers of the LSS, the scale was translated from its original source in English into Persian, according to the International Quality of Life Assessment (IQOLA) project guidelines (Bullinger et al. 1998).

Two translators who were native speakers of Persian, translated the LSS into Persian according to the standard IQOLA protocol. The two translators working independently, produced two preliminary translations of the LSS. Next, together with the researchers, the translators compared individual translations and then agreed on a common translation. Then, another native Persian translator evaluated the quality of the common translation, considering factors including clarity, common language use, conceptual equivalence and acceptability and modified the translation of items, if it was necessary (Bullinger et al. 1998).

Finally, an American bilingual translator translated the forward translation back into English to produce the backward version. The conceptual equivalence of the backward translation to the original version was verified by the developers. Then during the cognitive debriefing interview, 20 older adults were asked to complete the questionnaire to find any obscure, confusing and difficult items (Bullinger et al. 1998).

The Instruments

Life Satisfaction Scale

The LSS is a multi-factor questionnaire which has been developed by Salamon and Conte to assess the life satisfaction. The LSS was developed primarily as a scale to measure life satisfaction in older adults by considering the whole domain of life satisfaction indicators. The LSS consists of eight subscales derived from 40 items with a five points Likert type scale responses. The eight subscales of the LSS include: daily Activities, meaning, goals, mood, self-concept (positive self-concept), health, finances, and social contacts (Salamon and Conte 2003). Each subscale of LSS consists of five items. The items of LSS were designed in a summative Likert format with a five point range. Therefore, the total score of each subscale may range from 5 to 25. For the complete LSS, total scores can range from 40 to 200 (Salamon 1988).

The Short Form (36) Health Survey (SF-36)

SF-36 is a 36-item, self-administered measure of health-related quality of life. It consists of eight subscales including physical functioning, bodily pain, role limitations due to physical health problems, role limitations due to emotional problems, emotional well-being, social functioning, energy/fatigue, and general health perceptions. Scores for each subscale range from 0 (poor health status) to 100 (good health status) (Ware Jr and Sherbourne 1992). The Persian version of SF-36 has been previously validated in Iran (Montazeri et al. 2005).

Clock Drawing Test

The Clock Drawing Test (CDT) is a simple neuropsychometric measure that can be easily administered to assess cognitive functions especially in elderly (Cahn et al. 1996; Brodaty and Moore 1997). The CDT score provides reasonably good screening test for dementia (Aprahamian et al. 2009). In a standard way of the clock drawing test the person is asked to draw the face of a clock, put all the numbers in correct positions, and set the hands at “10 after 11” (Hubbard et al. 2008). There are two general clock drawing test scoring approaches, including qualitative and quantitative scoring systems (Nair et al. 2010).

Data Analysis

The structural equation modeling (SEM) is a powerful method for validating a measurement in the field of psychology (Yu and Hsu 2012). In this statistical methodology, structural theory bearing on some phenomenon, is analyzed by taking a confirmatory approach (Byrne 2001). Confirmatory factor analysis (CFA) is a technique of structural equation modeling which is used to assess the goodness of fit between a hypothesized model and the data gathered from the study samples (Kline 2011).

The partial least squares approach to SEM (or PLS path modeling) is a powerful data analytical tool applied in different fields of research.The PLS could be used as a first step to find the existing relations and to analyze the real world data (Rosipal and Krämer 2006). PLS can simultaneously test the structural model (relationships between constructs) and the measurement model (relationships between indicators and their corresponding constructs) to ascertain the overall psychometric properties of the scales, and the important relationships among the variables (Barclay et al. 1995; Hulland and Business 1999). PLS path modeling has the advantage of not assuming the normality of data distribution (Bagozzi 1994a, b).

In the present study, a confirmatory factor analysis followed by PLS analysis was performed, using the software SmartPLS2.0, to analyze the validity of the model’s constructs and the relationships between the constructs. Construct validity evaluated through the presence of two important aspects of measurement model; convergent and discriminant validity (Gefen et al. 2000).

Convergent validity can be evaluated by assessing indicator reliability, composite reliability, and average variance extracted (Fornell 1982).

Item’s factor loadings for their respective construct were calculated to assess indicators reliability (Fornell and Larcker 1981). The internal consistency reliability is indexed by the composite reliability. The recommended threshold value of 0.7 was considered (Nunnally et al. 1967). Cronbach’s alpha also was measured to assess internal consistency. Cronbach’s alpha values between 0.70 and 0.95 are considered satisfactory for internal consistency (Terwee et al. 2007).

The average variance extracted (AVE) represents the amount of variance a construct captures via its items relative to the amount of variation due to measurement error. According to the guidelines of Fornell and Larcker, if the AVE is less than .50, the validity of the construct is questionable (Fornell and Larcker 1981).

The discriminant validity determined through comparison of item loadings with item cross loadings. Each item should load more highly on its respective constructs than on other constructs (Gefen and Straub 2005).

Furthermore, to establish discriminant validity, Fornell and Larcker (1981) suggest that the square root of AVE in each latent variable should be greater than other correlation values among the latent variables (Fornell and Larcker 1981).

The structural model represents the relationships between constructs that were hypothesized in the model. The overall model goodness of fit determined through paths (statistical and practical significance) and coefficients of determination (R-square) (Gil-Garcia 2008). Also, a global criterion of goodness-of-fit (GoF) calculated as an index for validating the PLS model globally (Tenenhaus et al. 2005). For global validations of a PLS model, Wetzels, Odekerken-Schroder, and van Oppen have formulated indicative GoF values as baseline values: GoFsmall = .1, GoFmedium = .25, and GoFlarge = .36 (Wetzels et al. 2009).

In addition, a priori hypothesized patterns of associations between a scale and other validated instruments provide further evidence for the scale construct validity. It was hypothesized that the correlations between the LSS mood scores and SF-36 emotional wellbeing scores should be high; also, the correlations between the LSS health scores and SF-36 general health perception scores should be high. The Spearman’s rank correlation coefficient (rs) was used to assess construct validity. In order to further investigate validity of the construct, the correlation between the LSS and the single question about life satisfaction as a single item measure was calculated, as well.

To assess the test-retest reliability, relative and absolute reliability indices were calculated. The relative reliability was measured by the two way random effects model of intraclass correlation coefficient (ICC2,1) (Shrout and Fleiss 1979), and ICC ≥0.6 was considered satisfactory (Chinn 1991).The standard error of measurement (SEM) was calculated to measure the absolute reliability (Weir 2005). Also to detect any possible systematic bias, the paired sample t-test between test and retest mean scores was performed (Bruton et al. 2000).

Ethics

The study was approved by the in Ethics Committee of the University of Social Welfare and Rehabilitation Sciences, Tehran, Iran. All participants were informed about the purpose of the study, and an informed consent was obtained from each participant. The LSS was used in this research, with the developers’ permission.

Results

Descriptive Statistics

334 cognitively healthy elderly (mean age = 65.61, Range = 60–86), participated in the present study. Sixty percent of them were male. Fifty three percent of them had high school educational level, and others had a university degree (Table 1).

Table 1 Demographic characteristics of participants (n = 334)

According to the results of the Kolmogorov-Smirnov test, the distribution of the items’ scores of the LSS was not normal (p < 0.05), therefore the PLS path modeling can be used.

The Measurement Model

We test the measurement model to ascertain the overall psychometric properties of the scales. Indicator reliability was determined by examining construct item loading. All items had loadings above 0.50. The scale had good internal consistency as indexed by the Cronbach’s α and composite reliability. In addition, the average variance extracted (AVE) for each measure exceeded 0.50.

Convergent validity analysis results are represented in Table 2.

Table 2 Convergent validity analysis (n = 334)

Comparison of item loadings with item cross loadings was used for discriminant validity. All items loaded more highly on their respective constructs than other constructs. Therefore the constructs demonstrate adequate discriminant validity (Table 3).

Table 3 Item loadings and cross loadings for 40 items of the LSS (n = 334)

In addition, Fornell-Larcker Criterion Analysis for Checking Discriminant Validity was done (Table 4). AVE is written in bold on the diagonal of the table. These diagonal elements are greater than their corresponding correlation coefficients except the AVE for the construct Goal and the correlation between the constructs Goal and Daily activity.

Table 4 Fornell-Larcker criterion analysis for checking discriminant validity (n = 334)

The Structural Model

Figure 1 shows the structural model results. The path between life satisfaction and all eight factors were highly significant. All beta coefficients are positive and statistically significant (p < 0.001).

Fig. 1
figure 1

Structural model results of the LSS

Coefficients of determination (R-squares) are represented in Table 5. A considerable percent of the factors was accounted by their explanatory construct; life satisfaction.

Table 5 Coefficients of determination for factors (n = 334)

The goodness-of-fit index (GoF) value was 0.59 witch could be interpreted as an acceptable value (Wetzels et al. 2009).

Further Evidences of the Construct Validity and Test-Retest Reliability

The correlation between the LSS mood scores and SF-36 emotional wellbeing scores was 0.65 and it was the highest value among all correlation coefficients between SF-36 emotional wellbeing subscale and all other subscales of the LSS. In addition, the correlation between the LSS health scores and SF-36 general health perception scores was 0.55 and it was the highest value among all correlation coefficients between SF-36 general health perception subscale and all other subscales of LSS. These results confirm the validity of the mentioned subscales (Table 6).

Table 6 Spearman’s rank correlation coefficient of the LSS and SF-36 subscales (n = 334)

Also, the correlation between LSS and the single question about life satisfaction asked from participants was 0.60 (p < .001).

Table 7 represents the reliability measures including test-retest reliability measures (ICC and SEM) for each subscales and the total score of LSS. The Intraclass correlation coefficient (ICC) was 0.91 for the overall scale, and 0.76–0.96 for the subscales.

Table 7 Test-retest reliability of the LSS subscales (n = 50)

The results of paired sample t-test indicated that the questionnaire scores were not significantly different (P ≥ .05) in the two administrations of test and retest.

Discussion

The purpose of the present study was to translate the “Life satisfaction Scale (LSS)” and evaluate psychometric properties of the Persian version of this scale. The translation of the LSS followed a standard protocol developed by International Quality of Life Assessment (IQOLA), including multiple forward and backward translations, assessment of quality of translation and evaluating its conceptual equivalence with the original version (Ware 2004). The LSS consists of 40 short and clear items and response choices. In the translation process of the present study, the translators did not encounter any obscure or challenging item for translation. After producing and documenting individual translations, consensus was reached among the translation team and they agreed on a common translation. In addition, during the cognitive debriefing interview, we didn’t find any obscure, confusing and difficult items. As the LSS was a well-designed scale, we could achieve a good Persian version of this scale.

In the present study, the psychometric properties of the Persian version of the LSS were evaluated. Construct validity of the LSS was supported by satisfactory convergent and discriminant validity of the constructs. The LSS demonstrate adequate discriminant validity based on the comparison of item loadings with item cross loadings and Fornell and Larcker’s test results, considering only one exception in the case of the near values of the AVE for the construct Goal and the correlation between the constructs Goal and Daily activity.

Reliability is a major concern when we use a psychological test. The results of the present study indicate that the measures are robust in terms of their internal consistency as indexed by the Cronbach’s α (ranges from .73–.87 for each subscale, and .93 for the LSS total scores) and the composite reliability. These results are well fit with the results of the LSS original version (the Cronbach’s α ranges from .60 to .79 for each subscale, and .93 for the LSS total score) (Salamon and Conte 2003).

In the current study, Item’s factor loading results demonstrated that in some instances, items loads significantly on more than one construct. The item DA1 loads similarly on its corresponding construct i.e. Daily Activity (loading = .780) and on the other construct i.e. Meaning (loading = .714); Also, item G5 loads on Goals (loading = .689) and on Meaning (loading = .629). In other words, the items have cross loaded on the construct “Meaning” as well as their related constructs. One possible explanation for these findings is that in Persian language, the concepts “Function”, “Goals” and “Meaning” are interrelated concepts. The item DA1 exactly refers to satisfaction with individual’s daily routine. The satisfaction with individual’s daily routine is dependent on his feeling that his or her life is meaningful or not. Similarly, the item G5 refers to satisfaction with the way things turn out. This item belongs to the Goals construct. However, the item may relate to the meaning of life (Meaning construct) as well. Therefore, the nature of these items may lead to their high loadings on more than one construct. Nevertheless, the loading of each item on its corresponding construct is higher than its loading on the other construct. Therefore, the results confirmed the structural model of the LSS.

On the other hand, similar results were reported by Salamon and Conte (2003) in the Procrustes confirmatory factor analysis of original version of the LSS. Their results indicated that two constructs (the Health and the Finance) show complete congruence with their corresponding items. The other six factors produced a number of significant loadings and in some instances, variables loads significantly on more than one construct (Salamon and Conte 2003).

Moreover, the loading of SC4 on its corresponding construct was 0.364 (t value exceeds 1.96) in the present study. More analysis demonstrated that eliminating the item from the model did not improve the model fit. On the other hand, the item’s loading value was close to 0.4 and consistent with the original version (loading = 0.404). Therefore, we decided not to eliminate SC4 from the model.

The results of the test-retest reliability study might be considered as an evidence of good scale’s stability over time. In the present study, good test-retest reliability of Persian version of LSS (ICC = 0.84) was obtained. The results were consistent with original version of the LSS (the test-retest correlation coefficient of 0.67) (Salamon and Conte 2003).

Construct validity was supported by the presence of higher correlations between the LSS and the SF-36 subscale measuring similar constructs, and lower correlations between the subscales measuring dissimilar constructs. The correlation between the LSS and the single question about life satisfaction, as a single item measure, confirmed the validity of the questionnaire.

In order to assess the eight-factor structural model, the overall model goodness of fit has been determined through paths, coefficients of determination and the global criterion of goodness-of-fit (GoF). The results indicated an adequate goodness of fit between the hypothesized model of the LSS and the model derived from the sample data. It is notable that the fit measures of Covariance Based Structural Equation Modeling (CB-SEM) can be better suited for model validation compared to PLS’GoF (Henseler and Sarstedt 2013). However, in the present study we preferred to use PLS path modeling as the distribution of the data was not normal (Bagozzi 1994a, b; Hair et al. 2011). In addition, there are some studies that show PLS path modeling is appropriate for the confirmatory factor analysis which is more reliable and valid compared to Covariance Based Structural Equation Modeling (CB-SEM) (Hair et al. 2011; Afthanorhan 2013).

The “Life Satisfaction Scale” has been used as an outcome measure, measure of clinical need, and as a part of research protocols in a variety of settings. Numerous studies have confirmed its optimum psychometric properties (Salamon and Conte 2003). The results of the present study demonstrated that the Persian version of the LSS has good psychometric properties (internal consistency, test re-test reliability, and construct validity) to be used as an outcome measure in Iranian older adults.

The limitation is that the eligibility criteria for participation in this study were older adult’s literacy level (at least 9 years of school education) which may limit the generalization of its usefulness to a group of illiterate older adults. Furthermore, qualitative studies are suggested to find the meaning of life satisfaction of the Iranian elderly population in more depth.

Conclusion

In conclusion, the Persian version of the Life Satisfaction Scale (LSS) is a reliable and valid instrument for measuring life satisfaction in the Iranian older adults.