The validity of the EQ-5D-3L items: an investigation with type 2 diabetes patients from six European countries

Konerding, Uwe; Elkhuizen, Sylvia G; Faubel, Raquel; Forte, Paul; Malmström, Tomi; Pavi, Elpida; Janssen, MF Bas

doi:10.1186/s12955-014-0181-5

The validity of the EQ-5D-3L items: an investigation with type 2 diabetes patients from six European countries

Research
Open access
Published: 05 December 2014

Volume 12, article number 181, (2014)
Cite this article

Download PDF

You have full access to this open access article

Health and Quality of Life Outcomes Aims and scope Submit manuscript

The validity of the EQ-5D-3L items: an investigation with type 2 diabetes patients from six European countries

Download PDF

Uwe Konerding¹,
Sylvia G Elkhuizen²,
Raquel Faubel³,
Paul Forte⁴,
Tomi Malmström⁵,
Elpida Pavi⁶ &
…
MF Bas Janssen⁷

3075 Accesses
29 Citations
Explore all metrics

Abstract

Background

Most previous studies concerning the validity of the EQ-5D-3L items refer to applications of only a single language version of the EQ-5D-3L in only one country. Therefore, there is little information concerning the extent to which the results can be generalised across different language versions and/or different countries. Here the validity of the EQ-5D-3L items is investigated for six different language versions in six different countries.

Methods

Data came from 1341 type 2 diabetes patients (England: 289; Finland: 177; Germany: 255; Greece: 165; the Netherlands: 354; Spain: 101). The relationships of the five EQ-5D-3L items with seven different test variables (age, gender, education, previous stroke, problems with heart, problems with lower extremities, problems with eyes), were analysed for each combination of item and test variable. For each combination two logistic regression models with the dichotomised EQ-5D-3L item as dependent variable were computed. The first model contained the test variable and dummy coded countries as independent variables, the second model additionally the terms for the interaction between country and test variable. Statistically significant better fit of the second model was taken as evidence for country specific differences regarding the relationship. When such differences could be attributed mainly to one country the analyses were repeated without the data from this country. Validity was investigated with the remaining data using results of the first models.

Results

Due to lack of variation in the Spanish data only 31 of the originally intended 35 interaction tests could be performed. Only three of these yielded a significant result. In all three cases the Spanish data deviated most. Without the Spanish data only 1 of the 35 interaction tests yielded a significant result. With 3 exceptions, the tendency of reporting problems increased with age, female gender, lower education, previous stroke, heart problems, problems with lower extremities and problems with eyes for all EQ-5D-3L items.

Conclusion

The results concerning the European Spanish version are ambiguous. However, the items of the English, Finnish, German, Greek and Dutch versions of the EQ-5D-3L relate in substantially the same way to the test variables. Mostly, these relationships indicate the items’ validity.

Measurement properties of the EQ-5D-5L compared to EQ-5D-3L in the Thai diabetes patients

Article Open access 06 February 2015

Juntana Pattanaphesaj & Montarat Thavorncharoensap

Translation and psychometric properties of the Persian version of the Audit of Diabetes Dependent Quality of Life (IR-ADDQoL)

Article Open access 28 November 2022

Zeinab Ghazanfari, Mohammad Mehdi Naghizadeh, … Ali Montazeri

Cross-cultural measurement equivalence of the 5-level EQ-5D (EQ-5D-5L) in patients with type 2 diabetes mellitus in Singapore

Article Open access 16 July 2015

Ye Wang, Ngiap-Chuan Tan, … Nan Luo

The EQ-5D-3L with its 153 different official language versions and a further 19 language versions awaiting approval is currently the most widely used preference-based instrument for measuring health related quality of life [1]-[4]. The EQ-5D-3L consists of (1) a descriptive system comprising five questionnaire items and (2) a visual analogue scale. The questionnaire items address five different dimensions of health: (1) mobility; (2) self-care; (3) usual activities; (4) pain/discomfort; and (5) anxiety/depression. Three answer categories are provided for each item with the first category referring to the best state. The EQ-5D-3L can be applied for two purposes: describing health states; and constructing indices which represent the utilities of these health states. Several scoring functions for assigning utilities to health states are available and most of these refer to one specific language version of the EQ-5D-3L [5].

Whatever purpose the EQ-5D-3L items are used for, they need to be valid. This means, in the first instance, that they should adequately describe the respondents’ actual health states. This can best be investigated by analysing the relationships between the items and some more concrete indicators of actual health. There are several studies which refer to these relationships [6]-[17] and which, together, constitute a remarkable evidence basis for judging the validity of the EQ-5D-3L items. For a deeper understanding, however, it is also interesting to know how the items relate to socio-demographic variables such as age, gender and education. As far as these socio-demographic variables are related to actual health, the relationships of these variables with the EQ-5D-3L items also provide information about the items’ validity. There are several studies in which these relationships have been investigated [6],[11],[14],[15],[17]-[20] and the findings from these studies enrich the empirical knowledge relevant for judging the validity of the EQ-5D-3L items.

However, most of the studies just mentioned have been performed in only one specific country using only one specific language version of the EQ-5D-3L. There are only four studies in which different language versions have been applied. Three of these have been performed using different language versions within the same country [12],[15],[18] and only one study referred to data from different countries [19]. Analyses which are relevant for investigating validity and which have been performed separately for either the different language versions or the different countries respectively have been reported for only two of the four studies [12],[19]. Hence, there is not much empirical evidence concerning the extent to which results referring to the EQ-5D-3L items can be generalised across different language versions and/or across different countries. In fact, the results of one of the few relevant studies [12] even suggest that this is not the case. Obtaining more information about the generalizability of findings concerning the EQ-5D-3L items would require more studies in which different language versions of the EQ-5D-3L are applied in different countries.

A study with the features just mentioned is presented here, performed on patients with type 2 diabetes. Type 2 diabetes is a chronic insufficiency in the processing of glucose in the blood caused by too little insulin being produced by the body or by the available insulin working ineffectively. In the long run diabetes can lead to severe complications; particularly stroke, heart attack, kidney failure, ulcers of the lower extremities and impairment of sight [21],[22].

Patients from England, Finland, Germany, Greece, the Netherlands and Spain were investigated and the corresponding EQ-5D-3L language versions were used. The data from this study are analysed with regard to two objectives:

1) to investigate the generalizability of findings concerning the EQ-5D-3L items across the study countries, and

2) to investigate the validity of the EQ-5D-3L items on the basis of the data from those countries for which the findings concerning the EQ-5D-3L items are substantially the same.

Methods

Study participants and study conduction

The analyses were performed using data from surveys of type 2 diabetes patients in England, Finland, Germany, Greece, the Netherlands and Spain which were conducted during a major European project concerned with health provider networks [23]. Inclusion criteria for participants were 1) that they were being treated for type 2 diabetes by the health providers investigated in the project and 2) that they were at least 18 years old. The health providers investigated in the study selected the patients to be approached for participation according to criteria defined by the researchers. The patients were then contacted either by post or by being directly given the questionnaire when visiting their health care provider. The patients who participated in the survey completed their questionnaires on their own without personnel from the service provider or a researcher beside them. Depending on what was the most feasible method for the provider the participants returned their completed questionnaires either by mail directly to the local project study centres, or to the care provider who then passed them on to the study centres. All surveys had been approved by national ethical committees and took place between October 2011 and March 2012.

Questionnaire

The questionnaire used for the surveys contained items addressing socio-demographic features, health, health-related behaviour and health treatment. Most of these items were not included in the analyses presented here. The questionnaire items included were those concerning age, gender, educational attainment, and secondary complications of diabetes, and all EQ-5D-3L items. There were also two questions concerning the participant’s competence in mastering the questionnaire language. In the English version of the questionnaire the first question was ‘What is your first language?’ and the categories ‘English’ and ‘Other, please specify’ were given as answer options. The second question was ‘If English is not your first language, how well do you master it?’ with the answer options ‘Not at all’, ‘Poorly’, ‘Moderately’, ‘Well’ and ‘Perfectly’. In the other language versions the word ‘English’ was replaced with the word for the language in which the questionnaire was formulated. Both questions were relevant in selecting which questionnaires were to be included for the analyses.

Educational attainment was assessed by asking participants whether they had left school after the minimum school leaving age of their country. Those answering ‘yes’ were classified as having a lower level of educational attainment than those who answered ‘no’. After lengthy discussions with researchers from all six investigated countries this assessment turned out to fit best to all the six corresponding education systems which are all diverse in many respects.

Five secondary complications of type 2 diabetes were considered in the questionnaire: stroke, kidney failure, heart problems, problems concerned with the lower extremities and problems with eyes. Each complication was addressed by a single question. Stroke was assessed by asking if they had ever had a stroke, and kidney failure by asking if they were dependent on dialyses or had had a previous kidney transplant. The other three secondary complications were addressed by questions each of which in turn had three answer options. For heart problems these were: no problems; having had a by-pass or suffering from angina, but never having had a heart attack; and having had a heart attack. For problems with lower extremities the options were: no problems with legs, feet and toes caused by diabetes; having lesions in legs, feet or toes caused by diabetes, but no amputation caused by diabetes; and amputation of a toe, foot or leg due to problems caused by diabetes. The three answer options for eye problems were: no problems; problems caused by diabetes, but no blindness; and blindness as a result of diabetes.

Statistical analyses

Questionnaires were excluded from analyses if the participant could not sufficiently master the questionnaire language and/or if there were too many missing values for the variables under consideration. Sufficient mastering of the questionnaire language was defined as either being a native speaker or as mastering the questionnaire language at least ‘well’. Having too many missing values was defined as there being either more than one missing value for the three socio-demographic questions, or more than one missing value for the five questions concerning the secondary complications or more than one missing value for the five EQ-5D-3L items. The latter criterion was a compromise between the demands of not relying on information that was too questionable and of not excluding too many participants. If there was no more than one missing value for each of the three question categories, this could still be attributed to casual slips by the respondent. More missing values than this were taken as an indicator of fundamental insufficiencies in the respondent's answers.

To obtain a basic understanding of the data, the distributions of the three socio-demographic features and the five secondary complications were analysed for each country separately, and for all countries together. Age was treated both as a continuous variable and as a categorical variable with the categories ’18-54’, ’55-64’, ’65-74’, and ‘older than 74’. These categories were chosen because they were expected to cover qualitatively different age ranges. Country differences with regard to age were tested by computing a multivariate linear regression with age as the dependent variable and dummy coded countries as independent variables. Country differences with regard to the remaining socio-demographic variables, stroke, and kidney failure were tested using Pearson’s chi-square test. Country differences on the three ‘three-categorical’ secondary complications (problems with heart, lower extremities, or eye problems) and the five EQ-5D-3L items were analysed using the Kruskal-Wallis test. This test is specially designed for comparing values of variables with an ordinal scale level. It was chosen because the three-categorical items for the secondary complications as well as the EQ-5D-3L items possess only this scale level. Bivariate relationships of the socio-demographic features and the secondary complications with the EQ-5D-3L items were also investigated using Kruskal-Wallis tests. The remaining analyses directly refer to the two objectives presented in the introduction.

Generalizability of findings concerning the EQ-5D-3L items

As mentioned above the validity of the EQ-5D-3L items is usually investigated by analysing the relationships of these items with demographic features and with more objective features of health. Therefore, the cross-national generalizability of findings concerning the EQ-5D-3L items was investigated here with regard to exactly these relationships. To simplify these analyses the five EQ-5D-3L items and all three-categorical items referring to secondary complications were dichotomised with one category being ‘no problems at all’ and the other ‘any problems’. The secondary complications which were measured with three-categorical items were problems with heart, lower extremities and eyes. The items referring to stroke and to kidney failure were already binary variables. As country specific differences regarding the relationships between the EQ-5D-3L items and other variables can only be reasonably investigated when there are values for all levels of the investigated variables in all countries those socio-demographic features and secondary complications were excluded from the analyses which did not meet this criterion. The included variables will be referred to as the ‘test variables’ below.

To test whether there are country-specific differences regarding relationships between the EQ-5D-3L items and the test variables two logistic regression models were compared for each combination of EQ-5D-3L item and test variable. Both models contained the respective dichotomised EQ-5D-3L item as dependent variable. The first of the two models contained the test variable and dummy coded country as independent variable; the second model contained additionally terms for the interaction between countries and test variable. Statistically significant better fit of the second model was taken as evidence for country specific differences regarding the relationship. This was tested using a likelihood ratio test. To get also descriptive measures for the improvement of model fit Nagelkerke’s pseudo R-square was computed for each model. Nagelkerke’s pseudo R-square reflects how all included independent variables together explain the dependent variable. This statistic is zero when the independent variables do not explain the dependent variable at all. When, in contrast, the independent variables explain the dependent variable completely this statistic is equal to one [24].

For those combinations of EQ-5D-3L items and test variable for which the interaction terms led to a statistically significant improvement of the model fit further analyses were performed. These analyses aimed to identify that country for which the investigated relationship differed most from the other countries. For this purpose, logistic regression models with the dichotomised EQ-5D-3L item as dependent variable were computed for each country separately. These country specific regression models contained only the test variable as independent variable. This was always one dummy variable (except for age, which was modelled with three dummy variables). It was determined how the country specific regression coefficients for these dummy variables deviated from the corresponding regression coefficients in the models which were computed for all countries together without using interaction terms. These deviations directly reflect how much the relationship for the respective country differs from the common core of all countries. When it was mainly the same country which deviated most from the common core when there were statistically significant interactions then all analyses just described were repeated without the data from this country.

Validity of the EQ-5D-3L items

The validity of the EQ-5D-3L items was investigated with the data from those countries for which there were no essential interaction effects. This was achieved through interpreting the results from those regression models which were computed for all countries together and where interaction terms as independent variables were excluded.

Results and discussion

More than 6000 questionnaires were distributed of which 1638 were returned and 1341 met the inclusion criteria (see Table 1). The proportion of excluded questionnaires was largest in England (39.2%) which was due to the fact that about 40% of all respondents in this sample were of Bangladeshi ethnicity who, due to lower levels of stated proficiency in the English language, did not meet the inclusion criteria for this analysis. Altogether, 21.6% of the distributed questionnaires were finally included in the analyses and these proportions vary from 8.6% for England to 55.2% for Germany. As a consequence percentages and means determined from these data might deviate from those means and percentages which would have been obtained for the total sample. However, relationships between variables can rather be expected to be the same for responders and non-responders. Hence, the low response rates will most probably not constitute a too great danger for the validity of the analyses concerning the relationships between the EQ-5D-3L items and the test variables.

Table 1 General information about the sample

Full size table

The mean age of all included participants was 65.4 years. The majority of the participants was male (56.9%), had a higher education attainment (53.6%), and no secondary complications (ranging from 78.3% for no heart complications to 99.7% for no kidney failure) (see Table 2). For the three socio-demographic variables country differences were statistically significant except for gender. For the five secondary complications country differences were significant except for kidney failure (see Table 2). As there were three countries in which there were no participants with kidney failure, this variable could not be applied for the analyses intended here. For all four secondary complications with statistically significant country differences, the best health states were reported in the Netherlands. Data were available for all categories of the five EQ-5D-3L items; however only for ‘usual activities’ , ‘pain/discomfort’ , and ‘anxiety/depression’ were there data for all categories in every country (see Table 2). There were significant differences between the countries for all five items. The Dutch participants had the lowest rates of reporting problems for mobility (31.3%), self-care (5.2%), usual activities (22.7%) and pain and/or discomfort (43.2%) and the Finnish participants had the lowest rate of reporting problems with depression and/or anxiety (11.4%).

Table 2 Distributions of the investigated variables ^a

Full size table

Generalizability of findings concerning EQ-5D-3L items

No Spanish participant between 55 and 64 years of age reported problems with self-care and all Spanish participants with a previous stroke reported problems with mobility, usual activities and pain or discomfort (data not shown). Hence, the regression models with interaction terms could not be computed for these combinations of EQ-5D-3L items and test variables (see Table 3). All other originally intended analyses, however, could be performed, i.e. the significance of interactions could be tested in 31 cases.

Table 3 Model fits for data from all six study countries ^a

Full size table

In only 3 of the 31 cases in which interactions could be tested, the interaction terms raised the model fit significantly with a significance level of 0.05 (see Table 3). These three analyses concerned the relationships of the item ‘mobility’ with gender (increase in Nagelkerke’s Pseudo R²: 0.012, p < 0.05), of the item ‘self-care’ with problems with eyes (increase in Nagelkerke’s Pseudo R²: 0.015, p < 0.05), and of the item ‘pain/discomfort’ with problems with heart (increase in Nagelkerke’s Pseudo R²: 0.012, p < 0.05). In all three of these cases the regression coefficient for Spain deviated most from the corresponding coefficient for the model computed for all countries without the interaction terms as independent variables. When 31 significance tests with a significance level of 0.05 are performed then the probability that more than two tests yield a significant result by chance is 0.201. Therefore, the three significant results could very well be false rejections of the zero-hypothesis. Of course, with a larger sample size more effects might be detected. However, in those three cases where statistically significant interaction effects were found, the differences between the Nagelkerke’s Pseudo R² are small. Hence, effects detected with a larger sample size can also be expected to be quite small. So, the results presented here suggest that the six investigated different language versions relate in approximately the same way to the test variables. However, the fact that the regression coefficients for Spain deviate most in all three significant interactions might also indicate that either the items of the Spanish version are differently understood by the respondents, or that they actually relate differently to the test variables.

To investigate the generalizability of findings across the five countries other than Spain, the same analyses were repeated without the Spanish data. All 35 interaction effects could be tested and only one of these 35 tests, i.e. the interaction between countries and educational level with regard to the item ‘pain/discomfort’, yielded a statistical significant result on the 0.05 level (see Table 4). When 35 statistical tests with a significance level of 0.05 are performed then the probability that at least one test yields a significant result by chance is 0.834. Therefore, the one and only significant result can very well be a false rejection of the zero-hypothesis. Considering the patterns of Nagelkerke’s pseudo R-square (see Table 4) no large effects can be expected to be detected with a larger sample size. Consequently, the results provide strong evidence for the generalizability of the results across England, Finland, Germany, Greece and the Netherlands.

Table 4 Model fits without Spanish data ^a

Full size table

Validity of the EQ-5D-3L items

Relation between EQ-5D-3L items and test variables

The validity of the EQ-5D-3L items was only investigated with the data of those five countries for which there was strong evidence of generalizability. For these data the results from the regression analyses performed with the test variables and dummy coded countries but without interaction terms as independent variables were inspected more thoroughly (see Table 5). These results show that the first four items of the EQ-5D-3L, i.e. ‘mobility’, ‘self-care’, ‘usual activities’ and ‘pain/discomfort’, relate in the same way to the seven test variables. The tendency of reporting problems increases in a statistically significant manner with age, female gender, previous stroke, heart problems, problems with lower extremities and problems with eyes. The tendency of reporting problems also increases with lower education for all four items, but only for three of those items in a statistically significant manner. The statistical test for the item ‘self-care’ just misses a significant result; but the regression coefficient for this item (b = −0.35) deviates more from zero than those for the items ‘mobility’ (b = −0.33) and ‘pain/discomfort’ (b = −0.33) and the confidence intervals for all four items overlap largely (see Table 5). There are still further commonalities between the first four items. For the three socio-demographic variables the absolute value of the coefficient is always largest for the highest age category in comparison with the lowest age category and always lowest for education. For the four investigated secondary complications this value is always largest for problems with lower extremities.

Table 5 Regression coefficients for models without interaction terms, Spanish data excluded ^a

Full size table

However, the regression coefficients also indicate a slight difference between the first three items, i.e. ‘mobility’ , ‘self-care’ and ‘usual activities’ , and the item ‘pain/discomfort’. The latter item seems to be less affected by age (b for age > 74: 0.92) and by the four secondary complications of diabetes (b range: 0.65 to 1.56) than the other three items (see Table 5). For ‘age’ and for ‘previous stroke’ this tendency is even expressed in the confidence intervals, i.e. for the age category ‘> 74’ the confidence interval for ‘pain/discomfort’ (0.53; 1.31) does not overlap with the confidence interval for the item ‘mobility’ (1.40; 2.25), and for ‘previous stroke’ the confidence interval for ‘pain/discomfort’ (0.17; 1.14) does not overlap with the confidence interval for the item ‘usual activities’ (1.25; 2.20) (see Table 5). There is still a further difference between the first three EQ-5D-3L items and the item ‘pain/discomfort’. Whereas ‘problems with the heart’ constitute that secondary complication which has the smallest effect on the first three items (b range: 0.86 to 0.98), it is ‘previous stroke’ which has the smallest effect (b = 0.65) on the item ‘pain/discomfort’.

The item ‘anxiety/depression’ differs quite distinctly from the other four EQ-5D-3L items. In contrast to the other items it is virtually unrelated to age and education. Moreover, the regression coefficients indicate that it is less affected by the secondary complications with only ‘previous stroke’ being an exception. For ‘problems with lower extremities’ and for ‘problems with eyes’ this tendency is also evident in the confidence intervals, i.e. for ‘problems with lower extremities’ the confidence interval for ‘anxiety/depression’ (0.59; 1.31) does not overlap with the confidence interval for ‘mobility’ (1.44; 2.18), and for ‘problems with eyes’ the confidence interval for ‘anxiety/depression’ (0.43; 1.10) does not overlap the confidence interval for ‘usual activities’ (1.25; 2.20) (see Table 5).

Relationships to previous studies

The finding that the first three EQ-5D-3L items are most similar to each other whereas the last item, i.e. ‘anxiety/depression’ , differs most is very well in line with results from factor analyses performed in other studies [25]-[27]. The finding that health impairments as reflected by more objective indicators have an adverse effect on all five items is, with a few exceptions [6]-[8],[12],[13],[16], also in line with the findings in earlier studies [6]-[17]. As in the results presented here, other studies show that age has an adverse effect on the first four EQ-5D-3L items [6],[11],[14],[17],[19],[20]. However, in contrast to the findings presented here, age also had a statistically significant adverse effect on the item ‘anxiety/depression’ in most of the studies in which this relation has previously been investigated [11],[14],[17],[19],[20]. In only one of these studies was no such effect found [6]. As in the study presented here, women reported more problems than men for all five items in most of the previous studies in which this relationship had been investigated [11],[14],[17],[19],[20]. There was only one exception [15]. There is also rich evidence that, just as in the study presented here, the tendency of reporting problems for the first four EQ-5D-3L items increases with lower education [11],[14],[15],[17],[19],[20]. However, in contrast to the study presented here, previous studies also show a statistically significant positive effect of education on the item ‘anxiety/depression’ [11],[14],[15],[17] or at least a non-significant tendency in this direction [19],[20].

Integrative interpretation of the results relevant for validity

Considered from only a conceptual perspective the constructs addressed by the five EQ-5D-3L items are related in a different way to each other. The first three items are all concerned with bodily function which to a large degree can be affected by the same impairments; the fourth item, i.e. ‘pain/discomfort’, describes something which is physical but which is not necessarily related to functional restrictions; the fifth item, i.e. ‘anxiety/depression’, describes something which is not physical and which can be affected by quite different factors other than physical health. These conceptual relations between the items are very well reflected by the results found in the study presented here as well as by the results of the previous studies concerning the factorial structure of the items [25]-[27]. Hence, these results provide evidence for the internal validity of the items.

Both the study presented here and previous studies show that more problems are reported for the first four EQ-5D-3L items with increasing age and lower education. As physical health is known to decrease with age and lower education [28]-[31] these findings further corroborate these items’ validity. The relationships of age and education with the item ‘anxiety/depression’, however, are more complicated.

For the relationship between age and the item ‘anxiety/depression’ two antagonistic mediating mechanisms can be assumed. One mechanism is that physical health decreases with age and that, this in turn, increases anxiety and/or depression [32]-[34]. On the other hand, most people at an older age receive a pension income from which they can live and which can alleviate some reasons for being anxious and/or depressive. Moreover, older age for some people is associated with a time for reduced work responsibilities and increased leisure time which can also reduce anxiety and/or depression. When these two antagonistic mechanisms work together no universal adverse effect of age on the item ‘anxiety/depression’ can be expected. In the data presented here, both mechanisms might have neutralised each other whereas in most studies in literature the first mechanism might have been stronger.

For the relationship between education and the item ‘anxiety/depression’ two antagonistic mediating mechanisms can also be postulated. One mechanism is that higher education is associated with better physical health [28]-[31], and better physical health, in turn, is associated with less anxiety and depression [32]-[34]. The other mechanism is that higher education is very often associated both with more being expected from these people in their work positions and with their own expectations from life in general being higher, which both increases the risk of becoming anxious and depressive. Assuming these two antagonistic mechanisms implies that no universal adverse effect of education on the item ‘anxiety/depression’ can be expected. In the data presented here, both mechanisms might have neutralised each other whereas in most studies in literature the first mechanism might have been stronger.

Both the study presented here and previous studies show that males report fewer health problems than females. The interpretation of these findings, however, is also complicated. In fact, some empirical studies indicate that males are healthier than females [35]-[38]. However, the classical male self-concept expects males to be both physically and emotionally strong [39], which reduces the tendency to admit impairments even if they actually exist. So, as far as the gender effect in the answers corresponds to actual differences in health this effect constitutes further evidence for the validity of the items. However, to the extent to which the gender effect is stronger than the differences in actual health, this effect is evidence for a response bias and therefore for a specific restriction of validity. With the analyses presented here the relative influence of both factors cannot be determined and additional research would be required.

Conclusions

The items of the English, Finnish, German, Greek, Dutch, and Spanish versions of the EQ-5D-3L relate in approximately the same way to age, gender, education, previous stroke, heart problems, problems with lower extremities and problems with eyes. However, for 3 of the 31 performed tests statistically significant deviations from the same pattern were detected and all of these were largely attributable to the Spanish data. These results can still be explained by statistical variation caused by chance. However, they might also suggest that something is different either with the European Spanish language version of the EQ-5D-3L or with the actual relationships between the EQ-5D-3L items and the test variables. Further studies are required for deciding between both possible explanations. After excluding the Spanish data, only one of the 35 performed statistical tests of deviation from the same pattern produced a statistically significant result. This provides strong empirical evidence that the items of the English, the Finnish, the German, the Greek, and the Dutch version of the EQ-5D-3L relate in the same way to the test variables. The aggregated results for English, Finnish, German, Greek and Dutch data provide further evidence for the validity of the EQ-5D-3L items.

References

EuroQol: The current state of play Health Policy 1996, 37(1):53–72. 10.1016/0168-8510(96)00822-6
Article Google Scholar
Rabin R, de Charro F: EQ-5D: a measure of health status from the EuroQol Group. Ann Med 2001, 33(5):337–343. 10.3109/07853890109002087
Article CAS PubMed Google Scholar
Brauer CA, Rosen AB, Greenberg D, Neumann PJ: Trends in the measurement of health utilities in published cost-utility analyses. Value Health 2006, 9: 213–218. 10.1111/j.1524-4733.2006.00116.x
Article PubMed Google Scholar
EuroQol Group: EQ-5D-3L. . Accessed June 10th 2014., [http://www.euroqol.org/eq-5d-products/eq-5d-3l.html]
Szende A, Oppe M, Devlin N: EQ-5D value sets: Inventory, comparative review and user guide. Springer, Dordrecht, The Netherlands; 2007.
Book Google Scholar
Aburuz S, Bulatova N, Twalbeh M, Gazawi M: The validity and reliability of the Arabic version of the EQ-5D: a study from Jordan. Ann Saudi Med 2009, 29(4):304–308. 10.4103/0256-4947.55313
Article PubMed Central PubMed Google Scholar
Brettschneider C, König HH, Herzog W, Kaufmann C, Schaefert R, Konnopka A: Validity and responsiveness of the EQ-5D in assessing and valuing health status in patients with somatoform disorders. Health Qual Life Outcomes 2013, 11: 3. 10.1186/1477-7525-11-3
Article PubMed Central PubMed Google Scholar
Hunger M, Sabariego C, Stollenwerk B, Cieza A, Leidl R: Validity, reliability and responsiveness of the EQ-5D in German stroke patients undergoing rehabilitation. Qual Life Res 2012, 21(7):1205–1216. 10.1007/s11136-011-0024-3
Article PubMed Google Scholar
König HH, Born A, Günther O, Matschinger H, Heinrich S, Riedel-Heller SG, Angermeyer MC, Roick C: Validity and responsiveness of the EQ-5D in assessing and valuing health status in patients with anxiety disorders. Health Qual Life Outcomes 2010, 8: 47. 10.1186/1477-7525-8-47
Article PubMed Central PubMed Google Scholar
König HH, Roick C, Angermeyer MC: Validity of the EQ-5D in assessing and valuing health status in patients with schizophrenic, schizotypal or delusional disorders. Eur Psychiatry 2007, 22(3):177–187. 10.1016/j.eurpsy.2006.08.004
Article PubMed Google Scholar
Kontodimopoulos N, Pappa E, Niakas D, Yfantopoulos J, Dimitrakaki C, Tountas Y: Validity of the EuroQoL (EQ-5D) instrument in a Greek general population. Value Health 2008, 11(7):1162–1169. 10.1111/j.1524-4733.2008.00356.x
Article PubMed Google Scholar
Luo N, Low S, Lau PN, Au WL, Tan LC: Is EQ-5D a valid quality of life instrument in patients with Parkinson's disease? A study in Singapore. Ann Acad Med Singapore 2009, 38(6):521–528.
PubMed Google Scholar
Öster C, Willebrand M, Dyster-Aas J, Kildal M, Ekselius L: Validation of the EQ-5D questionnaire in burn injured adults. Burns 2009, 35(5):723–732. 10.1016/j.burns.2008.11.007
Article PubMed Google Scholar
Savoia E, Fantini MP, Pandolfi PP, Dallolio L, Collina N: Assessing the construct validity of the Italian version of the EQ-5D: preliminary results from a cross-sectional study in North Italy. Health Qual Life Outcomes 2006, 4: 47. 10.1186/1477-7525-4-47
Article PubMed Central PubMed Google Scholar
Shafie AA, Hassali MA, Liau SY: A cross-sectional validation study of EQ-5D among the Malaysian adult population. Qual Life Res 2011, 20(4):593–600. 10.1007/s11136-010-9774-6
Article PubMed Google Scholar
Stark RG, Reitmeir P, Leidl R, König HH: Validity, reliability, and responsiveness of the EQ-5D in inflammatory bowel disease in Germany. Inflamm Bowel Dis 2010, 16(1):42–51. 10.1002/ibd.20989
Article PubMed Google Scholar
Sun S, Chen J, Johannesson M, Kind P, Xu L, Zhang Y, Burström K: Population health status in China: EQ-5D results, by age, sex and socio-economic status, from the National Health Services Survey 2008. Qual Life Res 2011, 20(3):309–320. 10.1007/s11136-010-9762-x
Article PubMed Central PubMed Google Scholar
Faridah A, Jamaiyah H, Goh A, Soraya A: The validation of the EQ-5D in Malaysian dialysis patients. Med J Malaysia 2010, 65(Suppl A):114–119.
PubMed Google Scholar
König HH, Bernert S, Angermeyer MC, Matschinger H, Martinez M, Vilagut G, Haro JM, de Girolamo G, de Graaf R, Kovess V, Alonso J: Comparison of population health status in six european countries: results of a representative survey using the EQ-5D questionnaire. Med Care 2009, 47(2):255–261. 10.1097/MLR.0b013e318184759e
Article PubMed Google Scholar
Wang HM, Patrick DL, Edwards TC, Skalicky AM, Zeng HY, Gu WW: Validation of the EQ-5D in a general population sample in urban China. Qual Life Res 2012, 21(1):155–160. 10.1007/s11136-011-9915-6
Article PubMed Google Scholar
Kumar V, Abbas AK, Fausto N, Robbins SL, Cotran RS: Robbins and Cotran Pathologic Basis of Disease. 7th edition. Elsevier Saunders, Philadelphia, Pa; 2005.
Google Scholar
Melmed S, Polonsky K, Larsen PR, Kronenberg H, Williams : Textbook of Endocrinology. 12th edition. Elsevier/Saunders, Philadelphia; 2011.
Google Scholar
Managed Outcomes Consortium: Managed Outcomes. . Accessed 30 May 2014, [http://www.youtube.com/feed/UCGR1TTllCKmp-FHKH6cEAcg]
Nagelkerke NJD: A Note on a General Definition of the Coefficient of Determination. Biometrika 1991, 78(3):691–692. 10.1093/biomet/78.3.691
Article Google Scholar
Cherepanov D, Palta M, Fryback DG: Underlying dimensions of the five health-related quality-of-life measures used in utility assessment: evidence from the National Health Measurement Study. Med Care 2010, 48(8):718–725. 10.1097/MLR.0b013e3181e35871
Article PubMed Central PubMed Google Scholar
Essink-Bot ML, Krabbe PF, Bonsel GJ, Aaronson NK: An empirical comparison of four generic health status measures. The Nottingham Health Profile, the Medical Outcomes Study 36-item Short-Form Health Survey, the COOP/WONCA charts, and the EuroQol instrument. Med Care 1997, 35(5):522–537. 10.1097/00005650-199705000-00008
Article CAS PubMed Google Scholar
Konerding U, Moock J, Kohlmann T: The classification systems of the EQ-5D, the HUI II and the SF-6D: what do they have in common? Qual Life Res 2009, 18(9):1249–1261. 10.1007/s11136-009-9525-8
Article PubMed Google Scholar
Dupre ME: Educational differences in age-related patterns of disease: reconsidering the cumulative disadvantage and age-as-leveler hypotheses. J Health Soc Behav 2007, 48(1):1–15. 10.1177/002214650704800101
Article PubMed Google Scholar
Dupre ME: Educational differences in health risks and illness over the life course: a test of cumulative disadvantage theory. Soc Sci Res 2008, 37(4):1253–1266. 10.1016/j.ssresearch.2008.05.007
Article PubMed Google Scholar
Khang YH, Lynch JW, Kaplan GA: Health inequalities in Korea: age- and sex-specific educational differences in the 10 leading causes of death. Int J Epidemiol 2004, 33(2):299–308. 10.1093/ije/dyg244
Article PubMed Google Scholar
Silventoinen K, Lahelma E: Health inequalities by education and age in four Nordic countries, 1986 and 1994. J Epidemiol Community Health 2002, 56(4):253–258. 10.1136/jech.56.4.253
Article PubMed Central PubMed Google Scholar
Ayerbe L, Ayis S, Wolfe CD, Rudd AG: Natural history, predictors and outcomes of depression after stroke: systematic review and meta-analysis. Br J Psychiatry 2013, 202(1):14–21. 10.1192/bjp.bp.111.107664
Article PubMed Google Scholar
Mitchell AJ, Ferguson DW, Gill J, Paul J, Symonds P: Depression and anxiety in long-term cancer survivors compared with spouses and healthy controls: a systematic review and meta-analysis. Lancet Oncol 2013, 14(8):721–732. 10.1016/S1470-2045(13)70244-4
Article PubMed Google Scholar
Roy T, Lloyd CE: Epidemiology of depression and diabetes: a systematic review. J Affect Disord 2012, 142(Suppl):S8–21. 10.1016/S0165-0327(12)70004-6
Article PubMed Google Scholar
Arber S, Cooper H: Gender differences in health in later life: the new paradox? Soc Sci Med 1999, 48: 61–76. 10.1016/S0277-9536(98)00289-5
Article CAS PubMed Google Scholar
Frederiksen H, Hjelmborg J, Mortensen J, McGue M, Vaupel JW, Christensen K: Age trajectories of grip strength: cross-sectional and longitudinal data among 8,342 Danes aged 46 to 102. Ann Epidemiol 2006, 16: 554–562. 10.1016/j.annepidem.2005.10.006
Article PubMed Google Scholar
Leveille SG, Penninx BW, Melzer D, Izmirlian G, Guralnik JM: Sex differences in the prevalence of mobility disability in old age: the dynamics of incidence, recovery, and mortality. J Gerontol B Psychol Sci Soc Sci 2000, 55: S41-S50. 10.1093/geronb/55.1.S41
Article CAS PubMed Google Scholar
Nybo H, Gaist D, Jeune B, McGue M, Vaupel JW, Christensen K: Functional status and self-rated health in 2,262 nonagenarians: The Danish 1905 Cohort Survey. J Am Geriatr Soc 2001, 49: 601–609. 10.1046/j.1532-5415.2001.49121.x
Article CAS PubMed Google Scholar
Helgeson VS: Prototypes and dimensions of masculinity and femininity. Sex Roles 1994, 31: n11–12. 653–682 10.1007/BF01544286
Article Google Scholar

Download references

Acknowledgements

The research leading to these results has received funding from the European Community's Seventh Framework Programme (FP7/2007-2013) under grant agreement n° 241741. The surveys have been performed with the special assistance of Tom Bowen, Eleftheria Karampli, Teresa Meneu, Paulus Torkki, and Antero Vanhala. The authors would like to express their gratitude to the participating healthcare service provider organisations for their co-operation and support in enabling the study to take place.

Author information

Authors and Affiliations

Trimberg Research Academy, University of Bamberg, Bamberg, D-96045, Germany
Uwe Konerding
Institute of Health Policy & Management, Erasmus University Rotterdam, Rotterdam, 3000, DR, The Netherlands
Sylvia G Elkhuizen
Joint Research Unit in Biomedical Engineering, IIS La Fe- Universitat Politècnica de València, Bulevar sur s/n, Valencia, 46026, Spain
Raquel Faubel
The Balance of Care Group, 39a Cleveland Road, London, N1 3ES, UK
Paul Forte
Department of Industrial Engineering and Management, Aalto University, Espoo, 00076, Aalto, Finland
Tomi Malmström
Department of Health Economics, National School of Public Health, 196 Alexandras Ave, Athens, 115 21, Greece
Elpida Pavi
Department of Psychiatry, Section Medical Psychology & Psychotherapy, Erasmus MC, Rotterdam, 3000, CA, The Netherlands
MF Bas Janssen

Authors

Uwe Konerding
View author publications
You can also search for this author in PubMed Google Scholar
Sylvia G Elkhuizen
View author publications
You can also search for this author in PubMed Google Scholar
Raquel Faubel
View author publications
You can also search for this author in PubMed Google Scholar
Paul Forte
View author publications
You can also search for this author in PubMed Google Scholar
Tomi Malmström
View author publications
You can also search for this author in PubMed Google Scholar
Elpida Pavi
View author publications
You can also search for this author in PubMed Google Scholar
MF Bas Janssen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Uwe Konerding.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

UK has coordinated the survey in all study countries, organised the survey in Germany, elaborated the basic conception of the paper, performed the statistical analyses and written most parts of the paper. SE, RF, PF, TM and EP have organised the survey in their countries, discussed the conception of the paper, critically reviewed the paper and given important contributions to its refinement. MFBJ has given essential support in elaborating the conception of the paper, critically reviewed several earlier versions of the paper and given important contributions to its refinement. All authors read and approved the final manuscript.

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Konerding, U., Elkhuizen, S.G., Faubel, R. et al. The validity of the EQ-5D-3L items: an investigation with type 2 diabetes patients from six European countries. Health Qual Life Outcomes 12, 181 (2014). https://doi.org/10.1186/s12955-014-0181-5

Download citation

Received: 20 June 2014
Accepted: 26 November 2014
Published: 05 December 2014
DOI: https://doi.org/10.1186/s12955-014-0181-5

The validity of the EQ-5D-3L items: an investigation with type 2 diabetes patients from six European countries