Background

Critical steps in improving the quality of health care include evidence-based, patient-centered, and systems-oriented care [1]. Incorporating this approach has changed medicine in all areas, from everyday practice, where shared decision making has become essential and recommended by guidelines [2] to clinical trials where patients are involved in Steering Committees [3]. Major treatment goals for patients with cardiovascular disease include a focus on health status which includes symptom burden, functional limitations, and health-related quality of life (HRQL) [4, 5]. Health status typically is evaluated with patient-reported outcome questionnaires through which patients may express feelings about their health condition without adulteration or interpretation by clinicians [6].

Both generic and disease-specific patient-reported outcome questionnaires can be used to evaluate and compare the health status. Generic instruments, such as the SF-36 Health Survey (SF-36) [7, 8], are designed for use in healthy populations, different disease populations as well as between healthy populations and populations with various diseases. On the other hand, disease-specific instruments, such as the Seattle Angina Questionnaire [9] and the Minnesota Living with Heart Failure questionnaire [10] provide more relevant disease-specific information than generic instruments. However, there is a need for core disease-specific questionnaires which permit between-diagnosis comparisons within a given disease with a single questionnaire, for example, the MacNew [11] and, more recently, the HeartQoL [12, 13] in patients with ischemic heart disease (IHD).

The HeartQoL questionnaire was developed with the objective of assessing HRQL using a single instrument in patients with angina pectoris (AP), myocardial infarction (MI), or ischemic heart failure (IHF). The HeartQoL Project was conducted between 2002 and 2011 in European and English speaking regions in two phases: first, a cross-sectional survey to identify items for questionnaire inclusion on the basis of three previously validated instruments [12], the Seattle Angina Questionnaire, [9] the MacNew Heart Disease Health-related Quality of Life Questionnaire (MacNew) [11], and the Minnesota Living with Heart Failure questionnaire [10], and a second phase to evaluate the HeartQoL questionnaire’s psychometric properties [13]. The result was a bifactorial, 14-item questionnaire that was reliable, valid, and responsive in patients with AP, MI, or IHF [13]. These results have been recently confirmed in German [14] and Chinese [15] and also in other clinical contexts, such as atrial fibrillation [16], implantable cardioverter defibrillator recipients [17], and valve surgery [18].

To optimize their use as both research and clinical tools, patient-reported HRQL questionnaires need to meet basic attributes such as reliability and validity [19]. Reliability is considered the degree to which a questionnaire is free from random error with internal consistency reliability the intercorrelations of items at one point in time. Validity is the degree to which the instrument really measures what it purports to measure and includes convergent validity (the degree to which two theoretically related measures are in fact related), divergent validity (the degree to which two dissimilar measures can be differentiated) and discriminative validity (does the questionnaire adequately discriminate between groups that should or should not differ). The purpose of this study was to examine the factor structure, reliability, convergent, divergent, and discriminative validity of the Italian version of the HeartQoL questionnaire in patients with AP, MI, or IHF.

Methods

Patients

Patients diagnosed with IHD [n = 472] were recruited at five sites in Italy (Florence, Veruno, Turin, Udine, and Naples) from 2015 to 2017. The eligibility criteria were extracted from the original cross-sectional survey phase of the HeartQoL Project [12] and included:

(a) currently being treated for AP [20] with an objective measure of IHD (e.g., previous MI, exercise testing, echocardiogram, nuclear cardiac imaging, or coronary angiography); or

(b) had experienced a documented MI between one to six months previously; or

(c) currently being treated for IHF with evidence of left ventricular dysfunction (ejection fraction ≤ 40% by invasive or non-invasive testing) and an objective measure of IHD (e.g., exercise testing, echocardiogram, nuclear cardiac imaging, or coronary angiography).

The patients were ≥ 18 years, able to complete the self-administered battery of HRQL questionnaires in Italian, without hospitalization in the last 6 weeks, without serious psychiatric disorder as well as no current substance abuse as identified by the referring physician. The Ethics Committee in each study site approved the project and informed consent was obtained from all patients.

Patient-reported questionnaires

Sociodemographic and clinical variables

Demographic characteristics of age, sex, family status, education, employment, and cardiovascular risk factors (diabetes, hypercholesterolemia, hypertension, physical inactivity, and smoking) were self-reported by patients.

HeartQoL

The Italian version of the disease-specific HeartQoL was used in the HeartQoL Project [12] and in this study to measure HRQL. The HeartQoL is a bi-dimensional instrument with a 10-item physical subscale and a 4-item emotional subscale that, when combined, provides the 14-item global scale [12]. The responders answer the question of how much they are bothered by each of the 14 HeartQoL questionnaire items on a 4-point scale ranging from “bothered a lot” (= 0) to “not bothered” (= 3) with higher scores indicating better HRQL. Both the global scale and the subscales were calculated as the mean of the scored items.

Hospital anxiety and depression scale (HADS)

The HADS is a 14-item self-assessment questionnaire including an anxiety and a depression subscale that together generate a common score for general distress; the Italian version was used in this study [21, 22]. The items are answered on a scale ranging from 0 to 3 with higher scores representing higher levels of anxiety or depression. A subscale cutoff score ≥ 8 identifies patients with symptoms of depression or anxiety [22] and a global cutoff score ≥ 12 indicates “general” distress.

MacNew

The MacNew Questionnaire is designed to assess patient’s feelings about how IHD affects daily functioning [11]. It contains 27 items with a global HRQL score and physical limitation, emotional and social function subscales, scored from 1 (low HRQL) to 7 (high HRQL) [23, 24]. Using forward–backward translation, the MacNew questionnaire had been translated into Italian and validated as part of the HeartQoL Project [25].

Statistical analyses

Means and standard deviation (SD) are reported for continuous variables and frequencies for categorical variables. Categorical sociodemographic and clinical variables were analyzed with Pearson’s Chi-square test while continuous variables and scale means were examined with analyses of co-variance (ANCOVA) with post hoc Bonferroni correction adjusting for age, sex, risk factors, and disease severity within diagnosis. A two-sided p value of < 0.05 was applied for statistical significance. The data were analyzed using IBM SPSS Statistics 24 and STATA 14.

The evaluation of the psychometric properties of the Italian HeartQoL followed criteria recommended by the Scientific Advisory Committee [19]. Floor and ceiling effects of the HeartQoL are considered moderate when > 15% patients report the lowest score (= 0; floor) or the highest score (= 3; ceiling) [26]. A confirmatory Mokken scale analysis, using Loevinger’s coefficient H of ≥ 0.50 considered a “strong,” 0.49–0.40 a “moderate,” and 0.39–0.30 a “weak” scale [27], was performed to assess whether the established two-factor structure of HeartQoL [12] could be confirmed in the Italian HeartQoL. Internal consistency reliability was determined using Cronbach’s alpha with values of ≥ 0.70 acceptable for group and ≥ 0.90 for individual comparisons [28]. The strength of the correlations between HeartQoL and related (convergent) as well as unrelated (divergent) constructs was assessed with Pearson coefficients (r < 0.10 = no correlation, r = 0.10–0.29 = low correlation, r = 0.30–0.49 = moderate correlation, r ≥ 0.50 = high correlation) with Steiger’s test used for differences in correlations [29]. Discriminative validity was tested for using the “known-groups” approach [26] with paired t tests used to compare HeartQoL scores with two-level variables and one-way ANOVA used to compare three-level variables. The following discriminative validity hypotheses test whether significantly higher HeartQoL scores are reported:

  1. 1.

    by younger than by middle-age or older patients,

  2. 2.

    by male patients than by female patients,

  3. 3.

    by patients with MI than by patients with either AP or IHF,

  4. 4.

    by patients reporting higher MacNew HRQL scores (top 33% of actual scores) than by lower scores (bottom 33% of actual scores),

  5. 5.

    by patients without HADS anxiety (score < 8) than by patients with HADS anxiety (score ≥ 8)

  6. 6.

    by patients without HADS depression (score < 8) than by patients with HADS depression (score ≥ 8) and

  7. 7.

    by patients without HADS general distress (score < 12) than by patients with HADS general distress (score ≥ 12).

Results

Patient characteristics (Table 1)

Table 1 Patient characteristics: sociodemographic, risk factor, and intervention variables in patients with angina (AP), myocardial infarction (MI), or ischemic heart failure (IHF)

A total of 472 patients with documented AP (N = 183, 38.8%), previous MI (N = 167, 35.4%) or IHF (N = 122, 25.8%) were eligible for the validation of the Italian HeartQoL Questionnaire. The mean age of the total cohort was 65.4 ± 12.5 years. The majority of patients (74.6%) were male, most were married (76.7%), 52% had less than high school education, and 54.9% were retired. Hypertension was the most prevalent risk factor (53.8%) followed by hypercholesterolemia (50.6%) and physical inactivity (47.1%) with differences in risk factors by diagnosis detailed in Table 1.

Patient-reported questionnaire score distributions (Table 2)

Table 2 Scores (mean ± standard deviation) on HeartQoL, Hospital Anxiety, and Depression Scale (HADS) and MacNew and percentage HADS anxious, depressed, and distressed in the total cohort and in patients with angina (AP), myocardial infarction (MI), or ischemic heart failure (IHF)

HeartQoL Mean HeartQoL HRQL scores for the total cohort were 2.0 ± 0.7 on the global scale, 1.9 ± 0.8 on the physical subscale and 2.2 ± 0.8 on the emotional subscale with ANCOVA results demonstrating significantly higher global scale and both physical and emotional score differences in patients with MI than patients with either AP or IHF.

MacNew The mean MacNew global HRQL score was 5.2 ± 1.1, the mean physical subscale score was 5.2 ± 1.2 and the mean emotional subscale score was 5.1 ± 1.3 in the total group. Using ANCOVA, there were significant score differences between patients with MI and those with either AP or IHF on the global scale and both physical and emotional subscales with patients with MI reporting higher HRQL than patients with either AP or IHF.

Hospital Anxiety and Depression Scale For the total cohort, the mean HADS anxiety score was 5.2 ± 3.9 with anxiety scores ≥ 8 reported by 27.1% of the total cohort. The mean HADS depression score in the total cohort was 5.5 ± 4.2 with depression scores ≥ 8 reported by 27.9% of the total cohort. The mean HADS general distress score in the total cohort was 10.7 ± 7.4 with general distress scores ≥ 12 reported by 41.1% of the total cohort. Although patients with IHF reported significantly higher depression scores than patients with MI, the anxiety scores, the general distress scores and the proportions of patients with HADS anxiety or depression scores ≥ 8 or general distress scores ≥ 12 were not significantly different by diagnosis.

Mokken scale factor structure (Table  3 )

Table 3 Mokken scale analysis in in the total cohort for the HeartQoL items with Loevinger’s Hi (item) and H (scale) coefficients (strong, ≥ 0.50, moderate, 0.49 to 0.40)

With Mokken analysis, the HeartQoL H coefficients were rated as “strong” on the global scale (0.59) and on both the physical (0.67) and the emotional (0.68) subscales confirming the original HeartQoL two-factor structure. Of the HeartQoL Hi coefficients, 85.8% of the items on the global scale (12 of 14 items; Hi range, 0.47–0.66) and 100% of the items on both the physical subscale (10 items; Hi range, 0.61 to 0.70) and the emotional subscale (4 items; Hi range, 0.63 to 0.72) were rated as “strong”.

HeartQoL floor and ceiling effects (Table  4 )

Table 4 Italian HeartQoL floor and ceiling effects and internal consistency reliability (Cronbach’s alpha) in the total cohort and in patients with angina (AP), myocardial infarction (MI), or ischemic heart failure (IHF)

Floor effects on the HeartQoL global scale and both subscales were always ≤ 1.7% in the total group and in each diagnosis.

Ceiling effects on the HeartQoL global scale and the physical subscale were always ≤ 10.8% in the total group and in each diagnosis with moderate ceiling effects on the emotional subscale as high as 16.5% in the total group and 22.2% in patients with MI.

Psychometric properties of the HeartQoL (Tables  2, 4, 5, & 6)

Reliability (Table 4) Internal consistency reliability was confirmed with Cronbach’s alpha ranging from 0.87 to 0.94 in the total cohort, from 0.88 to 0.94 in patients with AP, from 0.86 to 0.93 in patients with MI and from 0.84 to 0.95 in patients with IHF.

Convergent validity (Table 5) Convergent validity was confirmed in the total cohort and each diagnosis with correlations between the HeartQoL and MacNew physical subscales ranging from 0.71 to 0.75 and from 0.79 to 0.84 between the HeartQoL and MacNew emotional subscales. Correlations between dissimilar scales of both instruments were significantly lower according to Steiger’s test for comparing Pearson correlations confirming divergent validity.

Table 5 Convergent validity of the Italian HeartQoL subscales with the MacNew subscales in the total cohort and in patients with angina pectoris (AP), myocardial infarction (MI), or ischemic heart failure (IHF)

Discriminative validity (Tables 2, 6) Discriminative validity for the HeartQoL was fully confirmed for diagnosis, MacNew global HRQL scores, HADS anxiety, depression, and general distress, largely confirmed for sex, and essentially not confirmed for age.

Table 6 Discriminative validity of the Italian HeartQoL in the total cohort and in patients with angina pectoris (AP), myocardial infarction (MI), or ischemic heart failure (IHF)

Diagnosis (Table 2): Patients with MI reported significantly higher HeartQoL global scale and both physical and emotional subscale scores than patients with either AP or IHF.

MacNew (Table 6): Patients reporting higher MacNew global HRQL scores (top 33% of actual scores reported significantly higher HeartQoL global and both physical and emotional subscale scores than patients reporting lower MacNew global HRQL scores (bottom 33% of actual scores).

HADS anxiety (Table 6): Significantly higher HeartQoL global scale and both physical and emotional subscale scores were observed in patients who were not anxious than in patients who were anxious;

HADS depression (Table 6): Patients who were not depressed reported significantly higher HeartQoL global scale and both physical and emotional subscale scores than reported by patients who were depressed (p < 0.001);

HADS distress (Table 6): Significantly higher HeartQoL global scale, physical, and emotional subscale scores were observed in patients who were not distressed than in patients who were distressed;

Sex (Table 6): In the total group, female patients reported significantly higher HeartQoL global scale and both physical and emotional subscale scores than male patients; HeartQoL global scale and emotional subscale scores were significantly higher in females than males in patients with AP and in patients with IHF; in patients with MI, HeartQoL global scale and physical subscale scores were significantly higher in females than males.

Age (Table 6): The only significant HeartQoL score difference by age was a higher physical HeartQoL subscale score in middle-aged patients compared to elderly patients in the total group.

Discussion

The Italian-language version of the 14-item HeartQoL questionnaire demonstrated adequate internal consistency, convergent, divergent and discriminative validity and therefore may be recommended for the use in clinical practice and research to assess and compare HRQL in Italian-speaking IHD patients with AP, MI or IHF. The psychometric properties of the Italian version of the HeartQoL were evaluated on a sample of 472 patients with either AP, MI or IHF from five Italian centres in North-East, North-West, Central, and Southern Italy and therefore largely representative of the demographic, social, cultural, and economic characteristics of the Italian population. Further, these psychometric properties are consistent with the original validation study in patients with AP, MI or IHF, [13] the English HeartQoL version based on patients with AP or MI in the USA, [30] the German HeartQoL version in patients with AP, MI or IHF, [14] The Iranian HeartQoL version in patients with MI, [31] the EuroAspire IV study in chronic coronary syndrome patients [32] and also with validation studies in patients with atrial fibrillation, [16] implantable cardioverter defibrillator recipients [17] or following valve surgery [18]. In addition, the original two-factor structure [12], determined with Mokken scale analysis [27], was confirmed with strong physical (Hi 0.67) and emotional (Hi 0.68) domains being comparable with the original [13] and more recent validation studies. [14, 16, 17, 30, 32].

With higher HRQL scores (range 0–3) indicating better HRQL, the mean HRQL scores in the Italian HeartQoL total group were 2.0 ± 0.07 on the global scale, 1.9 ± 0.08 on the physical subscale and 2.1 ± 0.08 on the emotional subscale and all scores reported by patients with MI indicated significantly better HRQL than reported by patients with either AP or IHF. With moderate floor and ceiling effects present when > 15% patients report the lowest score (= 0; floor) or the highest score (= 3; ceiling), [26] floor and ceiling effects on the global scale and physical subscale in the total group and each diagnosis were minimal indicating detection of both deterioration and improvement. However, with the emotional subscale, although all floor effects and ceiling effects in patients with AP or IHF were minimal, the ceiling effects were moderate in the total group and in patients with MI indicating limited detection of improvement. As this outcome is consistent with the original HeartQoL study [13] and other validation studies [14, 16,17,18, 30, 32] assessing improvement in HeartQoL emotional HRQL may be more challenging than either global or physical HRQL.

The Italian HeartQoL demonstrated high internal consistency reliability for each subscale, exceeding the recommended criterion of 0.70 for group HRQL comparisons with Cronbach’s α ≥ 0.92 on both the global scale and physical subscale and ≥ 0.84 on the emotional subscale in the total group and each of the three diagnoses. This is consistent with the previous validation studies [14, 16, 17, 30, 32] indicating that the items captured in the HeartQoL reflect the same constructs. Convergent validity refers to how closely scales measuring similar constructs are correlated, and was demonstrated across diagnostic groups with strong correlations (r > 0.70) between the physical and emotional subscales of HeartQoL and corresponding subscales of MacNew; with significantly lower correlations between dissimilar constructs, divergent validity was also confirmed.

Discriminative validity was determined using the ‘known groups’ approach [26] with the expectation that HRQL scores would distinguish between pre-identified groups. Of the total of 84 discriminative validity calculations in the total cohort and each diagnosis, 70 (83%) were fully confirmed; of the 14 calculations that were not significant, 11 (79%) were on the age and 3 (21%) on the sex variables. The discriminative validity of the Italian HeartQoL was fully confirmed for diagnosis as patients with MI reported significantly higher HeartQoL scores than patients with either AP or IHF which is largely consistent with both the original validation study [13] and the German validation study [14] as well as in the English HeartQoL where patients with MI reported higher HRQL scores than did patients with AP. [30] Discriminative validity was also fully confirmed with HADS anxiety/depression scales in the total group and in each diagnosis which is consistent for HADS anxiety and depression in the original study [13] and previously published HeartQoL studies. [14, 16,17,18, 30, 32] As reported in the German HeartQoL validation publication [14] the Italian HeartQoL HRQL scores were significantly higher in patients not reporting HADS distress compared to those reporting distress. Discriminative validity was largely confirmed for sex in this study with females reporting significantly better HRQL than males but this is not consistent with the EuroAspire IV study in chronic coronary syndrome patients [32], the German HeartQoL observations in patients with AP, MI or IHF [14] or the studies in Danish patients with atrial fibrillation [16] or following valve surgery [18] where males reported higher HRQL, although not always significant, than females. Finally, except for significantly higher physical HeartQoL scores in middle-aged patients than in elderly patients, discriminative validity was not confirmed for age in the present study. There was a trend for lower HRQL to be reported by elderly Italian patients which is largely consistent with the EuroAspire IV study [32] but not with the German HeartQoL observations [14] where the hypothesized lower HRQL scores in older patients was not consistently met.

Limitations and strengths of the study

In every questionnaire with a relatively small number of items per scale, the amplitude of assessment of differences or changes in HRQL may be limited. Furthermore, examination of test–retest reproducibility was not possible with the cross-sectional design of the study which is also the major limitation of the present validation study not allowing for the determination of responsiveness. Future studies will be needed to determine the evaluative validity of the Italian HeartQoL questionnaire. The strength of the study lies in the demonstration of adequate key psychometric attributes of internal consistency reliability and convergent, divergent, and discriminative validity in Italian-speaking patients with ischemic heart disease.

Conclusion

The Italian version of the HeartQoL questionnaire demonstrated satisfactory psychometric characteristics showing excellent internal consistency reliability and adequate convergent, divergent, and discriminative validity in Italian speaking patients with IHD and its three major diagnoses (AP, MI, and IHF). The Italian HeartQoL can be recommended to clinicians and researchers to estimate and compare the impact of IHD on patients’ HRQL.