Background

Rheumatoid arthritis (RA) is a chronic inflammatory, systemic disease largely affecting the synovium, which can lead to joint damage and bone destruction. It can affect the heart, lungs and eyes and causes severe disability, psychological distress and increased mortality [1, 2]. Drug management aims to relieve symptoms and to modify the disease process. Despite new biologic treatments which are more efficacious and specific than other drug treatments [3, 4], the patients' improvement in health status and quality of life may depend on their ability and willingness to adhere to all their therapies and undertake self-care activities. Patient education is the process by which patients are prepared for the latter important undertaking [5].

Patient education is recommended as an integral part of rheumatic diseases management [6, 7] and ranges from supplying patient information leaflets to well-structured self-management programmes. However, systematic reviews have suggested that non-targeted education does not deliver long-term effects in RA patients [8, 9]. Consequently recommendations have been made for patient education to be more patient-centred and tailored to address individuals' educational needs [10]. In order to plan effective patient-tailored education, clinicians need to assess patients' perceptions of their educational needs.

The Educational Needs Assessment Tool (the ENAT) is a patient-completed questionnaire designed to help patients with rheumatoid arthritis identify their educational needs. It was originally developed with patients and practitioners in the UK and it comprises 39 items grouped into 7 domains, namely: managing pain (6 items), movement (5 items), feelings (4 items), arthritis process (7 items), treatments (7 items), self-help measures (6 items) and support systems (4 items). The items are 5-category rating scales with descriptors: "not at all important", "a little important", "fairly important", "very important" and "extremely important". This gives a total score of educational needs ranging from 0-156. In the early development of the ENAT, two pilot studies were conducted among patients with various forms of arthritis [11]. The first one (with 20 patients) found the ENAT acceptable and easy to use and in the second (with 97 patients) the ENAT demonstrated a good test-retest reliability [11]. The original (English) ENAT was later completed by a sample of 125 patients with RA in the UK and its 7 domains demonstrated a good fit to the Rasch model indicating a good construct validity and supporting the unidimensionality of the scale [12].

Since patient education is a globally accepted part of treatment in RA and given the increasing need to undertake multinational studies, tools such as the ENAT also need to demonstrate a cross-cultural invariance (i.e. work in a consistent manner across countries) [1315]. Thus cross-cultural validation of the ENAT would enable comparison of educational needs and data pooling across Europe. The objective of this research was to assess the cross-cultural validity of the ENAT in RA in 7 European countries.

Methods

Patients

This multicentre quantitative survey involved patients from the Netherlands, Finland, Norway, Portugal, Spain, Sweden and the UK. Each country was asked to provide at least 125 patients in order to achieve the minimum sample size recommended for Rasch analysis [16]. Apart from the Netherlands and Sweden, which used random sampling, all centres utilised convenience sampling methods to recruit patients from their rheumatology clinics, wards, day hospitals and databases. The inclusion criteria were age 18 or above, a positive diagnosis of RA and willingness to complete the ENAT. The exclusion criteria were (a) having any other rheumatic disease such as systemic lupus erythematosus, systemic sclerosis, psoriatic arthritis, ankylosing spondylitis and osteoarthritis, (b) inability to read or write and (c) those unwilling to participate. Participation was voluntary and each centre obtained an ethical approval from their local ethics committees.

Measure

The original (English) ENAT was translated into the respective language versions by using the cross-cultural adaptation process recommended by Beaton et al [17]. The adaptation process involved 5 steps: 1) Initial translation - from the original (English) language to the target language 2) synthesis of the translations 3) back (blind) translation into the original (English) language 4) expert committee review which decides on equivalence between the source and target versions and 5) test of the pre-final version - testing the "adapted" version with 30 patients. This process was facilitated by an initial "set-up" meeting where the parameters for adaptation were considered and formalised. At this meeting, emphasis was placed upon the importance of the "conceptual" meaning of the statements in the ENAT.

The translated versions of the ENAT were given to patients in their respective countries to complete as postal surveys, or before their clinic consultations. The ENATs were anonymous but contained patients' demographical data such as gender, age, educational background and self-reported disease duration.

Statistical analysis

Rasch analysis was used to assess the construct validity and the cross-cultural invariance of the ENAT [18]. Rasch analysis has been used in rheumatology in the development of new scales [19, 20], to test the psychometric properties of existing scales [21, 22] and for cross-cultural validation of patient outcome measures [14, 23, 24]. Since the Rasch model provides formal representation of fundamental measurement; fit to the model implies a criterion-related construct validity [25], objectivity [26], reliability [27] and statistical sufficiency [28]. Given fit to the Rasch model, ordinal data from an instrument can be converted into an interval scale, and parametric analytical methods can be used [29]. A more detailed description of the Rasch measurement model and its use in rheumatology is given elsewhere [30].

In the current study, data from individual countries were analysed separately and as pooled data, and subjected to Rasch analysis using RUMM2020 software [31]. Since the response format of the ENAT is polytomous, we utilised the Masters Partial Credit Model parameterisation [32]. Both the country-specific datasets and pooled data were assessed for fit, reliability and unidimensionality. The ideal fit values are given at the bottom of the Rasch analysis results table. In addition, the residual correlation matrix was examined and items that had a correlation of ± 0.3 were considered to display a local dependency [33]. These locally dependent items were combined into a "testlet". A testlet is defined as a subset of items that is treated as a measurement unit in test construction, administration and/or scoring [34]. The data, in the form of testlets were then tested for fit, unidimensionality and invariance. Strict unidimensionality was assessed by analysis of the principal components of the standardised residuals, the loadings upon which give rise to two sets of items to generate independent estimates, which are then compared using the independent t-test method suggested by Smith [35]. The reliability of the ENAT was assessed by Person Separation Index (PSI) which provides the estimate of the internal consistency of the scale using the logit value for each person as opposed to the raw score used in Cronbach's alpha [36].

The test of invariance (DIF analysis) was based on person factors such as gender, age, disease duration, educational background and country. To allow for comparisons, the continuous data (age and disease duration) were converted into categorical variables by splitting them at their country-specific medians. Education was categorised as basic (up to secondary education) or higher (high school - university) education. Item characteristic curves for each testlet were checked for any significant DIF with respect to any person factor. Since there were 7 countries, post-hoc analysis of cross-cultural DIF (Tukey test) was performed to ascertain cross-cultural DIF patterns. Subsequently the testlets affected with uniform DIF were "split" for DIF in order to adjust for the bias [37]. To avoid type I errors due to multiple testing, the p-values for fit statistics and DIF analyses were Bonferroni-adjusted to the alpha level (i.e. p = 0.05/number of tests carried out) [38].

Results

Sample characteristics

The sample of 1042 comprised 135 patients from Finland, 165 from the Netherlands, 137 from Norway, 123 from Portugal, 230 from Spain, 125 from Sweden and 125 from the UK. Their country-specific gender distribution, median age, median disease duration and educational background are summarised in Table 1.

Table 1 Sample characteristics by country

Individual country fit

Initially the data from each country was fitted to the Rasch model separately (Table 2). Local dependency was observed within each domain, and so the 39 items were grouped into 7 domains (testlets) and re-analysed. Fit to the Rasch model was then satisfied in each country including unidimensionality, with the exception of Portugal, where marginal multi-dimensionality was observed. An analysis of differential item functioning showed absence of bias for age, gender, educational level and disease duration across all countries except Spain, where two items (movement and feelings) showed bias for gender and disease duration.

Table 2 Rasch analysis results

Pooled data

Initial Rasch analysis of the 39 items from the pooled data revealed significant deviations from the expectations of the Rasch model (X2 = 977.055, DF = 351, p = 0.000) and a high reliability (PSI = 0.976) (Table 2). Item fit residuals had a mean of 1.018 (SD = 4.014) and person fit residuals had a mean of 0.552 (SD = 2.554). Nine items displayed fit residuals values outside the ± 2.5 expected range and a probability less than the Bonferroni adjusted α-value of 0.0013 indicating significant deviation from the model.

The patterns of the items' thresholds were examined and 5/39 items displayed borderline disordering. These items were: 1) Using acupuncture, ultrasound or hydrotherapy (pain) 2) Devices which would help me do practical things (Movement) 3) Why I am feeling down or depressed (Feelings) 4) How arthritis might affect my children or relatives (Arthritis process) 5) Times when I should call the doctor or nurse (Self-help measures). Examination of the category probability curves for the above items indicated the need to amalgamate two categories "a little important" and "fairly important" for all the five items. A trial rescoring improved the threshold ordering but the overall fit worsened, therefore the category structures of these items were not re-scored.

Local dependency and unidimensionality

Examination of the person-item residual correlation matrix revealed that most domain-specific items were locally dependent and this was significantly affecting the fit to the model. All domain-specific items were then amalgamated into a testlet (each testlet corresponding to one ENAT domain) and the ENAT was then re-analysed as a seven-testlet scale which showed an acceptable fit to the Rasch model (X2 = 71.909; DF = 63; p = 0.207) (Table 2 and Table 3). The strict unidimensionality test revealed the proportion of significant t-tests to be 0.048 (95%CI = 0.034 - 0.063) confirming its unidimensionality. The reliability of the final ENAT was excellent (PSI = 0.951). All analyses were thereafter based on the domain (testlet) scores. The person-item threshold distribution indicated that only a small proportion of the sample was above the range of the measurement indicating the ability of the ENAT to capture well the educational needs of patients (Figure 1).

Table 3 Item (testlet) fit
Figure 1
figure 1

Person-item threshold distribution.

Differential item functioning

Following fit to the Rasch model, DIF analysis on the pooled data revealed DIF by gender, age, disease duration, educational background and by country. However, apart from domain 3 (feelings) which displayed a non-uniform DIF by gender, all the DIF was uniform. Post-hoc analyses revealed that the cross-cultural DIF was the most significant contributing factor.

The Dutch data alone contributed to DIF by country on 4 testlets (pain, feelings, treatments and support). Splitting for DIF by country resolved all the cross-cultural DIF and most other sources of DIF. Testlet 2 (movement) continued to display uniform DIF by gender and by disease duration, while testlet 6 (self-help measures) had borderline DIF by educational background.

Calibration of the ENAT into an interval scale

Following the adjustment for the cross-cultural DIF, the ENAT maintained a good fit to the Rasch model (X2 = 138.311, DF = 162, p = 0.214) and an excellent reliability (PSI = 0.951). The ENAT domain raw scores were mapped against the corresponding Rasch transformed scores (in logits) and were linearly transformed to calibrate an interval-level scale of the same range (Table 4). A separate DIF-adjusted table was calibrated for the Dutch cohort (Table 5).

Table 4 Conversion of raw ENAT domain scores to Rasch-transformed values.
Table 5 Conversion of raw ENAT domain scores to Rasch-transformed values for the Dutch dataset.

Discussion

This study has demonstrated that the ENAT satisfies Rasch model expectation in seven countries, with the possible exception of marginal multidimensionality in Portugal. The ENAT has been shown to be largely invariant by age, gender, educational level and disease duration with each country. When data were pooled, some DIF manifested, but was largely driven by country-specific DIF. When examined, most DIF was shown to cancel, but country DIF remained. Consequently when data are pooled across countries, adjustment must be made to accommodate the potential bias. Following such an adjustment, an interval scale transformation can be made, giving a raw-score interval metric table for general use.

A number of issues have arisen from this work. The breach of the local independence assumption has been shown to drive misfit to the Rasch model. This can be accommodated through the testlet design. This deals with the perennial problem of the tension between the clinimetric needs and the psychometric requirements of a measure when items that have clinical relevance are removed from the scale [3941]. Leaving the items in the scale is advantageous in that the items may inform practitioners about educational needs at the finer level, while grouping them into testlets effectively accounts for the local dependence so satisfying the psychometric requirements [42].

The second issue is the implication of the lack of invariance across countries for pooling data in international studies. DIF analysis revealed that the cross-cultural DIF was responsible for most of the non-invariance in the data. Cross-cultural adjustment involved splitting the DIF-affected items by country, producing a scale with both etic (culturally-general) and emic (culturally-specific) items. This permits the scale to be culturally relevant while permitting comparisons across cultures and languages on the basis of the common etic items. Once the cross-cultural invariance was adjusted, the overall DIF improved including resolution of the non-uniform DIF by gender. When cross-cultural comparisons are to be performed, a separate DIF-adjusted conversion table for the emic data will need to be used. However, it should be stated that the magnitude of the observed DIF was only marginal, in that the maximum difference of location across countries within any educational need level (class interval) was only 0.18 logits. This suggests that the sample size of the pooled data was driving the statistical significance, and that the observed magnitude of DIF is below the level considered to be associated with measurement error [43].

As such, the ENAT can be used as a routine clinical checklist or as a research (or an audit) tool. In the former use, the clinician can use the ENAT as a simple checklist to assess perceived educational needs of patients before a clinic consultation. In this context, the perceived priority needs can be determined by looking at the completed ENAT without the need for scoring. In the latter use (measurement context) where the underlying latent construct of "educational need" is important, the Rasch-transformed scores give a common metric across all domains for comparative purposes. All domains contribute to measuring a single construct; thus adding up the domain scores is a sufficient statistic for estimating patients' educational needs (range = 0 to 156).

While the ENAT provides patients' perceived education needs, the health professional may know more about current treatment options and guidelines such as treat-to-target recommendations [44] which are beneficial to the patients. Having assessed patient's perceived educational priorities using the ENAT, the health professional can then discuss the needs arising with the patient, and provide more information about the available treatment goals and options in order to facilitate patient participation in their management. The main limitation of this study is that the ENAT is a self-completed questionnaire and consequently it did not reach the population of patients who cannot read and write. Further investigation of the marginal multidimensionality in Portugal would also be required. The ENAT can be obtained by contacting the corresponding author.

Conclusion

While Patient education is recommended as an integral part of rheumatic diseases management, [6, 7] knowing what aspect of education may be required by a patient at any specific point of their treatment is an essential prerequisite to ensure that such needs are met. The ENAT offers a simple tool to help professionals judge what is required. It satisfied the strictest standards of measurement in all but Portugal, where marginal multidimensionality was observed, and can offer interval scaling when required. The scale can be used with confidence within the countries studied, but if data are to be pooled, then this will require adjustment within the framework of the Rasch measurement model, so providing the ability to compare educational needs across Europe.