Introduction

It is estimated by the World Health Organisation (WHO) that, in 2016, diabetes mellitus (DM) was a top seven cause of death [1]. DM is a life-changing disease with a high incidence of micro- and macrovascular complications [2]. These include neuropathy, nephropathy, retinopathy, peripheral vascular disease, coronary heart disease, and stroke. These complications are associated with high morbidity and mortality, which markedly reduce the quality of life (QoL) of the patient [3]. QoL is defined by the WHO as “an individual’s perception of their position in life in the context of the culture and value systems in which they live and in relation to their goals, expectations, standards and concerns” [4]. Assessing QoL helps in uncovering the needs of patients, in setting up preventative programmes, and in planning service delivery. Unfortunately, there is still limited evidence on ‘QoL’ of diabetes patients in Sub-Saharan Africa region as compared to the large number of studies done in higher-income countries. In Rwanda, the prevalence of DM has been estimated at 5.1% [5]. A sharp increase in the prevalence on-communicable chronic diseases is anticipated over the next decade owing to urbanization and increasingly sedentary lifestyles. Much evidence has already been generated elsewhere on the impact of diabetes on QoL as well as its associations with socio-demographic characteristics such as gender, age, education, and income; clinical factors such as severity and management of the disease; lifestyle and diet; as well as acute and chronic complications [6,7,8,9,10].

The increasing understanding of the importance of measuring QoL in diabetics is driving research into specific interventions and into QoL management in clinical settings [11]. The Diabetes-39 (D-39) questionnaire is a widely-used self-reporting tool, which has been significantly associated with glycaemic control, adherence to treatment and complications, and has been linked to other associated constructs of QoL [12, 13].

There are a number of existing tools—both generic and disease-specific—for measuring QoL in diabetes [14]. Generic instruments are used in the general population to measure a wide range of domains applicable to a variety of health states, conditions, and diseases. The symptoms disease-specific instruments can include the most important aspects of health, as considered by patients or clinicians [15]. Because disease-specific instruments are more focused, they can be more responsive to changes in health and provide a more detailed and accurate assessment of patients concerns. Among generic instruments for diabetes, the Medical Outcomes Study ‘Short Form (36) Health Survey’ (or SF-36) is commonly-used, but the D-39 is the preferred instrument, as it has good psychometric properties [16]. The D-39 has been translated into multiple languages, has high internal reliability and good responsiveness to change, and has been used in a wide range of interventions [11, 14,15,16,17,18].

To the best of our knowledge, in all of Africa, the D-39 questionnaire has only been translated and adapted into Arabic [19, 20]. There is no validated version of the D-39 questionnaire for the Sub-Saharan African context, which includes Rwanda. The aim of this paper, therefore, was to report on the translation and cultural adaption of the D-39 questionnaire into a local language- Kinyarwanda, and to evaluate its psychometric properties.

Material and methods

The diabetes-39 questionnaire

The instrument used was the D-39 questionnaire, a multidimensional scale developed in United States of America [21], which consists of 39 items grouped in five dimensions: Energy and mobility (15 items), diabetes control (12), social burden (5), anxiety and worry (4), and sexual functioning (3). The D-39 is used to assess the QoL of patients with type 1 and type 2 diabetes; regardless of their treatment regimen [16]. Patient themselves could rate their QoL during the last month for each item. Each item can be answered using a seven-point scale ranging from 0.5 (not affected at all) to 7.5 (extremely affected). Each of the five dimensions were summed up, and the resulting raw scores were transformed into scales ranging from 0 to 100 using a linear transformation: (raw score − minimum value)/(maximum value − minimum value) × 100 according to the developer’s instructions. The questionnaire also includes two supplementary “overall ratings”, in which respondents use the same seven-box Likert scale to evaluate their perceived overall QoL (ranging from “lowest quality” to “highest quality”) and the severity of their diabetes (ranging from “not severe at all” to “extremely severe”). The self-rating overall health status was also evaluated with a single item asking participants to rate their overall health using a five-point Likert scale [22]. Participants responded to this item from ‘’one = very poor’’; ‘’two = poor’’; ‘’three = moderate’’; ‘’four = good’’ and ‘’ five = very good).

Translation and cross-cultural adaptation procedures

Translation of D-39 to Kinyarwanda, back-translation and consensus version in English

Permission to use the D-39 was obtained from the D-39 developers. Then, for the translation of the questionnaire, we used a standard approach [23] coupled with known steps in the process of adaption [24, 25]. The translation was carried out by two native Rwandans. One of the translators possess a university degree in English literature and has twelve years of work experience, while the other is a medical doctor with work experience of seven years, who also has taught English for more than eight years. Both have a certificate of proficiency in English, and they translated the questionnaire into Kinyarwanda independently, following an ‘item intent’ guide. The two translations were synthesised into one, addressing any discrepancies. The Kinyarwanda back-translation of Kinyarwanda questions into English were done by two English native speakers. One of the back-translators has a university degree in International Studies, while the other one has studied linguistics, African studies and computational linguistics. Both back-translators had excellent language skills in Kinyarwanda. They back-translated the Kinyarwanda questions into English, while blinded to the original version. Subsequently, the two backward translations were reconciled into one.

Assessment by expert committee

An expert committee was set up comprised of seven members, including two Rwandan forward translators, one of the back-translators, an epidemiologist, a local bilingual representative, and the two researchers conducting the study. All members of the committee were fluent in English. The aim was to appraise the results of the translations, evaluating their semantic, idiomatic, experiential and conceptual equivalence, and produce a pre-final version. A report was prepared providing an account of these steps, the controversial items, and the ways they were resolved in the consensus translation. The report and the pre-final version were shared with the questionnaire developer, and consent was received.

Pre-testing

The pre-final version was assessed by conducting interviews using a sample of patients (n = 26) with DM. The objective was to evaluate patient comprehension of the translated questions and the answer categories whether respondents could retrieve relevant information from memory, the effort required to answer the degree of interest and social desirability bias. To attain maximum variability of the participants, the interviews were conducted in four different hospitals. After each round, modifications were proposed for some items, based on the interview transcripts and notes. A new iteration of the questionnaire was then prepared and tested in the following round. Lastly, a final version was produced, and a report was made available to the original D-39 developers.

Psychometric properties and statistical analysis

Study participants, data collection and sample size

Evaluating Psychometric Properties of D-39 was part of a cluster-randomised controlled trial (RCT), that aimed at determining the efficacy of an integrated mobile-health and community-health-worker programme for the management of diabetes in primary healthcare in Rwanda. The protocol of this RCT (ClinicalTrials.gov registration: NCT03376607) consists of a mixed-methods study, and has been published elsewhere [26].

For the purpose of conducting the confirmatory factor analysis (CFA), at least 200 participants would be necessary [24, 26,27,28]. The power analysis of the RCT indicated a sample size of 324 participants, which was also adequate for conducting the CFA. Nonetheless, the pre-enrolment screening revealed that a sufficient number of patients living with diabetes could not be recruited in the specific recruitment areas selected for the RCT [26]. Furthermore, logistical challenges impeded the prompt activation of the last two of the nine hospitals (Kabutare and Ruhango).

For these reasons an additional sample was also recruited for the purposes of the evaluation, following the inclusion criteria of the RCT: patients aged 21–80 years and diagnosed with DM at least six months prior to study onset. This supplementary cohort consisted of patients residing in additional zones in the catchment areas of same hospitals, except for the hospitals of Kibungo, Kibuye and Kinihira, where the number of patients was particularly low. The recruitment was carried out between June and December 2019. Exclusion criteria for both samples were illiteracy, severe hearing or visual impairments, severe mental health conditions and pregnancy, or in the post-partum period. The classification of DM type was based on the patients’ clinical records available at the hospitals. As the precise date of the diagnosis of DM was unknown for some participants, only those with at least one year of diagnosis were included so as to limit the effect of the emotional distress linked to recent diagnosis [18, 21]. Reliability analysis and known groups validity To assess internal reliability Cronbach’s α and composite reliability were calculated. Mean differences in total score and in the scales of the model with the closest fit were investigated across socio-demographic and clinical groups with Mann–Whitney U test. Effect sizes were calculated based on z values; r of 0.10, 0.30 and 0.50 were interpreted as small, medium and large effects respectively [29]. For continuous variables, Spearman’s correlation was used to determine which of them were associated with the total score and scales. Correlation coefficients below 0.4 were considered as weak, those between 0.4 and 0.7 as moderate, and those above 0.7 as strong [30, 31].

Construct and discriminant validity analysis To assess construct validity, a CFA was conducted. The five-factor model was fitted to the 39 items of the questionnaire with no cross-loadings or correlated errors terms. The estimation strategy, in line with the standard underlying assumptions of the CFA [32], is reliable, given the relatively different nature of the 5 dimensions (energy and mobility, diabetes control, social burden, anxiety and worry, and sexual functioning) used to group the 39 items under analysis.

The discriminant validity of D-39 was studied by calculating inter-factor correlations. Inter-factor correlations have been included in Table 5, supporting the assumption of a sufficient discriminant validity for all scales. The weighted least square mean and variance adjusted (WLSMV) estimator was used in the CFA. The root mean square error of approximation (RMSEA), the comparative fit index (CFI), and the Tucker Lewis index (TLI) were used to examine the approximate model fit. For RMSEA, values of less than 0.05 were indicative of a close fit and those between 0.05 and 0.08 were interpreted as adequate fit [27, 28]. The 90% confidence intervals of RMSEA were also evaluated, as they should be less than 0.05 for the lower bound and no worse than 0.08 for the upper one [27]. For CFI and TLI, values of 0.90 and above were regarded as acceptable fit [27, 28]. Hu’s and Bentler’s recommendation of raising such cut-offs to 0.95 was also taken into account [27, 33]. The relative χ2 was also calculated and a value of 2 or less was deemed adequate [27]. Finally, although the weighted root mean-square residual (WRMR) was computed and values of 1 or lower were considered a good fit, the experimental nature of this statistic thwarted drawing conclusions based on it [27, 28, 34].

Statistical analyses were performed using Stata version 16. Mplus version 7, and JASP version 16.2.

Ethical and research clearance

The study protocol was developed, and research authorisation was sought from the Rwandan Health and Education Ministries. Ethical approval was obtained from the Rwanda National Ethics Committee and the Ethics Review Panel of the University of Luxembourg.

Results

Cultural adaptation

The expert panel evaluated all translations and reached a consensus (Additional file 1), particularly regarding the items without precise translation into Kinyarwanda. This include the following dimensions: Energy and Mobility (item question 10, 33, 35 and 36), Diabetes Control (item question 15 and 28) and Social Burden (item question 19 and 26). Three rounds of interviews were conducted thereafter, with a total of 26 diabetic participants: 22 women and 4 men, with the median age of 47 (IQR = 39–62), median years of completed education of 6 (IQR = 6–8) and the median years of diabetes 3.5 (IQR = 2–6). Comprehension of the translated items was good, and amendments were made to increase clarity and resolve any ambiguities. Table 1 summarises the consensus translation, and reasons for modification of the item questions of the D-39.

Table 1 Consensus translation and reasons for modification for the D-39 Items instrument

In the first round we used a layout similar to the original English instrument, in which the introductory phrase “During the past month how much was the quality of your life affected by:” was repeated once at the top of each page, and the questions beneath stated only the second part of the sentence (e.g. “your daily medication for your diabetes”). During that round, it was noticed that most of the interviewees could not understand the questions that referred to “how the quality of their life was affected during the past month”. To resolve this issue, a new layout was tested, in which every question was preceded with the introductory phrase, and the question was written in bold (e.g. “During the past month how much was the quality of your life affected by your daily medication for your diabetes”).

According to the developer’s scoring instructions, each item is scored with a 0.5 step depending on where the cross is placed by the participants (e.g., if a mark is placed on the right-side margin of the last box, that should be interpreted as 7.5). Hence, the effective possible scoring range for each item is between 0.5 and 7.5. However, during pretesting, we observed participants having difficulty marking with precision different parts of the box space. Consequently, we adopted a simplification of scoring by considering only the area of the seven boxes (i.e., each item could be scored from 1 to 7, with a step of 1), similarly to the method recommended in the Brazilian adaptation of D-39.

Characteristics of the subjects

Table 2 shows the patients’ characteristics. Two hundred and five participants were included from the RCT, and 122 were recruited additionally for the purposes of the evaluation. A total of 18 patients were excluded as there were marked. The total sample (N = 309). were included in the analysis of the D-39. The mean total score of D-39 for the sample was 51 (SD = 12.7), the median was 52 (IQR = 42–60) and 64% were female. More than half of the participants were married and completed secondary level education. The mean and median years of completed education were 7.6 (SD = 12.7), and 6 (IQR = 5–9) respectively. Eighty-nine percent (88.7%) of the participants reported having type 2 diabetes according to their clinical record. Six (SD = 5.8) and five (IQR = 2–9) were the mean and median years since diagnosis of diabetes in the study population respectively. All the subjects were of Rwandan nationality and spoke Kinyarwanda.

Table 2 Sample characteristics of the study participants

Internal consistency

Table 3 shows that composite reliability for all scales was acceptable (> 0.7). Similarly, Cronbach’s α ranged from 0.72 for “anxiety and worry” to 0.90 for “sexual functioning”, and McDonald’s ω ranged from 0.73 for “anxiety and worry” to 0.90 for “sexual functioning”.

Table 3 Psychometric properties of the Kinyarwanda version of D-39

The standardised factor loadings ranged from 0.39 to 0.67 for the “diabetes control” scale; from 0.54 to 0.75 for the “anxiety and worry” scale; from 0.53 to 0.72 for the “social burden” scale; from 0.90 to 0.91 for the “sexual functioning” scale, and from 0.38 to 0.71 for “energy and mobility” (Table 4).

Table 4 Mean, median and the standardised factor loading of all D-39 items

Construct validity (confirmatory factor analysis)

Construct validity was assessed with CFA based on weighted least square mean and variance adjusted estimator. The five-factor model was fitted to the 39 items of the questionnaire and did not yield an exact fit (χ2 = 1228.6, df = 692, p < 0.0001, relative χ2 = 1.8); however, the fit indices indicated a satisfactory approximate fit (CFI = 0.93, TLI = 0.92, RMSEA = 0.05 (90% CI 0.046–0.055)).

There was sufficient discriminant validity for all scales with the exception of “social burden” and “anxiety and worry” with a reported inter-factor correlation of 0.8 (Table 5).

Table 5 Inter-factor correlations in the five dimensions of D-39

Table 6 demonstrates the relationships between socio-demographic variables, the five dimensions of the D-39, and the two additional “overall ratings” items. Overall, there were significant gender differences in the “diabetes control”, “anxiety and worry” and “energy and mobility” scales (small effect sizes), and “sexual functioning” scale (medium effect size). Small correlations were observed between years of completed education and the “anxiety and worry”, “social burden” and “energy and mobility” scales. The self-rated overall health was also weakly correlated with all D-39 scales but for the “sexual functioning”. Finally, “energy and mobility” differed significantly, albeit with a small effect size, between the two types of diabetes.

Table 6 Relationships between socio-demographic variables and the five dimensions of the D-39

Concerning the two “overall rating” items, the mean perceived quality of life was 3.9 (SD = 1.3) and the mean perceived severity of the disease was 3.9 (SD = 1.4). Question X2 (mean = 3.9, SD = 1.4, median = 4, IQR = 3–5).

Discussion

Our research indicates that the tool we adapted to assess diabetic QoL was the first of its kind, being the only such tool to be tailored specifically with the Rwandan and sub-Sahara Africa cultural contexts in mind. We analysed, made cultural adaptations to, and translated the D-39 Questionnaire into Kinyarwanda. With approximately 20 million speakers, Kinyarwanda is one of the most widely-spoken bantu languages, known to have both grammatical and lexical reduplications and is a national language in Rwanda [35].

There are a number of dialects and word substitutions throughout Rwanda, and so we aimed to account for these so that the Kinyarwanda version could be understood by the majority. Sometimes, different words may be used to express a single concept, and there is precedent for this approach [35]. We aimed to assemble a varied consensus panel, in order to enable a comprehensive assessment of the translated version. The feedback from patients regarding comprehension was particularly useful in achieving consensus on highlighted discrepancies; agreement was reached not only on the wording and formulation of items, but also on the changes which needed to be made. These adaptations were intended to improve respondent understanding, and to increase consistency in responses.

The full scale showed a good internal reliability in line with previous studies [14, 15, 36, 37]. Overall, some items did not load highly in some scales (e.g., diabetes control), while others performed better (e.g., sexual functioning). Discriminant validity was assessed through inter-factor correlations. In this study, there was good discriminant validity for all scales with the exception of the “anxiety and worry” and “social burden” scale (0.80). A similar lower correlation coefficient were observed in the Brazilian study [38] for the domain “anxiety and worry” (0.21) and for the domain “social burden” (0.34) in a study from Jordan [19].

All of the D-39 domains are higher than the composite reliability standard of 0.7 that previously justified as a value to support claims of internal reliability of the instrument [37, 39]. Previous studies have shown that a Cronbach’s alpha coefficient of below 0.70 can undermine the instrument’s internal consistency [40]. For each of the five scales in the 39-item instrument, the Cronbach's coefficient alpha was calculated. The results of the D-39 item and scales tests assumption in this study showed that the internal consistency reliability Cronbach’s alpha in the diabetic population in Rwanda ranged between 0.72 and 0.92. This is similar (or higher) to those obtained in a study population of Jordan [19] (0.80 to 0.92), of the United states [21] (0.82 to 0.93 and 0.81 to 0.93 for Iowa and Carolina studies respectively) and of the Nordic countries [41] (0.83 to 0.92, 0.83 to 0.91 and 0.82 to 0.92 for Finnish, Norwegian and Danish studies respectively). The Cronbach’s alpha of this study differed from a Moroccan study [20] (0.65–0.93), and a Brazilian study [38] (0.58 to 0.85). It is worth mentioning that our sample size of 309 was approximate to the one used in the Jordan study [19] (N = 368) and higher than the studies in Brazil N = 52 and Morocco N = 92 [20, 38].

Despite the fact that a lot of effort was engaged in reaching out the communities to recruit a large sample, there have been significant logistical and systemic barriers, and this was marked as the study limitation. The presentation of diabetes specific QoL may differ between patients depending on the form of the disease, and this should be noted as a possible limitation of this study. For example, Insulin dependent diabetes mellitus patients may present with a higher fear of hypoglycaemia [42]. Although we observed no significant differences between the two forms of the disease, our sample consisted predominantly of patients with type 2 DM. Notwithstanding this, such distinctions between forms of the disease need to be treated cautiously due to the possibility of misclassification and/or atypical disease forms [42,43,44,45]. The treatment type may also cause a separate effect, particularly pertaining to the use of insulin [46]; however, information on insulin use was not collected for this study.

As it was not possible to identify another established and previously validated tool in the Rwandan population—either generic or diabetes-specific—there was a lack of testing for convergent validity, and this may also be considered a limitation of our study. Test–retest reliability was not carried out, and further research is therefore indicated. We were also unable to evaluate the correlation of D-39 with glycated haemoglobin, as in Rwanda this was not routinely measured during the time we conducted this study, and ad hoc measurements for the entire study sample were not possible. Finally, as reliable diagnoses were not easily obtained from patients’ medical records, it was difficult to effectively investigate comorbidity.

The results show the perceptions of patients and their health care providers on gaps in the readiness of the society, patients, and the health care system to ensure improved health related QoL of diabetes patients. A programme to ensure QoL would tackle many challenges that are currently being faced by diabetic patients in Rwanda while at the same time addressing the increasing prevalence of the disease in the country. Such a study would help generate new insight around factors influencing the health related QoL within the Rwandan social, cultural and demographic context [47, 48], thus informing researchers and clinical practice for better health outcomes.

Conclusion

Diabetes-39 is a questionnaire originally developed in English which was adapted and translated into Kinyarwanda for the purposes of this study. Our results confirm that this Kinyarwanda version is a both reliable and valid instrument to measure the health related QoL of diabetic patients, and could help both researchers and clinicians in their practice to improve health outcomes for patient with diabetes in Rwanda and its sub-region. It can provide insights into the factors that impact QoL, in the context of Rwandan values and culture, and also for the purposes of assessment in disease management. Further scale assessment, using larger samples with a more diverse population across sub-Saharan Africa, would strengthen the evidence for the viability of this questionnaire as a health related QoL tool for diabetic patients.