figure b

Introduction

Hypoglycaemia impacts the health and wellbeing of people with diabetes [1,2,3], which can be subjectively quantified using patient-reported outcome measures (PROMs). Valid and reliable PROMs are vital for accurately assessing real-life health impacts and the effectiveness of care for people with diabetes. PROMs can be developed further to evaluate the cost-effectiveness of healthcare interventions (i.e. preference-based scoring methods). A recent review has questioned whether existing hypoglycaemia-specific PROMs can capture the full impact of hypoglycaemia on health-related quality of life (HRQoL) [4]. HRQoL is increasingly recognised as an important outcome in healthcare settings and in clinical trials [5].

Carlton et al highlighted gaps in evidence for psychometric performance, especially among legacy measures [4]. In particular, there was insufficient supporting evidence for the content and structural validity of the reviewed PROMs to assess the quality of life (QoL) impacts of hypoglycaemia, raising questions over their suitability for use for this purpose. While a recent hypoglycaemia-specific PROM assessing the impact on QoL has been published [6], this was a rapid adaptation of an existing diabetes-specific measure. As acknowledged by the authors, this measure did not follow full developmental rigour (e.g. supporting qualitative content validity interviews [7]) and was not developed for use in health economic evaluations or cost-effectiveness analysis as a result of clinical trials [6].

The aim of this mixed-methods, multi-stage project was to develop an HRQoL PROM for use in hypoglycaemia that had demonstrable validity and reliability and enabled patient experiences to be incorporated into clinical shared decision-making and as a clinical trial endpoint. The terms HRQoL and QoL are often used interchangeably and/or not well-defined. We define HRQoL as ‘a multidimensional concept that includes the physical, psychological and social functioning associated with an illness or its treatment’ [8]. An additional aim was for the PROM to be amenable to adaptation for use in cost-effectiveness analyses of new healthcare technologies for hypoglycaemia (via preference-based scoring) [9]. The need for this PROM was endorsed by an international collaboration of clinicians, scientists, industry partners and people with diabetes via the Hypoglycaemia REdefining SOLutions for better liVEs (Hypo-RESOLVE) project [10].

The hypoglycaemia health-related QoL measure (Hypo-RESOLVE QoL) was developed following best practice (Fig. 1) [7, 11], involving a series of studies with people with diabetes. This included qualitative work eliciting a comprehensive understanding of hypoglycaemia-related HRQoL impacts on people with diabetes, further qualitative work refining the draft PROM and an assessment of its psychometric performance in a large sample of individuals with diabetes. The work had three governance groups involved at key stages, including: an Advisory Group of researchers and stakeholders external to the Hypo-RESOLVE Consortium; a patient advisory committee (PAC) comprising adults living with type 1 or type 2 diabetes and representatives from the IDF and JDRF drawn from Hypo-RESOLVE patient advisors (https://hypo-resolve.eu/network/advisors); and a Scientific Group comprising PROM developers, health economists, clinical experts and stakeholder representatives from the wider Hypo-RESOLVE Consortium.

Fig. 1
figure 1

Overview of Hypo-RESOLVE QoL development

Methods

The development of the Hypo-RESOLVE QoL is summarised in Fig. 1 and an associated protocol [12]. We used mixed-methods, as follows: concept elicitation interviews (to generate an HRQoL framework and draft PROM content); cognitive debriefing interviews (to validate and refine content); and a large quantitative survey (to assess psychometric performance). The final version of the Hypo-RESOLVE QoL was informed by psychometric analyses and complementary evidence, including a translatability assessment and stakeholder engagement. Ethics approval was obtained from the UK National Health Service (NHS; REC reference: 20/NI/0048) and all participants gave informed consent.

Stage 1: concept elicitation

Items (questions) for the draft PROM were informed by an HRQoL framework elicited through semi-structured interviews with individuals with diabetes who reported at least one self-defined hypoglycaemic event (‘hypo’) in the previous 12 months. Adult participants (≥18 years) were recruited from a large NHS site, purposively sampled across age, sex and type and duration of diabetes. Demographic data were taken from hospital records. Recruitment continued until data saturation (defined by no new codes emerging for three interviews [13]) in a sample of sufficient breadth. Interviews were conducted online or by telephone due to COVID-19. A topic guide, informed by a previous review [4] and developed with the PAC, was produced to cover aspects of HRQoL potentially relevant to hypoglycaemia (see our online supplementary files, hosted by University of Sheffield; https://doi.org/10.15131/shef.data.23295284.v2, supplementary file A). Interviews with 31 people with diabetes were conducted between September 2020 and April 2021. The mean duration of each interview was 42 min (range 20–77 min). Participant demographics are shown in Table 1.

Table 1 Participant demographics

Interviews were conducted by two senior HRQoL researchers with qualitative experience. Interviews were audio recorded, transcribed verbatim and anonymised. Data was coded iteratively alongside data collection using Framework Analysis [14], following Gale et al [15]. Transcripts were analysed independently, with both researchers analysing their own interviews and dual coding 50%. An initial codebook informed by the draft HRQoL framework was used and refined based on emerging themes. Groups of four transcripts were coded at any one time, before the researchers met to discuss their coding and revise the working framework.

A final thematic framework, developed after all transcripts were coded, was used to inform draft PROM content in consultation with the PAC and Scientific Group. Where appropriate, multiple potential items were included to be tested in Stage 2. Item development followed a set of rules developed in previous research [16].

Stage 2: cognitive debriefing

Cognitive debriefing interviews were conducted with individuals with diabetes and clinicians to assess and refine the content validity of the draft PROM and reduce redundancy. Interviews were conducted online or by telephone in three iterative waves and facilitated by a topic guide (https://doi.org/10.15131/shef.data.23295284.v2, supplementary file B).

Adult participants (≥18 years) were recruited from a large NHS site in the UK and from the Diabetologist’s Office, Diabetes Association, social media and word of mouth in Germany. Participants were purposively sampled across age, sex, type and duration of diabetes. Demographic data was taken from hospital records. Participants received the draft items in advance. Relevance (are the items, response options and recall period appropriate?), comprehensiveness (are all key concepts included?) and comprehensibility (is the PROM content understood as intended?) were assessed [7, 17]. Where items covered overlapping constructs (e.g. ‘I felt sad’ and ‘I felt depressed’), participants’ preference was elicited. Items were split into three aspects of HRQoL (physical, social, psychological), mirroring the Stage 1 framework. Cognisant of potential response burden, each participant was asked to review up to 40 items. Interviews were recorded and transcribed verbatim. Detailed notes and transcripts were used to refine draft PROM content. Eighty-four interviews were conducted (70 people with diabetes and 14 clinicians) between August 2021 and March 2022 across three waves. Participant demographics are shown in Table 1.

Stage 3: quantitative survey and finalising the PROM

The draft PROM content was administered as part of a survey to assess psychometric performance. The survey was completed by a large UK sample of people with diabetes (≥18 years) who self-reported at least one hypoglycaemic episode in the past 12 months, recruited from 25 NHS sites in the UK (https://doi.org/10.15131/shef.data.23295284.v2, supplementary file C). Sampling involved a mixture of targeted and convenience strategies, including using existing patient databases to distribute e-mails and letters, putting up posters and giving invitations in clinic. To ensure sufficient sampling breadth for the psychometric analyses, a minimum sample of 1000 participants was targeted (at least 200 with type 2 diabetes).

The draft Hypo-RESOLVE QoL was presented alongside sociodemographic questions including self-reported gender, clinical background measures and other measures of HRQoL in a fixed order. The following measures were included: the Gold Score [18] and hypoglycaemia awareness questionnaire (HypoA-Q) [19] measures of hypoglycaemia awareness; a 100-point visual analogue scale (VAS) of hypoglycaemia-related QoL (‘at the moment’ and ‘past 4-weeks’ versions); DAWN2 Impact of Diabetes Profile (DIDP) [20] (‘currently’ and ‘past 4-weeks’ versions); and a generic measure of health-related QoL (EQ-5D-5L) [21]. A copy of the survey, with further details on measures used, is available (https://doi.org/10.15131/shef.data.23295284.v2, supplementary file D).

The survey was hosted online, with paper surveys provided upon request. To assess test–retest reliability of the Hypo-RESOLVE QoL, a convenience sample, from two NHS sites, of online participants who expressed an interest were invited to take part again, a minimum of 4 weeks later. Survey responses at these sites were linked to participants’ clinical data obtained from medical records (HbA1c and continuous glucose monitoring data, where available) but these data were not used in the development of the PROM.

Survey data were cleaned and subject to a priori defined quality checks [12]. Psychometric analyses were conducted in R (v4.2.2; https://cran.r-project.org/bin/windows/base/old/4.2.2/) iteratively, using techniques described by Dima [22]. A sample of 1246 participants was used for analysis. Participant demographics are shown in Table 1. Hypo-RESOLVE QoL data was summarised descriptively, including the distribution of responses (to explore potential ceiling and/or floor effects) and missing data. Classical test theory (i.e. correlations, confirmatory factor analysis with maximum likelihood estimation [one-factor and three-factor models], Cronbach’s α) and item response theory (IRT; i.e. Mokken scale analysis based on a monotone homogeneity model; IRT partial credit model; differential item functioning [DIF]) analyses of the draft Hypo-RESOLVE QoL were undertaken to help inform final item selection. A partial credit model showed superior fit to a rating scale model in a likelihood ratio test. DIF was assessed by diabetes type (type 1 vs type 2) and gender (woman vs man) using logistic regression with IRT θ estimates as the conditioning variable.

A priori criteria used for assessing psychometric performance were developed and used to interpret results (Table 2). Tests that necessitated item grouping were based on the underlying theoretical HRQoL model (i.e. physical, social, psychological). Prior to parametric IRT analyses, sufficient unidimensionality was assessed using Mokken scale analysis homogeneity coefficients of the subscales. Mokken scale analysis was also used to assess assumptions of local dependence and monotonicity. Towards the latter stages of item selection (n=18 items), the same tests were also conducted on the overall scale as a total HRQoL score, which was assessed for sufficient homogeneity.

Table 2 Psychometric analyses used to help inform item selection (Stage 3)

Following final item selection (Stage 2.3), the Hypo-RESOLVE QoL was scored summatively, and Pearson correlations were estimated. This facilitated an initial analysis of the construct validity of the Hypo-RESOLVE QoL based on a priori hypotheses about its associations with other measures [12]. We expected to demonstrate convergent validity by the Hypo-RESOLVE QoL correlating moderately (at r≥0.3) with the VAS, DIDP and EQ-5D-5L, with the size of coefficient larger for more similar constructs (i.e. VAS > DIDP > EQ-5D-5L).

Test–retest reliability for the Hypo-RESOLVE QoL was estimated with the intraclass correlation coefficient (ICC) on a subset of repeat participants who demonstrated sufficient stability in the construct of interest (whose differences in scores on the VAS were within 1 SD of the mean). A two-way random effects model with absolute agreement and single-unit parameters was estimated. ICC values suggest reliability as follows: 0.5–0.75 moderate reliability; 0.75–0.9 good reliability; and >0.9 excellent reliability [23].

In addition to psychometrics, to produce the final version of the Hypo-RESOLVE QoL multiple sources of evidence were considered. This included: Stage 2 findings; a translatability assessment of draft PROM content in eight languages selected by the Hypo-RESOLVE Consortium (https://doi.org/10.15131/shef.data.23295284.v2, supplementary file E); published criteria for selecting items for use in health economics [16]; and PAC and expert Advisory Group consultations. To facilitate decision-making, a traffic-light system was used to indicate whether responses to each item (from each source) were positive (green), mixed (amber) or negative (red) (https://doi.org/10.15131/shef.data.23295284.v2, supplementary file E) [24, 25]. Decisions were made iteratively (allowing for updates to the psychometric information when changes were made) to help define the final PROM.

Results

Stage 1: item generation

Three higher-level themes (physical, social, psychological) were used to organise the qualitative data, consisting of 19 subthemes (https://doi.org/10.15131/shef.data.23295284.v2, supplementary file F), from which items were generated. The full list of draft items (n=76) and how they mapped onto underlying themes is available (https://doi.org/10.15131/shef.data.23295284.v2, supplementary file G). Items were rephrased to allow subsequent testing of severity and frequency response options. Five-item response scales and a 7 day recall period were initially selected for testing at Stage 2.

Stage 2: initial item testing

Reductions in the draft items were made in each wave, due to perceived redundancy or overlapping items (https://doi.org/10.15131/shef.data.23295284.v2, supplementary file H). In the first two waves, participants provided suggestions for rewording of some items (e.g. ‘I could do what I wanted to do [in my life]’) and identified new items (e.g. ‘I had interrupted sex’). A preference for frequency-based response options was determined after Wave 2. Participants favoured the use of the five-point response scale. Most participants preferred a longer recall period than 7 days, so a 1 month recall period was tested at Wave 2 and approved by participants (rephrased as ‘4 weeks’ for consistency across months). Minor modifications were made to the instructions following Waves 1 and 2 to improve clarity (e.g. to emphasise that participants should consider the overall impact of hypoglycaemia). At the end of Wave 2, decisions on the final draft PROM content were made and tested in Wave 3. Six items were dropped in Wave 3 and no further substantive problems with the draft PROM were identified. All items were understood as intended and considered relevant to HRQoL in hypoglycaemia. The draft PROM was deemed sufficiently comprehensive and a revised draft 40-item version was agreed (https://doi.org/10.15131/shef.data.23295284.v2, supplementary file I).

Stage 3: final item selection

The PAC rated two items and the Hypo-RESOLVE Advisory Group nine items as potentially redundant (https://doi.org/10.15131/shef.data.23295284.v2, supplementary file E). The translatability assessment identified two problematic items (‘I felt ashamed’ and ‘my sex life was negatively affected’) if the content of the measure was adapted for use in other countries and/or cultures.

The online survey was accessed 2562 times and 213 paper surveys were returned. Of these, 821 online accesses did not result in participation, 307 responses were incomplete (280 left the survey prior to the Hypo-RESOLVE QoL and 27 started but did not complete the Hypo-RESOLVE QoL), 40 responses were duplicates and 321 responses were screened out (did not meet the inclusion criteria; including 18 paper surveys from which data were not inputted). To ensure data quality, 19 responses were excluded for being quicker than the estimated survey reading speed (at 300 words per minute [26]) or taking longer than 24 h, and 21 responses were excluded for straight-lining >25% of the Hypo-RESOLVE QoL, including opposing items (psychometric results are also presented with these participants included, https://doi.org/10.15131/shef.data.23295284.v2, supplementary file E). Seventy-five responses were examined for potential multivariate outliers but no suspicious or implausible response patterns were observed. This resulted in a sample of 1246 for analysis. A breakdown of responses per recruiting site is available (https://doi.org/10.15131/shef.data.23295284.v2, supplementary file C).

Clinical and sociodemographic characteristics of the sample are given in Table 1. Mean age was 48.87 (SD 16.29) years, with a slight majority of women (n=684, 54.9%). Almost all participants were White (n=1175, 94.3%), with the majority having type 1 diabetes (n=993, 79.7%). The mean self-reported duration of diabetes was 22.68 (SD=15.70) years.

An initial analysis of the 40-item Hypo-RESOLVE QoL showed an unacceptable fit to the tripartite HRQoL model (with physical, social and psychological factors): χ2=5624.42, df=737, p<0.001; CFI=0.754; RMSEA=0.087 (90% CI 0.085, 0.090), p<0.001. Several potential problems with items were identified in the floor/ceiling, Mokken and IRT analysis. Several high intercorrelations between items were observed, particularly in the psychological domain, suggesting potential redundancy. A full set of psychometric results is available (https://doi.org/10.15131/shef.data.23295284.v2, supplementary file E).

Eleven items were dropped from the Hypo-RESOLVE QoL. These included items that substantially overlapped with other items (i.e. intercorrelations ≥0.7) and some of the worst psychometric performers (unless there was a strong theoretical rationale for keeping the item). All decisions on which items were kept and dropped and the rationale are available in an item-tracking matrix (https://doi.org/10.15131/shef.data.23295284.v2, supplementary file H).

The 29-item revised Hypo-RESOLVE QoL displayed an improved fit but this was still below acceptable levels: χ2=2482.54, df=374, p<0.001; CFI=0.840; RMSEA=0.079 (90% CI 0.076, 0.082), p<0.001. Notable problems also remained at the item level. A further 11 items were dropped from the scale, which showed a range of violations in the Mokken and/or IRT analysis (https://doi.org/10.15131/shef.data.23295284.v2, supplementary file J).

The 18-item version of the measure revealed an acceptable fit to the HRQoL model: χ2=966.69, df=132, p<0.001; CFI=0.910; RMSEA=0.075 (90% CI 0.070, 0.079), p<0.001. Fewer potential problems were evident at the item level. However, some problems remained, including violations of local independence, potential floor effects and problematic DIF by diabetes type. At this point, an increasing trade-off was being made between item coverage (comprehensiveness) and statistical performance. In consultation with the Scientific Group and PAC, four final items were dropped from the scale (https://doi.org/10.15131/shef.data.23295284.v2, supplementary file K).

The final, 14-item Hypo-RESOLVE QoL demonstrated an acceptable fit to the three-factor HRQoL model: χ2=472.27, df=74, p<0.001; CFI=0.943; RMSEA=0.069 (90% CI 0.063, 0.075), p<0.001. This showed a superior fit to a one-factor model: χ2=836.41, df=77, p<0.001; CFI=0.891; RMSEA=0.093 (90% CI 0.088, 0.099), p<0.001. Potential psychometric problems were minimised but some minor statistical issues remained (https://doi.org/10.15131/shef.data.23295284.v2, supplementary file L). Three items violated local independence at the domain level, although not when considered as part of the overall scale. One item at the domain level and two items at the overall scale level showed disordered thresholds. However, this was a small proportion of the total items, so the original response options were retained. One item showed a potential clustering of responses at the floor (54.2%) and another item was just below the threshold for good IRT item fit at the domain level and showed potential non-uniform DIF for diabetes type on the overall scale (McFadden R2 0.030; see electronic supplementary material [ESM] Fig. 1). It was deemed important to retain both to retain to facilitate the calculation of a social HRQoL domain score (which requires three or more items). All items were endorsed by at least 50% of participants in the cognitive debriefing study and PAC consultation and had a positive translatability assessment. Ultimately, no further revisions were considered justified.

Scoring and relationship with other variables

Simple summative scores were calculated for the three subdomains and total score of the Hypo-RESOLVE QoL so that a higher score represented better HRQoL (4 = none of the time, 0 = most or all of the time; the item ‘I could do what I wanted to do in my life’ is reverse-scored). Cronbach’s α for the Hypo-RESOLVE QoL total score was 0.91, with values of 0.85, 0.78 and 0.81 for the physical, social and psychological subscales, respectively. Scores for the HRQoL measures included in the study are in shown in Table 3.

Table 3 HRQoL measures used in Stage 3

The Hypo-RESOLVE QoL total score did not significantly differ between people with type 1 diabetes or type 2 diabetes (mean ± SD 34.56 ± 10.54 vs 35.31 ± 12.64, respectively; t=−0.73, df=217.46, p=0.464, d=−0.07). Men reported a higher total HRQoL than women (mean ± SD 36.38 ± 11.10 vs 33.44 ± 10.53, respectively; t=−4.52, df=1038.1, p<0.001, d=0.27). The Hypo-RESOLVE QoL total score was significantly greater in those who did not report having a hypoglycaemic event in the last week (n=224) than those who did (n=1022) (mean ± SD 39.6 ± 10.78 vs SD 33.75 ± 10.66, respectively; t=6.73, df=255.32, p<0.001, d=0.55), suggesting provisional known-group validity.

Correlations (Pearson and point-biserial) between the HRQoL measures and background characteristics are shown in Table 4. As hypothesised [12], moderate-to-large correlations were observed between the Hypo-RESOLVE QoL total score and the VAS (4 weeks) (r=0.59), DIDP (r=−0.55) and the EQ-5D-5L (r=0.46), in the expected order of magnitude. Thus, the measure showed preliminary convergent validity.

Table 4 Pearson and point-biserial correlations between HRQoL measures and background characteristics (Stage 3)

Test–retest reliability

Three hundred and seventeen online participants consented to being re-contacted and the second survey was accessed 169 times. Of these participants, seven did not go on to complete the survey and 18 left the survey prior to the Hypo-RESOLVE QoL. One participant was excluded due to completion time (see above), leaving 143 for test–retest analysis. In a subsample that had a difference in scores on the VAS (4 weeks) within 1 SD of the mean (n=108; median 34 days between surveys), the estimated ICC was 0.87 (95% CI 0.80, 0.91), which represents good reliability [23].

Discussion

Valid and reliable PROMs are indispensable tools for capturing patient-centred outcomes in clinical care and for assessing the effectiveness of healthcare interventions [27]. The Hypo-RESOLVE QoL is a novel 14-item measure, fit to an underlying theoretical framework of HRQoL, featuring a hierarchical three-domain structure of physical, social and psychological aspects and a superordinate HRQoL factor. The measure has good content validity and initial good supporting psychometric evidence, including structural validity, convergent validity, internal consistency and test–retest reliability.

Strengths and weaknesses

The Hypo-RESOLVE QoL was developed in accordance with best practice, using qualitative and quantitative data synthesised from three studies with a combined sample of 1347 people with diabetes. The development process incorporated collaborative input throughout, including a high degree of patient engagement, clinician input and wider stakeholder engagement [28]. Iterative qualitative development work with individuals with diabetes, incorporating techniques designed to enhance trustworthiness, has produced a PROM with evident content validity. The absence of DIF by gender in the final PROM suggests the measure will function similarly across men and women. The resultant 14-item instrument was designed to help minimise respondent burden while balancing content coverage and psychometric performance, and facilitate subsequent work on producing a preference-based scoring system for use in cost-effectiveness trials for hypoglycaemia [12].

PROM development is iterative and, despite its evident strengths, there are some limitations of the Hypo-RESOLVE QoL that may be addressed in future studies. It would be inaccurate to claim that the reduced 14-item version of the Hypo-RESOLVE QoL would capture everything of interest to all individuals with diabetes. The comprehensiveness of the 14-item version cannot be determined without further qualitative research. A trade-off must be acknowledged between achieving adequate comprehensiveness and satisfactory psychometric performance. The importance of different items to people with diabetes who experience hypoglycaemia was incorporated into the item selection, based on information from underlying qualitative work and consultation with the PAC, to ensure important aspects were captured. The 14-item measure was approved by the PAC. However, it should be noted that the content validity of the final version was not assessed qualitatively, including its comprehensiveness. Further, some concepts identified as important to people with diabetes (e.g. sexual functioning) were not included in the final instrument due to unsatisfactory psychometric performance. Future work should assess the content validity of the PROM in independent samples, including the additive benefit of additional ‘bolt-on’ items to the core scale for use in particular groups.

While psychometric results in the development of the measure were generally positive, some minor issues should be acknowledged, such as the significant χ2 test when assessing goodness-of-fit. Nevertheless, it is well known that the χ2 test is of limited value in large samples, as it is almost always significant, even in the cases of trivial misfit [29, 30]. Further, only one item retained in the Hypo-RESOLVE QoL is positively framed. While efforts have been made to make this clear in the final questionnaire design, subsequent work should assess whether this affects responses.

The Hypo-RESOLVE QoL was developed predominantly in the UK and the sample was predominantly White. The PROM benefits from supplementary cognitive debriefing in another European country, Germany and a formal translatability assessment on the included content. Evidence from these sources is positive. Nevertheless, as is standard in the adaptation of PROMs to new countries and cultures [11], future research is required to ensure the PROM works well in other settings throughout Europe and internationally. This should include work with more ethnically diverse samples. Furthermore, future PROM development projects may consider involving linguistic experts earlier in item generation.

In relation to other studies and PROMs used in hypoglycaemia-related research, the Hypo-RESOLVE QoL benefits from assessing HRQoL as a holistic concept rather than having a narrow focus on specific QoL impacts, such as fear [31, 32]. We provide supporting evidence for the content and structural validity of the scale, which is absent in other hypoglycaemia-specific measures [4]. A weakness of the Hypo-RESOLVE QoL relative to other PROMs is a current lack of data on responsiveness to change in clinical trials, which is available for some diabetes-specific PROMs (e.g. [33]).

Implications and future research

The Hypo-RESOLVE QoL can be used by clinicians and researchers in research and practice, including as a HRQoL outcome in clinical trials. A copy of the Hypo-RESOLVE QoL can be requested from the corresponding author. A subsequent preference-based scoring system will demonstrate utility in informing healthcare resource allocation decisions in hypoglycaemia, relevant to policymakers [12]. While a comprehensive body of development work is reported for the Hypo-RESOLVE QoL, including supporting psychometric evidence, studies in independent samples are required to further develop the evidence base for the measurement properties of the scale. This includes assessing the psychometric properties of the standalone 14-item version and, in particular, the responsiveness of the measure in clinical trials of intervention(s) for hypoglycaemia and its final comprehensiveness.

Conclusion

The Hypo-RESOLVE QoL has been designed as a flagship output from the international Hypo-RESOLVE Consortium. The scale was developed in collaboration with, and for measuring HRQoL in, people with diabetes who experience hypoglycaemia, to demonstrate good content validity relative to existing hypoglycaemia-specific PROMs. Evidence from qualitative and quantitative studies suggest that the Hypo-RESOLVE QoL performs well in people with diabetes. The generation of additional data obtained from using the Hypo-RESOLVE QoL will be of value in furthering our understanding of the impacts of hypoglycaemia and how these may be best managed to optimise patient outcomes.