Introduction

Over the past decade, there has been a 13% increase in mental health conditions globally, with one in five years of disability attributed to mental health issues [1]. Before the COVID-19 pandemic, global anxiety disorder prevalence was 3,824.9 per 100,000 population (298 million people), rising to 4802.4 (374 million people) post-pandemic, indicating an additional 76.2 million cases (25.6% increase). Similarly, for major depressive disorder, pre-pandemic prevalence was 2470.5 cases per 100,000 (193 million people), increasing to 3152.9 (246 million people) post-pandemic, with an additional 53.2 million cases (27.6% increase) [2]. Low- and middle-income countries (LMICs) have been disproportionately impacted by this health problem as evidenced by highest age standardised prevalence [3]. Globally, mental healthcare is underfunded and underprovided, with less than 2% of government health expenditures being allocated to mental health [1]. This is in stark contrast to the economic burden imposed by disorders such as depression and anxiety, which costs the global economy USD 1 trillion [1].

Anxiety and depression can significantly impact individuals’ health-related quality of life (HRQoL) [4], which is usually defined as a subjective assessment of an individual’s physical, emotional and social well-being [5]. HRQoL measures have gained increasing importance as an outcome measure to provide a comprehensive evaluation of the impact of illness on patients’ lives [6]. One of the challenges in assessing the impact of health interventions or programs on patient outcomes is the need to convert subjective responses about HRQoL into quantifiable measures that can be compared across different interventions or populations. Methods to measure ‘utility values’ that denote the relative value of different levels of HRQoL have been developed to address this challenge [7].

Utilities are used to calculate the quality-adjusted life year (QALY) metric, which is a commonly used outcome measure in cost-utility analysis (CUA) [8]. The QALY combines extensions to a person’s life expectancy and their HRQoL into a single metric. It can be used to evaluate health outcomes produced by different interventions and services across a variety of diseases and population groups. The length of life is adjusted using a utility value (a.k.a., utility weight) that numerically represents HRQoL on a scale between 0 (denoting death) and 1 (denoting perfect health). The most commonly used approach for estimating utility values is through the use of preference-based HRQoL instruments that are conventionally referred to as multi-attribute utility instrument (MAUI). MAUIs ask respondents to rate their health across several dimensions (i.e., their health state). Preference weights are then applied using a pre-determined scoring algorithm to estimate the corresponding utility value [7, 9]. Utility values are an important outcome measure and can help estimate the health benefits of different interventions and treatments [10]. To this end, several generic and disease-specific tools have been developed to measure HRQoL and calculate utility weights [11]. One of the most widely used and accepted measures for establishing utility weights is the EuroQol − 5 dimension (EQ-5D) [12]. The EQ-5D is a standardised questionnaire that measures health status across five dimensions on a three-level or five-level response scales: Mobility (Mob), Self-care (SC), usual activities (U/A), pain/discomfort (P/D), and anxiety/depression (A/D). It has value sets that have been validated in multiple countries and is available in over 200 languages. The tool contains the EQ-5D descriptive system and the EQ visual analogue scale (EQ VAS) [13].

In many resource-constrained settings, healthcare decision-making requires evidence from cost-effectiveness analyses to facilitate the efficient allocation of limited resources across competing healthcare interventions. Utility values are important elements in both trial- and model-based economic evaluations that use QALYs to guide resource allocation decisions [7]. Within the context of anxiety and depression, previous studies have provided such utility values for different disease severity levels [14]. However, existing utility values have been largely derived from developed countries, limiting their applicability for modelled economic studies conducted for LMICs [15]. Further, given that utilities can be derived using different HRQoL questionnaires that produce different scores, it is imperative to estimate utility values for different severity levels using the same instrument to inform modelled economic analysis [16, 17]. To address these gaps, the aim of this study was to describe the HRQoL, estimate the associated health state utility values (HSUVs) and explore factors associated with HRQoL of patients with anxiety and/or depression in a resource-limited hospital setting.

Method

Study design

This cross-sectional study, conducted in a tertiary care mental health hospital, recruited patients with anxiety and/or depression. The study followed the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guidelines to ensure accurate and complete reporting of the study’s methods and results [18]. A copy of the completed STROBE checklist for this study is available in Supplementary File [Supplementary file 1].

Study setting and recruitment

This hospital-based survey recruited participants who were receiving outpatient services at Amanuel Mental Specialised Hospital. It is the only specialised psychiatric hospital in Ethiopia, situated in Addis Ababa, the country’s capital. Patients from all regions of the country, both urban and rural, visit the hospital for inpatient and outpatient psychiatric care. This ensured the inclusion of participants with diverse demographic backgrounds, making the sample more representative. This study involved patients with anxiety and/or depression who attended outpatient follow-up visits at the hospital. Patients treated in an inpatient setting were excluded from this study. Diagnosis of anxiety and depression was based on clinician assessments reported in the patient’s medical records (patient case notes). In the hospital, clinicians primarily use The Diagnostic and Statistical Manual of Mental Disorders (DSM-5) to diagnose and confirm cases of anxiety and depression [19]. After completing their outpatient visit, research nurses who had access to patients’ medical records invited eligible patients to an interview, which was done in a separate room where clinicians or other hospital staff were not involved. Eligible patients with anxiety and/or depression were asked to provide an informed consent before the interview. Further information about recruitment procedure is provided in previous publication (Belay et al., 2023) [20]. The sample size (462) was determined using a single proportion formula, considering a 95% confidence level, 5% margin of error, a proportion (P) of 0.5 and a 20% non-response rate [21]. Participants were consecutively recruited using a convenience sampling method until the target sample size was reached, comprising approximately equal thirds: 149 with anxiety, 159 with depression, and 154 with both anxiety and depression.

Data collection and instrument

The data collection was carried out between June and October 2021 by five research nurses using an interviewer-administered approach to ensure the participation of individuals with no formal education who might struggle with self-complete tasks. The Amharic version of the interviewer-administered EQ-5D-5 L, which is widely used as the official working language in Ethiopia, was employed to assess HRQoL. This translation, conducted as part of the Ethiopian valuation study, followed the standardised approach recommended by the EuroQol Group [22, 23]. Validated Amharic versions of instruments were used to collect data on participant characteristics and mental health status. This included the Patient Health Questionnaire-9 (PHQ-9), a nine-item questionnaire rating depressive symptoms in the last two weeks on a scale of zero (not at all) to 4 (nearly every day) [24, 25], and the Generalized Anxiety Disorder-7 (GAD-7), seven-item questionnaire rating anxiety symptoms on a scale of zero (not at all) to 4 (nearly every day) in the last two weeks [26, 27]. All HRQoL measures, participants’ demographic (gender, age, education, employment status, income, and marital status), behavioural characteristics (smoking and alcohol) and symptom severity (PHQ-9 and GAD-7), were self-reported by patients. Patient clinical profiles (diagnosis, type of anxiety or depression, duration of illness, comorbidity, and medication) were collected from patient medical records (patient case notes) that had been documented by clinicians.

Analysis

The analysis incorporated data from all 462 participants, with no missing responses, due to the data collection approach using in person interviews. Demographic and clinical characteristics were described using frequency and percentage. The EQ-5D-5 L responses were converted into a HSUVs using value set developed for Ethiopia [23]. To derive the HSUVs value from each EQ-5D-5 L health profile, we applied a formula that subtracts the disutility weight of each level in each dimension from 1 (representing full health). This approach is outlined in the methods for analysing and reporting EQ-5D data by Devlin et al.,2020 [28].

The study presented the proportions of reported problems (level 2 to level 5) for each dimension of the EQ-5D-5 L across demographic and clinical characteristics of the population. The PHQ-9 scores were categorised for depression severity: 1–4 (minimal), 5–9 (mild), 10–14 (moderate), 15–19 (moderately severe), and 20–27 (severe) [29]. GAD-7 scores were categorised for anxiety severity: 0–4 (minimal), 5–9 (mild), 10–14 (moderate), and 15–21 (severe) [27]. Internal consistency was assessed using Cronbach’s alpha and was considered acceptable only if it exceeded 0.70.

HSUVs by severity level and other clinical and demographic profile were described using means and standard deviations (SD). T-test and one-way ANOVA analyses were used to compare the mean difference in HSUVs across different groups. Scheffe’s post hoc comparisons and effect sizes (Cohen’s d statistic) were used to assess the magnitude of group differences. Cohen’s d used to quantify differences by converting the mean difference between two groups into standard deviation units (mean of group 1 - mean of group 2/ SD). Cohen’s d scores were interpreted as follows: < 0.2 = small effect size, approximately 0.5 = moderate effect size, and > 0.80 = large effect size [30]. To explore factors associated with HRQoL, the univariate association between independent variables and HRQoL measures (HSUV and EQ VAS) was examined using ordinary least squares regression. A hierarchical multiple regression analysis was then conducted to determine the relationship between demographic variables, clinical variables and behavioural variables with HSUVs and EQ VAS. In step 1, a regression model was constructed to evaluate the relationship between HSUVs or EQ VAS scores and demographic variables. In step 2, the analysis was extended to include clinical variables and in step 3, behavioural variables were added to the model. Model selection was performed based on the adjusted R-squared, Akaike information criterion (AIC), and Bayesian information criterion (BIC). The model with the highest adjusted R-squared and the lowest AIC and BIC was considered the best-fitting model. Model 3 demonstrated better performance based on the performance indices of the hierarchical regression analysis. The significance level was set at p < 0.05. The analysis was performed using Stata version 17.

Result

Participant characteristics

We recruited 462 participants with comparable numbers of patients diagnosed with anxiety only (149), depression only (159) and comorbid anxiety and depression (154). The most commonly reported types of anxiety and depression was generalised anxiety disorders (56.1%) and major depressive disorder (64.7%) respectively. Most participants (53.9%) were below the age of 35 years, 45.2% were unmarried, 30.7% had primary education, 66.7% reported insufficient income, 79.7% did not smoke and 61.4% reported having never consumed alcohol. According to the PHQ-9 scores, 33.6% of respondents had moderate depression, with the rest distributed among mild (23.5%), moderately severe (21.0%), and severe (21.8%) depression. Based on the GAD-7 scores 37.5% of participants reported mild anxiety, 40.0% had moderate, and 22.5% reported severe anxiety (See Tables 1 and 2).

Table 1 Summary score of HSUVs and EQ VAS by socio-demographic characteristics (n = 462)
Table 2 Summary score of HSUVs and EQ VAS by clinical characteristics (n = 462)

Patient-reported health problems on the EQ-5D-5 L

Regardless of differences in respondent demographic and clinical characteristics, a higher proportion of reported problems were observed in the P/D and A/D dimensions of the EQ-5D-5 L measure. The older age group and pensioners exhibited a higher prevalence of problems across various dimensions of the EQ-5D instrument (Fig. 1). Additionally, individuals with severe anxiety or depression symptoms reported more problems compared to those with milder symptoms (Fig. 2).

Fig. 1
figure 1

Number of participants (n) reported problems (levels 2–5) on EQ-5D-5 L health dimensions across socio-demographic and behavioural profiles (n = 462)

Fig. 2
figure 2

Number of participants (n) reported problems on EQ-5D-5 L health dimensions across clinical profiles (n = 462)

Patient-reported HSUVs and EQ VAS by respondent characteristics

The PHQ-9, GAD-7 and EQ-5D-5 L demonstrated good internal consistency with a Cronbach’s alpha 0.90, 0.77 and 0.80 respectively. The mean HSUV and EQ VAS score of the total sample were 0.87 (SD = 0.17) and 71.4 (SD = 19.1), respectively. The age group of 18–35 years had the highest mean HSUVs (0.9) and EQ VAS score (73.8), while individuals over the age of 55 had the lowest mean scores for both HSUVs (0.78) and EQ VAS scores (61.8). The differences in reported HRQoL between the younger and older populations were large, as demonstrated by Cohen’s d effect sizes of 0.83 for HSUVs and 0.67 for EQ VAS scores (see Table 1). Patients experiencing both anxiety and depression scored significantly lower on HRQoL measures (HSUV = 0.83 [0.16], EQ VAS = 64.4 [17.9]) compared to those with either anxiety only (HSUV = 0.88 [0.17], EQ VAS = 75.3 [17.9]) or depression only (HSUV = 0.89 [0.18], EQ VAS 74.4 [19.7]). However, the observed effect sizes of these differences were small to moderate. Significant differences in the reported HSUVs and EQ VAS were observed across the various clinical variables, with the largest differences occurring between mild and severe conditions (see Table 2).

Patient-reported HSUVs, EQ VAS and associated respondents’ characteristics

The results of the univariate analysis revealed significant negative associations between HSUVs and factors such as age (β = -0.003 [-0.004,-0.002]), GAD-7 score (β = -0.01 [-0.013, − 0.007]), PHQ-9 score (β = -0.009 [-0.011, − 0.007]), duration of illness (β = -0.004 [-0.007, − 0.001]), comorbid diagnosis (HTN: β =-0.103 [-0.172, − 0.034]), and multiple medication use (β = -0.047 [-0.079, − 0.014]). In contrast, having ‘more than enough’ income and being a non-smoker was associated with a higher score in HSUVs. Age was consistently and significantly associated with a lower HSUVs across the three models. The clinical profiles, including higher PHQ-9 scores, comorbid hypertension and multiple medication use, were also consistently negatively associated with the HSUVs across model 2 and model 3 (see Table 3).

Table 3 Regression analysis results with HSUVs as the outcome variable

With regard to the EQ VAS, the univariate analysis presented in Table 4 demonstrate that being a non-smoker and having ‘more than enough’ income was associated to higher EQ VAS scores. Conversely, having a frequent alcohol drinking habit, a diagnosis of both anxiety and depression, a higher in PHQ-9 score, or having comorbid hypertension were associated with a decrease in EQ VAS scores. In the multivariate analysis, consistent positive associations with EQ VAS scores were observed for those who reported having enough income across all models (Model 3: β = 11.8 [4.5,19]). Clinical variables such as PHQ-9 score (Model 3: β = -0.87 [-1.16, -0.58]), having comorbid hypertension (Model 3: β = -7.14 [-14.28, -0.14]) and taking multiple medications (Model 3: β = -4.27 [-7.74, -0.8]) demonstrated an association with EQ VAS scores in both univariate and multivariate analyses.

Table 4 Regression analysis results with EQ VAS scores as the outcome variable

Discussion

Main findings

The majority of respondents reported problems on the A/D (more than two thirds of respondents) and P/D (over half of the respondents) dimensions. The frequency of reported problems was higher among respondents experiencing more severity of anxiety or depression. This is consistent with a previous study, which reported that patients with mental illness, such as anxiety disorders, more frequently reported problems on the EQ-5D’s A/D and P/D dimensions compared to other dimensions [31]. The mobility and self-care dimensions had the least reported problems. However, it is important to note that the problems reported on mobility and selfcare should not be ignored. Adopting a comprehensive approach to treating anxiety and depression disorders that addresses both the psychological and physical aspects of the condition might improve the prevalence of reported problems across all the EQ-5D dimensions. Specifically, incorporating interventions that promote physical activity and self-care practices alongside treatments that primarily focus on patients’ mental health may lead to better outcomes in enhancing overall health outcome [32]. Our study findings indicate that older adults tend to report more problems across all five EQ-5D-5 L dimensions. This is supported by existing literature, which demonstrates a decrease in EQ-5D-5 L scores as individuals age increases [33].

The study results indicate that the reported mean score on the EQ-5D-5 L and EQ VAS were 0.87 (SD = 0.17) and 71.4 (SD = 19.1), respectively. These values were lower than the utility values of Ethiopia’s general population, which were 0.94 (SD = 0.10) and 87.26 (SD = 13.64) on the EQ-5D and EQ VAS scores, respectively [23]. The observed difference is expected since the general population includes both healthy individuals and those with disease conditions. However, the mean HSUVs reported by participants in the current study were lower than the values reported for patients with diabetes (mean EQ-5D-5 L = 0.95 and mean EQ VAS = 80) [34] and HIV/AIDS (mean EQ-5D-5 L = 0.95 and mean EQ VAS = 80) [35] in Ethiopia. Studies consistently indicate a minimum clinically important difference of 0.037 to 0.069 for the EQ-5D-5 L instrument [36, 37]. Therefore, in the context of chronic disease, the observed score differences hold significant clinical importance. The measurement scales used in this study demonstrated good internal consistency, with Cronbach’s alpha coefficients indicating strong reliability: PHQ-9 (α = 0.90) and GAD-7 (α = 0.77). These values are consistent with those previously reported in validation studies, where PHQ-9 had an alpha of 0.81 [24] and GAD-7 had an alpha of 0.77 [26].

The overall mean HSUV (Mean (SD) 0.87 (0.17) is surprisingly high compared to studies in high-income settings using the EQ-5D (mean (SD) 0.685 (0.218) for respondents with depression [16]. When looking at different levels of severity, our study found that the mean HSUVs for depression varied from 0.91 for moderate to 0.78 for to severe depression. In contrast, a study conducted in Sweden reported lower means ranging from 0.46 for moderate to 0.27 for severe depression [38]. This calls for a study to look at how people in high-income and low-income settings understand the EQ-5D questionnaire, which could potentially influence the instrument’s validity. Furthermore, the use of an interviewer administered version of the tool in our study may have introduced potential bias due to the presence of an interviewer, which may have led to positive responding bias.

The results suggest that self-reported health status may be influenced by both age and gender, with younger and male individuals reporting better health status. The literature also previously indicated that women are more likely than men to experience depression and that older adults are at a higher risk of developing mental health issues, including depression [39]. This highlights the potential need for targeted interventions to improve health outcomes for female and older populations.

In addition to comparing the mean scores, this study explored the association of individuals’ demographic, clinical and behavioural characteristics on the reported HSUVs and EQ VAS score. In the univariate analysis, a strong negative association was found between reported HSUVs and having an anxiety disorder (β=-0.139) or being a pensioner (β=-0.13), while a strong positive association was observed in non-smokers (β = 0.091) when compared to daily smokers. This observation is consistent with existing studies, which have shown that quitting smoking can have a substantial positive impact on reducing anxiety and depression, as well as improving overall quality of life [40] Therefore, it is imperative to emphasise and enhance behavioural education for smoking cessation, in line with the Ethiopian mental health policy [41].

A multiple regression analysis was used to investigate how various characteristics of the respondents are associated with the reported HSUVs and EQ VAS scores. Older age consistently showed a significant negative association with the HSUVs across all models. These findings highlight that it might be important to consider age, along with clinical and behavioural factors, when assessing the HSUVs. The pattern of association of variables with EQ VAS score showed some minor differences compared with the HSUVs. For instance, age showed significance only in the univariate analysis and Model 1 of the multiple regression analysis; however, it did not exhibit significance in Models 2 and 3. In addition to age, marital status and income were associated with EQ VAS scores. The discrepancy in results between the EQ VAS and EQ-5D-5 L may be attributed to the fact that these measures capture different concepts. The EQ VAS provides a subjective assessment of overall health using a single scale, while the EQ-5D-5 L encompasses multiple dimensions of HRQoL and assigns a value to different health states based on societal preferences [42].

Our study investigated the association between depression severity (PHQ-9), anxiety severity (GAD-7), and HSUVs as measured by the EQ-5D index. Despite the potential overlap between anxiety/depressive symptoms and health-related quality of life (HRQOL) measures, our primary objective was to establish HSUVs across varying levels of anxiety and depression severity. The PHQ-9 and GAD-7 are well-validated tools for assessing the severity of depression and anxiety, respectively, within the Ethiopian context [24, 26]. While these tools primarily focus on psychological well-being, the EQ-5D-5 L addresses multiple health dimensions, with only one item directly related to mental well-being (anxiety/depression). This led to some conceptual overlap due to shared constructs between the measures. This overlap might reflect a structural association, given the inclusion of mental health dimensions in both the PHQ-9/GAD-7 and the EQ-5D-5 L. Analytically, the correlation analysis revealed moderate correlations (r = -0.28 to -0.38) between the EQ-5–5 L and the symptom scales, aligning with previous studies that reported correlations in the range of r = -0.39 to -0.41, primarily driven by the anxiety/depression dimension of the EQ-5D-5 L [43]. These moderate correlations are consistent with the existing literature, suggesting that while some overlap exists, the tools also measure distinct constructs.

Policy implications

This analysis has shown that older age or severity of the symptom and the presence of comorbidities, associated with a lower self-reported HRQoL, as depicted with reduced HSUVs and EQ VAS scores. However, marital status and income, as well as duration of illness and types of anxiety also have an association with HRQoL scores. Therefore, mental health service needs to be tailored to account for improvements in disease severity. Furthermore, social support packages that improve the income of individuals and counselling services related to marital or family issues might assist with improving the perceived HRQoL of patients. Income might be linked to access to healthcare, particularly in Ethiopia, where high out-of-pocket expenses are prevalent, potentially affecting patients’ HRQoL [44]. Our study has further important implications for the health technology assessment process in Ethiopia. The utility values reported in this study for various health states could be beneficial in conducting economic evaluations of anxiety and depression interventions in Ethiopia and similar resource-constrained settings.

Strengths and limitations

This study is the first to establish HSUVs based on the severity of anxiety and depression and generate a coefficient based on symptom severity scores for Ethiopia. The study incorporated the preferences of the Ethiopian general population to estimate HSUVs for anxiety and depression [23]. This approach of estimating HSUVs offers an ethically acceptable foundation for resource allocation decisions, as it integrates societal preferences [7]. The inclusion of patients from different diagnostic groups and the use of a validated tool to assess symptom severity are also strengths of this study. However, there are some study limitations. While we observe variations in HSUVs based on the severity measured by the PHQ-9 and GAD-7, it is essential to recognise the conceptual overlap between these measures and EQ-5D-5 L particularly with the anxiety/depression dimension, which could contribute to this observed correlation [45]. The study’s applicability is limited to tertiary care service users and outpatients, as it did not include primary or secondary care service users or inpatients, in accordance with the country’s healthcare referral system. Participants were recruited using a convenience sampling approach within a single hospital. While this approach may have limited the generalisability of the study, it is important to note that the hospital from which participants were recruited is the only specialised psychiatric hospital in Ethiopia, representing the majority of patients affected by anxiety and/or depression. However, our findings should be interpreted within the context of this sampling approach, acknowledging the potential for selection bias. The reported literacy rate in our study was relatively higher (only 11% reported no formal education) compared to the low literacy rates in Ethiopia, with adult literacy at 52% [46]. Therefore, it is important to consider this limitation when extrapolating the findings within the country’s context. Administering the EQ-5D-5 L questionnaire through an interviewer may have introduced potential bias due to the presence of an interviewer, thereby limiting the applicability of the results to self-complete scenarios.

Conclusion

In conclusion, this study found that overall, a higher proportion of reported problems was observed in the P/D and A/D dimensions of the EQ-5D-5 L measure. The age and clinical profiles of respondents were associated with differences in reported problems across EQ-5D-5 L health dimensions. The reported mean HSUVs and EQ VAS scores of respondents were lower than those reported in the general population and among individuals with other physical health conditions. Older age, higher levels of anxiety or depression severity and the presence of comorbid diagnoses were consistently associated with lower HSUVs and EQ VAS scores. To enhance health outcomes, clinicians managing anxiety or depression should consider age alongside clinical factors.